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ELECTIONS 


I. THE FUNCTIONS OF ELECTIONS 
m1. ELECTORAL SYSTEMS 


W. J. M. Mackenzie 
Stein Rokkan 


I 
THE FUNCTIONS OF ELECTIONS 


Elections may be regarded as one procedure for 
aggregating preferences of a particular kind. 

Liberal democratic theories attribute special au- 
thority to the amalgamation of the expressed pref- 
erences of individuals through recognized proce- 
dures. They reject the idea that social choice can 
be made by some sort of group mind or interper- 
sonal entity built out of individuals but different 
from them in kind. They also reject the idea that 
social choice is a mere illusion, that is, the notion 
that what appears to be a choice between alterna- 
tives is really no more than the consequence of the 
interplay of various forces. 

But it has been argued, in the attack on welfare 
economics (Arrow 1951; Little 1950), that the 
preference schedules of individuals cannot be amal- 
gamated without paradox except on one of two 
conditions: either through the operation of a mar- 
ket or through the compliance of individual partic- 
ipants with decisions by a recognized authority. 

Liberal theories would certainly accept the idea 
that in certain cases to be defined social choice is 
made and should be made through the market or 
by relying on authority. But they postulate also that 
there are and should be public decisions in which 
citizens make an explicit choice between alterna- 
tive courses of public action. This can be done in 


practice only through forms of procedure generally 
accepted as binding within the political society. 

Voting is one of these procedures but not the 
only one. It is relevant to quote an authority on the 
practice of the Dominican order in the Middle Ages 
(Galbraith 1925, p. 33) to the effect that choice 
might be made by vote, by explicit agreement after 
negotiation, or “as if by the inspiration of God.” 
Certainly one finds everywhere, even in the most 
developed societies, choice by bargaining between 
factions and choice by acclamation, and there may 
be other procedures as well. It appears, however, 
that in “liberal” societies voting is held in reserve 
as a procedure possessing special authority within 
the group, organization, or state. Conversely, elec- 
tions are by no means the only occasion for pro- 
cedure by vote. Voting on propositions is of great 
practical importance in many different social and 
political situations, and it raises similar problems 
of formal analysis (Black 1958). 

Voting in nationwide elections has a position of 
special importance in Western democracies. Its 
authority is strengthened because similar proce- 
dures are used for social choice in many institu- 
tions, large and small, public and private, through- 
out the society. (It is not greatly weakened by the 
existence of formal paradoxes of voting, even 
though these anomalies are of some tactical im- 
portance to groups seeking victory for their own 
interests.) This predominance has led to the export 
of voting in elections to countries where voting pro- 
cedure has not historically possessed the same 
social authority as in the West: countries of the 
Soviet bloc on the one hand and developing coun- 
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tries on the other. This may give rise to situations 
in which the procedure exists but the element of 
choice does not. 


Definition of elections 


Thus, it is not always easy to answer the ques- 
tion, What is a real election? —and it may be use- 
ful to attempt a formal definition. What follows is 
based on English usage of political terms and may 
not have general validity, but it will serve to indi- 
cate important points for discussion. 

One requires, first, the concept of recognized 
positions or roles (“offices”) which confer certain 
powers and duties within an organization. Individ- 
uals may be assigned to office either by choice or 
by a method independent of choice, such as a rule 
of inheritance, or seniority, or regulated trial by 
competition. Next, a general concept is needed, 
such as “to choose a man for a job” or perhaps “to 
decide between candidates for a job.” Within this 
concept, one must distinguish among “electing,” 
“appointing,” and “co-opting” a man. In English 
each word has overtones of political evaluation. 
“Election” (provided it is “free”) would be deemed 
“democratic” and therefore good, but for certain 
positions only. “Appointment” would be regarded 
as “patronage” that tends to increase the power of 
the patron, except insofar as it is hedged by rules 
specifying the field of “qualified” candidates. “Co- 
option” smacks of oligarchy, the self-perpetuation 
of a ruling group, unless similarly regulated. 

On this basis election might be defined as a form 
of procedure, recognized by the rules of an organ- 
ization, whereby all or some of the members of the 
organization choose a smaller number of persons 
or one person to hold office of authority in the 
organization. 

This definition raises a number of points. (1) It 
attempts to embrace both formal procedure and 
social significance—both “rules” and “choice.” 
Ideally, both elements should be present in an 
election. To mark a ballot paper and drop it in a 
ballot box is not “electing” unless the actor “chooses” 
in some socially significant sense. But equally a 
choice is not a “vote in an election” unless the 
chooser conforms to the specified legal procedure. 
J. L. Austin made the same point (1961; 1962) 
when he said that “I vote for Mr. A” is not a state- 
ment but a verbal act or performative utterance 
and that the same act can be achieved without 
words where this is the proper procedure. Never- 
theless, it may be convenient to use the word “elec- 
tion” for something that falls short of such com- 
pleteness; for instance, where procedure is followed 
but no choice is present, or where there is a sig- 


nificant element of choice without close conformity 
to a socially recognized procedure (Akzin 1960). 

(2) The rather loose word “organization” is 
chosen here deliberately. The word “election” is not 
used only for “state” elections to a hierarchy of 
public bodies. Indeed, it could be maintained that 
state elections are effective only where electoral 
procedure is regarded as a usual procedure through- 
out the society and is therefore written into the 
rules of all sorts of nonpublic bodies, such as busi- 
ness companies, trade unions, free churches, sports 
clubs, and so on. Nor would it do to replace “an 
organization” by “a society.” This might imply that 
a voter can choose only within his society, whereas 
multiple membership of overlapping organizations 
is characteristic of complex societies, and one man 
may be a voter in many different capacities and 
under different rule systems. 

(3) Two phrases in the definition—“the rules 
of an organization” and “the members of an organ- 
ization choose”—refer to fundamental conceptual 
problems in social science, All that need be said 
here is that ordinary language about elections deals 
with persons acting within systems of ethical norms 
and legal procedures. It is possible to reject this 
language, as would happen if either economic de- 
terminism or behaviorism were strictly applied in 
social science. Such studies might have substantial 
predictive value in relation to electoral behavior, 
but they would leave unanswered some fundamen- 
tal questions about what men think they are doing 
when they participate in elections. 

(4) The word “office” implies a position desig- 
nated by the same system of rules that determines 
the electoral procedure. The general problem is that 
in all social systems persons must somehow be 
linked to offices; election is one of many different 
procedures used to ensure legal succession to office 
in different organizations and societies, 

(5) It remains to distinguish election from ap- 
pointment or co-option. There are ambiguities in 
usage here. For instance, fellows of a college would 
use the word “elect” both for choosing a master 
and for choosing a junior colleague; critics of the 
college system might accept the former usage but 
would describe the latter as “co-option.” Political 
advantage may be drawn from these ambiguities at 
various levels of political debate; in England, at 
least, “election” is a good word, “patronage” is a 
bad word, and “co-option” lies in between, This 
usage suggests the following distinctions: 

(a) In an election the choosers are a relatively 
numerous body. Choice by one voter would, of 
course, be an appointment. But how many choosers 
are needed to make an election? 


(b) There is a question of proportion as well as 
of absolute number. If ten choosers voted to fill one 
office, one might call it election; if they filled 100 
offices, one would tend to call it appointment (or 
even patronage). But once again there is no sharp 
point of division. 

(c) There is a question of the relationship be- 
tween the choosers and the office to be filled. A 
person co-opted would be a colleague; a person 
appointed would be a subordinate, even though he 
might exercise great discretionary power; a person 
elected would hold an office of authority, which 
might include authority over those who elected 
him. 

(d) It may be said that when electing, the voters 
act independently of one another and more or less 
at the same time, whereas an appointing body acts 
in consultation, with each member sharing in the 
deliberation and expressing his point of view in 
turn until a conclusion is reached (Akzin 1960). 
This is a very important problem in the study of 
political development, but it seems to be a distinc- 
tion between voters and councilors, rather than 
between election and appointment. Deliberative pro- 
cedure in council is very widespread in human soci- 
eties at all stages; under some circumstances 
(which have nowhere been seriously studied) the 
device of voting is used to bring issues to a con- 
clusion. But election does not inevitably entail vot- 
ing; in certain societies the proper procedure for 
election is by council, in others by acclamation, 
and in yet others by voting. 


Historical development 


Elections first took a central place in politics in 
the Greek city-states of the eastern Mediterranean 
in the fifth and sixth centuries B.c. There has been 
no systematic study of elections in societies inde- 
pendent of this Western tradition; certainly, traces 
are to be found elsewhere, but it does not seem that 
elections have played a central part in other so- 
cieties. In the following discussion it has been 
assumed that electoral procedures can usefully be 
studied historically in terms of the diffusion of a 
social pattern from a single source and its modifi- 
cation in a great variety of situations. Further, it 
is assumed that these procedures correspond func- 
tionally to certain general social needs, which are 
particularly marked in literate, technological, and 
mobile societies; hence they have periodically re- 
appeared, after setbacks, in new forms in new cor- 
ners of Western society. Finally, it is assumed that 
where these procedures meet no social needs they 
may be retained as forms but are filled with a new 
content. 
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The heroic age. The poems of Homer reflect 
a state of society in which rule was by kings whose 
position was conspicuously unlike that of the “Ori- 
ental despots” of the river valley civilizations with 
which they came in contact. The evidence of the 
mythological and epic narratives is difficult to use, 
but it suggests a situation roughly parallel to cases 
found in mobile African societies where the king, 
although drawn from a royal lineage, emerges as 
leader by a process which may include competition, 
conciliar election, and acclamation by the people. 
Clearly the leader of the war coalition, Agamem- 
non, had attained his precarious eminence among 
other kings by a process of this kind. Analogies can 
be drawn from Tacitus’ account of the Germans 
and from the world of Teutonic, Scandinavian, and 
Icelandic epics. 

The Greek democracies. The epic period of tri- 
bal mobility was succeeded by one of peasant agri- 
culture tempered by growing commercial activity 
and emigration to colonies overseas. From this situ- 
ation emerged the strife between the well-born and 
the people, which affected Greek ideas and prac- 
tice about political institutions almost everywhere. 
Where this strife was intense, Greek elections as- 
sumed new forms, either through a complete pop- 
ular victory or through attempts at compromise. 

We are primarily concerned not with voting on 
measures in popular assemblies but with the choice 
of persons to fill offices of authority. Two points are 
of general importance. First, in voting on proposi- 
tions in the assembly of the citizens the rule was 
apparently that of individual voting by show of 
hands (yeporovev). Use was also made of written 
votes (in the procedure of ostracism) and of bal- 
lots in the form of pebbles (y}¢o.)—hence, psephol- 
ogy. There was at times a leader of the assembly 
who held his informal position (for example, Per- 
icles, Cleon, Demosthenes) because of fairly stable 
majority support. But holders of certain legally 
recognized offices (in particular, archons and gen- 
erals) were elected by nonlocal constituencies 
known as tribes (¢vAdé), which were held to have 
been instituted deliberately so as to cut across local 
divisions of interest within Attica. The number of 
voters in each tribe must have differed a good deal. 

Second, the principle of election was accepted 
somewhat grudgingly in Athenian democratic the- 
ory; it infringed the principle of equality among 
citizens, and it was dangerous because it opened 
the way to power for ambitious, attractive, and 
well-trained young men of the old families (for 
example, Alcibiades) and equally for ambitious 
men of the people who were prepared to perpetu- 
ate their electoral victory by force (the common 
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pattern of Greek “tyranny”). The orthodox prin- 
ciple was that citizens should hold offices of au- 
thority in rotation, the order to be determined by 
lot; this was the practice for the Council of 500 
and its monthly committees, which maintained 
continuity in the control of public business, and 
also for the selection of juries (methods of “bal- 
loting” for juries are described in great detail by 
Aristotle, Politeia athenaién, chapters 63-66). Sim- 
ilar institutions were common in early English and 
American practice, and rotation in office is still 
quite usual in small voluntary societies, But there 
has been no modern discussion of the relation be- 
tween the principles of rotation in office and that 
of election by vote. It is notable, however, that in 
general the Athenians used voting for elections to 
offices requiring special skills, such as military 
leadership, whereas in Western countries voting is 
now used to fill offices of a representative char- 
acter, for which the Athenians used the lot; offices 
requiring special skills are now generally filled by 
appointment from a field determined by specified 
professional qualifications. 

The Roman republic. Even under the repub- 
lic the Romans never accepted the principle of 
“one man, one vote.” Decision in legislation and in 
the choice of the principal officials was by a plural- 
ity of “centuries” or by a plurality of “tribes”: 
within each of these constituencies one man, one 
vote prevailed, but the units varied in size. It was 
tactically important that each of them had some 
local basis, but locality was not decisive in their 
composition. 

The medieval church. The tradition of ancient 
elections was preserved in the church rather than 
in the state, It continued unbroken in the Roman 
Catholic church, but many national and noncon- 
formist churches also developed the use of elections 
as the basis of a legitimate claim to hold office. 
(It is an interesting coincidence that “election” has 
in Protestant theology a different meaning: that of 
the granting of spiritual grace to God’s elect. ) 

The most ancient and continuous tradition has 
been that of the election of superiors (popes, 
bishops, deans, priors, and so on) by a relatively 
small electorate consisting of those next in rank. 
Up to a point the procedure is deliberative, tending 
toward a conclusion by “sense of the meeting.” But 
there are also ancient and complex rules about 
voting procedures. These rights of election were 
defended, strongly but not always with success, 
against hierarchical and secular attempts to sub- 
stitute appointment. 

There is an undercurrent (almost Athenian in 


tone) emphasizing the electoral rights of the many 
against the few. In Presbyterian terms, the congre- 
gation will defend the position of the elders in 
appointing a minister insofar as that position is 
endangered by the lay patron, but it claims the 
right to confirm or to upset the verdict of the 
elders. Dissent sometimes accepts the authority of 
a charismatic leader; but it often tends toward the 
equal sovereignty of all true believers, which may 
be shown either by election or by rotation in office. 

Feudalism. The position of the feudal emperor, 
king, or overlord was deemed to be limited by law 
and custom and to some extent by the consent of 
his vassals. 

The relation between king and lord and between 
lord and man was in principle one of consent lead- 
ing to binding mutual obligation. The vassal chose 
to do homage, the lord chose whether or not to 
accept it. It was not a long step from this to an 
elected emperor and (in a few instances) an elected 
king. The social situation greatly limited the appli- 
cation of the principles of consent and election in 
practice; but the idea of a binding legal right of 
succession to office emerged slowly, along with the 
growth of other notions of private and heritable 
property. 

In principle, the king was independent insofar 
as he could “live of his own,” But this was a limited 
independence in a period of quite rapid change, 
and in many cases its boundaries were obscure. 
Hence the need for consultation, first with a feudal 
council, then with assemblies “representing” others 
besides immediate vassals. These assemblies were 
the basis of the parliamentary tradition in Europe. 
They embodied two principles not yet wholly ob- 
solete: 

(a) The separate representation of “estates,” 
which might be more or less numerous; for in- 
stance, great lords, great clergy, lesser lords, lesser 
clergy, burghers, peasants, 

(b) The representation of local communities 
but not of individuals. The classic case is that of 
the English House of Commons, based on two 
knights from every shire and two burgesses from 
every burgh. Apart from the great men of the 
realm, the “units of account” in government were 
shires and burghs, not individuals, The choice of 
representatives by communities was a matter for 
each community, within the general law of the 
land. Elections thus established themselves in na- 
tional government but without any national enact- 
ment about electoral procedure. 

The seventeenth and eighteenth centuries. In 
most of Europe the assemblies of estates were dis- 


placed by autocratic, modernizing monarchies. For 
the diffusion of elections the only important sur- 
vival was in England (the parliaments of Scotland 
and Sweden survived but had little or no influence 
outside their own countries) and in colonial assem- 
blies based on the English model. During the strug- 
gle for survival certain basic principles of consent, 
franchise, and representation were hammered out; 
although these principles were never fully applied 
in practice, they were recognized as the ideological 
basis of a system of democratic elections. The clas- 
sic statements are those of English popular leaders 
in the 1640s and 1650s: their language recalls both 
that of nonconformist congregations and that of 
Athenian democracy. The principle, in brief, is that 
all governments owe their just powers to the con- 
sent of the governed and that in numerous societies 
this consent may be expressed by representatives 
freely elected on a basis of universal adult suffrage. 

This principle can readily be elaborated in insti- 
tutional form, for example, by the extension of the 
suffrage, the equalization of constituencies, pro- 
portional representation, the elimination of intimi- 
dation and corruption, and so on. These elabora- 
tions in turn lead to political situations which 
illustrate ambiguities in the principle; for instance, 
as regards the relation between elected and elec- 
tors, is there a difference between a “representa- 
tive” and a “delegate”? [See REPRESENTATION. ] 

Parties in elections. By far the most important 
of these new problems is that of parties as inter- 
mediaries between voter and assembly. Clear recog- 
nition of this situation came first in American 
presidential elections, but it spread rapidly with 
the extension of the franchise in large states in the 
nineteenth century. By the last quarter of that cen- 
tury, parties and elections had become interdepend- 
ent. Electoral parties were no longer limited to 
national politics; trade unions and large coopera- 
tive societies are obvious examples. But national 
elections are henceforth intelligible only in terms 
of parties; the traditional principles demand the 
scrutiny of procedure within parties, since they 
control the first stage of national elections. [See 
PARTIES, POLITICAL.] 

Plebiscitary democracy and “unfree” elections. 
The predominance of parties has led to a change 
in the character of national elections, even in coun- 
tries where electoral procedure is in constant use 
at subnational levels. The choice of a man to hold 
office as a member of an assembly has given place 
to a national vote between different “packages” 
consisting of leadership, party, and program. The 
election is a choice of government or even of regime, 
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and voting procedure is called on to bear new 
strains. In stable democracies the strains are miti- 
gated because there is an understood difference 
between governmental structure (“government”) 
and government in power (“regime”), and the 
former is not questioned by electors. But where 
this distinction is not drawn the strain may prove 
too great for the electoral system to bear, and the 
element of choice is removed (or greatly reduced) 
by various devices. The oldest of these are plebisci- 
tary democracy, which dates from the time of Na- 
poleon 1, and the exercise of influence on elections 
by officials of the government in power, without 
blatant breach of legality. The reductio ad absur- 
dum of these trends appears in “elections” such as 
those of East Germany, where a vote of 99.9 per 
cent was recorded in favor of the government in 
1964. Such a result could only be due to fraud, or 
pressure, or both. However, the fact that the regime 
deems elections necessary seems to pay tribute to 
the immense strength of the tradition that elections 
confer legitimacy. [See DEMocracy.] 


The functions of elections 


This brief historical summary illustrates the per- 
sistence and adaptability of the use of electoral pro- 
cedure as a means of legitimating the assignment 
of a person to an office of authority. It may be said 
that electoral procedure is functionally analogous 
to procedure in a marriage ceremony: “Do you take 
this man (or woman) to be your lawfully wedded 
husband (or wife)?” “I do” (Austin 1962). The 
point in time at which “I do” is said is not psycho- 
logically a moment of choice or decision—that 
came earlier; it is the point at which an individual 
preference becomes a social commitment. The 
words and acts are “performative”; if correctly said 
and done in the right context, they establish new 
social relationships of a binding character. 

Such acts are generally associated with ritual 
which underlies the multiple relationships linking 
them to a complex system of behavior and belief. 
To continue the analogy with marriage ceremonies, 
there is a possible range of ritual complexity from 
ostentation to extreme simplicity—but even in mar- 
riage by registration in an advanced secular so- 
ciety, some elements of ritual are present. There is 
the same wide range in electoral ritual—for ex- 
ample the election of a pope and that of the direc- 
tors of a manufacturing company; but in both 
cases there is a procedure which has binding effect 
if properly followed. 

Thus, it is possible to speak of elections in gen- 
eral as a “ritual of choice”; the binding character 
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of elections derives from the participation of an 
individual as chooser in a social act, and legitimate 
authority is thus conferred on the person chosen. 
But such a generalization tells one little about the 
position of elections in any given society. 

Men are called by different kinds of elections to 
different offices in different societies. The historical 
sketch given above notes only a limited range of 
cases, but it may be sufficient to indicate that it is 
rash to talk of the function of elections. This may 
be illustrated by the British case. A British general 
election serves to choose a governing party and 
thus a government. But (on the one hand) that 
government, though powerful, has not a monopoly 
of legitimate authority in the political system. This 
authority is shared by many others—those profes- 
sionally qualified by education and experience, the 
leaders of organized interests, property owners of 
various kinds, and so on. On the other hand, the 
electoral system serves many other functions be- 
sides choice of a government; the party organiza- 
tion based on it serves as a market place and recon- 
ciler of interests, a ladder for the political careers 
of national and local officials, a forum of national 
discussion, and so on. It would be quite amiss to 
assume at once that the same functions were filled 
by elections in Athens, or in the medieval church, 
or even in other industrial societies today. 

Argument about the merits of different electoral 
systems is generally based on assessments of their 
efficiency in relation to one or more of their many 
possible functions. The political literature of Eng- 
land in the nineteenth and twentieth centuries con- 
tains a rich store of such arguments; and this has 
been added to in the process of “decolonization,” 
since “free elections” were assumed to be a neces- 
sary step toward independence in most of Britain’s 
dependent territories. This is, therefore, a conven- 
ient testing ground for theories about the nature of 
political argument, and in particular about the 
relation of ideology to rationality on one flank and 
to self-interest on the other, 

A. H. Birch (1964) has shown how contempo- 
rary debate about elections in Britain draws in 
arguments from various historical stages, a mix- 
ture which can be logically justified only if one 
assumes that elections in England serve many 
functions that are not necessarily compatible with 
one another. If one had to ground the defense of 
elections on a single maxim it would doubtless be 
that of the Puritan revolution: “There are no laws 
that in their strictness and vigour of justice any 
man is bound to that are not made by those whom 
he doth consent to.” Parallels for this maxim could 
be found in many other political cultures. The doc- 


trine or ideology is one of great and continuing 
power: but it remains empty until expressed in 
terms of institutions arid interests, and its sim- 
plicity is then obscured and complicated by argu- 
ments drawn from other streams of political 
doctrine. 


There has been no general study of choice as an 
element in the legitimation of authority. Such a 
study would present great difficulties. It is safe to 
guess that where choice is an element in determin- 
ing authority in simpler societies it is entangled 
with other factors such as seniority, lineage, and 
personal ascendancy. Isolating one factor would 
distort the situation. In complex societies elections 
appear in many different contexts, private and 
public, and electoral procedure often survives as a 
ritual although the element of choice is absent; so 
that it would be difficult, perhaps unwise, to take 
the forms of electoral procedure as a guide in 
unraveling the complexities of modern political 
structure. 

It would be of value, nevertheless, if pilot studies 
could be made of the place of elections in one or 
two cases of simple and complex societies. Very 
little work has been done on the legitimation of 
authority in contemporary societies; it seems proba- 
ble that the part played by elections is relatively 
small even in established democracies, if elections 
are considered separately as a single factor. An 
attempt to isolate this factor might therefore break 
down; but it could hardly fail to sharpen our per- 
ception of the problem, which is of central im- 
portance in political science and is now within the 
grasp of empirical inquiry. 

W. J. M. MACKENZIE 


[See also AUTHORITY; CONSTITUTIONS AND CONSTITU- 
TIONALISM; DEMOCRACY; GOVERNMENT; LEGITI- 
MACY; MAJORITY RULE; REPRESENTATION. ] 
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I 
ELECTORAL SYSTEMS 


Elections are institutionalized procedures for the 
choosing of officeholders by some or all of the rec- 
ognized members of an organization. Whether the 
organization is a club, a company, a party, or a 
territorial polity, an electoral institution can be 
described in a series of dimensions: the scope and 
structure of the organizational unit; the tasks and 
the authority of the offices to be filled; the types 
and levels of membership in the organization and 


the qualifications for participation in the choice of 
officers; the criteria, if any, used in differentiating 
the numerical weight of the choice of each qualified 
member; the extent and character of the subdivi- 
sions instituted within the organizations for pur- 
poses of such choice; the procedures for the setting 
of the alternatives of choice; the procedures used 
in eliciting and registering choices among these 
alternatives; and the methods used in translating 
the aggregated choices of members into authori- 
tative collective decisions on the attribution of the 
given offices. 

A club might recognize as qualified electors only 
a few senior members and require these to vote by 
a show of hands or by acclamation. This practice 
would contrast on a number of dimensions with 
the elaborate procedures of some joint-stock com- 
panies: all shareholders have the right to partici- 
pate, but the weight of their votes is a function of 
the number of shares they hold; their preferences 
are expressed under elaborate provisions of secrecy, 
and their votes are aggregated through strict rules 
of accountability. A systematic discussion of all 
such dimensions of variation, even for just the 
major types of organizations, would take us far 
afield. In this article the discussion will center on 
one distinct type of organization: the territorially 
defined units of the nation-state—the self-govern- 
ing local community and the overarching unitary, 
or federal body politic. 

The histories of the known political systems pre- 
sent a bewildering variety of electoral arrangements 
(Braunias 1932; Meyer 1901). Any attempt to ac- 
count for these variations through the construction 
of a basic model of strategic options and structural 
restraints must start out from an analysis of the 
histories of changes in each of six dimensions of 
the local and the national electoral system. 

(1) The qualifications for franchise: how does 
a subject of the territory acquire political citizen- 
ship rights? 

(2) The weighting of influence: how many 
votes are formally attributed to each elector and on 
what grounds? what is done to ensure differentia- 
tion or equality in the actual influence of each vote? 

(3) The standardization of the voting proce- 
dures and the protection of the freedom to choose: 
what is done to ensure uniform and accountable 
practices of electoral administration, and what pro- 
visions are made to equalize the immediate cost of 
all alternatives for the elector? 

(4) The territorial levels of choice: how is the 
territory divided for purposes of election, and how 
many levels of electoral aggregation are distin- 
guished? 
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(5) The stages of electoral choice: how are the 
alternatives set for the electors? to what extent are 
the alternatives set in advance, and to what extent 
is the range still open for the electors? 

(6) The procedures of calculation: how are the 
votes aggregated, and how are the aggregated dis- 
tributions translated into authoritative collective 
decisions on territorial representation? 


One man, one vote, one value 


The Western developments toward equalitarian 
electoral democracy may conveniently be analyzed 
against an “ideal-type” model of five successive 
phases. 

(1) An early, prerevolutionary phase was char- 
acterized by marked provincial and local variations 
in franchise practices but implicit or explicit recog- 
nition of membership in some corporate estate (the 
nobility, the clergy, the city corporations of mer- 
chants and artisans, or, in some cases, the freehold 
peasantry) as a condition of political citizenship 
(Hintze [1941] 1962; Lousse 1943; Palmer 1959). 

(2) In the wake of the American and French 
revolutions, there was a period of increasing stand- 
ardization of franchise rules; the strict regulation 
of access to the political arena under a régime 
censitaire was accompanied by formal equality of 
influence among the citizens allowed to vote under 
the given property or'income criteria (Meyer 1901; 
Williamson 1960). 

(3) In the first phase of mass mobilization the 
suffrage was greatly extended, but formal inequal- 
ities of influence persisted, under arrangements for 
multiple votes or for differential ratios of votes to 
representatives. 

(4) In the next phase, manhood suffrage, all 
significant social and economic criteria of qualifi- 
cation for men over a given age were abolished. 
Although there were now no formal inequalities 
of voting rights within constituency electorates, 
marked differences in the weight of votes across 
the constituencies still existed (Zwager 1958). 

(5) Finally, in the current phase, one of con- 
tinued democratization, steps were taken toward 
the maximization of universal and equal citizenship 
rights by (a) extension of the suffrage to women, 
to younger age groups (down to 21 or even 18), 
and to short-term residents (reductions in “quaran- 
tine” periods) and (b) further equalization of 
voter—representative ratios throughout the national 
or federal territory. 

Only three of the nation-states of the West 
passed through these five stages in anything like a 
regular sequence: England, Belgium, and Sweden. 
The step-by-step evolution characteristic of these 
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countries contrasts violently with the abrupt and 
revolutionary changes in France (Bendix 1964; 
Rokkan 1961). 

In England (Seymour 1915) the process took 
more than one hundred years, from the Reform 
Act of 1832 to the abolition of multiple votes in 
1948. In Sweden (Verney 1957) the system of 
estate representation was abolished in 1866, but the 
extreme inequalities of electoral influence were 
maintained until 1921. The Belgians (Gilissen 
1958 ) passed from the phase of estate representa- 
tion into a régime censitaire as soon as they had 
achieved independence in 1831 and went through 
an intriguing phase of multiple voting from 1893 
to 1917: all men over 25 were enfranchised, but 
additional votes were granted not only on censitaire 
criteria but also in recognition of educational 
achievement (principe capacitaire) and of respon- 
sibility for the maintenance of a family. 

By contrast, in France (Bastid 1948; Charnay 
1965) the transition from the first to the fourth 
stage took a mere four years: the Law of January 
1789 maintained a system of indirect elections 
within the recognized corporations of nobles, clergy, 
and the tiers état; the constitution of 1791 stipu- 
lated a tax-paying criterion and introduced the 
concept of the citoyen actif; and the constitution 
of 1793 went straight to the stage of manhood 
suffrage, the only remaining qualification being a 
six-month minimum residence in the canton. This 
sudden thrust toward maximal mass democracy 
proved very short-lived: the Terror intervened, and 
for decades France was torn between traditionalist 
attempts to restrict the suffrage to a narrow stratum 
of owners and high officials and radical—plebisci- 
tarian pressures for universal and equal elections. 
The period from 1815 to 1848 was one of classic 
régime censitaire: the property qualifications lim- 
ited the franchise to less than 100,000 out of 7 mil- 
lion adult males before 1830 and to roughly 240,000 
in 1848. The Revolution of 1848 brought on the 
next sudden thrust toward maximal democracy: the 
first modern mass election took place on Easter day 
that year, and 84 per cent of the 9,360,000 electors 
went to the polling stations. 

The electoral histories of the rest of Europe fall 
at various points between these two models. In 
northwest Europe the Dutch went through the 
same sequence as the British and the Swedes, while 
Denmark and Norway came closer to the French 
model, The Dutch passed from estate representa- 
tion to régime censitaire in 1848 but did not go 
through any phase of plural voting before they 
opted for manhood suffrage during World War 1 
(Geismann 1964). Denmark, the most absolutist 


of the Nordic polities, went through a brief period 
of estate representation after 1831 and then moved 
straight into a system of nationwide elections under 
a very extensive manhood suffrage in 1849: the 
result, again as in France, was a half century of 
constitutional struggle between an oligarchic elite 
and a coalition of urban radicals and the mobiliz- 
ing peasantry. Under the impact of the struggle for 
independence, the dependent “colonial” territories 
of the north all proceeded rapidly to maximal suf- 
frage. Norway gave the vote to close to half her 
adult males on establishing her own parliament in 
1814 and proceeded to full manhood suffrage dur- 
ing the conflict over union with Sweden in the 
1890s. Finland stuck to the inherited Swedish sys- 
tem of four estates until 1906 and then all of a 
sudden passed from the first to the fifth phase of 
the model: not only all men but also all women 
were given the vote, and the process of mass mobili- 
zation had gone so far under the restrictive estate 
system that the turnout at the first election under 
universal suffrage reached the record height of 70.7 
per cent. Developments in the third of the Nordic 
“colonies” were less spectacular: Iceland saw the 
re-establishment of its parliament in 1874 and then 
passed through two successive phases of régime 
censitaire before the stage of near-universal suf- 
frage for men and women was reached in 1915 
(as in most other countries of the north, paupers 
receiving public assistance were kept out in the 
first round; the Icelanders did not admit them until 
1934). 

The German territories were torn among several 
competing models of the representative polity: the 
traditional notions of election through established 
estates; the altliberale ideology of unified national 
representation under a property or income suffrage; 
the Napoleonic ideas of plebiscitarian mass democ- 
racy; and the Roman Catholic models of func- 
tional representation within the corporate state, the 
Sténdestaat. This electoral schizophrenia found a 
number of intriguing expressions. In Prussia and 
the Bismarckian Reich, two sharply contrasted sys- 
tems of elections coexisted for half a century. In 
Prussia the “lower orders” had been given the right 
to vote in the wake of the Revolution of 1848, but 
the weight of their votes was infinitesimal in the 
three-class system introduced to protect the inter- 
ests of the landowners and the officials. By contrast, 
the Reichstag was elected on strict criteria of equal 
suffrage for all men: this principle had been laid 
down, after much debate, by the German National 
Assembly in Frankfurt in 1848 but was not en- 
forced until 1867, when Bismarck saw the impor- 
tance of general elections as a source of legitimacy 


for the new Reich. The Hapsburg empire went 
through a much longer and more tortuous process 
of democratization: first, estate representation; 
from 1861, corporate-interest representation under 
a system of four curiae; in 1896, an extraordinary 
attempt to stave off equalitarian democracy, by 
adding a fifth curia, for the citizens so far without 
representation; and finally, in 1907, a unified sys- 
tem of national representation and enforcement of 
“one man, one vote, one value.” 

By the end of World War 1 the great majority 
of European and European-settled polities had 
opted for manhood suffrage, many of them even 
for universal suffrage for women as well. Suffrage 
for women (Kraditor 1965) came first in the settler 
nations (Wyoming, 1890, all of the United States, 
1920; New Zealand, 1893; South Australia, 1895) 
and in Scandinavia (Finland, 1906; Norway, 1910 
to 1913; Denmark, 1915; Sweden, 1918 to 1921). 
The British proceeded by steps: restricted suffrage 
for women in 1918; full suffrage, on a par with 
men, in 1928. The “Roman” countries took longer 
to recognize the rights of women: France, Belgium, 
and Italy waited until the end of World War u 
before they admitted all women to political citizen- 
ship, and Switzerland has still, after 120 years of 
smoothly functioning manhood democracy, to reach 
agreement on the enfranchisement of her women. 

With the victory over the Axis powers in World 
War u and the subsequent dismantling of colonial 
empires, the principle of “one man, one vote” gained 
ground throughout the world, even in countries at 
the lowest level of literacy and without a trace of 
the traditions of pluralist competition, which had 
been essential for the growth of effective party op- 
positions in the West. In an increasing number of 
newly independent states the enforcement of equal 
and universal rights of political citizenship was no 
longer seen as a means for the channeling of legiti- 
mate claims against the power holders but was 
regarded simply as an element in a strategy of 
national unification and the control of dissidence. 
Really serious struggles over the old cry of “one 
man, one vote, one value” only occurred in the 
ethnically most divided polities, e.g., South Africa, 
Rhodesia, and the United States. 

Resistance to electoral equalitarianism has gen- 
erally tended to be stronger at the level of local 
government than at the national or federal level. 
Payment of local property taxes remained in many 
cases a criterion of local franchise long after the 
abolition of régime censitaire at the national level. 
Residence requirements, too, were retained much 
longer for local than for national elections. In fact, 
in recent years the increased flow of labor from 
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the backward to the economically advanced coun- 
tries is bringing about extensive disfranchisement 
even at the national level. In the earlier phase the 
migrant workers within the one national territory 
were kept locally disfranchised; today vast numbers 
of immigrants are denied political rights in their 
host countries because of the high barriers against 
citizenship. “One, man, one vote, one value” may be 
upheld as a principle within a population of settled 
territorial citizens, but it breaks down at the cross- 
national level. 


Standardization of electoral practices 


The extension of the franchise to the econom- 
ically and culturally dependent strata of each 
national society increased the pressures for a stand- 
ardization of electoral practices. Before elections 
could be established as essential instruments of 
legitimation, local variations in the arrangements 
for the elicitation and recording of choices had to 
be minimized, The electoral returns constituted 
claims to legitimate representation that had to be 
established through procedures acceptable to all, 
or at least to the dominant, competitors for office 
and power. The history of the democratization of 
the suffrage was paralleled in country after country 
by a history of increasing standardization of ad- 
ministrative procedures in all phases of the elec- 
toral process: the establishment of registers; the 
determination of voting rights; the maintenance of 
order at the polling stations; the casting of the 
vote; the recording of the act in the register; the 
counting of choices; the calculation of outcomes. 

Of all the issues facing the national administra- 
tions in the early phases of suffrage extension, one 
was of particular importance for the functioning 
of the electoral system: the measures taken to in- 
sure the independence of the individual electoral 
decision (Rokkan 1961). 

The defenders of the estate traditions and the 
régime censitaire had argued that economically and 
culturally dependent subjects could not be expected 
to form independent political judgments and there- 
fore the vote should be given only to citizens likely 
to withstand social or economic pressures and able 
to take public responsibility for their choices on 
election day. 

Liberal advocates of an extended suffrage, such 
as John Stuart Mill, were placed in a dilemma. 
They knew that the new voters could easily be 
swayed by their social superiors or their economic 
masters, yet they were convinced that the vote 
ought to be open, that each voter ought to be pre- 
pared to defend his decision in his day-to-day en- 
vironments. This moralist argument for the old 


10 ELECTIONS: Electoral Systems 


tradition of open voting soon had to yield to an- 
other imperative: the safeguarding of the integra- 
tive and legitimizing functions of the electoral 
ritual. To generate legitimacy, elections had to be 
dignified and without any tinge of violence. The 
maintenance of the system of open voting under 
the conditions of mass elections could lead only to 
alienation, corruption, and disrespect for the insti- 
tutions of the nation. 

The result was a widespread movement to en- 
sure the secrecy of the act of voting. To qualify as 
“democratic,” elections had to be not only universal 
and equal but also secret. The French were the 
first to introduce this principle. The electoral law 
establishing the States General in 1789 retained 
open oral voting at the level of the general elector- 
ate but called for secret ballots in the colleges of 
delegates. The electoral law establishing the first 
legislative assembly in 1791 introduced secrecy at 
all levels of the electorate, but very little was done 
to ensure regular enforcement. There was a great 
deal of opposition to the principle; the Jacobins, in 
particular, wanted open voting to control dissi- 
dence. The constitution of 1793 left it to the voters 
themselves to decide whether to vote openly or in 
secret. Subsequent laws reintroduced the principle 
of secrecy for all voters, but electoral administra- 
tion remained at a low level of standardization 
throughout the nineteenth century. The isoloir and 
the standard envelope for the ballot were intro- 
duced by law in 1913, but even these highly de- 
tailed provisions left leeway for a variety of abuses 
and manipulations, particularly in the south, in 
Corsica, and in the overseas départements (Char- 
nay 1965). 

The extension of the suffrage to vast numbers 
of illiterates made it impossible to stick to a strict 
rule of secrecy. In the economically and culturally 
backward areas of the national hinterland particu- 
larly, it proved easy to control the votes of the 
lower classes even under strict rules of secrecy. The 
secret ballot expressed an essential feature of liter- 
ate urban society: it introduced an element of 
anonymity, specificity, and abstraction in the sys- 
tem of political interchange, 

Significantly, the countries that retained the old 
tradition of open and oral voting longest were all 
heavily dominated by landed interests: Denmark 
did not abolish it until 1901; Iceland, until 1906; 
Prussia, until the collapse of the Reich in 1918; 
and Hungary retained it even into the 1930s. By 
contrast, Belgium, Switzerland, and Sweden had 
opted for secrecy even under the régime censitaire, 
and the English had only waited five years after 
the Reform Act of 1867 to introduce the Ballot Act, 


which ensured the freedom of the voters from 
intimidation and bribery. In the settler nations 
overseas, the principle of secrecy was recognized 
quite early but there were marked local variations 


` in enforcement. One of the Australian States de- 


veloped an effective procedure of secret voting as 
early as 1856, and this innovation, the “Australian 
ballot,” (Wigmore 1889) spread very rapidly 
through the United States during the 1880s and 
1890s. The open recognition of legitimate partisan- 
ship made secrecy less important in the United 
States than in Europe. Most states allow primary 
elections within each party, and participation in 
these cannot easily be hidden from the public. 

In all these countries the underlying purpose of 
the introduction of the ballot system was to take 
the act of voting out of the regular give and take of 
day-to-day life and enhance its dignity and ritual 
significance by isolating it from the sordid pres- 
sures and temptations of an unequal and divided 
society. Most histories of electoral arrangements 
emphasize the importance of secrecy, as a device 
to protect the economically dependent from the 
sanctions of their superiors. This was the essence 
of the Chartists’ early demands in England, and it 
has traditionally been a basic concern of working- 
class movements. What has often been overlooked 
is that the provisions for secrecy could as easily 
cut the voter off from his peers as from his su- 
periors. In fact, the secrecy provisions fulfill two 
distinct functions: first, they make it possible for 
the voter to keep his decision private and avoid 
sanctions from those he does not want to know; 
second, they make it impossible for the voter to 
prove how he voted to those he does want to know. 
The very rigorous rules set up in country after 
country for the invalidation of all irregularly 
marked ballots was directed to this second point. 
They were devised to ensure that the citizen could 
no longer treat his vote as a commodity for sale. 
He might well be bribed, but the price per vote 
clearly would decrease as soon as it proved impos- 
sible to check whether it was actually delivered. 
The salient point here is that by ensuring the com- 
plete anonymity of the ballots it became possible 
not only to reduce bribery of the economically de- 
pendent by their superiors but also to reduce the 
pressures toward conformity and solidarity within 
the working class. 

With the secret ballot, a personal choice was 
placed before the worker that made him, at least 
temporarily, independent of his immediate environ- 
ment: was he primarily a worker or primarily a 
citizen of the broader local or national community? 
Secret voting made it possible for the inarticulate 
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rank and file to escape the pressures of their organ- 
izations, and at the same time it put the onus of 
political visibility on the activists within the work- 
ing-class movement. The established national “sys- 
tem” opened up channels for the expression of 
secret loyalties, while forcing “deviants” to declare 
themselves openly. Some socialist parties tried to 
turn the tables by establishing intimate organiza- 
tional ties with the trade unions and imposing po- 
litical leyies on their members, irrespective of their 
actual preferences. The controversy over “contract- 
ing in” versus “contracting out” in the British labor 
movement can be interpreted as the counterpart of 
the controversy over open versus secret voting in 
the total system. The Labour party wanted to put 
the onus of visibility on its own “deviants,” the 
trade union members who did not want to vote for 
the party (contracting out), while the Conserva- 
tives and Liberals wanted the inarticulate masses 
to stay out of political commitments and to put the 
onus of visibility on the socialist militants (con- 
tracting in). 

The introduction of mass elections in the devel- 
oping countries of Africa and Asia during the final 
phases of decolonization raised a number of tech- 
nical issues (MacKenzie & Robinson 1965; Maquet 
1959; Smith 1960). In some British territories a 
system of separate ballot boxes for each candidate 
or party was introduced. These were marked by 
distinguishing symbols (a lion, an elephant, etc.) 
and the illiterate voters were asked to drop their 
ballot paper into the box of their choice. There were 
elaborate rules for the stamping of official identi- 
fication marks on the ballot papers and for the 
screening of the ballot boxes from the eyes of the 
officials, but this procedure still left a wide margin 
for interference with the choice of individual 
voters. In other British colonies with high rates of 
illiteracy the voters had to mark off on the ballot 
the candidates of their choice. This often made it 
essential to allow election officials to accept “whis- 
pering votes” from voters who could not read the 
names on the ballot. The French colonies and their 
successor states adopted the system of separate 
party ballots and the isoloir for the placing of the 
chosen list in the official envelope. This simplified 
procedures (although it wasted a lot of paper) but 
still left a great deal of leeway for the exertion of 
social pressure. Even if the village chieftains or the 
political agents could not see what the voter did 
inside the isoloir, they could either observe him 
when he chose his party ballot from the separate 
piles placed at the polling station or, what was 
more common, prevail upon him to show them 
afterward the ballots he had not used, as proof 


ELECTIONS: Electoral Systems 11 


that he had voted as instructed. There was nothing 
new about this, of course. The experiences in the 
developing countries simply confirmed what had 
been known about voting in the backward rural 
areas of Europe for decades. Any attempt to up- 
hold strict rules of secrecy in societies at low levels 
of economic differentiation is bound to run into 
difficulties. Relationships are too diffuse and the 
possibilities of observation too many to allow indi- 
viduals to escape from control through the isola- 
tion of particular acts from their daily contexts. 


From votes to seats 


The emergence of mass electorates produced a 
great literature of political engineering, not just 
on the organization of party work and the waging 
of campaigns but also on the territorial structuring 
of constituencies and on the strategic pros and 
cons of alternative procedures of translating the 
registered distributions of votes into legitimate de- 
cisions on representation. Of the extraordinary 
tangle of issues debated in this literature, only two 
interconnected issues will be discussed here: the 
delimitation of units of aggregation, and the pro- 
cedures for the allocation of seats within each unit. 
Basically, the bitter debates over the two issues re- 
flect fears and resentments generated through 
changes in the equilibrium of political power under 
the impact of mass democracy: the influx of new 
voters altered the character of the system and a 
great variety of stratagems were tried out to bring 
it back into equilibrium. 

Varieties of majority systems. The early sys- 
tems of electoral representation all rested on some 
kind of majority principle. The will of a part of 
the electorate was taken to express the will of the 
whole, and all the participants were taken to be 
bound in law and conscience by the decision 
reached through this procedure. 

Three distinct varieties of majoritarian decision- 
making procedure established themselves during 
the early phases of electoral development. The first 
of these stipulated one round of election, with de- 
cisions by simple plurality. The second and the 
third both stipulated several rounds and required 
absolute majorities in the first round. They differed, 
however, in their requirements for the final and 
decisive round. The second allowed an open field 
of candidacies and simple plurality; the third re- 
stricted the competition to the two foremost candi- 
dates and retained the absolute-majority require- 
ment to the very end. 

The first of the three procedures had been estab- 
lished in England since the Middle Ages and had 
been used to ensure the election of “two knights 
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from every shire and two burgesses from every 
borough” to the House of Commons. It also became 
the standard method in the United States and soon 
spread to the other English-settled nations over- 
seas. The method was originally used in two-mem- 
ber constituencies, but it met with general accept- 
ance even when applied in single-member units. 

The method of repeated ballots had a long tra- 
dition in the Roman Catholic church (Moulin 
1953) and was formally instituted in the French 
ordonnance calling for elections to the States Gen- 
eral in 1789. This stipulated three successive ballots 
—the first two open, the final one restricted to just 
twice as many candidates as there were seats left 
to fill. Three-ballot systems of this type prevailed 
throughout the régime censitaire. This method was 
clearly best suited to elections restricted to the eco- 
nomically independent classes, with leisure enough 
to travel to electoral sessions and spend the day or 
more required to get through all the balloting. The 
Revolution of 1848 swept away this system and 
introduced single-ballot mass elections. The old 
habit of repeated ballots persisted, however, Only 
four years after the revolution, Napoleon mu de- 
vised a system of two-ballot elections suited to the 
new situation of manhood suffrage. He broke with 
the old tradition on a point of fundamental im- 
portance: he allowed an open field of candidacies 
even at the second ballot and required simple 
plurality only. This was an astute strategic move; 
in the half-free elections of the Empire, it allowed 
the officials maximal freedom of maneuver against 
the opposition. Interestingly, there was no return 
to the principle of second-ballot absolute majority 
in the Third, Fourth, or Fifth Republic: none of 
the parties or groupements wanted to be faced with 
the cruel yes-or-no alternatives of the two-way fight. 

Other polities on the European continent stuck 
to the old rule of absolute majority and restricted 
last-ballot candidacies. This was the system upheld 
in the German Reich down to its defeat in 1918; 
it was, also, used in Switzerland until 1900 and in 
Austria, Italy, and the Netherlands until the end 
of World War 1, 

These majoritarian electoral methods came un- 
der heavy attacks in the later phases of democra- 
tization. The extension of the suffrage made pos- 
sible the organization of strong lower-class parties 
but the electoral systems, inherited from the ages 
of estate representation and régime censitaire, set 
high barriers against the entry of such parties into 
national politics. The German rule of absolute ma- 
jority set the highest barrier. A lower-class party 
had to reach the 50 per cent mark or go without 


representation. The French and the Anglo—Ameri- 
can systems also set high barriers against rising 
movements of the hitherto disfranchised, but the 
initial levels were not frozen at 50 per cent; the 
height of the barriers varied with the strategies of 
established, censitaire, parties. The essential dif- 
ference between the French and the Anglo—Ameri- 
can systems was that the one made for much 
greater local variations in such counterstrategies 
than the other: the first-ballot results offered a 
basis for bargaining among the established parties, 
and the coalition strategies would of necessity vary 
from constituency to constituency. The height of 
the barrier against new entrants depended essen- 
tially on the willingness of the established parties 
to enter into alliances. This was not always ex- 
clusively a matter of immediate payoffs but of trust 
and the openness of communication channels. 

Origins of proportional representation. Karl 
Braunias (1932) distinguished two phases in the 
spread of proportional representation: the “minor- 
ity protection” phase, before World War 1; and the 
“antisocialist” phase, in the years immediately after 
the armistice. It was no accident that the earliest 
moves toward proportional representation (PR) 
came in the most ethnically heterogeneous Euro- 
pean countries: Denmark in 1855; the Swiss can- 
tons in 1891; Belgium in 1899; Moravia in 1905; 
Finland in 1906. In linguistically and religiously 
divided societies majority elections could clearly 
threaten the continued existence of the political 
system. The introduction of some element of mi- 
nority representation came to be seen as an essen- 
tial step in a strategy of territorial consolidation. 

As the pressures mounted for extensions of the 
suffrage, demands for proportionality were also 
heard in the culturally more homogeneous nation- 
States. In most cases the victory of the new prin- 
ciple of representation came about through a con- 
vergence of pressures from below and from above. 
The rising working class wanted to lower the 
thresholds of representation in order to gain access 
to the legislatures, and the most threatened of the 
old established parties demanded PR to protect 
their position against the new waves of mobilized 
voters created by universal suffrage. In Belgium 
the introduction of graduated manhood suffrage in 
1893 brought about an increasing polarization be- 
tween the Labor party and the Catholics and threat- 
ened the continued existence of the Liberals. The 
introduction of PR restored some equilibrium to 
the system (Gilissen 1958). 

The history of the struggles over electoral pro- 
cedures in Sweden and Norway tells us a great deal 


about the consequences of the lowering of one 
threshold for the bargaining over the level of the 
next. In Sweden, Liberals and Social Democrats 
fought a long fight for universal and equal suf- 
frage and at first also advocated PR, to ensure 
easier access to the legislature. The remark- 
able success of their mobilization efforts made 
them change their strategy, however, and from 
1904 onward they advocated majority elections in 
single-member constituencies (Verney 1957). This 
aroused fears among the farmers and the urban 
conservatives, who, to protect their own interests, 
made the introduction of PR a condition for accept- 
ance of manhood suffrage. Accordingly, the two 
barriers fell together. It became easier to enter the 
electorate and easier to gain representation. In 
Norway (Rokkan & Hjellum 1966) there was a 
longer lag between waves of mobilization. The fran- 
chise was much wider from the outset, and the first 
wave of peasant mobilization brought down the old 
regime as early as 1884. As a result, the suffrage 
had been extended well before the final mobiliza- 
tion of the rural proletariat and the industrial 
workers, under the impact of rapid economic 
change. The victorious radical—agrarian “left” felt 
no need to lower the threshold of representation 
and, in fact, helped to raise it through the intro- 
duction of a two-ballot system of the French type 
in 1906. There is little doubt that this contributed 
greatly to the radicalization and alienation of the 
Norwegian Labor party, which in 1915 gained 32 
per cent of all the votes cast but which was given 
barely 15 per cent of the seats. The “left” did not 
agree to lower the threshold until 1921; the decisive 
motive was clearly not just a sense of equalitarian 
‘justice but the fear of rapid decline, with further 
Labor advances across the majority threshold. 

In all these cases the high threshold might have 
been maintained if the parties of the property- 
owning classes had been able to make common 
cause against the rising working-class movements. 
But the inheritance of hostility and distrust was too 
strong. The Belgian Liberals could not face the pos- 
sibility of a merger with the Catholics, and the 
cleavages between the rural and the urban interests 
went too deep in the Nordic countries to make it 
possible to build up any joint antisocialist front. By 
contrast, the higher level of industrialization and 
the progressive merger of rural and urban interests 
in Britain made it possible to withstand the demand 
for a change in the system of representation: the 
Labour party was seriously underrepresented only 
during a brief initial period, and the Conservatives 
were able to establish broad enough alliances in the 
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counties and the suburbs to keep their votes well 
above the critical point. 

Threshold strategies under PR. PR systems dif- 
fer markedly in their threshold levels, however, and 
the struggles over these details of electoral engi- 
neering tell us a great deal about the dynamics of 
multiparty systems. 

The variant most frequently introduced in con- 
tinental Europe was the one invented by the Bel- 
gian professor Victor d'Hondt (1878; 1882): the 
method of the “largest average.” This method favors 
the largest party and, in fact, lowers the threshold 
very little in constituencies electing few members 
and choosing among few competing party lists. If 
the total number of votes cast is designated as 
V, the total number of mandates as M, and the 
total number of parties as P, the threshold for- 
mula for the d'Hondt procedure will read 


This means that the smallest number of votes 
(T) required for representation will be a function 
not only of the size of the constituency and its 
share of seats but also of the number of parties. A 
fragmented party system lowers the threshold but, 
by implication, also increases the overrepresenta- 
tion of the largest of the parties (particularly if 
P > M, since the votes for a number of the small 
parties must of necessity go unrepresented ). 

Thus, the debates and bargains over electoral 
arrangements in a great number of PR countries 
have centered on the questions, Should there be 
some gentle overrepresentation of the largest party? 
and Should the threshold for the first seat be set 
high enough to discourage new parties and splinter 
movements? These concerns have been particularly 
prominent in the Scandinavian countries: the typi- 
cal constellation there has been one party, Labor, 
in the range just below the 50 per cent mark; three 
or four parties, all nonsocialist, in the 5-20 per 
cent range; and one or two very small parties, with 
only minimal chances of representation. In such 
constellations the d'Hondt procedure would give the 
largest party more seats than its votes justi- 
fied. In fact it often gave the Labor parties clear 
majorities in parliament without majorities among 
the voters. This overrepresentation was essentially 
achieved at the expense of the very smallest of the 
parties, such as the Communist, but often hurt the 
efforts of the one-seat parties to gain additional 
representation. A variety of remedies were sug- 
gested. The Danes retained the high d'Hondt 
threshold but ensured greater proportionality 
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among the already represented parties through a 
two-level procedure: any underrepresentation pro- 
duced at the constituency level was corrected 
through the allocation of additional seats at the re- 
gional level. In Sweden and in Norway the non- 
socialist parties opted for another strategy: they 
found it impossible to join forces under one single 
list, but they were anxious to increase their repre- 
sentation through provisions for electoral cartels. 
In Sweden such cartels were allowed after 1921 
and cost the Social Democrats a substantial num- 
ber of seats. The system placed the Agrarians in a 
very difficult position. To avoid underrepresenta- 
tion, they were tempted to join in cartels with the 
other nonsocialist parties; but to advance the inter- 
ests of the farmers they found it best to support 
the Social Democratic government. In the end the 
provisions for cartels were abolished and the Sainte- 
Lagué method (1910) of calculation was intro- 
duced. 

By a curious coincidence this alternative was 
adopted in all the three Scandinavian countries 
during 1952 and 1953. The Norwegian Labor party 
had gained a majority in parliament in 1945 and 
by 1947 had abolished the cartels. As a result, the 
party received 45.7 per cent of the votes but 56.7 
per cent of the seats in the 1949 election. This 
caused a great deal of recrimination, and the party 
finally accepted the new method of allocation in 
1952. A similar lowering of threshold was also 
brought about in Denmark, through provisions in 
the constitution of 1953. 

The Sainte-Lagué method was once described as 
a “miracle formula” by the leader of the Swedish 
Agrarian party. In the typical Scandinavian situ- 
ation it had a threefold effect: it strengthened the 
middle-sized nonsocialist parties by reducing the 
overrepresentation of the Social Democrats; it was 
nevertheless of strategic advantage to the governing 
parties because it reduced the pay-offs of mergers 
within the opposition; and finally, it helped all the 
established parties by discouraging splinters and 
new parties. How could all this be achieved in one 
formula? To explain this, we have to go into some 
technicalities of electoral mathematics (Janson 
1961; Rokkan & Hjellum 1966). 

Two procedures were frequently suggested as 
alternatives to d'Hondt in the discussions in the 
Scandinavian countries: the method of the “great- 
est remainder” and the Sainte-Lagué system of 
successive division by odd integers. 

The method of the “greatest remainder” lowers 
the threshold of representation to a minimum: the 
threshold formula is T = V/(MP). This is a direct 
invitation to party fragmentation, since the thresh- 


old decreases rapidly with increases in the number 
of parties. The simple Sainte-Lagué formula does 
not go quite that far. The threshold formula is 
(V —1)/(2M +P — 2). Its crucial contribution is 
the progressive increase in the cost of new seats. 
The greater the number of seats already won by a 
party in a given constituency, the more votes it 
will take to add yet another. The d'Hondt formula 
makes no distinction between first and later seats. 
The total votes cast for each party are divided suc- 
cessively by 1, 2,3,---. The Sainte-Lagué method 
is to divide by 1, 3,5,---. Thus, if the first seat 
costs each party 1,000 votes, the second seat will 
cost the party (1,000 -3)/2 = 1,500 votes and the 
third seat (1,000 -5)/3 = 1,667 votes, and so on. 
This is definitely the optimal formula for small 
parties: it is easy to gain representation but hard 
to reach a majority in parliament. At the same 
time, it discourages mergers and cartels. Two 
parties polling just beyond the threshold for their 
first seats will, in fact, lose out if they merge. 

This procedure appealed both to the nonsocialist 
parties, typically at the one-seat level in most con- 
stituencies, and to the governing Social Democrats. 
The nonsocialists were anxious to reduce the “gov- 
ernment bonus” built into the d'Hondt procedure, 
and the Social Democrats wanted to make sure 
that their opponents did not find it profitable to 
merge into one broad competitive party. 

But this was not all. The electoral strategists 
went even further to ensure the perpetuation of 
the established party constellations, They wanted 
lower thresholds, but they wanted them set just 
below the typical voting levels of the smallest of 
the established parties. If the threshold were to be 
set much lower, it would increase the chances of 
even smaller, “antisystem” parties and encourage 
splinter movements. The solution proved very 
simple: the first divisor was set, not at 1, but at 
1.4. In the example already used, this would mean 
that the first seat would cost 1,400 votes as against 
1,500 for the second and 1,667 for the third. It 
cost more to gain entry, but once a party was in, 
the steps toward further representation were no 
longer so steep. 

This formula fitted the established power con- 
stellations as closely as any procedure at this level 
of simplicity could ever be expected to. It had all 
the appearance of a universal rule, but in fact it 
was essentially designed to stabilize the party sys- 
tem at the point of equilibrium reached by the 
early 1950s. 

Developments in the late 1950s and the early 
1960s showed that even this formula would not 
protect the system against change. The Social 


Democrats began to regret that they had given up 
so much of their “government bonus.” Their parlia- 
mentary majorities had become very small and 
highly vulnerable, and they were reluctant to con- 
template long-term alliances with one of the op- 
position parties. In Sweden the Royal Commission 
on Constitutional Reform proposed in 1963 a two- 
tier system of representation. They wanted to use 
the Sainte-Lagué procedure at the level of the old 
constituencies and the d'Hondt procedure to elect 
regional members at large. This was a deliberate 
attempt to bring back into a system some measure 
of overrepresentation for the largest party. The 
motive was explicitly stated to be the need for some 
stabilization of the majority basis for the cabinet. 
This proposal is still under debate, and there is no 
basis for any final prediction of the outcome of the 
complex bargaining currently under way. Similar 
discussions got under way in Norway in the wake 
of the defeat of the Labor party in the election of 
1961. Some Labor strategists have guardedly sug- 
gested a return to d'Hondt or even a switch to 
simple majority elections of the British type, while 
their rivals in the nonsocialist camp have put forth 
a diametrically opposed solution: the lowering of 
the Sainte-Lagué threshold to 1.3 or even 1.2. Para- 
doxically, the old Labor party, once the champion 
of PR, now wants to increase the threshold of rep- 
resentation, while their opponents, once the de- 
fenders of the old majority threshold, now advocate 
a radical lowering. Their aim is clearly to encour- 
age the Left Socialist splinters from Labor, but this 
gain can be bought only at the cost of all future 
mergers of the nonsocialist parties (Rokkan & 
Hjellum 1966). 

PR without party lists. By 1920 PR systems of 
one sort or another had won out throughout Eu- 
rope. Even the French gave up their two-ballot, 
single-member procedure from 1917 to 1927 and 
introduced a curious mixture of d'Hondt propor- 
tionality and majoritarianism. These systems all 
required the voter to elect several representatives 
at the same time and to choose among a number 
of lists of candidates. These lists were normally 
set up by competing political parties. The voters 
might have some influence on the fate of indi- 
vidual candidates on such lists, but it was nearly 
impossible to elect anyone not appearing on the 
initial lists. The Continental PR was a product of 
party bargaining. The parties wanted to survive 
and saw that they rated the best chances under a 
system that would allow them not only to control 
nominations but also to gain representation even 
when in minority. 

In the Anglo-Saxon countries this type of PR 
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never caught on. There were strong party organ- 
izations, but there was also a strong tradition of 
direct territorial representation through individual 
representatives. The early English advocates of pro- 
portionality were profoundly indifferent to the sur- 
vival of organized parties; they wanted to equalize 
the influence of individual voters. The great inno- 
vation of these electoral reformers was the intro- 
duction of a procedure for the aggregation of indi- 
vidual rank-order choices. The election was not to 
be decided through the counting of so many choices 
for X and so many for Y but through the compari- 
son of schedules of preference. The possibilities of 
such aggregations of rank orders had been analyzed 
with great ingenuity by Charles de Borda and the 
Marquis de Condorcet in the eighteenth century 
(Black 1958; Ross 1955), but these theoretical dis- 
cussions had been confined to decision making in 
committees and assemblies. The French method of 
repeated ballots, in fact, developed out of decision- 
making situations in assemblies and entailed a 
rank ordering of preferences; the voters for the 
candidates at the bottom had to decide on their 
next-order preferences. The Australian system of 
the “alternative vote” is another approximation; the 
voter indicates his second and third choices, as well 
as his first, and knows that these lower preferences 
will be brought into the count if his first preference 
should not receive enough support. These methods, 
however, aim at the maximizing of support behind 
each candidate. First preferences count to the end, 
even when there are many more than needed to 
elect the given candidate. The great strength of the 
movement for the “single transferable vote” lay pre- 
cisely in the insistence on the effective use of all 
the preference schedules, not only the ones given to 
the candidates with the smallest followings but also 
of those “wasted” through overconcentration on a 
single candidate. 

This required the setting of a quota—the small- 
est number of preferences required for election. 
The inventors of the system, a Dane, Andrae 
(1855), and an Englishman, Hare (1857), set it 
at Votes + Seats, but this was quickly shown to be 
too high. H. R. Droop (1868) had no difficulty in 
demonstrating that the correct quota would be 
[Votes + (Seats + 1)]+1. This would be just 
enough to beat competitors for the last of the seats. 

Once the quota had been set, the procedure was 
in itself straightforward, if time consuming. The 
wasted first preferences at the top of the poll were 
treated just like the wasted ones at the bottom— 
the lower preferences were entered when the first 
ones could no longer help. There was one differ- 
ence. At the bottom all the first preferences were 
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wasted and had to be examined for lower prefer- 
ences, while at the top it was impossible to say 
which ones were wasted—which ones were in the 
quota and which ones were beyond. The solu- 
tion was to work out proportional shares of lower 
preferences. If an elected candidate had received 
10,000 first preferences but needed only 9,000, the 
below-quota candidates would get (10,000 — 
9,000) + 10,000 = \ of the second preferences 
given each of them by those 10,000 voters (Lake- 
man & Lambert 1955). 

The Andrae variant of this system was used in 
the election of some of the members of the Danish 
Rigsraad from 1855 to 1866 and in the electoral 
colleges for the Upper House from 1866 to 1915, 
but the method has otherwise found acceptance 
only in Britain and the British-settled areas: Tas- 
mania since 1907, the two Irelands since 1920 
(quickly abolished in Northern Ireland), Malta 
since 1921, New South Wales since 1932, and the 
Australian Senate since 1949. The “single trans- 
ferable vote” was ardently advocated by British 
Liberals but never gained much of a foothold in 
England. Most Conservatives were against it, and 
the Labour party found it less and less interesting 
as they grew in strength. There was a strong move 
toward proportionalism in 1931. Labour promoted 
an “alternative-vote” bill and was supported by the 
Liberals in the House of Commons, but the gov- 
ernment fell and the law was never enacted (But- 
ler 1953). 

On the European continent there has always 
been a great deal of resistance to the dominance 
of the organized parties in the determination of the 
lists of candidates, and a variety of devices has 
been invented to ensure some measure of voter influ- 
ence on the fate of individual candidates. Denmark 
was the only country on the Continent to go as far 
as to opt for PR without party lists. Under the cur- 
rent system they provide three levels of electoral 
aggregation. At the level of the nomination dis- 
trict the voters choose among individual candi- 
dates; at the level of the constituency their votes 
are aggregated by party to determine the allocation 
of direct seats; while at the level of the region 
there is a further round of aggregation to decide 
the attribution of additional seats designed to 
maximize proportionality (Pedersen 1966). An- 
other multilevel solution has been devised in the 
German Federal Republic. There the voters are 
allowed two votes, one for a simple plurality elec- 
tion in single-member constituencies, the other for 
a PR election among party lists. A high degree of 
candidate orientation can also be achieved in a 
single-level PR system. The Finnish system pro- 


vides the most interesting example. The parties do 
not present multicandidate lists or indicate a pre- 
ferred order among them but submit a number of 
separate candidacies. The voters then choose indi- 
vidual candidates only, but the votes are aggre- 
gated by party within each constituency to deter- 
mine the allocation of seats. 

Single-member versus multimember constituen- 
cies. On the Continent the conflict between the 
majority principle and the proportionality principle 
was, at the same time, a conflict over conceptions 
of the territoriality of elections. Majority elections 
were typically tied to single-member constituencies 
and posited close interaction between the elected 
representative and the entire local electorate. Pro- 
portional elections were held in larger constitu- 
encies and posited interaction between organized 
parties and functionally defined core sectors of the 
population. The Single Transferable Vote made 
little sense in single-member constituencies but 
offered an alternative to party dominance in multi- 
member units. The voter was, free to establish his 
own list of candidates and did not have to abide 
by any party nominations. Thomas Hare (1857) 
and John Stuart Mill went so far as to propose that 
all of Britain be turned into one single constitu- 
ency, but this clearly would make for enormously 
laborious computations of transfers, The Govern- 
ment of Ireland Act of 1920 stipulated constitu- 
encies of three to eight members each, and most 
advocates of the Hare system now give five seats 
as the ideal (O'Leary 1961; Ross 1959). 

PR-list systems have allowed wide variations in 
the size of constituencies. Several countries have, 
in fact, made the entire national territory one con- 
Stituency. This was the system of the Weimar Re- 
public, and it is still the system in use in the 
Netherlands; it has also been used in Israel since 
1949. This does not necessarily mean that the 
same set of candidates is presented throughout the 
national territory, There may be primary constitu- 
encies for the presentation of local party lists, but 
the fate of these lists is not determined within that 
constituency alone but by the success of the party 
in the total national territory. The electoral ar- 
rangements in Denmark and in the German Fed- 
eral Republic are of this type. Some of the seats are 
allocated directly by constituency (in Denmark by 
the Sainte-Lagué formula, in Germany by plu- 
rality); others are allocated on the basis of the 
nationwide result, to ensure proportionality. 

Such large constituencies obviously favor the 
formation of splinter parties: direct PR thresholds 
are functions of both the number of seats and the 
number of parties. To guard against party frag- 


mentation, many systems have introduced higher 
barriers, either on the basis of the percentage share 
of the total national vote or on the basis of the 
number of direct seats already won. Danish law 
requires as a condition for the allocation of “pro- 
portionalized seats” that the party has (a) gained 
one direct seat or (b) received 2 per cent or more 
of the vote across the nation or (c) received in 
two of the three regions a total number of votes 
higher than the average regional cost of direct 
seats. The German threshold is 5 per cent of the 
federal vote or three direct seats. 
Cross-constituency equality. The demand for 
equality of representation was at first met at the 
constituency level only. “One man, one vote, one 
value” was enforced within the local unit of aggre- 
gation but not throughout the national territory. 
There were everywhere highly vocal movements for 
the equalization of electoral districts, but these de- 
mands met with greater resistance than the claims 
for equality of influence within each unit. Under 
the inherited systems of estate representation, elec- 
tions were taken to express the will, not of indi- 
vidual citizens, but of the corporate units of the 
nation. A shire or borough might have declined in 
population or in number of enfranchised citizens, 
but it still constituted a unit of government worthy 
of representation on a par with larger units. Even 
after the Reform Act of 1832 in England, differ- 
ences between the lowest and the highest numbers 
of constituents per representative were of the order 
of 1 to 60. The radical redistribution carried out in 
1885 brought the ratio down to 1 to 7, but further 
progress was slow. Even after the reorganization of 
1948 there are still constituencies with electorates 
only one-third the size of the largest in the country. 
Great variations in the ratios of representatives 
to electorates was the rule throughout Europe and 
the West until well into the twentieth century. On 
the European continent the early systems of repre- 
sentation generally gave great advantages to the 
cities; the centers of commerce and industry were 
still small in population but had major stakes in 
the building of the nations. The continuing growth 
of the national economies brought about changes 
in this urban-rural balance. As the populations of 
the cities grew and the franchise was widened, the 
rural areas gradually gained in their electorate— 
representative ratios and became heavily overrep- 
resented. This inequality of representation proved 
highly resistant to protest movements. The more 
conservative voters in the cities had found im- 
portant allies in the countryside and preferred to 
stay underrepresented at home as long as their 
allies could help them in their fight against urban 
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radicals (Cotteret et al. 1960; David & Eisenberg 
1961-1962; De Grazia 1963). 

In some countries this urban—rural conflict was 
reinforced through conflicts between the central 
districts and the peripheries. The constituencies 
farthest away from the capital and the economically 
most advanced areas of the nation claimed a right 
to numerical overrepresentation to offset the diffi- 
culties of communication with the decision makers 
and the officials at the center. In Denmark the con- 
stitution of 1953 even goes so far as to stipulate 
that constituencies be allotted seats based not only 
on their population but also on the size of their 
territories. A representative speaks not just for a 
given number of citizens but also for a unit of 
physical territory. Even in the most “proportional- 
ized” of democracies, the electoral arrangements 
still reflect tensions between three conceptions of 
representation: the numerical, the functional, and 
the territorial. 


Priorities for comparative research 


The development in so many countries of stand- 
ardized arrangements for the conduct of elections 
at several levels of the polity sets a wide variety of 
challenging tasks for comparative social research. 
The comparative studies carried out thus far leave 
great gaps in our knowledge. It is, in fact, much 
easier to pinpoint lacunae and lost opportunities 
than to describe positive achievements. 

Given the crucial importance of the organization 
of legitimate elections in the development of the 
mass democracies of the twentieth century, it is 
indeed astounding to discover how little serious 
effort has been invested in the comparative study 
of the wealth of information available. There is no 
dearth of literature, but exceedingly little of it 
stands up to scrutiny in the light of current stand- 
ards of social science methodology. The great bulk 
of the items bear on technicalities and controversies 
within a single national or regional tradition, and 
the few wider-ranging ones tend to take the form 
of vehement polemics against competing systems, 
even when couched in the terms of academic 
discourse. 

The polemical writers tend to fall into two cate- 
gories: the violent majoritarians or the impassioned 
single-vote proportionalists. It is hard to trace any 
distinctive school of list-system proportionalists. 
The party lists have certainly had their defenders, 
but these have tended to be pragmatic and con- 
textual in their argumentation and have not been 
inclined to advertise their solutions as panaceas for 
all countries of the world. 

The majoritarians have been particularly articu- 
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late in the three European countries with the un- 
happiest records of mass politics: Germany, Italy, 
and France. In all these deeply divided countries 
there has been widespread nostalgia for the sim- 
plicity of the Anglo-Saxon system of plurality elec- 
tions. A great number of publicists had hoped for 
the development of unified national political cul- 
tures that would foster the kind of trust in terri- 
torial representatives they could observe in Eng- 
land and had somehow come to the conclusion 
that this could be brought about through straight- 
forward electoral engineering. 

In its academic guise this argument was devel- 
oped into a scheme of purportedly universal prop- 
ositions about the consequences of electoral sys- 
tems for the health of the body politic (Hermens 
1941; 1951). This proved a very difficult enter- 
prise. A great deal of information for a wide range 
of countries was processed, but the results were 
meager. The universal propositions gave way to 
complex statements about concrete sequences of 
change, and a bewildering multiplicity of condi- 
tioning variables had to be brought into the anal- 
ysis. It turned out to be simply impossible to 
formulate any single-variable statements about the 
political consequences of plurality as opposed to 
those of PR. A variety of contextual conditions had 
to be brought into the analysis: the character of 
the national cleavage system; the cultural condi- 
tions for the legitimation of representatives; the 
burdens of government and the leeway for legis- 
lative versus executive action (Duverger 1950; 
1951; Epstein 1964; Grumm 1958). 

This did not reduce appreciably the ardor of the 
majoritarians. They stuck to their guns in discuss- 
ing the three major countries of the western Euro- 
pean continent, but they admitted that PR might 
not hurt the functioning of democracy in the 
smaller nations (Unkelbach 1956). A good case 
could be made for plurality elections in Germany, 
within a reasoned analysis of the strategic options 
for the one country (Sternberger 1964; Scheuch 
& Wildenmann 1965), but the academic enterprise 
broke down as soon as attempts were made to 
argue this move for all full-suffrage democracies, 
whatever their structure and whatever their experi- 
ences in consensus building, 

In Anglo-Saxon circles the polemics against 
plurality elections have not been quite as vehement. 
Advocates of PR could not blame the inherited 
electoral system for major national disasters, such 
as Fascism in Italy, National Socialism in Ger- 
many, the 1940 debacle in France, The single- 
vote proportionalists (Lakeman & Lambert 1955; 
Ross 1955) do have something in common with the 


majoritarians. They tend to express the same naive 
belief in the possibilities of electoral engineering, 
and they show little awareness of the cultural and 
the organizational conditions for the acceptance of 
different systems of representation. 

The majoritarian—proportionalist polemic has re- 
cently been given a new dimension through the 
discussion of the consequences of electoral ar- 
rangements for the achievement and/or survival 
of democracy in the developing countries. A lead- 
ing analyst of the conditions of economic growth, 
W. Arthur Lewis, has formulated a strong indict- 
ment of the Anglo-French majority systems which 
the new African states inherited from their colonial 
masters. He argues that the Anglo-French systems 
had been developed and had found widespread ac- 
ceptance in “class societies” and cannot work in 
the same way in the African “plural” societies— 
territorial polities seeking to integrate within their 
boundaries populations historically hostile to each 
other, 


The surest way to kill the idea of democracy in a 
plural society is to adopt the Anglo-American electoral 
system of first-past-the-post. . . . First-past-the-post does 
not even require 51 per cent of the votes in each con- 
stituency to give one party all the votes. If there are 
three parties it can be done theoretically, with only 
34 per cent; or if there are four parties, with only 
26 per cent. Governments can get away with this in 
secure democracies without destroying faith. But if you 
belong to a minority in a new state, and are being 
asked to accept parliamentary democracy, you can 
hardly build much faith in the system if you win 
30 per cent of those votes and get only 20 per cent of 
the seats, or even no seats at all. If minorities are to 
accept Parliament, they must be adequately represented 
in Parliament. (Lewis 1965, pp. 71-72) 


These, of course, are exactly the arguments used 
in the “plural societies” of Europe for the introduc- 
tion of PR. The entrenched linguistic, religious, or 
ethnic minorities had no faith in the majority rep- 
tesentatives and threatened to disrupt the system. 
The introduction of PR was essentially part of a 
Strategy of national integration—an alternative to 
monopolization of influence or civil war. But the 
extent of minority entrenchment varied greatly 
from country to country, and the pressures for pro- 
portionalization were nowhere exactly the same. 
This is a high-priority area for comparative re- 
search. To bring about some understanding of the 
great variations in electoral arrangements both in 
the West and in the postcolonial polities, it will be 
essential to study the crucial decisions on the suf- 
frage, on privacy versus secrecy, on plurality versus 
PR, in the context of the process of nation building 
(Bendix 1964; Rokkan 1961; 1966b). 


Electoral systems have not changed in vacuo. 
They function within culturally given contexts of 
legitimacy, and they are changed under the strains 
of critical “growing pains” in the development of 
the over-all constellations of national institutions. 
The comparative study of electoral developments 
can contribute a great deal to the understanding 
of processes and strategies of national integration, 
but the contributions will be meager and unreliable 
as long as the principal motivation for new re- 
search is a concern with the pros and cons of dif- 
ferent schemes of electoral engineering. 

The conditions for a real advance in compara- 
tive electoral research are present. An increasing 
number of dispassionate analyses of national elec- 
toral histories have been forthcoming in recent 
years, and steps are being taken to facilitate the 
conduct of statistical investigations through the 
development of “data archives” for computer analy- 
ses of time-series records (Rokkan 1966a; Rokkan 
& Meyriat 1967). What has been lacking so far 
has been an international forum for the advance- 
ment of detailed comparative studies. A beginning 
has been made, however, and it is hoped that the 
next decades will see a breakthrough in the com- 
parative study of electoral systems. 

STEIN ROKKAN 


[See also LEGISLATION; PARTIES, POLITICAL; REPRESEN- 
TATION, article on REPRESENTATIONAL SYSTEMS; VOT- 
ING.] 
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ELECTROCONVULSIVE SHOCK 


It has been known for many years that electric 
current applied to an animal’s head can elicit a 
convulsion. The procedure of eliciting seizures by 
electrical stimulation is termed “electroconvulsive 
shock” (ECS). The term “electroshock therapy” 
(EST) is used in medical references. In the early 
1930s drug-induced convulsions (using insulin or 
Metrazol, for example) were adopted as a treat- 
ment for patients with severe mental disorders. In 
1938 Cerletti and Bini demonstrated that ECS pro- 
vided a more highly controlled and reliable means 
of eliciting convulsions. ECS gradually replaced 
drugs in convulsive therapy treatments and is still 


PAN oy 
used to some eXteht as a treatment for iehtal dis- 
orders. It is conSidefed to be particularly effective 


in the treatment of severely depressed, patients 
(Ulett et al. 1962). ‘Since Cerletti and Bipi intro- 
duced ECS there ‘Have been numerous @xtensive 


bases of its behavioral @ffédts:a\ 

The ECS convulsion>-Rats“or other rodents 
have been used as subjects in the majority of ex- 
perimental studies of ECS. In general, the pattern 
of the convulsion obtained with rats is similar to 
that obtained with larger mammals, including hu- 
mans. Convulsions are usually elicited by stimu- 
lating the rats with 25 to 100 milliamperes of cur- 
rent (AC) for approximately 0.2 to 1 second. The 
current is delivered through electrodes that are 
either attached to the animal's ear or applied di- 
rectly to the rats corneas. The maximal seizure 
(grand mal) consists of a highly stereotyped se- 
quence of movements. The animal's hind legs are 
first drawn up and then extended. This tonic exten- 
sion lasts for several seconds and is followed by a 
brief phase of whole-body clonus. The animal then 
remains in an immobile state for several minutes. 
A maximal convulsion can again be elicited in ap- 
proximately ten minutes. For about 15 minutes fol- 
lowing the convulsion the animals are usually 
hyperirritable. Current below that necessary to 
elicit a maximal seizure may produce violent run- 
ning, temporary immobilization, or clonic convul- 
sions. Maximal seizures can be prevented by ad- 
ministering to the animals such depressant drugs 
as ether, phenobarbital, or diphenylhydantoin prior 
to the ECS stimulation (Toman et al. 1946). Maxi- 
mal seizure thresholds increase with the age of the 
animals and, in different laboratory species, vary 
directly with the weight of the animals. 


clinical and Oe studies of he ture and 
7 


Behavioral effects of multiple ECS 


In the clinical use of ECS, patients are usually 
given a series of treatments. Most experimental 
studies with laboratory animals have adopted this 
procedure, This procedure is, of course, appropriate 
for research viewed as an experimental analogue 
of the clinical treatments. However, recent evi- 
dence indicates that ECS has numerous behavioral 
effects and that the effects of a single ECS treat- 
ment are quite different from those produced by 
a series of treatments. 

Activity, sexual and maternal behavior. When 
rats are given a series of ECS treatments (one or 
more per day for 7 to 25 days) their behavior 
is markedly affected. Spontaneous activity is de- 
pressed, male sexual arousal is impaired, and 
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maternal behavior (for example, nest building and 
care of the young) is disrupted (Munn 1950; 
Beach et al. 1955). The animals’ behavior usually 
returns to normal within a few weeks after the 
treatments are discontinued. 

Learning and retention. Multiple ECS treat- 
ments have also been found to impair rats’ maze 
learning and retention. The degree of impairment 
is directly related to the complexity of the maze 
task. However, the impairment appears to be tem- 
porary. Little impairment is found if a month 
elapses between the last ECS treatment and the 
maze learning or retention tests. There is no im- 
pairment of performance if convulsions are pre- 
vented by the delivering of the ECS stimulation 
while the animals are anesthetized with ether 
(Munn 1950; Russell 1948). 

Studies of memory in human patients given a 
series of ECS treatments yield findings similar to 
those of the rat studies. Patients typically experi- 
ence impaired memory for several weeks following 
the termination of the treatments. This deficiency 
is fairly general and typically involves difficulty 
remembering well-learned life-history data as well 
as recently experienced events. Although the defi- 
cits generally disappear within a few weeks, deficits 
have been observed over a period of months in 
some patients. Some investigators have suggested 
that temporary memory impairment may contrib- 
ute in some way to the therapeutic effectiveness of 
ECS treatments (Janis & Astrachan 1951). 

Conditioned emotional response. A number of 
Studies have shown that a series of ECS treat- 
ments is particularly effective in attenuating a 
learned emotional response (Hunt 1965). In these 
studies rats were first trained to press a lever for 
a water reward. They then were presented with a 
series of trials in which a clicking noise was fol- 
lowed by a painful shock delivered to their feet. 
Within a few trials the clicking noise elicited a 
conditioned emotional response (CER) consisting 
of crouching, urination, defecation, and depressed 
rate of lever pressing. The rats were then given a 
Series of 21 ECS treatments—three per day for 
seven days. On subsequent tests the clicking noise 
failed to elicit the CER. Less of an attenuating 
effect was found if fewer ECS treatments were 
given or if several weeks elapsed between the CER 
training and the ECS treatments. The differential 
effect of ECS on the lever-pressing response and 
the CER is not due to the fact that the CER was 
the last response learned. Similar results are ob- 
tained when the CER training is given prior to the 
learning of the lever-pressing response. The effects 
of multiple ECS on the CER, like those found with 


maze learning and retention as well as sexual and 
maternal behavior, are transient. The CER typi- 
cally reappears within a month following the treat- 
ments. The convulsions appear to play a critical 
role in attenuating the CER. Multiple ECS treat- 
ments do not attenuate the CER if the convulsions 
are prevented by etherizing the rats prior to each 
ECS treatment. 

It is clear from these studies that a series of 
ECS treatments markedly affects rats’ behavior. 
The findings indicate, however, that the effects 
are for the most part temporary. Effects on sexual 
arousal and learning and retention generally last 
for only a few weeks following the treatments. 
Further, it seems clear that the convulsions are 
essential for the effects observed. There is little 
evidence of behavioral effects of multiple ECS 
treatments when convulsions are prevented by 
anesthetization of the animals prior to the treat- 
ments. 


Behavioral effects of a single ECS 


Shortly after the introduction of ECS clinical 
observations indicated memory loss as a common 
consequence of such treatments. In addition to a 
general loss of ability to remember names, events, 
and personal life history, patients seemed to have 
amnesia for events that had occurred shortly be- 
fore each treatment. This phenomenon has been 
termed “retrograde amnesia.” Systematic studies 
of memory in patients treated with ECS have con- 
firmed this clinical observation, In an early study, 
for example, patients were shown a series of pic- 
tures prior to receiving an ECS treatment, In tests 
given the next day they showed poorest retention 
of the last pictures seen before convulsion (Mayer- 
Gross 1943), 

Evidence that ECS produces retrograde amnesia 
has been obtained in a large number of experi- 
ments with rats. In the first two of such studies 
(Duncan 1949; Gerard 1955) rats and hamsters, 
respectively, were given a single ECS treatment 
after each trial in a learning task. The animals were 
arranged in different groups and each group was 
given the ECS treatment at a different time interval 
after each trial. With the intervals shorter than one 
hour, the rate of learning increased directly with 
increases in the length of the interval between the 
trial and the ECS treatment. Learning was not 
affected by the treatments given one hour or longer 
after each trial. Inasmuch as all of the experimen- 
tal animals were given a series of ECS treatments, 
it is not possible to discount the possibility that at 
least some of the effects observed in these studies 
were due to the cumulative effects of repeated 


ECS treatments. It is important to note, however, 
that treatments did not affect behavior unless they 
were administered shortly after each training trial. 
Thus, these findings contrast with findings of a 
general learning and retention impairment in rats 
given a series of ECS treatments prior to learning 
or retention tests. 

Evidence of the retrograde amnesic effect of ECS 
has also been obtained in a large number of studies 
in which the animals were given only a single ECS 
treatment. In a series of experiments Thompson 
and his associates (for example, Thompson & Dean 
1955) administered an ECS to different groups of 
rats at different intervals following massed training 
trials on a visual discrimination problem. The find- 
ings were similar to those obtained by Duncan and 
Gerard. On retraining trials, 48 hours later, animals 
given an ECS treatment four hours after the train- 
ing did not react differently from the controls. In 
animals treated within one hour after the training, 
efficiency of relearning varied directly with the 
interval between the training and the ECS treat- 
ments. 

In other research Thompson and other investi- 
gators have shown that the degree of the retrograde 
amnesia found with ECS depends upon numerous 
conditions, including the age and strain of the 
subjects, degree or strength of original learning, 
and complexity or difficulty of the learning task. 
The greatest effects are found when the rats are 
young or brain damaged and when the learning 
task is difficult. The duration of the retrograde 
amnesic effect of ECS is typically limited to a few 
minutes when a relatively simple learning task is 
used (Glickman 1961). 


Basis of retrograde amnesic effect 


During recent years most of the interest in ECS 
has centered on the problem of the basis of the 
retrograde amnesic effect. Various hypotheses have 
been proposed. 

Brain damage. A number of investigators have 
suggested that the amnesic effect of the ECS may 
be due to brain damage produced by the current. 
Studies of the brains of experimental animals sub- 
jected to ECS treatments indicate that there is some 
evidence that ECS produces some changes in the 
brains—particularly small hemorrhages. However, 
the changes are generally minor and reversible 
(Madow 1956). It could be that the confusion and 
general impairment of learning and retention found 
with a series of ECS treatments is due in part to 
reversible vascular damage. Such effects could not, 
however, account for the retrograde amnesic effects 
of ECS. A single ECS treatment has little or no 
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amnesic effect if the treatment is given several 
hours after training has been terminated. Further, 
a single ECS does not impair rats’ subsequent ability 
to learn a new task or perform previously well- 
learned tasks. Thus, although ECS may produce 
brain damage, it would be difficult to explain the 
differential effects of ECS on recent and older mem- 
ories in terms of brain damage. 

Interference with memory consolidation. The 
most generally accepted interpretation of the retro- 
grade amnesic effect of ECS is that the ECS inter- 
feres with the neurophysiological processes involved 
in storage or consolidation of memory traces 
(Glickman 1961). Evidence from a variety of clin- 
ical and experimental studies has provided strong 
support for the hypothesis that memory trace 
consolidation is based upon the perseveration of 
neurophysiological processes initiated by an experi- 
ence. In humans retrograde amnesia is a common 
consequence of head injuries. In experimental 
studies with laboravrv animals retrograde amnesia 
comparable to that founa with ECS treatments has 
been produced by such treatments as hypoxia, 
drugs, and audiogenic seizures. The treatments 
seem to prevent the storage of information acquired 
during the training trials just prior to the treat- 
ment. According to this interpretation the memory 
loss should be permanent. Available evidence is 
consistent with this hypothesis. In a study by 
Chevalier (1965) rats were trained for a task and 
then given a single ECS. Clear evidence of retro- 
grade amnesia as long as 60 days later was ob- 
tained. 

Some investigators have assumed that the time 
required for consolidation to occur after training 
is indicated by the minimum interval between train- 
ing and ECS treatment within which no memory 
impairment is found. However, since different 
kinds of post-training treatments produce varying 
degrees of retrograde amnesia, it is more likely 
that memory storage involves a sequence of proc- 
esses and that the different treatments are capable 
of interfering with different processes. For exam- 
ple, with ECS treatments retrograde amnesia is 
usually obtained only when a few minutes or at 
most an hour elapses between training trials and 
treatments. With drugs (for example, Metrazol and 
puromycin) retrograde amnesia has been found 
with training-treatment intervals as long as several 
days (McGaugh & Petrinovich 1965). As indicated 
above, the deficits in learning and memory ob- 
served following a series of ECS treatments tend to 
disappear within a few weeks. 

Most of the experimental studies of ECS effects 
on memory have used learning tasks employing 


24 ELECTROCONVULSIVE SHOCK 


aversive motivation—usually punishing shock. Con- 
sequently, it has been suggested that the ECS 
treatments may produce only a selective amnesia 
for the aversive stimulation. This intriguing hy- 
pothesis, however, is not supported by the data. 
Clear evidence of retrograde amnesia has been ob- 
tained in studies using food and water rewards. 
Further, in studies using shock motivation in dis- 
crimination learning tasks, the ECS seems to have 
its primary effects on the memory of the correct 
cue rather than on the motivation for responding. 

Punishing effects of ECS. The findings of sev- 
eral studies suggest that impairment of perform- 
ance found following repeated ECS treatments may 
be due in part to aversive effects of the treatments. 
For example, Friedman (1953) found that rats’ 
performance of a lever-pressing response for food 
reward was depressed after the animals were given 
a series of ECS treatments in the apparatus. Be- 
havior of the animals suggested that they had 
learned to fear the apparatus. The animals uri- 
nated, defecated, trembled, approached the lever 
hesitantly, and then ran away from it. The depres- 
sion of responses and emotional behavior was 
considerably less marked in animals given the ECS 
treatments in a dissimilar apparatus. No effects 
were observed, however, in subjects treated while 
under ether anesthesia. The behavioral effects ap- 
peared to be due to the convulsions rather than to 
the ECS current. 

Other studies have shown that rats tend to stop 
performing responses that are repeatedly followed 
by ECS treatments. These findings have lead some 
investigators to suggest that the learned fear rather 
than amnesia may be the cause of the impaired 
performance of subjects given ECS shortly after 
training. Recent evidence indicates that, paradoxi- 
cally, ECS treatments have both amnesic and aver- 
sive effects; the amnesic effects, however, cannot 
be explained in terms of the aversive effects, In 
one study rats were given a single foot shock as 
they stepped from a small platform to the floor of 
a table; half of the rats were given an ECS within 
a few seconds. The next day the rats were placed 
on a platform again; those that had received the 
foot shock but no ECS tended to remain on the 
platform; those that had been given the ECS treat- 
ment displayed little evidence of remembering the 
previous foot shock and most of them readily 
Stepped off the platform. In a subsequent study 
using a similar procedure rats were given an ECS 
in the apparatus each day for eight days immedi- 
ately after they had stepped from the platform. 
Latencies of Stepping from the platform increased 
gradually over the eight days. Other rats were given 


only a foot shock, and most of these animals 
learned to stay on the platform within two trials. 
The performance of rats given foot shocks followed 
by ECS was similar to that of rats given only ECS 
treatments. Neither amnesic nor punitive effects 
were found when the ECS treatments were given 
one hour after each trial (McGaugh 1965). The 
results of these studies indicate that the aversive 
effects of ECS are found with repeated treatments, 
whereas retrograde amnesia can be obtained with 
a single ECS treatment. Thus, the amnesic effects 
of a single ECS cannot be explained in terms of 
the punishing effects of the treatment. 

The paradox remains, however. What is the basis 
of the punishment if the treatment produces am- 
nesia for events just prior to the ECS stimulation? 
One possibility is that the subjects learn gradually 
to associate the apparatus cues with the aftereffects 
of the convulsion. Patients given a series of ECS 
treatments tend to develop a fear of the treatments 
(Gallinek 1956). The patients readily admit that 
they do not experience any discomfort during the 
treatment. The fear seems to be based on the 
severe disorientation experienced while recovering 
from each treatment. The finding that the aversive 
effects in rats are eliminated by administering the 
ECS while the animals are anesthetized lends sup- 
port to the interpretation that the aversion is based 
on the aftereffects of the convulsions. A more com- 
plete understanding of the basis of the aversive 
effects of ECS treatments requires additional re- 
search [see LEARNING, articles on REINFORCEMENT 
and AVOIDANCE LEARNING]. 

Competing response hypothesis. Lewis and 
Maher (1965) have proposed still another inter- 
pretation of the amnesic effects of ECS. These in- 
vestigators suggested that through conditioning, 
the cues in the apparatus elicit behavioral inhibi- 
tion, that is, a general muscular relaxation and 
lowered level of activity that competes with and 
thus interferes with the performance of the previ- 
ously learned response. Most of the findings of 
ECS studies, however, are clearly inconsistent with 
this hypothesis, First, as indicated above, amnesia 
is found with a single treatment while other behay- 
ioral effects, including changes in response latency 
and freezing and crouching, appear only after a 
series of ECS treatments. Second, retrograde am- 
nesia is obtained with a single treatment even when 
the treatment is administered outside of the train- 
ing apparatus (this, in fact, is the typical proce- 
dure), Third, rats given a series of ECS treatments 
in an apparatus do not appear to be relaxed. They 
typically urinate, defecate, and tremble. Further, 
rats will actively avoid a place in an apparatus 


where they are given repeated ECS treatments 
(McGaugh 1965). These findings are clearly incon- 
sistent with the conditioned inhibition interpreta- 
tion of ECS effects. 

Contribution of the convulsions. There is clear 
evidence that the effects of repeated ECS treat- 
ments are different from those produced by a single 
ECS. The effects of repeated ECS treatments seem 
to be due to the convulsions rather than to the 
current. Impaired maternal behavior, learning and 
retention deficits, suppression of a CER, and pun- 
ishing effects are not found if the convulsions are 
prevented by the administration of the ECS while 
the subjects are anesthetized. Recent work indi- 
cates, however, that the convulsions are not essen- 
tial for the production of retrograde amnesia with 
a single ECS treatment. In unpublished research 
McGaugh and Alpern obtained clear evidence of 
retrograde amnesia in animals anesthetized with 
ether just prior to receiving a single ECS treatment. 
These findings suggest that the retrograde amnesic 
effect of ECS is produced by the current rather 
than the convulsion and are consistent with other 
recent evidence that retrograde amnesia can be 
produced by restricted subcortical (and subconvul- 
sive) electrical stimulation of the brain (Williston 
et al. 1964). 


Although many of the effects of ECS remain to 
be explained, it is clear that ECS has a variety of 
behavioral effects and that the varied effects are 
not readily explained in terms of any single hy- 
pothesis. In particular, the retrograde amnesic 
effects of ECS must be considered separately from 
the diverse and complex effects found when ECS 
is repeatedly administered. The results of experi- 
mental studies of ECS have, as yet, shed little light 
on the basis of the therapeutic effectiveness of ECS 
in human patients. The findings have, however, 
had considerable influence on theories and re- 
search concerned with learning and memory. It 
may be that the therapeutic effectiveness is due 
to the learning and memory effects; an evaluation 
of this hypothesis must await further research. 


James L. McGAUGH 


[Other relevant material may be found in DEPRESSIVE 
DISORDERS; MENTAL DISORDERS, TREATMENT OF, arti- 
cle on SOMATIC TREATMENT; NERVOUS SYSTEM, 
article on BRAIN STIMULATION.] 
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ELITES 


The concept of elites is used to describe cer- 
tain fundamental features of organized social life. 
All societies—simple and complex, agricultural and 
industrial—need authorities within and spokesmen 
and agents without who are also symbols of the 
common life and embodiments of the values that 
maintain it. Inequalities in performance and re- 
ward support this arrangement, and the inequality 
in the distribution of deference acknowledges the 
differences in authority, achievement, and reward. 
Elites are those minorities which are set apart 
from the rest of society by their pre-eminence in 
one or more of these various distributions. We shall 
concentrate here on the elites of industrial society. 

In modern societies of the West, there is no 
single comprehensive elite but rather a complex 
system of specialized elites linked to the social 
order and to each other in a variety of ways. In- 
deed, so numerous and varied are they that they 
seldom possess enough common features and af- 
finities to avoid marked differences and tensions. 
Leading artists, business magnates, politicians, 
screen stars, and scientists are all influential, but 
in separate spheres and with quite different re- 
sponsibilities, sources of power, and patterns of 
selections and reward. This plurality of elites re- 
flects and promotes the pluralism characteristic of 
modern societies in general. 

For virtually every activity and every correspond- 
ing sphere of social life, there is an elite: there 
are elites of soldiers and of artists, as well as of 
bankers and of gamblers. This is the sense in 
which Pareto (1902-1903) used the term. There 
is, however, an important factor that differentiates 
these various elites, apart from their different skills 
and talents: some of them have more social weight 
than others because their activities have greater 
social significance. It is these elites—variously re- 
ferred to as the ruling elite, the top influentials, 
or the power elite—which arouse particular inter- 
est, because they are the prime movers and models 
for the entire society. We shall use the term 
Strategic elites to refer to those elites which claim 
or are assigned responsibilities for and influence 
over their society as a whole, in contrast with seg- 
mental elites, which have major responsibilities in 
subdomains of the society. 

Strategic elites are those which have the largest, 
most comprehensive scope and impact. The bound- 
aries that separate strategic and segmental elites 
are not sharply defined because of the gradations 
of authority and the vagueness of the perceptions 
that assign positions to individuals. The more 


highly organized elites are, the easier it is to esti- 
mate their boundaries and membership. Thus, the 
more readily identifiable elites in Western societies 
are those of business, politics, diplomacy, and the 
higher civil and armed services. Elites in the arts, 
in religion, and in moral and intellectual life are 
more vaguely delimited and hence also more con- 
troversial. 

The differentiation of elites. Even the earliest- 
known human societies had leading minorities of 
elders, priests, or warrior kings, who performed 
elite social functions. A chief in a primitive so- 
ciety, for example, enacted one complex social role 
in which were fused several major social functions, 
expressed through the following activities: organi- 
zation of productive work; propitiation of, and 
communication with, supernatural powers; judg- 
ment and punishment of lawbreakers; coordination 
of communal activities; defense of the community 
from enemy attack; discovery of new resources and 
of new solutions to the problems of collective sur- 
vival; and encouragement or inspiration of artistic 
expression. As societies expand in size and in the 
diversity of their activities, such activities also ex- 
pand, and more elaborate, specialized leadership 
roles emerge. Following are some of the major 
forms of societal leadership. 

(1) Ruling caste. One stratum performs the 
most important social tasks, obtains its personnel 
through biological reproduction, and is set apart 
by religion, kinship, language, residence, economic 
standing, occupational activities, and prestige. Re- 
ligious ritual is the main force that supports the 
position of this ruling stratum [see CASTE]. 

(2) Aristocracy. A single stratum monopolizes 
the exercise of the key social functions. The stra- 
tum consists of families bound by blood, wealth, 
and a special style of life and supported by income 
from landed property. 

(3) Ruling class. A single social stratum is as- 
sociated with various key social functions, and its 
members are recruited into its various segments 
on the basis of wealth and property rather than of 
blood or religion. Historically, ruling classes have 
held economic rather than political power, but 
their influence tends to extend to all important 
segments and activities of society. Although various 
differentiated and specialized sectors may be dis- 
tinguished, they are bound together by a common 
culture and by interaction across segmental bound- 
aries. 

(4) Strategic elites. No single social stratum 
exercises all key social functions; instead, these 
functions and the elites associated with them are 
Specialized and differentiated. The predominant 


justification for holding elite status is not blood 
or wealth as such but, rather, merit and particular 
skills. Accordingly, these elites are recruited in 
various ways adapted to their differentiated tasks 
and are marked by diversity as well as by im- 
permanence. 

In general it appears that where the society as 
a whole is relatively undifferentiated, elites are few 
in number and comprehensive in their powers; 
where social differentiation is extensive, elites are 
many and specialized. The principal social forces 
underlying the change from societal leadership 
based on aristocracy or ruling class to that based 
on strategic elites are population growth, occupa- 
tional differentiation, moral heterogeneity, and in- 
creased bureaucratization. In a large, industrialized 
mass society, marked by innumerable ethnic, re- 
gional, and occupational differences and stratified 
as to work, wealth, prestige, style of life, and 
power, leadership cannot be entrusted to a single 
ruler, be he chief, warrior, or priest, or to a single 
stratum marked by hereditary exclusiveness and 
traditionalism. Instead, the elites of this society 
will tend to be varied, specialized, and differenti- 
ated as to skill, style, background, and rewards. 
In this way the characteristic attributes of the 
larger society are mirrored in the strategic elites 
through whom that society tries to realize its main 
goals and projects. The division of a society into 
many groups and strata is therefore paralleled by 
its reunification around a symbolic center, or core, 
that signifies the common and enduring character- 
istics of the differentiated whole. The shape of this 
center is determined by the complexity and variety 
of the whole. In this way a society, consisting of 
a multitude of individuals and groups, can act in 
concert despite its moral, occupational, and tech- 
nological diversity and can maintain the sense of 
unity necessary for collective achievements. 

The functions of strategic elites. In every dif- 
ferentiated society, there are patterns of beliefs 
and values, shared means of communication, major 
social institutions, and leading individuals or groups 
concerned with the maintenance and development 
of the society and its culture. These leading ele- 
ments, by focusing attention and coordinating ac- 
tion, help keep the society in working order, so that 
it is able to manage recurrent collective crises. 

The best efforts at classifying elites are still 
those of Saint-Simon (1807) and Mannheim 
(1935), whose approaches, although separated by 
a century, have much in common. Saint-Simon di- 
vided elites into scientists, economic organizers, 
and cultural—religious leaders. This classification 
parallels Mannheim’s distinction between the or- 
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ganizing and directing elites, which deal with con- 
crete goals and programs, and the more diffuse 
and informally organized elites, which deal with 
spiritual and moral problems. 

Elites may also be classified according to the 
four functional problems which every society must 
resolve: goal attainment, adaptation, integration, 
and pattern maintenance and tension management. 
Goal attainment refers to the setting and realiza- 
tion of collective goals; adaptation refers to the 
use and development of effective means of achiev- 
ing these goals; integration involves the mainte- 
nance of appropriate moral consensus and social 
cohesion within the system; and pattern mainte- 
nance and tension management involve the morale 
of the system’s units—individuals, groups, and 
organizations. 

Accordingly, four types of strategic elites, which 
may include a far larger number of elites, may be 
identified: (1) the current political elite (elites of 
goal attainment); (2) the economic, military, dip- 
lomatic, and scientific elites (elites of adaptation); 
(3) elites exercising moral authority—priests, phi- 
losophers, educators, and first families (elites of 
integration); and (4) elites that keep the society 
knit together emotionally and psychologically, con- 
sisting of such celebrities as outstanding artists, 
writers, theater and film stars, and top figures in 
sports and recreation (pattern-maintenance elites). 

Thus, the general functions of elites appear to be 
similar everywhere: to symbolize the moral unity 
of a collectivity by emphasizing common purposes 
and interests; to coordinate and harmonize diversi- 
fied activities, combat factionalism, and resolve 
group conflicts; and to protect the collectivity from 
external danger. 

Societies differ, however, in the way they in- 
corporate these functions into living institutions. 
In some societies, usually at simpler stages of de- 
velopment, one agent assumes responsibility for 
all four system functions; in others, several spe- 
cialized agents emerge. In advanced industrial 
societies the tendency is clearly toward several 
elites whose functional specialization is accom- 
panied by a growing moral and organizational 
autonomy among them. At the same time, how- 
ever, the overriding goals of these elites are, as 
they have always been, the preservation of the 
ideals and practices of the societies at whose apex 
they stand. 

Recruitment of strategic elites. Elite replace- 
ment, which occurs in all societies, involves both 
the attraction of suitable candidates and their ac- 
tual selection. What is considered suitable depends 
on the structure of the elite groups and on whether 
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these elites assume comprehensive or specialized 
functional responsibilities. Recruitment mechan- 
isms, however varied in practice, reflect only two 
fundamental principles: recruitment on the basis 
of biological (and, implicitly, social) inheritance 
and recruitment on the basis of personal talents 
and achievements. Although these two systems are 
not mutually exclusive, one or the other tends to 
prevail, depending on the system of social stratifi- 
cation, on the values placed on ascription and 
achievement, and on the magnitude of demand 
for elite candidates in relation to the supply. 
Broadly stated, these principles reflect the general 
tendencies within a social system toward expan- 
sion or toward consolidation. Under conditions of 
expansion, recruitment on the basis of personal 
achievement is likely to be the rule; under con- 
solidation, recruitment based on inheritance of 
status. Each principle, moreover, has profound so- 
cial repercussions on social mobility, on the stimu- 
lation of individual ambitions and talents, and on 
levels of discontent among different social strata. 
Each, furthermore, affects not only the composi- 
tion of the elites but also their spiritual and moral 
outlook. 

In modern industrial societies recruitment and 
selection patterns reflect the changes toward dif- 
ferentiation and autonomy among the elites. Ac- 
cording to available evidence from a number of 
such societies, recruitment based on social inheri- 
tance is giving way to recruitment based on in- 
dividual achievement. This is true for England 
(Cole 1955; Guttsman 1963; Thomas 1959), Ger- 
many (Deutsch & Edinger 1959; Stammer 1951; 
Dreitzel 1962), France (Aron 1950), the United 
States (Warner & Abegglen 1955; Mills 1956; 
Matthews 1960; Keller 1963), and the Soviet 
Union (Fainsod 1953; Crankshaw 1959), among 
others. Nonetheless, taking the elite groups as a 
whole, we note the simultaneous Operation of sev- 
eral recruitment and selection principles. Some 
elites stress ancestry; others, educational attain- 
ments; still others, long experience and training. 
Some elites are elected by the public, others are 
appointed by their predecessors, and still others 
are born to their positions. The members of some 
elites have relatively short tenure, while that of 
others is lifelong. This is a dramatic contrast to 
other types of societies with relatively small lead- 
ership groups that have diffuse and comprehensive 
functional responsibilities and comprise individ- 
uals trained for their status from birth on. 

Of course, looking at modern developments at 
a single point in time, we note that the hold of 
the past, with its emphasis on property or birth, 


is still very strong among some elites. Conspicuous 
achievements are still often facilitated, if not de- 
termined, by high social and economic position, 
since wealth and high social standing open many 
doors to aspiring candidates and instill in them 
great expectations for worldly success. From a 
long-range perspective, however, it is clear that the 
link between high social class and strategic elite 
status has, in many modern societies, become in- 
direct and informal. Ascribed attributes, such as 
birth, sex, and race, although they play a greater 
role in some elites than in others, have decreased 
in importance in comparison with achieved attri- 
butes. This is in line with the general modern 
trend toward technological and scientific speciali- 
zation, in which individual skill and knowledge 
count more than does a gentlemanly upbringing 
in the traditions and standards of illustrious fore- 
bears. 

Rewards of strategic elites. The process of se- 
lection or allocation is facilitated by the system of 
rewards offered to individuals assuming leader- 
ship positions in society. Some rewards are tangible 
material benefits, such as land, money, cattle, or 
slaves, and others are intangible, such as social 
honor and influence. The specific rewards used to 
attract potential recruits to elite positions depend 
on the social definition of scarce and desirable 
values and the distribution of these values. 

Rewards play a twofold role in the recruitment 
of elites: they motivate individuals to assume the 
responsibilities of elite positions, and they main- 
tain the high value placed on these positions, They 
thus serve as inducements to individuals, as well 
as indicators of rank. 

Rewards, too, have become specialized in mod- 
ern industrial societies. Some elites enjoy large 
earnings; others, popularity or fame; and still 
others, authority and power. Not all elites are 
equally wealthy, not all have equal prestige; only 
some have much more power than others, and 
none have influence in all spheres. The assumption 
of elite positions thus also involves the acceptance 
of specific rewards associated with them. Respon- 
sibilities and rewards form parts of a whole and 
may be discussed jointly. And each is linked to 
recruitment, for rewards are the spur to the ex- 
penditure of effort that the duties of strategic 
positions demand. 

The process of recruiting elites and the manner 
of rewarding them must not be confused with their 
purposes and status. For although recruitment and 
rewards affect the composition and performance 
of elites, they do not alter their functions. As 
Mosca (1896) clearly demonstrated, democrati- 
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cally and hereditarily recruited elites differ in 
many important ways, but they nonetheless func- 
tion as elites. 

The tendency toward a pluralization of elites 
is likely to conflict with the older tendency toward 
the monolithic exercise of power and leadership. 
This is a problem in totalitarian as well as in lib- 
eral societies. In totalitarian societies, the problem 
is how to permit the desired flexibility and variety 
without corroding social stability. Conversely, in 
liberal pluralist systems, the problem is how to 
achieve the necessary degree of social cohesion and 
moral consensus among partly autonomous, highly 
specialized, yet functionally interdependent elites. 
The cohesion and consensus are necessary if the 
society is to pursue common goals and is to be 
unified in more than name only. 

These recent tendencies and trends are neither 
absolute nor inevitable. They are clearly mani- 
fested today in a wide variety of contexts and 
reflect the tempo of social change in a technolog- 
ically expanding world. Should this tempo slow 
down markedly or cease altogether, the impulses 
toward rigidity and ascription may well come to 
the fore once again, albeit within a social structure 
shaped by centuries of industrialism. Some secu- 
rity and stability will be gained, but at the price 
of adventure and novelty—a familiar exchange in 
the annals of history and one bound to be reflected 
in the character and stamp of the strategic elites. 


SUZANNE KELLER 


[See also BUREAUCRACY; COMMUNITY, article on THE 
STUDY OF COMMUNITY POWER; POLITICAL SOCIOL- 
ocy; and the biographies of ARISTOTLE; MANNHEIM; 
MicHELs; MILLs; Mosca; PARETO; PLATO; SAINT- 
Smon.] 
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Oeuvres 


ELLIS, HAVELOCK 


Havelock Ellis was born on February 2, 1859, in 
Croydon, England, the son of an English sea cap- 
tain, Edward Peppen Ellis. His mother, Susannah 
Wheatley Ellis, was a highly energetic and viva- 
cious woman. Ellis felt, however, that he owed 
much to the mediocrity of his father’s family, most 
of whose males, “whatever their occupation, have 
all the qualities of trustworthy bank clerks”; and 
their temperate and cheerful acceptance of the 
world, according to Ellis, helped to modify his own 
literary—aesthetic temperament and prevent him 
from adopting a one-sided, excessive, or eccentric 
view of life. 

Ellis went to a boarding school, the Poplars, at 
Tooting, where he was well grounded in French, 
German, and Italian. It was here that one of his 
masters, Angus Mackay, revealed to him the de- 
lights of nineteenth-century English literature and 
helped arouse his vital interest in philosophic and 
politico—economic questions of the day. At 16 he 
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began an undistinguished career as a teacher and 
later headmaster in Australia. At the age of 19, 
however, he came under the influence of and was 
in effect converted by the writings of the phi- 
losopher—surgeon James Hinton. Hinton’s book Life 
in Nature made such a profound impression on 
young Ellis that he decided to undertake the study 
of medicine in order to do research and writing in 
the field of sex. Ellis received his medical training 
at St. Thomas’ Hospital in London, and as a med- 
ical assistant he attended a number of patients, 
many of them women in labor. He obtained his 
M.D. in 1889 but did not practice medicine; instead, 
he devoted the rest of his life to editing—for many 
years his main source of income—and writing. 

During the 1880s Ellis wrote on literary and 
social subjects for first-rate English journals and 
edited the Mermaid Series of Elizabethan drama- 
tists (a series of scholarly reprints), the Con- 
temporary Science Series, and other works. He 
made a name for himself in the field of belles- 
lettres with such books as The Soul of Spain, 1908, 
and Impressions and Comments, 1914-1924, and 
in the field of science and its social implications 
with such writings as A Study of British Genius 
(1904), The World of Dreams (1911), and The 
Dance of Life (1923). 

Havelock Ellis is known best as a researcher and 
philosopher in the field of sex and love, Beginning 
his studies of human sexuality with a fact-packed 
book, Man and Woman (1894), he went on to 
write his monumental seven-volume Studies in the 
Psychology of Sex (1897-1928). Although origi- 
nally banned in his native England, the Studies 
became widely read and cited in all other parts of 
the world and were without question the most 
influential and precedent-shattering volumes on 
human sexuality ever written, up to the time of 
the publication of the Kinsey reports. They were 
followed by several other important books on sex— 
love relations from Ellis’ pen, including The Task 
of Social Hygiene (1912), Little Essays of Love 
and Virtue (1922-1931), Psychology of Sex 
(1933), My Life (1939), and Sex and Marriage 
(1951). 

It is difficult to spotlight the most important and 
influential of Ellis’ contributions to the subject of 
sex. He produced the first notable scientific book 
on homosexuality; he pioneered in the presentation 
of full case histories, diaries, and letters on sexual 
subjects; he was the first important popularizer of 
the subject of sex—love relations; he was an out- 
standing crusader against sex censorship; he con- 
vincingly showed the interrelationships between 


human sexuality and the love emotions; he did 
some original research on masturbation, using him- 
self as a subject; and he presented many original 
and well-formulated ideas on sexual modesty, the 
biology and psychology of the sexual impulse, 
sexual periodicity, erotic symbolism, transvestitism, 
and several other sexual—amative aspects of life. 
He and Sigmund Freud did more to make sex a re- 
spectable word than any other writers of their day. 
Although Ellis was largely a devotee of the library 
rather than a clinician or a laboratory scientist, his 
careful sex research has inspired much clinical and 
laboratory investigation. 

The remarkable thing about Havelock Ellis’ sex 
writings is that while they are factual, objective, 
and coolly analytical, they are often pervaded with 
a thoroughly humane, love-centered (rather than 
sex-centered), and at times aesthetic-mystic qual- 
ity that makes his views acutely personal as well 
as dispassionately scientific. He himself was a 
mild undinist, suffering from sexual shyness and 
inadequacy during his youth. Ellis was married 
for 25 years to a basically lesbian woman, Edith 
Lees Ellis, with whom he nonetheless had a re- 
markably intense love relationship. He achieved 
real sexual fulfillment, however, during the last 
twenty years of his life in his extramarital relation- 
ship with Françoise Delisle (he died in 1939 in 
Suffolk). It seems clear that Ellis’ own sex experi- 
ences, as well as his personal naturist—-humanist 
philosophy of life (which he carried to almost re- 
ligious extremes ), combined to enable him to view 
human sexuality in a uniquely realistic yet essen- 
tially poetic way and to make him the best and 
most effective antipuritan of the late nineteenth 
and early twentieth centuries. 

ALBERT ELLIS 


[For discussion of the subsequent development of Ellis’ 
work, see SEXUAL BEHAVIOR. ] 
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ELLWOOD, CHARLES A. 


Charles Abram Ellwood (1873-1946), known 
for his efforts to establish a scientific psychological 
sociology in the United States, was born near Og- 
densburg, New York. He entered Cornell University 
in 1892, initially intending to study law. However, 
at Cornell he met Edward A. Ross, later a famous 
sociologist and then, early in his career, teaching 
economics. Ross induced Ellwood to abandon his 
plans for a legal career and to turn to the social 
sciences. Ellwood specialized in sociology and eco- 
nomics. His studies in statistics and demography 
were mainly directed by Walter F. Willcox; those in 
political science and economic research by Jere- 
miah W. Jenks. His instructors were oriented 
toward social reform, and by the time he was gradu- 
ated, in 1896, he had become convinced that the 
main objective of social science should be to im- 
prove public well-being. This became the leitmotiv 
of his sociological writings throughout his life. 

Ellwood went to the University of Chicago to 
pursue graduate work in sociology, being guided by 
W. I. Thomas and George H. Mead in social psy- 
chology, John Dewey in psychology and pragmatic 
philosophy, and Albion W. Small in systematic 
sociology and social-reform doctrine. Small advised 
him to spend a year at the University of Berlin, 
which he did in 1897/1898, studying historical and 
reformist economics, mainly under Gustav Schmol- 
ler, and philosophy and ethics, under Friedrich 
Paulsen. 
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He returned to Chicago in 1898 to complete his 
doctorate, producing as his dissertation, in 1899, 
Some Prolegomena to Social Psychology. This was 
the first presentation of social psychology to be 
firmly based on the principles of academic psy- 
chology. The concepts laid down here were amplified 
and revised in his later systematic works in this 
field, chiefly under the influence of Charles H. 
Cooley. Some fifteen years later, in 1914 and 1915, 
Ellwood studied in England under Leonard T. 
Hobhouse and Robert R. Marett, leaders in cul- 
tural sociology and anthropology, and was led 
thereby to place psychological sociology within the 
larger framework of a cultural interpretation of the 
social process. 

Ellwood traveled extensively in Europe in 1927 
and 1928 and in Latin America in 1937, thereby 
developing a deep interest in international rela- 
tions, which he interpreted from the standpoint of 
practical pacifism, holding that world peace is es- 
sential to any successful program of social amelio- 
ration. Ellwood’s travels, especially in Europe, led 
him to form many contacts with foreign sociolo- 
gists. He developed considerable prestige among 
them and served as president of the International 
Institute of Sociology in 1935/1936. 

In 1900 Ellwood accepted the newly established 
chair in sociology at the University of Missouri, and 
he remained there for three decades, turning out 
students who became distinguished sociologists, 
such as E. B. Reuter, Luther L. Bernard, and Her- 
bert Blumer. In 1930 he was called to Duke Uni- 
versity to establish a new department of sociology, 
and he remained there until his death. Among his 
better-known students at Duke were Paul E. Root, 
Guy V. Price, Austin L. Porterfield, and Leonard 
Broom. 

Ellwood’s most important work in the field of 
psychological sociology, and the one for which he 
will also be best remembered as a sociologist, is his 
Sociology in Its Psychological Aspects (1912), 
which, as a comprehensive psychological interpre- 
tation of human behavior, was far ahead of any 
other work in the field at this time, This synthesis 
combined contributions from the evolutionary per- 
spective of Darwin; the biological approach of 
Lloyd Morgan and E. L. Thorndike; the neurology 
and comparative psychology of Jacques Loeb, as 
passed on to Ellwood by W. I. Thomas; Thomas’ 
own views of folk psychology; William James's 
pragmatic and dynamic instrumentalism, espe- 
cially his emphasis on the importance of habit; 
the functional psychology of J. R. Angell and John 
Dewey; the social psychology of G. H. Mead and 
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C. H. Cooley; and Lester Ward's contention that 
psychic factors exert dominant control over human 
and social behavior. Later on, Ellwood’s work was 
far surpassed by that of specialists like L. L. Ber- 
nard. Ellwood’s Introduction to Social Psychology 
(1917) and Psychology of Human Society (1925a), 
while broader in perspective than his previous 
works, were less successful as psychological soci- 
ology because he tried to weave into them the cul- 
tural concepts that had begun to influence him 
deeply soon after he finished his masterpiece in 
1912. 

The cultural approach to the social process domi- 
nated Ellwood's work in formal sociology during 
the two decades before his death. He had received 
some suggestions here from W. I. Thomas during 
his student days at Chicago, but the main impetus 
to this shift in emphasis came from his work with 
Hobhouse and Marett. The cultural interpretation 
was set forth in his Cultural Evolution: A Study of 
Social Origin and Development (1927a). Primarily 
because of his contact with Marett, Ellwood was 
one of the first Americans to cut loose from the 
unilateral evolutionism of Herbert Spencer, Lewis 
Henry Morgan, Charles Letourneau, and their as- 
sociates, which had dominated the historical soci- 
ology of the Ward—Giddings—Howard era. Ellwood 
had a drastically revised and expanded version of 
his work ready for publication at the time of his 
death, and it is a serious loss to sociological litera- 
ture that it was never published. 

Ellwood’s comprehensive knowledge of the fields 
and methods of sociology was best and most con- 
structively exhibited in his Methods in Sociology: 
A Critical Study (1933). Ellwood cautioned in a 
reasonable manner against what he deemed to be 
danger signs in the sociological trends of the mid- 
1930s: the attempt to recast sociology in the terms 
and techniques of natural science; increasing frag- 
mentation; excessive emphasis on quantitative 
methods; and the repudiation of value judgments 
and of proper recognition of the ultimate role of 
social amelioration. 

Ellwood’s interest in practical sociology was re- 
flected in his 1910 textbook, the first textbook in 
sociology that appealed to college students. Over 
300,000 copies were marketed before it came to be 
supplanted in the mid-1930s by more substantial 
and sophisticated textbooks on social problems. 
Ellwood produced a number of books (see, for ex- 
ample, his 1915 book) that presented his general 
solutions to social problems, with increasing em- 
phasis on the responsibility of religion. He had 
planned to expand his introductory treatment of 
social ethics as a guide to social reconstruction into 


a comprehensive and systematic work on social 
ethics, but the strong impulses from the deep- 
seated religious experience of his younger days 
eventually led him to regard a modernized Christi- 
anity as the best stimulus and guide to needed 
social reform. Hence, he revamped his presentation 
and published The Reconstruction of Religion 
(1922), which became his most widely read book 
outside college classrooms, This was supplemented 
by Christianity and Social Science (1923). These 
books gained for Ellwood a large and powerful fol- 
lowing among liberally inclined and social-reformist 
clergymen. Although Ellwood constantly stressed 
the fact that social change must be guided by scien- 
tific and rational principles, he attributed more sig- 
nificance and potency to religious views and values 
than any other leading American sociologist of his 
generation. He especially evaded any attempt to 
apply rational interpretations to sexual problems. 

In seeking to summarize Ellwood’s place in the 
development of American sociology, one may safely 
say that he will be remembered first and foremost 
for the fact that he executed far and away the most 
successful of the early attempts to link up scien- 
tific psychology with systematic sociology. Other 
sociologists, such as Tarde, Le Bon, Durkheim, 
Sighele, Giddings, Ross, and Cooley, had produced 
more striking interpretations of social behavior 
from the psychological point of view, but most of 
them selected some special psychological factor, 
such as invention, imitation, impression, sugges- 
tion, crowd psychological impulses, creativeness, 
sympathy, and the like, rather than having a com- 
prehensive psychological approach to the subject. 
Moreover, most of them, save for Durkheim and 
Cooley, had little technical knowledge of formal 
psychology and based their analysis and generaliza- 
tion on common-sense and rule-of-thumb psycho- 
logical concepts. 

Influenced by Comte, Ward, and Hobhouse, 
Ellwood shared with Small the mantle of Ward in. 
presenting social telesis, expertly planned social 
guidance, as the main role and justification of 
social science in general and of sociology in par- 
ticular. Ellwood assigned to modern religion a 
more important role in social telesis than any other 
leading sociologist of his time. 

In our era, which may have settled down to ac- 
cepting a pattern of “perpetual war for perpetual 
peace,” Ellwood’s views on international relations 
are especially wholesome and pertinent. While 
primarily concerned with social amelioration, Ell- 
wood, inspired by his reading of Kant, was con- 
vinced that there is no likelihood of establishing a 
social utopia or of perpetuating democratic society 


unless world peace can be attained, and he be- 
lieved that this was possible only in connection 
with a strong world organization. 


Harry ELMER BARNES 


[For the historical context of Ellwood's work, see So- 
CIAL PROBLEMS and the biographies of DEWEY; 
Hosnouse; Marett; Mean; Morcan, C. LLOYD; 
SCHMOLLER; SMALL; THOMAS; WILLCOX; for discus- 
sion of the subsequent development of Ellwood's 
ideas, see EVOLUTION; PacirisM; and the biography 
of BERNARD.]} 
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Richard Theodore Ely (1854-1943), American 
economist, probably exerted a greater influence 
upon American economics during its vital forma- 
tive period than any other individual. Although 
Ely’s writings were prolific, timely, and vigorous, 
he made a more lasting impact on his discipline 
through his achievements as a founder and organ- 
izer of scholarly associations, institutes, and re- 
search projects. 

Ely’s career began when the influence of Ger- 
man scholarship upon the United States was at its 
height. Born in Ripley, New York, of pious Congre- 
gationalist stock, he graduated from Columbia 
College in 1876 and spent the next four years in 
Germany, primarily at Heidelberg, where he was 
strongly influenced by Karl Knies, one of the lead- 
ing historical economists. From 1881 to 1892 Ely 
taught economics at the then new Johns Hopkins 
University and produced several books and in- 
numerable articles for scholarly journals, maga- 
zines, and newspapers, including pioneer studies of 
socialism, organized labor, and state taxation. He 
was an impulsive, outspoken, and contentious man, 
whose academic friends and foes alike complained 
of his emotionalism and carelessness. His eager 
participation in contemporary reform movements 
brought him both lavish praise and severe con- 
demnation. For example, his sympathetic study 
The Labor Movement in America (1886) provoked 
his Johns Hopkins colleague Simon Newcomb to 
declare him unfit to hold a university chair; and 
in 1894, when he was at the University of Wis- 
consin, Ely was publicly denounced for preaching 
socialism and encouraging strikes. In fact, how- 
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ever, he was a moderate reformer, an optimist, and 
a progressive who favored a mean between indi- 
vidualism and socialism. After a widely publicized 
“trial,” the regents of the university exonerated him 
and issued a classic declaration in favor of 
academic freedom. 

Ely figured prominently in the controversy be- 
tween the “old” and “new” schools of American 
economics during the 1880s. His main contribu- 
tion to the debate was a polemical monograph en- 
titled The Past and the Present of Political Economy 
(1884), in which he attacked the old school ortho- 
doxy based on Ricardo and Mill and advocated a 
closer link between economics and ethics and an 
increased use of a crudely inductive “look and see” 
method. However, he never rejected Ricardo in toto 
and specifically exempted Ricardian rent doctrine 
from his general criticism of Ricardian economics. 
The following year he and several other new school 
rebels founded the American Economic Association 
to propagate their ideas and promote the scientific 
study of economic problems. Ely became the as- 
sociation’s first secretary and its most active pro- 
ponent, but his sentimentalism and reforming zeal 
at first discouraged more conservative economists 
from participating. However, even before 1892, 
when Ely resigned his secretaryship and moved to 
the Middle West, the organization was turning 
from social reform to a more neutral scholarly ap- 
proach. Ely was president of the association from 
1900 to 1902. During the 1880s and 1890s, he was 
prominent in such religious reform organizations 
as the Christian Social Union and the American In- 
stitute for Christian Sociology and was sometimes 
regarded rather as a preacher than an economist. 
An Introduction to Political Economy (1889a), 
which Ely prepared for use in connection with his 
teaching at the Chautauqua Methodist summer 
school, sold 30,000 copies in a decade, and he sub- 
sequently published an even more successful aca- 
demic textbook, Outlines of Economics (1893), 
which eventually sold more than 350,000 copies. 

On his move to Wisconsin in 1892, Ely inaugu- 
rated a school of economics, political science, and 
history. The school, staffed by such scholars as 
Frederick J. Turner, Edward A. Ross, and John R. 
Commons, all of whom had been Ely’s pupils at 
Johns Hopkins, became internationally famous be- 
cause of its collaboration with the Wisconsin 
government, led by the Progressive politician 
Robert La Follette. Ely’s new school teaching con- 
stituted a direct link between German historical 
economics and twentieth-century institutional eco- 
nomics. His major contribution to this economic 
tradition was Property and Contract in Their Re- 


lations to the Distribution of Wealth (1914), and 
his interest in this field eventually led him to estab- 
lish in 1920 the Institute for Research in Land 
Economics and Public Utilities and the associated 
Journal of Land and Public Utility Economics 
(later called Land Economics). Also at Wisconsin 
Ely helped to launch the American Association for 
Labor Legislation (of which he became president ) 
and obtained private resources to finance Com- 
mons’ massive 11-volume Documentary History of 
American Industrial Society, 1910-1911, 

Ely was neither an original theorist nor a seminal 
thinker; he was, however, a stimulating teacher 
who exerted a profoundly liberating influence on 
his students, many of whom became distinguished 
scholars or public figures. Until his death at the age 
of 89, he remained remarkably active, writing on a 
variety of topical issues and eventually editing 
more than a hundred volumes. During his later 
years he abandoned his earlier defense of the 
Ricardian rent doctrine and emphasized the paral- 
lels between land and capital; one indication of his 
increasing conservatism is the fact that his Insti- 
tute for Research was attacked in 1926 as a tool 
of the public utilities. In a sense, this was a sign 
of the change in the tone of American economics 
since Ely’s “trial” in 1894. 

A. W. Coats 


[For the historical context of Ely’s work, see Economic 
THOUGHT, articles on THE HISTORICAL SCHOOL and 
THE INSTITUTIONAL SCHOOL; and the biographies of 
Knres; MILL; Ricarpo.] 
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EMOTION 


It is virtually impossible to give a definition of 
emotion that all psychologists will accept, although 
there is fair agreement that such phenomena as 
fear, anger, joy, disgust, and affection should be 
classified as emotions. Nearly all theorists relate 
emotion in some way to motivation, and all assign 
important roles in emotion to the functioning of 
the autonomic nervous system. All, except the most 
rigidly behavioristic, classify emotions as affective 
phenomena. 

One difficulty in defining emotion is that emo- 
tional phenomena are exceedingly complex and 
must be observed and analyzed from different 
points of view. An emotional episode can be ob- 
served and studied as a conscious experience from 
the point of view of the experiencing individual. It 
can be analyzed from the point of view of a be- 
havioral scientist, a physiologist, a social scientist, 
or a psychiatrist. 


Emotion as a conscious experience 


Emotions and other affective processes. The term 
“emotion” is sometimes used to include the whole 
gamut of affective experiences, but this usage is 
too broad. Traditionally, the term applies to a single 
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variety of the affective process. The term “affect,” 
in psychiatry, designates a class of experiences 
including, among others, emotions, moods, and 
guilt feelings. 

The main varieties of affective processes can be 
classified as follows: (1) A simple feeling of pleas- 
antness is associated with such sensory stimula- 
tions as the odor of a perfume, a sweet taste, or a 
musical harmony, and a feeling of unpleasantness 
with a painful burn, a bitter taste, or a bad odor. 
(2) Pleasant organic feelings are associated with 
good health, buoyancy, or sexual satisfaction, and 
unpleasant feelings with hunger, thirst, fatigue, 
cramps, or headaches. (3) Interests are mild feel- 
ings of pleasantness associated with games, sports, 
plays, and other activities. Aversions are unpleas- 
ant affects associated with the rejection of foods, 
persons, and activities. (4) Sentiments are feelings 
associated with something valued or held sacred; 
they are based upon past experience and training. 
There are patriotic, moral, religious, aesthetic, and 
intellectual sentiments. The term “sentiment” also 
refers to a stable disposition to react with feeling 
to a class of objects or situations. (5) Emotions are 
acute affective disturbances arising from the psy- 
chological situation and expressing themselves in 
conscious experience, behavior, and physiological 
processes. (6) Moods are typically less intense and 
more chronic than emotions but are similar in 
affective tone and underlying dynamic mech- 
anisms. (7) Temperament designates the affective 
aspect of personality as a whole. Temperaments 
are said to be apathetic, moody, phlegmatic, cheer- 
ful, vivacious, depressed, sanguine, etc. Although 
temperaments are stable, they are known to change 
with age, health, and environmental conditions. 

In this classification, it will be noted that emo- 
tions and moods are distinguished, but they are 
closely related. An emotion may calm down into a 
mood or a mood build up into an emotion. Thus a 
fright may taper off into a mood of anxiety; anger 
may subside into a mood of hostility or resentment; 
laughter may become a mood of cheerfulness; 
weeping, a mood of sorrow or grief. Depression is a 
mood characterized by the decrease of an indi- 
vidual’s vitality, hopes, aspirations, and self-esteem. 
The mood may be a mild feeling of tiredness or 
sadness. In psychopathic states a depression may 
become a profound apathy with psychotic disregard 
for reality and with suicidal tendencies. Moods and 
emotions cannot be sharply distinguished; any line 
of distinction is arbitrary. 

“Emotion” is a substantive term; the adjective 
“emotional” would better characterize the process. 
Emotional activities are commonly contrasted with 


36 EMOTION 


rational, intellectual, or even mental processes, as 
well as with motivational processes. 

Emotions and cognition. Emotions are elicited 
by the awareness of a situation in which an indi- 
vidual finds himself. Magda B. Arnold (1960) 
argued that emotional behavior follows the intuitive 
appraisal of a situation. She defined emotion as a 
felt tendency to move toward anything intuitively 
appraised as good (beneficial) or away from any- 
thing intuitively appraised as bad (harmful). Thus, 
the feelings of a male enticing a female are emo- 
tional; the feelings of a man running a race for his 
life are emotional. 

The cognitive basis of emotion becomes clear 
when we consider conditions that elicit emotions 
in different societies. For example, among the 
Negroes of the Niger delta, it is a rule that if a 
woman gives birth to twins, she and the twins are 
put to death. If the mother is allowed to live, her 
life is little better than a living death, for she be- 
comes an outcast and must live the rest of her 
days in the forest. But among the Bankundo of the 
Congo valley, the mother of twins becomes an 
object of veneration. She is entitled to wear a 
special badge and her name is changed to “Mother- 
of-Twins.” Obviously, the type of affective arousal 
by such an event as the birth of twins depends 
upon the beliefs, attitudes, and practices of a 
group. 

Emotion and motivation. Although it is gen- 
erally agreed that emotions bear an important rela- 
tion to motivation, there is disagreement concern- 
ing the exact nature of the relationship. In general, 
there are two main views: First, it is claimed that 
emotion is a conscious experience associated with 
purposive, organized activity. Second, it is claimed 
that emotion is a disorganized experience due to 
conflict, frustration, thwarted expectation, tension, 
or the release of tension. 

Emotion as organized experience. The first view 
of emotion is illustrated in the writings of William 
McDougall (1908), who defined emotion as the 
consciously felt aspect of instinctive activity. He 
paired instincts with emotions: the instinct to flee 
from danger was paired with the emotion of fear; 
the instinct of pugnacity was paired with the emo- 
tion of anger; parental instinct, with tender emo- 
tion; sexual instinct, with lust; self-abasement, with 
the emotion of subjection; self-assertion, with the 
emotion of elation. McDougall regarded instinctive 
behavior as always purposive, goal-directed, and 
integrated, and emotion as the felt equivalent of 
instinctive behavior. 

Other psychologists, e.g., Carl R. Rogers, have 
emphasized that “emotion” facilitates goal-directed 


behavior. Feelings of success, self-confidence, and 
cheerfulness do, in fact, facilitate performance. A 
question can be raised, however, whether these 
feelings are properly classified as emotions. 

Emotion as disorganized experience. The sec- 
ond view—that emotion is a disorganized experi- 
ence dependent upon a dynamic disturbance—is 
widely held by psychiatrists, clinical psychologists, 
and others who are concerned with health, counsel- 
ing, and human adjustment. Thus, the psychologist 
Edouard Claparéde argued that emotion occurs pre- 
cisely when adaptation is hindered for any reason 
whatever: the man who can run away does not 
have the emotion of fear; fear occurs only when 
flight is impossible. Anger is experienced only when 
one cannot strike his enemy. The uselessness, or 
even the harmfulness, of emotion is known to 
everyone, said Claparède. 

There has been considerable controversy over 
these basic concepts. Robert W. Leeper (1948) re- 
gards disorganization as a concept inadequate to 
define emotion and prefers a “motivational” defini- 
tion. Paul T. Young (1949), in a reply to Leeper, 
pointed out that the problem is one of definition and 
emphasis. Some affective reactions are organized 
and organizing; some facilitate performance. But 
the term “emotion” has been used traditionally to 
define a special class of affective processes char- 
acterized by disturbance, upset, and disorganiza- 
tion. Affective disturbances, both pleasant and 
unpleasant, assuredly exist. If there were no dis- 
turbances, the term “emotion” could be dropped 
from the psychological vocabulary, because exist- 
ing motivational and affective terms and concepts 
are fully adequate for the descriptive analysis of 
organized, adaptive activity. 


Emotional behavior 


The radical behaviorist does not recognize con- 
scious feelings as such but restricts the science of 
psychology to the phenomena of behavior and 
associated bodily processes that can be objectively 
observed. It was John B. Watson, the founder of 
American behaviorism, who defined emotion as 
“an hereditary pattern-reaction involving profound 
changes of the bodily mechanism as a whole, but 
particularly of the visceral and glandular systems” 
(1919, p. 165). He described the stimulating situa- 
tions and the pattern-reactions for three basic emo- 
tions in the infant: fear, rage, and love. 

The pattern-reaction theory of emotion has been 
popular with physiologists and physiological psy- 
chologists for obvious reasons: The patterns of re- 
action appear reflectively under specified conditions 
of stimulation. They resemble simple reflexes but 


are more complex. They are well integrated. The 
emotional patterns can be conditioned and extin- 
guished. The neural mechanisms that regulate 
many of the emotional patterns have been de- 
scribed and localized within subcortical regions of 
the brain. 

Among the patterns of reaction that have been 
described and analyzed are the following: There is 
the rage pattern in cats, dogs, and other animals, 
as well as a similar pattern called “sham rage” in 
decorticate animals. There are patterns of escape, 
including impulses to run or fly or dart away when 
startled and patterns of defense that differ from 
species to species. There are male and female pat- 
terns of sexual response, the startle pattern in man 
and other animals, and the disgust pattern. There 
are internal patterns of visceral and glandular re- 
sponse that differ in hostility and fear. There are 
human patterns of facial expression—smiling, 
laughing, crying, and weeping. There is no doubt 
about the objective existence dnd functional im- 
portance of these patterns of reaction. 

Critique of the pattern-reaction theory. Despite 
the obvious advantages of a pattern-reaction theory 
of emotion, there are certain difficulties. The theory 
does not distinguish between emotional and non- 
emotional patterns, Coughing, sneezing, hiccough- 
ing, sucking, swallowing, and blinking are well- 
integrated reflexive patterns that are frequently 
accompanied by changes regulated through the 
autonomic nervous system. No one regards these 
reflexes as emotions. Again, the startle pattern, de- 
scribed in detail by Carney Landis and William A. 
Hunt, was regarded by them as a general skeletal 
reflex rather than as a true emotional pattern, be- 
cause startle is completed in the fraction of a sec- 
ond before visceral responses can get under way. 

Further, it is difficult to specify the grouping of 
elements that constitute an emotional pattern. For 
example, Watson claimed that “fear” is an innate 
emotional pattern in infants; but his description of 
“fear” included patterns known to be more ele- 
mentary: crying, catching the breath, the startle 
response, possibly the Moro reflex, and an impulse 
to crawl away. Watson’s “fear” is thus a complex of 
more elementary patterns. 

The pattern-reaction concept disregards the acute 
affective disorganization that is characteristic of 
emotion. And, further, the observed patterns do not 
correspond to the “emotions” of everyday life. What 
patterns, for example, correspond to mother love, 
pride, embarrassment? It would be wiser, we be- 
lieve, to describe the patterns of reaction that occur 
in or during emotion for their own sake than to 
define emotion as a pattern of reaction. 
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The expressions of emotion. The objective ex- 
pressions of emotion have been observed and 
studied since the earliest times. Charles Darwin 
(1872) made detailed observations on emotional 
behavior in man and other animals. After studying 
the data, he formulated three principles of emo- 
tional expression. 

First, Darwin believed that many expressions of 
emotion are reduced segments of biologically ser- 
viceable acts or acts that once were serviceable in 
an earlier stage of evolution. Thus an angry man 
raises the lips involuntarily and shows the canine 
teeth although he does not intend to bite. The com- 
plete expression would be biting and hostile attack. 

When a dog is about to attack, it approaches its 
enemy with a stiff gait and tail erect; the head is 
slightly raised; the hair, especially along the neck 
and back, bristles; the ears are pricked up and 
directed forward; the eyes are wide open and have 
a fixed stare; the animal shows its teeth and growls. 
No one is likely to misinterpret the significance of 
this emotional behavior. Even a small part of the 
total reaction, e.g., showing the teeth and growling, 
expresses hostility. 

Second, Darwin pointed out that some emotional 
expressions are directly antithetical to biologically 
serviceable behavior. To illustrate, suppose the 
hostile dog suddenly perceives that a man it is 
approaching is not an enemy but its beloved master. 
The bearing of the animal instantly changes. In- 
stead of walking upright with a stiff gait, the body 
sinks downward or even crouches; the animal’s 
movements are flexuous and supple; its tail, instead 
of being stiff and upright, is lowered and wagging 
from side to side; the hair is smooth; the ears are 
depressed and relaxed backwards; its lips hang 
loosely, and it salivates; the eyelids become elon- 
gated, and the eyes no longer appear round and 
staring. The behavior of the friendly dog is directly 
antithetical to that of the hostile animal. 

Third, Darwin recognized that the above two 
principles do not explain all expressions of emotion. 
He formulated a third principle: Some emotional 
expressions can be explained only in terms of the 
constitution of the nervous system and associated 
bodily mechanisms. For example, the writhing of 
an animal during the birth of young can be ex- 
plained only in terms of bodily constitution. The 
excessive activity is neither biologically serviceable 
nor antithetical to a serviceable act. 

Emotional and social expressions. In an experi- 
ment on the facial expressions of emotion, Carney 
Landis (1924) drew an important distinction be- 
tween emotional and social expressions. The emo- 
tional expressions, he said, are involuntary and 
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reflexive. They involve changes in the skeletal mus- 
culature, glands, and smooth muscles. They are 
regulated by neural mechanisms that include proc- 
esses within the autonomic nervous system. The 
social expressions are voluntary and learned. 

Otto Klineberg (1938) studied expressive be- 
havior as recorded in the Chinese novel and drama. 
He found that many phrases in the Chinese lan- 
guage describe involuntary changes that anyone 
will recognize as emotional. For example, fear is 
indicated by such expressions as “every one of his 
hairs stood on end, and the pimples came out on 
the skin all over his body” and “they were so 
frightened that their waters and wastes burst out 
of them.” The meaning of other expressions, how- 
ever, would not be recognized by persons in West- 
ern society. For example, the phrase “they stretched 
out their tongues” indicates surprise; “he made his 
two eyes round and stared at him” means anger; 
“he scratched his ears and cheeks” (in the novel 
Dream of the Red Chamber) means happiness; “he 
clapped his hands” is likely to mean worry or 
disappointment. 

These conventional expressions are culture- 
bound. They serve to communicate feelings within 
a group as spoken words convey meanings. We 
have all learned, of course, to express joy, sorrow, 
concern, amusement, and other feelings sympa- 
thetically, as actors do on the stage. The voluntary 
and conventional expressions are not true reflexive 
patterns of emotion. [See EXPRESSIVE BEHAVIOR. ] 


The physiology of emotion 


Walter B. Cannon (1915) supplemented and ex- 
tended the Darwinian doctrine of biological utility 
by carrying the principle of adaptation to the in- 
terior of the body. In a series of experiments, he 
showed how the bodily changes in pain, fear, and 
Tage are serviceable and adaptive in a struggle for 
existence. During a biological crisis, widespread 
organic changes mobilize the energy reserves of 
the body for a prolonged fight or flight. 

During an emergency there is a diffuse discharge 
across the sympathetic nervous network and in- 
creased secretion of the adrenal glands. This neural 
and glandular discharge produces widespread bod- 
ily changes: (1) cessation of Processes in the ali- 
mentary canal, thus freeing the energy supply for 
muscles and brain; (2) shifting of blood from 
abdominal organs to the organs immediately essen- 
tial to muscular exertion; (3) increased vigor of 
contraction of the heart: (4) discharge of extra 
blood corpuscles from the spleen, thus facilitating 
the process of oxygenation; (5) dilation of the 
bronchioles, along with deeper respiration; (6) 


quick abolition of the effects of muscular fatigue 
through adrenal discharge; and (7) mobilization 
of sugar in the circulation. All of these changes, 
Cannon claimed, are directly serviceable in making 
the organism more effective in the violent display 
of energy that fear, rage, or pain may involve. 

Critique of the emergency theory. Physiological 
studies (for example, Arnold 1960) have pointed 
to weaknesses in Cannon’s emergency theory of 
emotion. Critics agree on the following points: 
(1) Emotional processes are a function of the 
entire autonomic nervous system, not of the sym- 
pathetic division alone. The sympathetic and para- 
sympathetic divisions of the autonomic system 
function simultaneously and reciprocally in fear, 
rage, “sham rage,” pain, general excitement, and 
sexual and other emotions, producing patterns of 
visceral response that differ from one emotional 
state to another. (2) Whereas Cannon thought 
that the secretion of epinephrine was the main 
hormone in the defensive fight—flight reactions, it 
is now known that there are two chemical factors 
—norepinephrine and epinephrine—involved in 
this reaction. These two hormones are secreted 
independently and have different physiological 
effects: norepinephrine appears to be concerned 
with hostile states, and epinephrine with fear and 
anxiety. Hormones from the pancreas and pituitary 
body also are involved in emotional reactions. 
(3) The autonomic nervous system is on continu- 
ous duty 24 hours a day, and the bodily changes 
produced during a biological crisis correspond to 
departures from normal conditions. The autonomic 
system has two main functions: first, it prepares 
the body to respond defensively to danger; second, 
it plays its major role in maintaining homeostasis. 
(4) Cannon emphasized the utility of bodily 
changes in emotion, but many changes are disrup- 
tive, disturbing, and disintegrating rather than an 
aid in adaptive behavior. 

Cannon’s work should be brought into relation 
with that of Hans Selye ( 1956) upon the general 
adaptation syndrome and the adjustments of the 
organism to stress [see STRESS]. 

The neural basis of emotion. The phylogenet- 
ically older structures, collectively, and the limbic 
system and hypothalamus, in particular, are actively 
involved in pleasurable and painful experiences, in 
emotional behavior associated with fight, flight, 
food, and sex. We are just beginning to understand 
the central dynamics of the emotional reactions. 

The hypothalamus is of critical importance in 
the regulation of emotional behavior. It has long 
been recognized as a center of endocrine and auto- 
nomic-nervous-system control. It forms a critical 


juncture in the circular feedback system that reg- 
ulates neural impulses concerned with emotions 
and neuroendocrine activity. The hypothalamus 
influences and is influenced by the reticular acti- 
vating system, the limbic system, secretions of the 
pituitary, and other endocrine glands, as well as by 
the neocortex. 

The explorations of the reticular activating system 
by Magoun and associates (Magoun 1958) have 
altered neurophysiological thinking about motiva- 
tion and emotion. It is now known that every 
sensory stimulation has two kinds of effects upon 
the cerebral cortex: (1) impulses discharged 
through thalamic nuclei are relayed to the cortex, 
where they provide sensory information; (2) sen- 
sory stimulation also sends impulses through 
collaterals into the reticular activating system. 
These impulses are conducted over multisynaptic 
pathways to the cortex, where they have a non- 
specific activating influence. The degree of activa- 
tion varies with intensity of stimulation. In emo- 
tional excitement there is a high level of cortical 
activation, as Donald B. Lindsley has shown in 
studies with the electroencephalograph. 

The limbic system has been called the “visceral 
brain” by McLean (1949), who considers that it 
mediates visceral needs rather than ideational proc- 
esses; it is concerned with feelings rather than 
with symbolic activities. The frontotemporal limbic 
activities may be concerned with self-preservative 
behavior; the more posterior regions, with sexual 
behavior and sexual hormones. 

The limbic system is involved in positive and 
negative affective arousals. James Olds (1955) im- 
planted bipolar needle electrodes within the limbic 
system of rats’ brains and demonstrated that elec- 
trical stimulation of subcortical points could be 
either rewarding or punishing. When stimulated 
within the septal area, the rats acted as if they 
were pleased; but when stimulated within the 
medial lemniscus, the animals acted as if the stim- 
ulation had hurt them and had been unpleasant. 
Clearly the neural locus of affective arousals, physi- 
ological drives, and emotions is being penetrated. 
[See NERVOUS SYSTEM, article on BRAIN STIMU- 
LATION. | 


The dynamics of emotion 


A dispositional approach. According to David 
Rapaport (1942), a good deal of confusion con- 
cerning the definition of emotion has been the 
result of a failure of investigators to distinguish 
between the phenomena of emotion and the under- 
lying dynamic mechanisms. The phenomena of 
emotion are complex but can be analyzed from 
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several points of view. The phenomena include 
(1) the consciously experienced affect, (2) the 
emotional behavior, and (3) the physiological 
processes occurring during emotional upheavals. 
The dynamic mechanisms, in contrast, are always 
inferred or assumed. 

Psychologists assume persistent dispositions that 
were originally formed by emotional experiences. 
For example, an intense fright may produce a 
phobia for high places, enclosures, blood, or some 
other thing. The grounds of a phobia may seem 
unreasonable to the subject, but nevertheless the 
fear persists. Children normally develop fears of 
thunder, darkness, death, insects, ghosts, and other 
things on the basis of some fright. The emotion 
clearly leaves a disposition to fear [see PHOBIAS]. 

Attitudes and motives are formed on the basis of 
emotional experiences. The story is told that when 
Abraham Lincoln saw slaves being sold on the 
New Orleans market, he was so disturbed emotion- 
ally that he resolved: “If ever I have the chance, 
I will hit that thing hard.” The emotional disturb- 
ance left upon him an indelible imprint that later 
may have influenced his decisions and actions. 

A dispositional approach to the study of emotion 
implies a temporal dimension. Whenever we speak 
of emotional development, emotional maturity, or 
emotional stability, we imply a persisting individual 
with persisting dispositions. 

Dispositions include memory traces, attitudes, 
beliefs, specific motives, expectancies, hopes, and 
desires, as well as conflicts and unsolved problems. 
All of these are residues from the past. Among the 
dynamic conditions that produce emotional upsets, 
moods of anxiety, depression, and the like are con- 
flicts, frustrations, thwarted expectations, successes, 
failures, tensions and the release of tensions, pain- 
ful stimulations, and other factors of stress. A 
dispositional approach to the complex phenomena 
of emotion can bring unity out of the diverse data. 

Emotion and mental health. In an address to 
the American Neurological Association in New 
York City in 1876, George M. Beard maintained 
that disease might appear and disappear without 
the influence of any other agent than some form 
of emotion. Fear, terror, anxiety, grief, anger, won- 
der, and a definite expectation he regarded as 
mental conditions likely to produce disease [see 
Lewis 1959, p. 8]. Beard argued that certain emo- 
tional states could neutralize therapeutics and 
increase the effects of drugs. At the time, his ideas 
were new and startling; later the ideas were recog- 
nized in a movement known as psychosomatic 
medicine. 

Today, it is widely accepted that persistent emo- 
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tional disturbances constitute an important factor in 
certain disorders, such as peptic ulcer, essential hy- 
pertension, rheumatoid arthritis, ulcerative colitis, 
bronchial asthma, hyperthyroidism, and neuroder- 
matitis. The health of a patient is strongly influ- 
enced by stressful conditions of living that produce 
emotional traumata, such as financial failure, be- 
reavement, insult, injury, unrequited love, threat- 
ened divorce, and loss of self-esteem. These several 
factors are well recognized by clinical psychologists 
and psychiatrists. 

Emotion plays an important role in psychother- 
apy. The psychiatric examination is concerned 
with the whole personality but places special em- 
phasis upon emotions as related to thought proc- 
esses. During an interview the psychiatrist observes 
emotional reactions as the patient talks and seeks 
to elicit thoughts that accompany the emotional 
reactions. A dominant emotion is associated with 
something important to the patient, something that 
affects him deeply. Why it is important can be 
learned only by getting the patient to tell his 
thoughts. 

Psychoanalysts have long recognized the impor- 
tance of affects, especially repressed emotional 
experiences, in the etiology and treatment of neu- 
roses. Some repressed hostility or an unsolved emo- 
tional conflict, possibly unknown to the patient, 
may underlie neurotic symptoms. Free association, 
aided recall, and the interpretation of dreams, 
along with free emotional expression, may reveal 
unconscious motivations and alleviate the mental 
disorder. 

According to Robert W. White (1948), psycho- 
therapy is not an intellectual process. It has wrongly 
been said that the way to bring about readjustment 
is to help an individual understand his problems. 
Awareness of motivations and frustrations on the 
cognitive level is helpful but not enough to effect a 
cure. Psychotherapy operates in the sphere of emo- 
tion. The main aim of psychotherapy is to provide 
corrective emotional experience by relaxing the 
subject’s defenses and permitting him to reappraise 
his anxieties. In the major methods of psycho- 
therapy, the subject is encouraged to feel, to ex- 
press his emotions, 

It should be pointed out that emotional upsets 
are only one manifestation of neurosis, There are 
other aspects, such as dissociation, delusion, am- 
nesia, tics, and functional paralyses. Emotional 
upsets appear also in psychoses and disorders that 


have a definite organic basis as well as in normal 
everyday living. 


In the light of the above discussion, an emotion 
may be defined as an acute affective disturbance 


originating within the psychological situation and 
expressing itself in conscious experience (affect), 
emotional behavior, and physiological processes. 
The dynamic determinants of emotion include con- 
flict, frustration, thwarted (or satisfied) expecta- 
tion, tension or its release, painful stimulation, 
threat, insult, and similar conditions of stress and 
relief. Clinically viewed, an emotion is a persisting 
dynamic disturbance within the individual that 
may influence his health, happiness, and well-being. 


PAUL THOMAS YOUNG 


[Other relevant material may be found in AFFECTION; 
AGGRESSION; CONFLICT; DRIVES; MOTIVATION; SYM- 
PATHY AND EMPATHY. ] 
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The term “empire” has normally been used to 
designate a political system encompassing wide, 
relatively highly centralized territories, in which 
the center, as embodied both in the person of the 
emperor and in the central political institutions, 
constituted an autonomous entity. Further, al- 
though empires have usually been based on tradi- 
tional legitimation, they have often embraced some 
wider, potentially universal political and cultural 
orientation that went beyond that of any of their 
component parts. Such “imperial” designation has 
been attached to a great variety of sociopolitical 
systems, from relatively ephemeral frameworks like 
that of the Mongol empires of Genghis Khan and 
Kublai Khan to the various more modern “colonial” 
empires, which did not usually evince territorial 
continuity. 

However, the fullest and most succinct develop- 
ment of the major characteristics of empires as dis- 
tinct political systems can be found in what may 
be called the “historical bureaucratic empires.” In 
order to explain the meaning of this term, a num- 
ber of further distinctions need to be made (for a 
fuller treatment, see Eisenstadt 1963). 

Types of imperial systems. The connotation of 
the term “imperial,” as it evolved within political 
and social consciousness in the history of mankind, 
evinced throughout its development some common 
characteristics. At the same time, it also changed 
greatly between historical, premodern, and modern 
times. Its basic connotation, as manifest in the 
Latin imperium, is the existence of relatively con- 
centrated authority and rule, focused in a relatively 
strong center and diffusing its authority over broad 
territorial contours. In premodern times this desig- 
nation commonly referred to an authority that 
extended over territorially contiguous units so that 
the latter attained some symbols of common politi- 
cal identity. This authority and the concomitant 
political identity certainly did not connote national 
sovereignty, but rather the existence of a center of 
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authority that was accepted and hallowed beyond 
the confines of narrow territorial, kinship, or city 
limits. 

The authority that was enacted in these systems 
consisted of a special mixture of Weber's three 
types: the charismatic, the traditional, and the 
legal—rational. Such authority was usually rooted 
in a charismatic personality or group whose major 
orientations were traditional in the sense of uphold- 
ing a “given” order hallowed by tradition, but not, 
as we shall see, in the sense of accepting the 
traditional organizational confines of this order. At 
the same time, it often contained important ele- 
ments of a more legal-rational type of authority. 

The connotation of imperial has greatly changed 
in modern times, when it has come to denote 
rather a type of political system through which 
one political community or nation has extended 
its rule over other political units, mostly territorially 
noncontiguous ones, without fully incorporating 
them into a framework of common political sym- 
bols and identity. 

In this article we shall deal only with the first 
type of imperial system, focusing our analysis on 
those historical systems within which the basic 
characteristics of this type of system became most 
fully developed and institutionalized. Thus we shall 
not deal with those “conquest” empires, such as 
those of the Mongols, where the conquering rulers 
attempted to establish such authority but in which, 
for a variety of reasons, no such common symbols 
of identity became accepted and in which the con- 
querors retained their separate ethnic and political 
identities. Nor shall we deal with the colonial em- 
pires of the nineteenth and twentieth centuries, 
although this does not preclude examination of 
their origins. 


Formation of bureaucratic empires 


Examples of centralized bureaucratic empires 
are to be found throughout history; the principal 
ones, which comprise the major historical societies, 
are as follows: 


(a) The ancient empires, especially the Egyp- 
tian, Babylonian, and, possibly, the Inca and Aztec. 

(b) The Chinese Empire from the Han period 
to the Ch’ing. 

(c) The various Iranian empires, especially the 
Sassanid and, to a smaller extent, the Parthian and 
Achaemenid. 

(d) The Roman Empire and the various Hel- 
lenistic empires. 

(e) The Byzantine Empire. 

(f) Several ancient Hindu states (especially the 
Maurya and Gupta) and the Mogul empires. 
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(g) The Arab Caliphate (especially from the 
reign of the Abbassides and Fatimides), the Arab 
Muslim states in the Mediterranean and Iran, and 
the Ottoman Empire. 

(h) European states during the age of absolut- 
ism, and to some extent their initial colonial em- 
pires, especially insofar as they were built with the 
idea of the direct extension of the patrimony and 
its central authority and not as merchant colonies 
or purely colonial settlements of small groups. Of 
these, the Spanish American Empire is probably 
the nearest to the ideal type of a historical bureau- 
cratic empire. 


The majority of these empires developed from 
one of the following types of political systems: 
(a) from patrimonial empires such as the Egyptian 
and the Sassanid empires; (b) from dualistic 
nomad-sedentary empires, necessarily sharing 
many characteristics in common with the patri- 
monial type; (c) from feudal systems, as did the 
European absolutist states; and (d) from city- 
states, ʻas did the Roman and Hellenistic empires. 

Despite the great variety of historical and cul- 
tural settings, some common features in the first 
stages of the establishment of such polities may be 
found. The initiative for the establishment of these 
polities came, in all cases, from the rulers— 
emperors, kings, or some members of a patrician 
ruling elite (like the more active and dynamic 
element of the patrician ruling elite in republican 
Rome). In most cases these rulers either came from 
established patrician, patrimonial, tribal, or feudal 
families, or they were usurpers, coming from lower- 
class families, who attempted to establish new dy- 
nasties or to conquer new territories. In some cases 
they were conquerors who attempted to establish 
their rule over various territories, 

In most cases such rulers arose in periods of 
unrest, turmoil, acute strife, or dismemberment 
of the existing political system. Usually their aim 
was the re-establishment of peace and order. They 
did not, however, attempt to restore the old order 
in its entirety, although for propagandist and op- 
portunistic reasons they sometimes upheld such 
restoration as a political ideology or slogan. They 
always had some vision of the distinctly political 
goals of a unified polity, They aimed to establish a 
more centralized, unified polity in which they could 
monopolize political decision making and the set- 
ting of political goals, without being bound by 
various traditional aristocratic, tribal, or patrician 
groups. Even when they were conquerors, as in the 
case of the Roman, Islamic, and Spanish American 
empires, they also had some such vision and at- 


tempted to transmit it to at least part of the con- 
quered population. 

Of crucial importance in shaping the activities 
of these rulers was the geopolitical situation of the 
polity that they tried to organize—as, for instance, 
the specific geopolitical situation of Byzantium at 
the crossroads of Europe and Asia or the vast 
hydraulic arrangement of China and its special 
relation with the steppe frontiers. These geopoliti- 
cal factors indicated, in a sense, the nature of the 
specific international system to which these em- 
pires had to respond, as well as the range of prob- 
lems to which the rulers were willing and able to 
address themselves. 

The aims of the rulers were very often oriented 
against, and encountered the opposition of, various 
social and political groups. However great the tur- 
moil, unrest, and internal strife may have been, 
there were always some groups that either bene- 
fited from this state of affairs—or hoped to do so— 
or aimed to re-establish the old order, in which 
they held positions of power and influence. These 
hostile elements, usually consisting of some aris- 
tocratic groups or some of the more traditional 
urban and cultural elites, usually felt themselves 
menaced by the new aims and activities of the 
rulers. They felt that their position was threatened 
by the trend toward political centralization, and 
they were therefore unwilling to help in the imple- 
mentation of this trend. Accordingly, they often 
attempted to deny resources and support to the 
tulers, plotting and working against them either 
in open political warfare or by infiltration and 
intrigue. 

The rulers had to find allies, whether passive or 
active, in order to be able to implement their aims 
in the face of these various aristocratic forces. Thus 
they had to forge various instruments of power and 
policy with which to mobilize the various resources 
needed by them—whether economic resources, 
manpower, or political support. Naturally, the rulers 
tried to find such allies among the groups and strata 
whose interests were opposed to those of the more 
traditional and aristocratic elements and who thus 
could benefit by the weakening of the latter and 
by the establishment of a more unified polity. The 
rulers’ allies were therefore of two principal kinds: 
the more active ( mostly urban) economic, cultural, 
and professional groups who, whether by origin or 
by their social interests and orientations, were op- 
posed to the aristocratic—traditional groups; and 
the wider and politically and socially more passive 
Strata, especially the peasants and (to a smaller 
extent) the urban lower classes, 

It was from these various groups and strata that 


the rulers hoped to mobilize the various resources 
they needed. But in order to do this they also had 
to forge some instruments of political and admin- 
istrative action on which they could rely and 
through which they could provide various services 
to their potential allies. Most rulers were able to 
form an entourage by recruiting from established 
administrative and political bodies; however, even 
when such organs of administration were available, 
they had to be adapted to the rulers’ particular 
purpose. Insofar as the existing personnel were 
related to the aristocratic forces, the rulers had in 
many cases to find replacements. Nor was this 
enough; loyalty to the ruler had to be secured 
against bids from opposing forces. Moreover, the 
rulers had to make sure that these administrative 
bodies would be effectively organized for their 
tasks. To this end, the rulers attempted to concen- 
trate the nominations to these positions in their 
own hands. They tried, as far as possible, to ap- 
point persons who were loyal and who had the 
necessary administrative qualifications. The rulers 
also attempted to control the administrative budget, 
making sure that it was adequate for official sala- 
ries as well as other running expenses. This en- 
abled them to lay emphasis on the dependent 
position of the officials: they were to be “servants,” 
either of the individual ruler or of the polity that 
he wanted to establish. Accordingly, representation 
by officials of group or class interests was hence- 
forth to be eliminated. 

Thus, in general, the rulers attempted to make 
these administrative bodies, as far as possible, inde- 
pendent of the more traditional and aristocratic 
strata and groups and to give them some power 
and prestige vis-a-vis these strata. Here the rulers 
had, necessarily, to allow these bodies some meas- 
ure of autonomy and independence and had to 
enable them to perform some services to the popu- 
lation, True, the rulers very often wanted to use 
these administrative bodies only, or mainly, for 
exploitative purposes. But if the rulers wanted to 
perpetuate their rule, they had to allow these serv- 
ices to take into account at least some of the needs 
of some of the social groups, if only to provide 
them with peace, security, and certain minimal 
services. 

Thus, the development of an imperial system 
(in the sense of a historical bureaucratic empire) 
was dependent on two conditions. One condition 
was the existence, within the preceding social 
structure, of a relatively high level of societal dif- 
ferentiation, which limited the place of basic ascrip- 
tive units—such as family, kinship, or traditional 
status groups—in the social division of labor and 
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created many forces cutting across them. On the 
one hand, this differentiation created problems of 
integration that called for new solutions, while on 
the other hand it provided the resources needed for 
new organizations that could attempt to deal with 
some of these problems. The second condition was 
the development of a new type of political leader 
and elite with wider aims and perceptions of politi- 
cal authority and the ability to serve as a focus of 
the new imperial authority and symbols and to 
articulate new, more differentiated, and broader 
political goals. 

The existence of only one of these conditions 
was not sufficient for the institutionalization of an 
imperial system. Thus, for instance, in the city- 
states of Greece, as compared with republican 
Rome (within which there was a similar level of 
differentiation ), there did not develop such an in- 
ternally new leadership. Contrariwise, in the Caro- 
lingian and Mongol states (or empires), while 
there did develop rulers with such new styles of 
leadership, there did not exist an appropriate level 
of differentiation; thus the imperial system could 
not become institutionalized, and these polities re- 
mained at the level of loosely integrated “conquest” 
empires, in which the different regions or groups 
(conquerors and conquered) were not integrated 
into a polity bound by common symbols of identity. 

Although the general existence of a certain level 
of differentiation was a necessary precondition of 
the institutionalization of the political systems of 
such empires, their concrete structures could range 
from that of the city-state or feudal system to that 
of the patrimonial empire. Similarly, the social ori- 
gins, composition, and internal cohesion of the 
new ruling groups could vary greatly, and the 
combination of these variations could greatly in- 
fluence the concrete contours of the developing 
imperial systems. This tendency was manifest, 
first, in the tendency toward political centraliza- 
tion: second, in the development by the rulers of 
autonomous political goals; and third, in the rela- 
tively high extent of organizational autonomy of 
executive and administrative activities. 

But the extent of differentiation of political activ- 
ities, organization, and goals was, in these political 
systems, still limited by several important factors. 
First, the legitimation of the rulers was, in these 
regimes, usually couched in basically traditional— 
religious terms, even if the rulers tended to stress 
their own ultimate monopoly of such traditional 
values and tried to deny that other (traditional) 
groups could also share in this monopoly. Second, 
the basic political role of the subject was not fully 
distinguished from other basic societal roles, such 
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as, for instance, membership in local communities; 
it was often embedded in such groups, and the 
citizen or subject did not exercise any direct politi- 
cal rights through a system of voting or franchise. 
Third, many traditional ascriptive units, such as 
aristocratic lineages or territorial communities, still 
performed many crucial political functions and 
continued to serve as units of political representa- 
tion. As a consequence, the scope of political activ- 
ity and participation was far narrower than in most 
modern and contemporary political systems. 

The existence of both traditional and differenti- 
ated political orientations, activities, and organiza- 
tions created within these empires a complex inter- 
relation between the political institutions and other 
parts of the social structure. The rulers were in 
need of both traditional and more complex, differ- 
entiated political support and were dependent on 
both. The rulers’ traditional dependence on other 
parts of the social structure was manifest in their 
need to uphold their traditional legitimation and 
the traditional, unconditional political attitudes 
and identifications of many groups. On the other 
hand, however, the rulers’ tendency to political 
independence and autonomy made them dependent 
on types of resources that were not available 
through various traditional ascriptive commitments 
and relations. In order to implement their various 
Political goals as they pleased, the rulers were in 
need of more flexible support and resources, which 
would not be embedded in traditional ascriptive 
groups or committed for more or less fixed goals 
(Eisenstadt 1963, chapter 6; Altheim 1955). 

Among these flexible resources, the most impor- 
tant were economic and political ones. In the eco- 
nomic field, the rulers needed manpower and goods 
that could be available not through the fixed com- 
mitments of ascriptive kinship and status groups 
but that could be allocated directly, Among such 
economic resources, the most important were man- 
power (military and administrative ) for services 
and for relatively free and flexible occupational 
choices and various goods and commodities for 

irect spending or for payment of services, 

In principle, such resources could have been 
the same as those used within the various ascrip- 
tive groups and in their fixed interrelations. But 
the very emphasis on their flexibility entailed the 


which they could flow continuously. Similarly, it 
was very important to maintain conditions and 
frameworks in which possibilities of relatively free 
occupational choices and avenues of mobility could 
be realized. 

A similar situation developed in the field of 
political support and organization. The rulers were 
in need of commitments and loyalties that could 
be made available without the restrictions of such 
ascriptive groups, and this necessarily entailed the 
organization of new types of political organizations 
and leadership that could mobilize such support. 
Parallel needs could also be found in the cultural, 
social, and religious fields. 

The political demands made on the rulers by 
the various groups in the society were of both the 
traditional and the more complex, differentiated 
types. On the one hand, the rulers were expected 
to uphold traditional ascriptive rights and benefits; 
on the other, they were faced with demands for 
Participation in the formulation of the political 
balance of power—or even in the process of legiti- 
mating their own authority. Thus the authority of 
the rulers, “traditional” though it may have been, 
was no longer automatic; merely to raise the ques- 
tion of the rulers’ accountability was to deny them 
fixed support. 

These different types of political activities and 
orientations did not coexist in these political sys- 
tems in separate “compartments,” bound together 
only in some loose and unstable way. They were 
bound together within the same institutions, and 
the continuity of each type of political activity was 
dependent on the existence of both types of politi- 
cal orientation. Because of this, the activities of 
the rulers were, paradoxically, oriented to main- 
taining basic traditional legitimation through ma- 
nipulation not only of traditional but also of non- 
traditional support and to the mobilization also of 
traditional resources for politically autonomous 
goals and through nontraditional channels, 

Hence, the political system of these empires 
could subsist only insofar as it was possible to 
maintain simultaneously and continuously, within 
the framework of the same political institutions, 
both the traditional and the more differentiated 
levels of legitimation, Support, and political organi- 
zation. The continuity of these systems hinged on 
the continuous existence of a certain balance be- 
tween political activity and involvement on the 
part of some segments of the population and of 
political noninvolvement or apathy toward central 
political issues by most Segments of the population. 
The limited political involvement could assure some 


of the more flexible political support, while the 
apathy, in its turn, was necessary for maintenance 
of the traditional legitimation of the rulers. 


Contradiction and conflict 


It was the interplay between these varied orien- 
tations of the rulers—-their dependence on both 
traditional and differentiated resources—that 
greatly influenced their concrete policies and gave 
rise to some of the basic contradictions that devel- 
oped within those policies. The rulers of these 
empires tended to develop three major types of 
basic political orientations. First, they were inter- 
ested in the limited promotion of free resources 
and in freeing them from commitments to tradi- 
tional aristocratic groups. Second, the rulers were 
interested in controlling these resources and com- 
mitting them to their own use. Third, the rulers 
tended also to engage in various goals—military 
expansions, for example—which alone could ex- 
haust many of the available free resources. Be- 
tween these various tendencies of the rulers, serious 
contradictions easily developed. These contradic- 
tions, although not always consciously grasped by 
the rulers, were nevertheless implicit in their struc- 
tural position, in the problems and exigencies with 
which they dealt, and in the concrete policies they 
employed in order to solve their problems. 

These contradictions were exhibited mainly in 
the sphere of legitimation and stratification. As we 
have seen, the rulers often attempted to limit the 
aristocracy’s power and to create new status groups. 
But these attempts faced several obstacles. Regard- 
less of the extent of the monarchs’ independent 
activities in this field, the number of new titles 
created, and the degree of encouragement of new 
strata, the symbols of status used by the rulers 
were usually very similar to those borne by the 
landed hereditary aristocracy or by some religious 
elites. The creation of an entirely new secular and 
rational type of legitimation, in which the social 
groups or universalistic principles would be the 
foci of legitimation, was either beyond their hori- 
zon or against their basic interest. It would neces- 
sarily involve extending the sphere of political 
participation and consequently increasing the in- 
fluence of various strata in the political institutions. 

Therefore, the rulers were usually unable to 
transcend the symbols of stratification and legiti- 
mation borne by the very strata whose influence 
they wanted to limit. Consequently, the ability of 
the rulers to appeal to the lower strata of the 
population was obviously limited. Even more im- 
portant, because of this emphasis on the superiority 
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and worth of aristocratic symbols and values, many 
middle or new strata and groups tended to identify 
with them and consequently to “aristocratize” 
themselves. 

The contradiction in the rulers’ policies and goals 
could develop also in a different direction. However 
tradition-bound the ruling elites may have been, 
their policies required the creation and propagation 
of more flexible “free” resources in various institu- 
tional fields. Here again, the major types of free 
resources were, in the economic field, money and 
easily exchangeable goods, free manpower in gen- 
eral, and free professional manpower in particular; 
and, in the political and social fields, relatively free 
commitments and possibilities of support. The 
propagation of such free resources either gave rise 
to many religious, intellectual, and legal groups 
whose value orientations were much more flexible 
than the traditional ones, or else it promoted such 
groups. Moreover, the orientations and values of 
the broader middle strata of the society sometimes 
were similar to those propagated by these more 
active elite groups. Although in many cases all 
these elements were very weak and succumbed to 
the influence of the more conservative groups and 
policies of the ruling elite, in other cases—as in 
Europe—they developed into relatively independ- 
ent centers of power, whose opposition to the rulers 
was stimulated only by the rulers’ more conserva- 
tive policies. 

The rulers’ activities in the economic field were 
similarly inconsistent. Their main economic aims 
posed a series of dilemmas that could be extremely 
acute in relatively undifferentiated economic sys- 
tems and that could give rise to intensive contra- 
dictions between long-term and short-term eco- 
nomic policies. Thus, the continuous necessity to 
mobilize extensive resources at any given moment 
could often exhaust the available free resources 
on which the rulers’ economic independence rested. 
The big landowners and merchants, who consti- 
tuted important centers of economic power, quite 
often tried to intensify this contradiction by pro- 
viding the government with short-term loans or 
allocations of manpower for very limited periods 
and purposes; this increased the dependency of 
the rulers at the same time that it buttressed their 
own position. Although these allocations were 
usually insufficient to take care of their long-term 
needs, the rulers had to pay dearly in terms of the 
various free resources at their disposal. The rulers 
had to avail themselves of the various services and 
other resources of these more tradition-minded 
groups, giving them in return various concessions 
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that often tended to undermine the long-run avail- 
ability of free resources. In this way, the position 
of the rulers gradually became weaker (Eisenstadt 
1963, chapter 12). 

A similar contradiction existed between the long- 
range and short-range policies dealing with prob- 
lems of administrative manpower. In many cases 
there was not enough manpower available for the 
execution of various administrative and political 
tasks, or because of inadequate communication and 
technical facilities it was very difficult to supervise 
such personnel effectively. It then became neces- 
sary to farm out various functions and positions 
either to local gentry and landowners or to officials 
who gradually became “aristocratized.” 

The best example of how the social groups cre- 
ated by the ruling elite became partially opposed 
to its aims and basic political premises is the 
development of the system of sale of offices, which 
was closely connected, in these empires, with the 
entire process of recruitment into the bureaucracy 
(Swart 1949). At first this system was usually 
introduced by the rulers as a means of solving their 
financial problems and admitting new (nonaristo- 
cratic) elements into their service, But in time, in 
most of these societies, the bureaucracy came to 
regard its offices as possessions and either trans- 
mitted them in the family or sold them in the 
market; in this way the rulers, despite many efforts 
to the contrary, slowly lost control over these offices. 

This trend was connected, in general, with the 
tendency by the bureaucracy itself—the very in- 
strument of power of the rulers—to aristocratize 
itself, to acquire symbols of aristocratic status, and 
to ally itself with aristocratic forces. In such cases 
the bureaucracy very often displaced its goals of 
service to the rulers for those of self-aggrandize- 
ment. Its members used their positions for enrich- 
ing themselves and their families, thus becoming a 
growing burden on the economy and losing their 
efficiency. 

This development necessarily affected the nature 
and extent of political activity and the scope of 
mobilization of political leadership. Insofar as these 
processes of aristocratization became intensified, 
they usually depleted the supply of political leaders 
to the central political institutions, The more active 
elements became alienated from the regime, 
whether their alienation took the form of succumb- 
ing to the aristocratic forces, falling into complete 
political apathy, or becoming centers of social and 
political upheaval and change. 

Politics and social class, Similar contradictions 
tended also to develop in the political attitudes and 
activities of the major strata in these societies, 


Several basic attitudes of various strata and groups 
toward the basic premises of the political systems 
of these empires and toward the basic aims of their 
tulers can be distinguished. The first attitude, 
evinced chiefly by the aristocracy, was one of oppo- 
sition to these premises—an opposition that was 
often shared by the peasantry and sometimes also 
by other groups that were interested only in main- 
taining their own limited local autonomy and their 
immediate economic interests. 

The second attitude consisted of basic identifica- 
tion with the political premises of the imperial 
system, combined with a willingness to fight for 
one’s own interests within the framework of exist- 
ing political institutions. This attitude was to be 
found mostly among the bureaucracy and among 
various elements of the urbanized professional and 
cultural elites. 

The third attitude, developed mainly by the more 
differentiated urban groups and professional and 
intellectual elites, favored changes in the extension 
of the scope of political systems. This attitude, 
which was most clearly evinced by the European 
middle class and intellectual groups at the end of 
the eighteenth century (Beloff 1954), was mani- 
fested in various attempts to change the basic value 
premises of the political system, to widen the pat- 
terns of political participation within it, and to find 
referents of political orientation that transcended 
the given political system. 

These attitudes often overlapped in concrete in- 
stances and varied by group and stratum in differ- 
ent societies and periods. Moreover, the attitudes 
of any one group were never homogeneous and 
stable, and they could change greatly according 
to political conditions. The various political atti- 
tudes of the major social groups greatly influenced 
the extent of their political participation and the 
scope and the nature of the political leadership 
that tended to develop from within them. Here 
again the most significant factor, from the point 
of view. of the continuity of the imperial system, 
was the bureaucracy’s tendency to aristocratize 
itself and thus to undermine the very conditions of 
such continuity. No less important was the possi- 
bility that the very administrative organs created 
for the implementation of the rulers’ policies could 
develop autonomous orientations and activities that 


might become opposed to the basic premises of the 
imperial system, 


Imperial decline 


It was the interplay between the policies of the 
rulers and the political orientations and activities 
of the major social groups that constituted the crux 


of the processes of change within the empires and 
also brought about the development of conditions 
that could facilitate their downfall. These processes 
were rooted in the basic characteristics of the social 
and political structures of all the historical bureau- 
cratic empires. However, the exact ways in which 
these conditions for change developed, as well as 
the exact processes that caused them, varied in 
different empires according to the specific constel- 
lation of their structural characteristics, the various 
external processes that impinged on them, and 
their unique historical circumstances. 

Among the internal aspects of the social struc- 
ture of these empires which influenced the processes 
of change was, first, the nature of the goals of the 
rulers. Whether chiefly military and expansionist, 
or more oriented to the maintenance of a cultural 
order, or concerned mainly with economic advance- 
ment, each kind of goal made different kinds of 
demands on the various types of resources available 
in the society. The processes of change and disinte- 
gration were also set in motion by (a) the policies 
that the rulers developed for the implementation of 
their goals and the repercussions of these policies 
on the relative strength of different social strata; 
(b) changes in the relative strength of such strata 
as a result of internal economic, religious, or polit- 
ical developments; and (c) the development of 
various internal and external crises and the ways 
in which the policies developed to deal with them 
influenced the strength of different groups. 

The direction taken by an empire’s decline and 
the rate at which it declined were also greatly in- 
fluenced by two residual factors: the initial level 
of social and economic differentiation in the society 
and the initial strength of its various social groups 
in relation to one another. Within this context, of 
special importance was the extent to which there 
existed common cultural and political bonds en- 
compassing these major social groups and the 
rulers, as for instance in the case of the Confucian 
order in China (see Eisenstadt 1963, chapter ig; 
Balázs 1964, chapters 1-2). 

Among the more “accidental” or external factors 
that influenced the processes of change, we should 
mention different extents of external pressure, 
major movements of population, conquests of 
nomads, international economic fluctuations, and 
the degree to which there existed from the begin- 
ning ethnic heterogeneity in a given society. Of 
equally crucial importance was the specific geo- 
political situation of any polity: for instance, the 
special geopolitical situation of Byzantium at the 
crossroads of Europe and Asia. In general, it was 
some combination of external and internal pres- 
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sures and exigencies that precipitated change in the 
political systems of these empires. The greater 
the intensity of these internal contradictions, and 
the more intractable the external crises by which 
the empires were faced, the quicker and more 
intensive was the onslaught of change. 

Thus, to give only some very preliminary ex- 
amples (for a fuller exposition, see Eisenstadt 
1963, chapter 12), the fact that in China various 
invasions, rebellions, and the famous “dynastic 
cycles” did not undermine for a very long period 
of time the basic institutional structure of the 
Chinese Empire (from the Han to the Ch'ing) can 
be understood if one remembers its geopolitical 
position, which made it relatively immune to the 
heavy impact of external forces. Furthermore, in 
China the relative weakness of the aristocracy and 
the predominance of the gentry tended to enhance 
the position of the centralized rulers; and the Con- 
fucian literati and bureaucracy, who constituted 
the backbone of the social and political structure, 
intervened between the central government and the 
major social strata and provided an indispensable 
framework of continuity and unity for the empire. 
By contrast, the geopolitical exposure of the Byzan- 
tine Empire, with its strong sensitivity to invasions 
and continuous internal struggle between the aris- 
tocracy and the free peasantry, led to its complete 
downfall. Similarly, the Roman and Arabic em- 
pires, with their extended boundaries that em- 
braced a great variety of autonomous religions and 
cultural groups, were unable to contain internal 
dissension at the same time that they were forced 
to deal with external pressures. 


The foci of change 


The policies of the rulers and the political orien- 
tations and activities of the major social groups 
within the empires were greatly influenced by two 
major sources (both external and internal) of pres- 
sure and change. As we have seen above, the ex- 
ternal, geopolitical factors, in the broadest sense, 
provided not only the general setting for these 
polities but also constituted sources of many con- 
crete pressures, such as external pressures of popu- 
lation and problems of military security or of 
adjustment to international trade, These geopolit- 
ical settings indicated, as we have seen, the nature 
of the international system within which the rulers 
of the empires worked and the types of problems to 
which they were especially sensitive. It has been 
rightly claimed that in many of these empires there 
existed, because of their basic structural character- 
istics, what has been called Primat der Aussen- 
politik (Altheim 1955), or the priority of foreign 
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policy; this implied a much greater sensitivity to a 
variety of such external pressures than in many 
other types of political systems. These external 
pressures were very often connected with internal 
problems. For instance, close relations obviously 
developed between problems of international trade 
and the situations and activities of merchant groups 
or between military problems and problems of 
manpower recruitment. Thus it was the combina- 
tion of external and internal pressures that con- 
stituted the major foci of change in the empires. 

In more concrete terms, the main factors gen- 
erating processes of change in these empires were 
(a) the continuous needs of the rulers for different 
types of resources and especially their great de- 
pendence on various flexible resources; (b) the 
rulers’ attempts to maintain their own Positions of 
control, in terms of both traditional legitimation 
and of effective political control over the more 
flexible forces in the society; (c) the great and con- 
tinuous sensitivity of the internal structure of these 
Societies to various external pressures and to po- 
litical and economic developments in the interna- 
tional field; (d) the consequent needs of the rulers 
to intensify the mobilization of various resources 
in order to deal with problems arising out of 
changes in military, diplomatic, and economic in- 
ternational situations; and (e) the development of 
various autonomous orientations and goals among 
the major strata and their respective demands on 
the rulers, 

Insofar as there developed strong contradictions 
between these different factors and especially inso- 
far as the rulers emphasized very expensive goals, 
which exhausted the available economic and man- 
power resources, the rulers found themselves in 
various dilemmas. In such situations, the special 
Sensitivities of these political systems were brought 
out, and forces were generated that could under- 
mine the delicate balance between political par- 


more flexible, differentiated Strata, and either de- 
pleting or alienating the more “free” groups and 


conservative, aristocratic—patrimonial (or feudal) 
elements, and depletion or flight of mobile capital 
(Eisenstadt 1963, chapter 12). 


These processes were usually closely connected 
with the aristocratization or ossification of the 
bureaucracy, with its growing parasitic exploita- 
tion of the economy, and with the depletion of 
active political leadership identified with the re- 
gime. Such parasitic exploitation of the economy 
by the rulers was in a way an intensification of the 
usual economic activities of the rulers of the em- 
pires. The special parasitic nature of their activities 
in periods of decline—or of the setting in of de- 
cline—was evident not so much in the mere ex- 
tension of the demands for taxes or for manpower 
as in the fact that the resources mobilized by the 
rulers were used for the creation of new ascriptive 
positions and groups. Instead of promoting condi- 
tions that could have encouraged the extension of 
greater resources, through trade or the facilities for 
training professional manpower, the rulers de- 
pleted their resources by adding to their already 
overdeveloped bureaucracies. 

Thus there often developed a continuous flux of 
foreign elements into the centers of the realms. 
Initially mere merchants, hirelings, and personal 
helpers of the rulers, these foreign military groups 
gradually succeeded in infiltrating some of the 
most important political posts and finally in totally 
usurping the supreme political power. This was 
made possible by the depletion of native strata, to- 
gether with the mounting internal and external 
crises. Similar developments could take place with 
regard to foreign merchants who sometimes, as in 
Byzantium or the Ottoman Empire, finally man- 
aged to monopolize all the trading posts abandoned 
by the indigenous merchants (Ostrogorski 1940). 
In those cases, as in Europe, in which these eco- 
nomically and socially more active strata were de- 
pleted, they became alienated from the rulers and 
their policies, as well as from the political insti- 
tutions of the society, and turned to fomenting 
change and revolt. 

SHMUEL N. EISENSTADT 


[See also BUREAUCRACY; Civi. SERVICE; CoLontAtisM.] 
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EMPLOYMENT AND UNEMPLOYMENT 


The commercialization and industrialization of 
a large part of the world during the past several 
centuries have involved a radical change in the 
way in which human labor is directed toward pro- 
ductive ends. Two centuries ago, or more, most of 
the world’s work force tilled the soil, as is still 
largely true except in North America and Europe. 
This work was performed in good part under con- 
ditions of status, in which a man was tied to a par- 


ticular place and particular job by institutional 
arrangements which offered an individual and his 
family a sense of stability and security. 

Today, in the more developed countries and in 
the commercial—industrial sectors of the less devel- 
oped economies, command over labor is acquired 
in the market place. The interaction of demand 
and supply in the labor market determines, within 
limits, the level of wages and wage differentials, 
the volume of employment—and the volume of un- 
employment. As the relative importance of agri- 
culture and other forms of self-employment has 
declined, the interplay of demand and supply, in 
the economy as a whole and in labor markets in 
particular, has come to be crucial in determining 
both long-term trends and short-run fluctuations in 
the volume of employment and unemployment. 

The personal insecurity and widespread hard- 
ship that frequently resulted from the impersonal 
market determination of the level of employment 
began to influence government policies in the most 
industrialized countries in the latter part of the 
nineteenth century and became increasingly influ- 
ential in the twentieth (Beveridge 1909). But in- 
tense world-wide concern with the welfare implica- 
tions of widespread unemployment did not come 
until the great depression of the 1930s. Out of the 
shock of that experience and the economic plan- 
ning that followed came the modern concern with 
maintaining the level of employment. This devel- 
opment was greatly stimulated by the “Keynesian 
revolution” in economic thinking, which dates from 
the mid-1930s. 

Today “full employment” is an almost universally 
espoused goal. Virtually every advanced country 
that depends to a significant degree on private mar- 
kets consciously formulates and tries to implement 
an employment policy. This was not true before the 
1930s. The Beveridge Report of 1944, Full Em- 
ployment in a Free Society, represents a landmark 
in this development. For the United States a com- 
parable landmark is the Employment Act of 1946. 
The international concern at the end of World 
War 11 with the need to maintain a high and stable 
level of employment is suggested by the pledge, 
made by the governments subscribing to the United 
Nations Charter, to take action to promote “higher 
standards of living, full employment, and condi- 
tions of economic and social progress and develop- 
ment” (United Nations . . . 1949, p. 5). 

The goal of full employment may conflict with 
other economic goals. Thus, full employment may 
not be compatible with the desired degree of sta- 
bility in the price level, and at times governments 
have felt compelled to sacrifice some employment 
in order to correct a disequilibrium in the balance 
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of payments. The question of how to reconcile these 
goals was being debated vigorously in the Western 
world in the mid-1960s. 

Table 1 offers some historical background on the 
changing pattern of employment in broad indus- 
trial sectors. In the nineteenth century the shift 
away from agriculture had been most marked in 
the United Kingdom, and in the first decade of the 
twentieth century agriculture already accounted for 
less than 15 per cent of the British labor force. In 
the other leading industrial countries the agricul- 
tural sector of the labor force had already fallen 
below 50 per cent by the beginning of the twentieth 
century. 

The shift of labor out of farming continued 
Steadily during the first half of the twentieth cen- 
tury and has, indeed, accelerated since the end of 
World War 1 (Twentieth Century Fund 1961, pp. 
72-74). The movement has been not only toward 
what is loosely called “industry” (manufacturing, 
mining, construction, and electric power and gas) 
but also into the trade and service sectors. This 
latter movement has been particularly marked in 
the United States. It is also noteworthy that the 
relative contribution of industry to total employ- 


1950 to 1962. Involved here is not only the chang- 
ing pattern of demand but also a more rapid rise 


Table 1 — Percentage distribution of labor force by 
industrial sector in selected countries, 


1910-1962 
About About About 
1962 1950 1930 1910 
France 
Agriculture* 21 34 36 43 
Industry® 40 33 36 32 
Service® 39 33 27 25 
Germany 
Agriculture 14 23 29 34 
Industry 49 44 41 40 
Service 38 33 30 26 
United Kingdom 
Agriculture 4 5 ó 12 
Industry 48 47 43 43 
ice 48 48 51 45 
United States 
Agriculture 9 12 22 31 
Industry 33 35 31 31 
Service 59 53 47 38 


a. Agriculture includes forestry and fishing, 


b. Industry includes manufacturing, mining, 
water. 


& Service includes trade, banking and finance, tran: 
sport and 
ond all other services (including government) Sie aneo, 


construction, gas, electricity, and 


Sources; 1910-1950 data from Kuznets 1957, 
appendix, table 4; 1962 data from 
Organization for Economic Cooper- 
ation and Development 1963, 


Table 2 — Percentage distribution of total employment 
by occupational classification in the United 
States, 1900-1963 


Occupational group 1963 1950 1940 1930 1920 1910 1900 
White-collar workers 43.9 36.6 31.1 29.4 24.9 21.4 17.6 


Manual workers 36.3 41.1 39.9 39.6 40.2 38.2 35.8 
Service workers 13.1 10.5 11.7 98 78 FIU 91 
Farm workers* 6.7, 11.8 17.4 21.2 27.0 31.1 37.5 


* Includes farmers and farm laborers, 


Sources: For 1900-1950, U.S. Bureau of the Census 
1960; for 1963, U.S. President 1964. 


in labor productivity in manufacturing than in the 
trade and service sectors, 

Another change in the pattern of employment 
involves a gradual shift to white-collar occupations. 
This change has been particularly notable in the 
United States, as indicated in Table 2. 

Other changes in the composition of employment 


» an upward trend in the 
age at which children leave school and enter the 
labor force, and earlier retirement for older workers 
[see LABOR FORCE]. These trends are expected to 
continue into the future. 


Unemployment 


EMPLOYMENT AND UNEMPLOYMENT 51 


Table 3 — Unemployment rates in selected countries, 1913-1962* 
(per cent of the labor force) 


Germany 
Yeor France (later FRG.) Itoly 
1913 1.0 Ww 17 
1920 b 17 b 
1921 2.5 1.2 b 
1922 b 07 b 
1923 b 4.5 b 
1924 b 5.8 b 
1925 b 3.0 b 
1926 1.1 8.0 b 
1927 b 39 b 
1928 b 3.8 b 
1929 b 5.9 17 
1930 b 9.5 2.5 
1931 2.1 13.9 43 
1932 b 17.2 5.8 
1933 b 14.8 5.9 
1934 b 8.3 56 
1935 b 6.5 b 
1936 4.2 48 b 
1937 b 27 5.0 
1938 3.6 1.3 46 
1950 14 7.2 8.7 
1951 13 6.4 9.2 
1952 1.3 6.1 9.8 
1953 1.6 5.5 10.2 
1954 1.6 5.2 8.7 
1955 1.5 3.8 7.5 
1956 1.2 3.1 9.3 
1957 1.0 27 8.1 
1958 11 2.7 6.4 
1959: 5 oe 18 19 5.4 
1960 13 0.9 4.0 
1961 1.2 0.6 3.4 
1962 13 0.5 29 


United 
Sweden Kingdom Conode United States 

wl 12 3.0 43 
13 1.8 b 5.2 
64 96 5.8 1.7 
5.5 8.1 44 67 
29 66 3.2 24 
24 58 45 5.0 
2.6 64 44 3.2 
2.9 71 3.0 1.8 
2.9 5.5 1.8 3.3 
2.4 6.1 17 42 
2.4 5.9 29 3.2 
3.3 93 91 8.9 
48 12.6 11.6 16.3 
68 13.1 17.6 24.1 

7.3 n.7 19.3 25.2 
6.4 9.9 14.5 22.0 
6.2 9.2 14.2 20.3 
5.3 7.9 12.8 17.0 
5.1 67 9.1 14.3 
5.1 8.1 11.4 19.1 

1.7 2.5 3.6 5.3 
1.6 2.2 2.4 3.3 
7 29 2.9 3.1 

1.9 2.6 2.9 2.9 
1.8 2.3 4.5 5.6 
1.8 2.1 43 44 
1.6 2.2 3.3 4.2 
17 2.4 4.5 4.3 
2.0 3.0 6.9 6.8 
1.8 3.1 5.9 5.5 
1.6 2.5 6.9 5.6 
1.5 2.4 71 6.7 
1.3 2.9 5.8 5.6 


a. The unemployment rates shown for the various countries do not all reflect precisely the same coverage and definitions, although some limited standardization 
has been applied to the figures. Thus, the figures shown here do not agree completely with those in Table 4, where, for a more limited period, complete 


comparability with American definitions and coverage has been attempted. 


b. Not available. 


not had pockets of serious unemployment with 
which to deal. Various countries continue to have 
their depressed regions—the Italian Mezzogiorno 
and Northern Ireland, for example—and differen- 
tially high unemployment rates may exist in par- 
ticular countries for teen-agers, the older age 
groups, or the least skilled. 

The relatively high unemployment that persisted 
in the United States after the mid-1950s gave rise 
to a vigorous debate as to its causes and appropri- 
ate cures (Ross 1964). On the one hand have been 
those who argued that the high level of unemploy- 
ment reflected primarily a deficiency of aggregate 
demand. The argument has run that a higher and 
more rapidly rising level of total demand, sup- 


Sources: For all countries except United States, 1913-1960 from Maddison 1964, 
table E-1; 1961-1962 adapted from Organization for Economic Coop- 
eration and Development 1963; for United States, 1913-1960 from 
Lebergott 1964, p. 512; for 1961-1962, from U.S. President 1964, p. 195. 


ported by appropriate monetary-fiscal policies, 
could increase employment to the point where un- 
employment would again be brought down to a rate 
of 4 per cent or less. 

Opposing this view has been the structuralist 
argument that the main problem has been not a 
lack of jobs in the aggregate but a failure of the 
labor supply to adjust sufficiently to the rapidly 
changing pattern of employment opportunities 
(U.S. Congress, Senate . . . 1963, part 5, pp. 1461— 
1499). The changing pattern of the demand for 
labor—resulting from automation and other tech- 
nological change and from the shift in the com- 
position of output toward services—has greatly 
increased the demand for highly educated and 
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white-collar workers and reduced the demand for 
blue-collar, unskilled, and poorly educated workers. 
And, the argument runs, it has not been possible 
to make the necessary adjustments in labor supply. 
As a result, the structuralist position has main- 
tained, merely expanding aggregate demand would 
lead to labor shortages in occupations in which 
unemployment rates were already relatively low 
without materially reducing the rates among the 
groups that had the highest unemployment. 

In the early 1960s the consensus among those 
who had studied the problem was that the high 
level of unemployment in the United States was 
primarily the result of the failure of aggregate de- 
mand to expand rapidly enough. There was gen- 
eral agreement also, however, that the heterogene- 
ous character of the American labor force created 
a persistent structural problem of poor employ- 
ment opportunities for the underprivileged parts of 
the labor force (Ross 1964; Gordon 1964). There 
was increasing recognition on all sides that there 
was need for a vigorous and comprehensive man- 
power policy to expedite the absorption into gain- 
ful employment of the least qualified groups in the 
labor force (U.S. Senate . . . 1964). 

When is a person unemployed? There is no 
single unambiguous definition of unemployment. 
Different purposes call for different definitions. 
Further, the definitions that are implicit in the 
official figures published in different countries may 
vary because of the way in which unemployment 
Statistics are compiled. 

The difficulties involved in arriving at a gen- 
erally accepted definition of unemployment can be 
illustrated by the household survey utilized in the 
United States. Information is secured for each per- 
son in the household aged 14 and over. For those 
for whom it is reported that they did not work for 
pay outside the home at least one hour during the 
reference week, the question is then asked: “Was 
. . . looking for work?” An affirmative reply settles 
the matter; the person is listed as unemployed. A 
change in this procedure was under consideration 
in 1966. 

This procedure raises the following definitional 
questions, among others: 

(1) Should unemployment be measured in hours 
or persons? If a person wanting to work full-time 
can find only a half-time job, should he be counted 
as half unemployed? Similarly, if an unemployed 
person seeks only a part-time job, should he be 
counted as wholly unemployed? In all countries, 
the official figures refer to persons wholly unem- 
ployed, that is, persons who are not working at all 


and are seeking full-time or part-time work. But 
some data on part-time unemployment are avail- 
able, and the American government now publishes 
a monthly figure on “percent of labor force time 
lost” through unemployment. 

(2) What should a person have done to look for 
work, and on what terms should he be willing to 
accept a job, if he is to be counted as unemployed? 
What sorts of overt action—such as registering at 
an employment office or applying personally at a 
factory or office—should he have taken? And how 
recently should he have taken such action? Further, 
if he is willing to work only at his former wage 
while a job is available to him at a lower wage, 
should he be counted as unemployed? These are 
questions that have no single and obvious answer. 

(3) A question frequently asked is whether one 
should include among the unemployed those who 
are only marginally in the labor force—for exam- 
ple, boys and girls in school seeking part-time jobs 
or housewives who wish to work to supplement the 
family income or enjoy a more varied life. Those 
who would exclude such persons are in effect sug- 
gesting that hardship be the primary criterion in 
defining who is unemployed. Quite clearly, unem- 
ployment does not imply the same degree of hard- 
ship to all persons seeking jobs at a particular 
time. The answer here seems fairly clear-cut. An 
inclusive definition of unemployment should be 
used by those who compile the data, but the latter 
should be published in sufficient detail—for ex- 
ample, by age, sex, and marital status—so that the 
user can combine the subtotals in whatever way 
best fits the particular definition he wants to use. 

(4) The question of marginal attachment to 
the labor force can cut two ways. The official count 
of the unemployed may include some persons who 
are only marginally attached to the labor force, But 
it also excludes an unknown number who would 
like to work and would look for work under more 
favorable conditions. This is not a serious issue if 
jobs are plentiful, but it does assume importance 
when unemployment is at a high level. Such a 
situation of more or less forced withdrawal from 
the labor force is frequently characterized as one 
of disguised unemployment. 

Disguised unemployment was certainly of some 
importance in the United States after 1957, when 
the national unemployment rate remained above 
5 per cent. This is an area in which it is difficult to 
obtain reliable figures. But there has been a grow- 
ing body of evidence that the lack of jobs created 
disguised unemployment among women, older per- 
sons, the unskilled, those with the least education, 
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and nonwhites (U.S. President 1964, pp. 30-31). 
Or to put the matter in other terms, labor-force 
participation rates among these groups were de- 
pressed by relatively high unemployment rates. 

(5) Some other questions that arise in defining 
and measuring unemployment can only be men- 
tioned. What should be done about those who are 
presumably unemployable because of physical or 
psychological handicaps? Should any attempt be 
made to measure “underemployment,” that is, the 
extent to which persons are employed at jobs that 
call for less than their highest current level of 
skill? While it would certainly be desirable to have 
even partial measures of such underemployment, 
the obstacles in the way of developing such data 
are virtually insuperable. 

The problem of underemployment is most fre- 
quently discussed in another connection—the very 
low average productivity and virtually zero mar- 
ginal productivity of the agricultural populations 
in some of the least developed parts of the world. 
Here the pressure of population on natural re- 
sources is such that, given the state of production 
techniques, a moderate decline in the agricultural 
work force would lead to little if any reduction in 
total output. This condition is sometimes said to 
create “disguised unemployment,” a term that we 
have used to mean withdrawal from the labor force 
because of the scarcity of jobs. 

Methods and problems of measurement. It has 
already been noted that unemployment is measured 
in different ways in different countries. Systems of 
compiling unemployment data can be grouped 
under five headings: (1) sample surveys of the 
labor force; (2) compulsory unemployment-insur- 
ance statistics; (3) unemployment-relief data; 
(4) trade union records of unemployment; and 
(5) registrations at government employment offices. 

Of these, the first and the last are the most com- 
mon (U.S. President's Committee . . . 1962, appen- 
dix A). In 1964 sample surveys were the source of 
the official data on unemployment in Canada, 
Japan, and the United States; and such surveys 
have been used as a supplementary source of in- 
formation in a number of other countries—for 
example, France, Germany, Italy, and Sweden. 
Employment-office registrations were the primary 
official source in France, Germany, Great Britain, 
Italy, Sweden, and elsewhere. 

Section A of Table 4 presents the official unem- 
ployment rates for a number of countries for the 
years 1960-1962. Differences in definition and 
method of measurement, however, mean that a 
number of adjustments are necessary to make these 


Table 4 — Unemployment rates in selected countries, 
as published and after adjustment to United 
States definitions, 1960-1962 (per cent of 
the labor force) 


A. OFFICIAL FIGURES B. ADJUSTED TO U.S. 


AS PUBLISHED DEFINITIONS 
1960 1961 1962* 1960 1961" 1967 

United States 5.6 67 54 5.6 6.7 5.6 
Canada 7.0 7.2 6.0 7.0 7.2 6.0 
France 1.0 0.9 0.9 1.9 1.7 1.8 
Germany (F.R.G) 1.2 0.8 0.7 1.0 0.5 b 

Great Britain 1.6 1.4 1.9 2.4 2.2 2.8 
Italy 8.2 7.6 63 43 37 3.2 
Japan 1.0 0.9 0.9 1.1 1.0 1.0 
Sweden 14 1.2 13 b 1.5 1.5 


a. Some figures for these years ore preliminary. 
b. Not avoiloble. 
Source: Myers 1964, p. 174. 


figures fully comparable. Among the points of dif- 
ference are the following: 

(1) Treatment of unpaid family workers. Japan, 
for example, includes in the labor force unpaid 
family workers who worked as much as one hour 
during the survey week. The United States includes 
them only if they worked 15 hours. 

(2) The treatment of persons on temporary lay- 
off and of those waiting to start a new job. Before 
1957 these groups were counted as employed in 
the United States; they are now treated as unem- 
ployed. In Japan, they are considered to be 
employed. 

(3) The age at which children are included in 
the labor force. 

(4) The period to which the data apply. In the 
United States, for example, the period is a week, 
and anyone working even one hour during the 
week is counted as employed. In Germany and 
Great Britain, on the other hand, the reference 
period is a day. A person not working on that day 
is counted as unemployed, although he may have 
worked on one or more other days during that week. 

(5) The various unemployed groups that may be 
excluded from the unemployed. It is fairly com- 
mon to exclude the self-employed, who are included 
in the United States. If the data come from unem- 
ployment-insurance records, those ineligible for 
coverage would be excluded. If the figures are based 
on registrations at employment offices, persons who 
are seeking jobs but do not register will be excluded. 

(6) The computation of the unemployment rate. 
To compute an unemployment rate it is necessary 
to divide the number unemployed by some figure 
that includes the employed. In the United States, 
the denominator is taken to be the total civilian 
labor force. The self-employed are included. In a 
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number of countries, the unemployment rate rep- 
resents only the percentage of wage-and-salary 
workers who are unemployed. 

In 1962 the U.S. Bureau of Labor Statistics made 
the first intensive attempt to standardize the un- 
employment figures in a number of countries, the 
basis of comparison being the definitions in use in 
the United States. The results are shown in Section 
B of Table 4. Of the countries listed, the largest 
relative adjustments were in the British, French, 
and Italian figures—the first two upward and the 
last downward. Even after these adjustments the 
unemployment rates of all the other countries ex- 
cept Canada were below that of the United States 
for each of the years 1960-1962. 

The kinds of unemployment. For both ana- 
lytical and policy reasons, it would be extremely 
useful to be able to distinguish among various 
kinds of unemployment. In the classifications that 
have been attempted, the categories that one most 
frequently encounters bear the titles “frictional,” 
“seasonal,” “cyclical” (or “deficiency-of-demand”), 
and “structural” unemployment (U.S. Congress, 
Joint Economic Committee 1961). 

Virtually all attempts at classification seek to 
take account of three sets of factors that tend to 


aggregate demand, 

Seasonal unemployment creates a variety of 
problems, not the least of which is the problem 
of measurement, Considerable research has been 
aimed at improving existing methods of adjusting 


unemployment figures for seasonal variation (U.S, 
President's Committee . . . 1962, chapter 6), And 
various measures by both government and private 
employers have had some effect in dampening 
the amplitude of seasonal movements in unem- 
ployment in particular industries. An interesting 
example was the effect of Schlechtwettergeld (bad- 
weather compensation) in reducing recorded sea- 
sonal unemployment in the German construction 
industry (Germany [Federal Republic] . . . 1962, 
pp. 56-57). 

In Sweden in the early 1960s unemployment in 
midwinter tended to run three to four times as high 
as in midsummer. It is not Surprising that such 
countries as Sweden and Norway have devoted 
considerable effort to reducing these wide seasonal 
swings. In the United States in 1964 the official 
seasonal index of the unemployment rate for ex- 
perienced wage and salaried workers ranged from 
84.1 in October to 122.9 in February (see “Rates 
of Unemployment” 1964). The seasonal amplitude 
in Canada was still wider, 

2. Cyclical unemployment. Next is the set of 
factors implied by what formerly was called “cycli- 
cal” and is now frequently termed “deficiency-of- 
demand” unemployment. What is implied here is 
that the total demand for goods and services, given 
existent wage rates and labor productivity, is not 
sufficient to generate jobs for all those who want 
to work (after appropriate adjustment for mini- 
mum frictional unemployment). The number of 
job vacancies open in the economy as a whole is 
significantly less than the total number of people 
seeking work. This is the kind of unemployment 
that was so predominant during the great depres- 
sion, and it is to the prevention of this kind of 
unemployment in particular that so-called full- 
employment policies are addressed. 

3. Structural unemployment. The term “struc- 
tural” unemployment is most frequently associated 
with the third set of factors. While this term has 
been used in different Ways, structural unemploy- 
ment almost always implies the following: 

(a) There are particular sectors of the labor 
force from which workers cannot easily and quickly 
move into other sectors in search of jobs, 

(b) In some or all of these sectors with im- 
Paired mobility, unemployment significantly ex- 
ceeds available vacancies. And because of inade- 
quate mobility, labor supply does not easily adjust 
to the inadequate level of demand, Hence, unem- 
ployment rates are higher in these sectors than in 
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(c) There may be insufficient demand for par- 
ticular types of labor for a number of reasons. 
Three in particular might be cited. First, the de- 
mand for particular skills may be reduced because 
of technological change or a shift in the pattern of 
demand. Discussions of structural unemployment 
in the United States in the early 1960s emphasized 
this factor particularly. Second, there may be a 
shift of economic activity out of a geographical 
region not matched by a comparable exodus of 
workers. This leads to the problem of “depressed 
areas.” And third, there may be an influx of work- 
ers—of a particular type or into a particular region 
—at such a rate that they cannot be quickly ab- 
sorbed into jobs. The influx of refugees into West 
Germany after World War 1 led to considerable 
debate regarding the size of the structural unem- 
ployment problem in that country (Germany [Fed- 
eral Republic] . . . 1952). 

The three kinds of causes of structural unem- 
ployment cited in the preceding paragraph all have 
to do with identifiable shifts in the pattern of de- 
mand for or supply of labor, to which the economy 
finds it difficult to adjust because of some degree 
of labor immobility. It is also possible that demand 
and supply for a particular kind of labor may have 
been out of balance as far back as our records go. 
Immobility, however it came to exist, may perpetu- 
ate such sectoral imbalances and create differen- 
tially high unemployment rates that continue in- 
definitely. Such more or less permanent immobility 
may be associated with a variety of institutional 
factors. Thus, we expect to find that unemployment 
rates are higher among the least skilled and least 


educated. We should also expect that unemploy- 
ment rates would be higher at the extremes of the 
age distribution than in the prime working ages. 
In the United States unemployment rates have been 
higher among Negroes than among whites since 
unemployment data were first recorded [see Dis- 
CRIMINATION, ECONOMIC]. 

These structural differentials will be the more 
marked the more heterogeneous is the labor force 
—and the measure of heterogeneity in this context 
is intersectoral immobility. These unemployment 
differentials are particularly marked, and the labor 
force is particularly heterogeneous, in the United 
States. 

These differentially high unemployment rates 
may contain a frictional as well as a deficient- 
demand element. Among construction workers, for 
example, there may be ample vacancies, but the 
element of weather plus the need to move from 
job to job will cause frictional unemployment to be 
higher than in most other industries. Over and 
above such differentially high frictional unemploy- 
ment, there may also be a shortage of jobs in rela- 
tion to supply of this particular kind of labor. 

It might be added that it is not easy in fact to 
differentiate between unemployment due to an 
economy-wide deficiency of demand and that due 
to structural factors in the sense defined above. 
For one thing, variations in aggregate demand 
affect different groups differently. For another, even 
the most persistent structural reasons for differen- 
tial-employment opportunities tend to weaken in 
very tight labor markets; the extreme labor tight- 
ness during World War 1 illustrates this. Or, to cite 


Table 5 — Unemployment rates for different sectors of the labor force, United States, 1948-1963 
(per cent of sectoral labor force) 


Sector 1965 1963 
All civilian workers” 46 57 
By age and sex" 
Men 20 years and over 3.2 4.5 
Women 20 years and over 4.5 5.4 
Both sexes, 14—19 years 13.6 15.6 
Married men* 24 3.4 
By occupation” 
White collar 2.3 2.8 
Professional, technical only TS 1.8 
Blue collar 5.3 7.2 
Unskilled laborers only 8.4 12.1 
By color® 
White 4.1 5.1 
Nonwhite 8.3 10.9 


1959 1956 1953 1948 
5.5 4.2 2.9 3.8 
47 3.4 2.5 c 
5.2 4.2 2.9 £ 
13.2 10.4 71 c 
3.6 2.3 © c 
2.6 ie 1.4 2.1 
17 1.0 0.9 1.7 
7.5 5.1 3.5 4.3 
12.4 8.2 6.1 7.5 
4.9 3.3 2.3 3.2 
10.7 7.5 4.1 5.2 


a. The figures for 1948-1956 in these categories have been adjusted to account for a moderate change in definitions beginning in 1957. 


b. The figures for 1948-1956 in these categories have not been so adjusted. 
c. Not available. 


Sources: ‘Rates of Unemployment” 1964, pp. 34-35; “Unemployment 
Rate” 1966; U.S. President 1964, p. 201; 1966, pp. 166-170. 
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another example, the structural unemployment re- 
sulting from the influx of refugees into West Ger- 
many largely disappeared in the very tight labor 
market of the late 1950s and early 1960s. 

Table 5 gives some notion of the heterogeneity of 
the American labor force and of the resulting wide 
spread in unemployment rates for different parts 
of the working population. Thus, the unemploy- 
ment rate for teen-agers has tended to run about 
2.5 times the national rate. The rate for white-collar 
workers has tended to be about half the national 
rate, while the incidence of unemployment among 
unskilled laborers is more than twice as heavy as 
for the labor force as a whole. Similarly, the un- 
employment rate for nonwhites is about twice the 
national average, It is hardly necessary to elaborate 
upon the social and political, as well as economic, 
consequences of these high unemployment rates. 
This is particularly so with respect to what is re- 
ferred to as “the Negro problem” in the United States. 

In European countries the labor force is more 
homogeneous. Also, far fewer data are available 
on differential unemployment rates by age, occu- 
pation, etc., than is the case for the United States, 
As noted previously, however, every country has 


breakdown of the unemployed is available for a 
number of countries, Thus, in the United Kingdom 
in 1962 regional unemployment rates varied from 
1.3 per cent in London and southeastern England 
to 3.7 per cent in Scotland and to no less than 7.5 
per cent in Northern Ireland. 


The goal of full employment 


A high level of employment—or “full employ- 
ment”—is now an accepted goal of national policy 
in virtually all of the economically advanced coun- 


proximate equality between vacancies and unem- 
ployment. 

In recent years, with growing Sensitivity to the 
potential conflict between the goals of full employ- 
ment and price stability, there has been a tendency 


to define the employment objective in an alternative 
way: as the lowest level to which unemployment 
can be pushed by an expansion of aggregate de- 
mand without bringing about an unacceptable rise 
in the price level. Here, the definition makes it 
explicit that the employment target depends on the 
nature of the trade-off between a fall in unemploy- 
ment and the associated rise (if any) in the price 
level. 

Most definitions of full employment have sought 
to describe “a situation in which unemployment 
does not exceed the minimum allowances that must 
be made for the effects of frictional and seasonal 
factors” (United Nations, . . 1949, p. 13). The con- 
tinued high level of unemployment in the United 
States between 1957 and 1965, however, raised the 
question as to how full employment should be de- 
fined if a significant amount of structural unem- 
ployment exists. i 

The problem here can be illustrated by Figures 
1 and 2. In Figure 1 the job-vacancy rate is meas- 
ured on the vertical axis and the unemployment 
rate on the horizontal. The 45° dine portrays all 
possible situations that correspond to an equality of 
vacancies and unemployed, Curve AA’ represents 
one “structural situation.” Vacancies rise as unem- 
ployment falls (assuming general expansion with 
Structural rigidities) and vice versa; point C repre- 
Sents a situation of “full” employment, in which 
vacancies and unemployment are equal—at an 
over-all unemployment rate of 3 per cent in the 
diagram, 

Assume now that, because of Structural changes, 
the curve shifts to BB’, Vacancies now equal un- 
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employment at an unemployment rate of 6 per 
cent. If the attempt were made to force unemploy- 
ment much below this by expansionary measures, 
labor shortages would mount while unemployment 
would decline only slowly and painfully, with sub- 
stantial upward pressure on wages and prices. Is 
6 per cent to be considered the full-employment 
target, including not only frictional but also struc- 
tural unemployment? 

The second definition cited suggests the relation- 
ships illustrated in Figure 2. Here, again, unem- 
ployment is measured along the horizontal axis, but 
now the rate of change in the price level is repre- 
sented on the vertical axis. This is a variant of 
what has come to be called the “Phillips curve,” 
which’ relates wage changes rather than price 
changes to the level of unemployment (see Phillips 
1958). The curves XX’ and YY’ describe two pos- 
sible relationships between price-level changes and 
unemployment. The curve XX’ implies that prices 
can be kept stable at an unemployment rate of 
3 per cent. Lower unemployment leads to price 
increases at an accelerating rate. 

Let the situation change to that portrayed by 
YY’. This upward shift is most commonly associ- 
ated with labor’s increased bargaining strength; but 
the curve might also shift because of the sort of 
structural change previously described, with the 
result that labor shortages in some sectors of the 
economy are associated with unemployment in ex- 
cess of vacancies in other sectors. (Not only im- 
mobility between sectors but also downward wage 
rigidity is assumed.) As a result, in the second 
situation, price stability can be maintained only at 
an unemployment rate of 6 per cent. Is this full 
employment? 

Since World War u it has been clear that all 
countries have been willing to settle for something 
less than absolute price stability. Probably no gov- 


ernment would be prepared to pay a significant 
price in terms of higher unemployment in order to 
keep the rise in the price level below the rate of, 
say, 2 per cent per year. This upper limit on the rate 
of inflation that would be tolerated can be repre- 
sented by the horizontal line CD in Figure 2. Then 
OH and OK represent the minimum levels to which 
unemployment could be reduced in the two cases. 

This discussion suggests that the definition of 
full employment should be approached in two 
stages. First, we may define “aggregative full em- 
ployment” as corresponding to that unemployment 
rate that meets the over-all criterion: total vacan- 
cies equal to total unemployment or the smallest 
amount of unemployment consistent with the de- 
sired degree of price stability. 

Given the goal of aggregative full employment, 
which can best be implemented through the use of 
monetary-fiscal policy, we then need to ask the 
following question. By means of the various instru- 
ments of manpower policy (training and retrain- 
ing programs, relocation of workers and industry, 
reduction of discrimination in hiring, and so on), 
how far is a government prepared to go in reducing 
the amount of structural unemployment that exists 
at aggregative full employment? Or, in terms of 
the diagrams, how far is it prepared to go in seek- 
ing to shift the BB’ and YY’ curves to the left? The 
answer to this question, when costs and benefits 
are weighed on the scales of the policy makers’ 
value system, yields what we may call aggregative 
and structural full employment. In the United 
States, aggregative full employment in the early 
1960s was assumed to correspond to an unemploy- 
ment rate of about 4 per cent, and it was believed 
by many that unemployment might, by means of 
an intensive manpower policy, be brought down to 
3 per cent—at a cost that was worth incurring in 
terms of the benefits to be achieved (U.S. Senate 
... 1964). i 

It is one thing to conceptualize the goal of full 
employment; it is another thing to quantify it. Most 
governments have been reluctant to announce the 
precise unemployment rate they consider to be 
equivalent to full employment. One can, however, 
draw some inferences from recent policy actions 
and statements. Thus, in 1964-1965 it was reason- 
able to assume that the full-employment targets in 
the leading European countries ranged between 
about 1.5 and 3 per cent unemployment, using 
American definitions (Gordon 1965). For the United 
States the corresponding target figure would be 
about 4 per cent. 

The targets cited here correspond to what has 
been called “aggregative full employment.” In addi- 
tion, governments have various kinds of manpower 
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policies aimed at reducing structural and frictional 
unemployment. 

Other aggregative goals. The potential conflict 
between full employment and price stability has 
already been mentioned. There are also at least 
three other “aggregative” goals that need to be taken 
into account when a government seeks to pursue a 
full-employment policy. They are balance-of-pay- 
ments equilibrium, growth of total output, and 
what has come to be called “incomes policy” (chiefly 
keeping the rise in wages in line with the increase 
in productivity ). 

There is normally no conflict between the goals 
of full employment and rapid economic growth. But 
there is some conflict between full employment and 
the other goals. Involved here is a set of inter- 
relationships that can conveniently be discussed in 
two parts. 

First, relative price stability is a goal that is 
desired to some extent for its own sake, The more 
rapidly prices rise or fall—apart from balance-of- 
payments considerations—the greater is the dam- 
age to economic welfare as seen by those who frame 
economic policy. In recent years Germany and the 
United States have been the two large countries 
that have shown the greatest sensitivity to infla- 
tionary pressures. In most European countries, 
however, the trade-off between unemployment and 
price stability seems to be such that the govern- 
ments concerned would not be willing to sacrifice 
much, if any, employment in order to reduce the 
rate of price increase—provided that (1) prices in 
other countries rose also, so that serious balance- 
of-payments problems did not arise, and (2) the 
rise in prices did not degenerate into runaway 
inflation. 

This brings us to the second aspect of the poten- 
tial conflict between full employment and the other 
aggregative goals. The strongest threat to pursuit 
of a vigorous full-employment policy comes from 
the possibility of balance-of-payments disequilib- 
rium. If expansionary policies cause spending and 
Prices to rise faster at home than abroad, a country 
may incur a deficit in its balance of payments 
which, if continued long enough, will force it to 
devalue its currency. This is an especially serious 
problem for countries for which international trade 
is particularly important. Such countries will, from 
time to time, be forced to pursue restrictive mone- 
tary and fiscal policies, Sacrificing some employ- 
ment in order to protect international monetary 
Teserves. In the early 1960s the United States was 
apparently prepared to put up with unemployment 
considerably in excess of what was widely consid- 


ered to be the full-employment target, in part be- 
cause of the need to reduce the persistent deficit in 
its balance of payments. In 1966, the British gov- 
ernment felt compelled to impose strong deflation- 
ary measures to protect the balance of payments. 

It is in this connection that price stability plays 
its most important role. Price stability is a means 
of achieving balance-of-payments equilibrium, It is 
primarily a means, rather than an end in itself, 
It might be noted that what is important here for 
the balance of payments is not the behavior of the 
domestic price level by itself but the behavior of 
domestic prices relative to that of foreign prices. 

As the 1960s began, there was increasing recog- 
nition on both sides of the Atlantic of the need for 
an incomes policy that would hold the rise in wages 
roughly in line with the rise in labor productivity 
(Organization for Economic Cooperation and De- 
velopment . . . 1962; U.S. President 1962, pp. 185- 
190). The Netherlands has had an Official, but 
only partially successful, incomes policy since the 
end of the war. Here again there is a potential 
conflict with a vigorous full-employment policy. An 
incomes policy is extremely difficult to apply unless 
a government is prepared either to exercise com- 
pulsion or to accept a higher level of unemploy- 
ment than would otherwise be necessary. It is not 
surprising, therefore, that efforts to implement an 
incomes policy have not so far been very suc- 
cessful. 

The events of the decade from the mid-1950s to 
the mid-1960s suggest a strong contrast between 
the ways in which western Europe and the United 
States reconciled the full-employment and other 
aggregative goals during this period. The United 
States was prepared to sacrifice a significant amount 
of employment in order to keep a relatively stable 
price level, partly for its own sake but chiefly for 
the sake of its balance-of-payments position. The 
relative emphasis put on a fairly strict interpreta- 
tion of the full-employment goal was considerably 
stronger in Europe. (For further discussion of the 
issues in this section see Gordon 1965; Ross 1964, 
Pp. 155-171; Gordon & Gordon 1966.) 

Development of manpower programs. The in- 
creasing recognition of the need to formulate the 
goal of full employment in Structural as well as 
aggregative terms calls for not only the appropriate 
use of monetary-—fiscal measures but also the devel- 
opment of a comprehensive manpower program. 

Manpower policy is concerned with the structural 
aspects of employment and unemployment—with 
facilitating the adjustment of the composition of 
labor supply to the changing composition of labor 
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demand. A well-developed manpower program 
would include measures of the following types: 
(1) an active and comprehensive employment serv- 
ice to assist in placement, to provide vocational 
guidance, and possibly to collect data on vacancies; 
(2) an adequate system of unemployment insur- 
ance, integrated with a system of subsistence allow- 
ances for retraining and aid in the relocation of 
workers; (3) programs to facilitate the geographi- 
cal mobility of workers; (4) heavy emphasis on 
training and retraining to assist workers in acquir- 
ing new skills; (5) special programs for the physi- 
cally and mentally handicapped and for older 
workers; (6) a well-developed program of project- 
ing the future demand for labor and prospective 
shortages and surpluses of particular kinds of 
labor, and wide dissemination of such information; 
(7) cooperation with the schools in the improve- 
ment of vocational education; and (8) special em- 
ployment projects to assist particular regions or 
groups of workers in making difficult adjustments 
to new patterns of demand. 

Sweden has a particularly well-developed man- 
power program that aims to be “so varied, so indi- 
vidualised as—in time—to fit every single person 
on the employment market” (Olsson 1963, p. 411). 
Most of the other western European countries have 
significant manpower programs, which vary in the 
aspects that have been particularly emphasized. 
Thus, training and retraining have been stressed 
in France and Italy; Sweden has emphasized re- 
gional labor mobility as well as retraining; regional 
industrial development has played a key role in 
manpower policy in the United Kingdom. All of 
these countries—as well as the other countries of 
western Europe—have also developed, to different 
degrees, various of the other dimensions of man- 
power policy previously listed. It is interesting to 
note also that elements of an international man- 
power policy have been developed within the Euro- 
pean Common Market. 

Spurred by high over-all unemployment and con- 
cerned with the especially high unemployment 
rates in particular segments of the labor force, the 
United States began to move toward a comprehen- 
sive manpower policy at the beginning of the 
1960s. A decentralized employment service had 
existed since the 1930s, as had a system of unem- 
ployment insurance. In 1961 the Area Redevelop- 
ment Act was passed, followed in 1962 by the 
much more comprehensive Manpower Development 
and Training Act. The emphasis in the latter was 
almost entirely on retraining. Other measures have 
involved acceleration of public works, federal sup- 


port of education, special provisions for the training 
and rehabilitation of underprivileged youth, and 
special measures to counter discrimination against 
Negroes (U.S. President 1963; 1964). 

By 1965 no country had yet developed a man- 
power program “so varied, so individualised as to 
fit every single person on the employment market.” 
But movement toward this goal, more rapid and 
systematic in some countries than in others, was 
the trend. Avowed full-employment goals and sys- 
tematic manpower policies did not exist in the 
Western world when the great depression struck. 
Considerable progress toward the formulation of 
such goals and the implementation of such policies 
has been made since then. 

R. A. GORDON 


[See also MONETARY POLICY; FISCAL POLICY; INFLA- 
TION AND DEFLATION; and the biography of 
BEVERIDGE. ] 
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ENCLAVES AND EXCLAVES 


Enclaves and exclaves are discontiguous terri- 
tories of states which are located within the terri- 
tory of other states. Seen from the state within 
which the outlier is located, it is an enclave; seen 
from the state to which the outlier belongs, it is an 
exclave. A typical example is the Spanish town of 
Llivia in the eastern Pyrenees of France: this is an 
exclave of Spain, entirely surrounded by French 
territory and located about four miles from the 
main Spanish territory; it is also a Spanish enclave 
within French territory. 

Enclaves (exclaves) may be accessible to the 
main territory of the state to which they belong by 
land, through the territory of other states, as well 
as by sea. A typical example is the former German 
exclave of East Prussia, which from 1919 to 1939, 
as well as in previous centuries, was separated 
from the main German territory by Polish territory 
and the Baltic Sea. However, if discontiguous ter- 
ritories are accessible only by sea, they are usually 
described as islands; if they are located on other 
continents, as colonies or associated territories. Thus 
Hong Kong is regarded as a colony of Great Britain 
in eastern Asia, although, from the Chinese point 
of view, it is sometimes described as a British en- 
clave surrounded, on the landward side, by Chinese 
territory. Gibraltar can similarly be regarded as a 
British colony on the continent of Europe and as 
an enclave within Spain. In these cases the nomen- 
clature is necessarily vague; however, this does not 
detract from the political, economic, and historical 
significance of enclaves and exclaves, 

Territorial waters have the same sovereign at- 
tributes as land, and enclaves may therefore exist 
within territorial waters. A typical example is the 
islands of Chisamula and Likoma in Lake Nyasa; 
they are located within the territorial waters of Por- 
tuguese Mozambique but are exclaves of Malawi. 

States entirely surrounded by territory of one 
other state, for example, the Vatican, share the 
characteristics of enclaves and are sometimes re- 
ferred to as virtual exclaves, Contiguous territories 
of states which for all regular commercial and ad- 
ministrative purposes can be reached only through 
the territory of other states are called pene-enclaves 
(pene-exclaves ). These have Virtually the same 
characteristics as complete enclaves (exclaves). A 
typical example is the Drumully area of the Repub- 
lic of Ireland, which can be reached by road and 
rail only through territory of Great Britain (North- 
ern Ireland). 


Enclaves (exclaves) should be distinguished 


from neutral territories that sometimes separate 
states, for example, those on the northeastern bor- 
der of Saudi Arabia. Sovereignty in neutral terri- 
tories is shared between two or more states, and 
territorial discontiguity is not an essential charac- 
teristic. However, for convenience of administration 
some governmental functions, for example, cus- 
toms and excise, may be relinquished by the sov- 
ereign of the enclave (exclave) to the sovereign of 
the surrounding territory, and thus an impression 
of shared sovereignty can result. 

Except for the unique cases of East Pakistan 
and West Berlin, enclaves are today relatively un- 
important economically and cover only small areas, 
usually less than 20 square kilometers. Their politi- 
cal and military value is probably also very limited. 
Some enclaves (exclaves), for example, West Berlin 
(including Steinstiicken, Papenberger Wiesen, etc.) 
consist of several parcels of territory, which makes 
accurate statistical accounting virtually impossible. 

Origin of the terms. Use of the terms originated 
during the late Middle Ages. The first diplomatic 
document to contain the word “enclave” was the 
Treaty of Madrid, signed in 1526. Since then the 
terms have also been used outside political geogra- 
phy; parks, for example, have been described as 
rural enclaves within cities. Similarly, areas of 
low prices located within higher price regions, for 
example, areas along major highways within a 
high price region where low gasoline prices are 
posted, are described as low-price enclaves. 

Geographical distribution. Most enclaves to- 
day are relics of the feudal era, and western Eu- 
rope contains the largest number, about fifteen. 
Formerly there were many more; for example, 
prior to 1866 Prussia alone consisted of more than 
270 discontiguous pieces of territory. India also 
contained large numbers of enclaves prior to inde- 
pendence. During and after the crusades, enclaves 
were a fairly widespread phenomenon in the east- 
ern Mediterranean region, and presumably they 
also existed elsewhere during periods of feudal 
rule. Through accidents of discovery and difficul- 
ties of inland penetration, enclaves were also fre- 
quently established during the colonial era in or 
near the coasts of Africa and Asia. Today only a 
few survive, for example, the Portuguese Cabinda 
enclave north of the mouth of the Congo. When 
considered as enclaves, British Hong Kong and 
Gibraltar fall into this category. Similar Portu- 
guese (Goa, for instance, but also complete ex- 
claves entirely surrounded by non-Portuguese ter- 
ritory, such as Nagar Aveli) and French enclaves 
(such as Pondicherry, but also complete exclaves, 
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such as Chandernagore) in India have disappeared 
from the map since India gained independence. 

Throughout history, enclaves were sometimes 
deliberately established to assure control of neigh- 
boring areas, for example, by the settlement of 
Roman veterans in classical times; a modern ex- 
ample is that of the British enclaves on Cyprus 
created in 1960 to control the eastern Mediterra- 
nean and the Middle East. Errors of definition with 
respect to straight-line boundaries have caused the 
establishment of two enclaves in North America 
near the 49th parallel (Point Roberts, Washington, 
and Lake of the Woods County, Minnesota, both of 
which are in Canada). Where rivers form bound- 
aries between states, changes of river courses have 
frequently created enclaves on opposite shores, as 
on the Mississippi River; where such changes re- 
sulted in international boundary problems, these 
enclaves were usually eliminated by exchange of 
territory and other compensation, as occurred be- 
tween the United States and Mexico on the Rio 
Grande. 

The enclaves of West Berlin and of the Hebrew 
University campus (part of Israel within Jordanian 
territory) were the result of recent armistice agree- 
ments. Similar temporary arrangements have been 
recorded frequently, particularly in feudal times. 
The enclaves of East Prussia and of East Pakistan 
were created in this century to accommodate na- 
tional or religious aspirations. For the same national 
reasons the internal spatial organization of the 
Soviet Union exhibits several enclaves. Parts of 
former German East Prussia now constitute an 
enclave of the Russian Soviet Federated Socialist 
Republic. To what extent the enclaves shown on 
maps of the Soviet Union are economically or po- 
litically significant is not known. Although several 
enclaves have survived since the early Middle 
Ages, most have disappeared with the growth of 
centralized states, and many appear to be only 
temporary features of modern political geography. 

Theories of origin. Most settlements were vir- 
tually self-sufficient before the advent of modern 
transportation, and generally only goods of high 
value and little bulk were exchanged over dis- 
tances of more than a few miles. Because of high 
costs of transportation, most settlements and their 
environs were surrounded by undeveloped lands 
separating them from neighboring settlements 
(Thiinen 1826-1863). In the feudal era each of 
these settlements became dominated by a local 
ruler who, by war, marriage, or purchase, tried to 
enlarge his territories. Messengers and the small 
armies of the era could move through the unde- 


62 ENCULTURATION 


veloped lands between the settlements, and since 
there was also little trade, there was no need for 
feudal territories to be contiguous. Thus the pat- 
tern of enclaves which is so characteristic of this 
period gradually evolved. Territorial discontiguity 
was found not only at the lowest but also at the 
highest level of the feudal hierarchy. Even self- 
governing cities contained both enclaves and ex- 
claves. Territorial discontiguity was most marked 
in Germany but also existed in much more cen- 
tralized Great Britain, where a few enclaves sur- 
vive to this day among county areas, for example, 
in Flintshire. 

The modern era. As a result of the growth of 
the modern state, enclaves were gradually ab- 
sorbed into the new national territories. Ratzel 
(1896) and other German geographers, particu- 
larly the later German school of “geopolitics,” re- 
garded this absorption as an “organic” phenomenon 
in the evolution of so-called natural boundaries of 
states. These are frequently related to physical fea- 
tures of geography, such as seacoasts, watersheds, 
or mountain crests, but usually are poorly definable 
except in the context of nationalistic ambition. 
Arguments of this nature were, for example, used 
by India to defend its action in occupying the 
Portuguese enclaves and exclaves on the Indian 
subcontinent. At the time of the French Revolution 
the process of absorption was already virtually 
complete in the Iberian Peninsula, Great Britain, 
and Scandinavia. As a result of the French Revolu- 
tion the many vestiges of enclaves (particularly 
with respect to internal tariffs) disappeared from 
France, and the number of enclaves was consider- 
ably reduced in central Europe and Italy. Because 
of the lesser significance of feudalism in eastern 
Europe, enclaves were far less important, and they 
completely disappeared during the nineteenth 
century. 

The final reduction in the number of enclaves to 
the present low and insignificant figure took place 
during the period of railway construction. Enclaves 
came to be regarded as economically and politically 
absurd, as revealed in the discussion of the prob- 
lem of German East Prussia during the period be- 
tween the two world wars. The introduction of 
motorcars also increased the possibility of smug- 
gling from enclaves ( always a serious matter) and 
assisted public demand for the abolition of en- 
claves, as in India and Switzerland. Within sov- 
ereign states, administrative enclaves, usually relics 
of the process of state amalgamation following the 
feudal era, were also greatly reduced in number by 
the redefinition of local boundaries, as in India and 


Germany. However, many enclaves have survived 
in the cantonal system of the Swiss Confederation. 

Since World War m the development of air 
transportation and the building of limited-access 
highways and railways with systems of overpasses 
and underpasses have presented new opportunities 
for connecting enclaves to other territory of the 
same state. The air lanes, highways, and railways 
connecting West Berlin to West Germany are an 
example, although these connections have not 
worked very satisfactorily. The United Nations par- 
tition scheme in 1947 for the former British man- 
date of Palestine called for a series of discontig- 
uous Arab and Jewish territories, each consisting 
of a main territory and two exclaves. These ex- 
claves were to touch the main territories at two 
junction points, where short overpasses and under- 
passes of limited-access transportation avenues 
were to connect them with the main territories. 
While the violent rejection of this partition scheme 
was the result of other more serious national con- 
siderations, there appears to be little doubt that 
these connecting links between exclaves and main 
territories were also regarded as highly unsatisfac- 
tory. For all these reasons it is unlikely that the 
new means of transportation will bring about an 
increase in the number of enclaves. 
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ENGEL, ERNST 


Ernst Engel (1821-1896) was a German statis- 
tician. After his early training at a mining acad- 
emy in Germany, Engel went to the Ecole des 
Mines in Paris, where he came under the influence 
of Frédéric Le Play, a pioneer in the study of fam- 
ily budgets, who was a professor there. During a 
subsequent stay in Belgium he became acquainted 
with Adolphe Quetelet, well-known for his faith 
in the possibility of discovering quantitative social 
laws. Upon his return to Germany Engel took up 
statistics as a profession and became director of 
the statistical bureaus of Saxony, 1850-1858, and 
Prussia, 1861-1882. As a vigorous administrator 
and inspiring teacher he did much to establish 
the modern tradition of official statistics; in partic- 
ular, he insisted on making statistical data acces- 
sible and intelligible to the general public. He was 
also active in the social reform movement, which 
he supplied not only with figures but also with 
various practical proposals. In 1881 he published 
a pseudonymous attack on Bismarck’s agricultural 
protectionism and was promptly relieved of his 
position on grounds of ill-health. Of the great work 
on “demology” that he then set out to write only 
a fragment was completed. 

His greatest contribution to the social sciences 
soon became known as “Engel’s law.” It states that 
the proportion of a consumer's budget spent on 
food tends to decline as the consumer's income 
goes up, or, in more technical terms, that the in- 
come elasticity of demand for food is less than 
one, Engel derived this generalization from the 
budgets collected in different countries by his 
teacher Le Play and from Belgian data compiled 
by Edouard Ducpétiaux; it has since been invar- 
iably confirmed in hundreds of more elaborate 
surveys in all parts of the world. 

Engel published his discovery in the context of 
a study of the economic development of Saxony 
(1857), in which he demonstrated that a general 
rise in productivity requires a shift of population 
from agriculture to manufacturing because of the 
effect of an increase in income on the pattern of 
demand. He also advanced his observations as an 
argument against Malthus’ fears of overpopulation. 

From a methodological point of view Engel’s law 
is important as the first significant quantitative 
law ever established by empirical economic data. 
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Formulated when economic theory proceeded largely 
by deduction from a priori assumptions, the law 
pointed the way to a more reliable approach. It 
also drew attention to the study of demand, which 
had been neglected by the classical economists, 
and suggested that economic development is not 
merely a matter of capital accumulation. Engel, 
however, was not much interested in economic 
theory and considered himself primarily a statis- 
tician, as did his numerous followers in the em- 
pirical study of family budgets, at least until 
recently. 

Attempts were soon made to find similar gen- 
eralizations for other categories of expenditure, 
such as housing and clothing. But here the pattern 
of demand was not so simple, at least when ana- 
lyzed in terms of increasing or decreasing budget 
shares. Thus the “laws” of Schwabe for housing 
and Schiff for clothing, which state that the share 
of these categories respectively falls and rises as 
income increases, are not as universally valid as 
Engels law, although they probably hold in a 
majority of budget surveys. For these two cate- 
gories the income elasticity is normally so close 
to one that in a particular survey it may be on 
either side of unity. From a review of the evidence 
accumulated during the century following the 
enunciation of Engel’s law, Houthakker (1957) 
concluded that the typical elasticity with respect 
to total expenditure is 0.6 for food, 0.8 for hous- 
ing, 1.2 for clothing, and 1.6 for all other expendi- 
tures combined. 

The relation between the expenditure on a par- 
ticular item or group of items on the one hand 
and income or total expenditure on the other is 
now generally known as an “Engel curve.” Such 
curves have been extensively studied, especially 
after the development of regression analysis around 
the turn of the century. Engel could not avail him- 
self of this technique and had to fit his food—total 
expenditure relationship by graphical methods. 
Perhaps the most important instance of an Engel 
curve is Keynes's consumption function (1936), 
which relates total expenditure to income and has 
become the cornerstone of macroeconomic anal- 
ysis. There is a clear line of descent from Engel’s 
law to Keynes's “fundamental psychological law,” 
according to which the proportion of total con- 
sumption to income falls as income goes up. 

Engel refined his analysis of family budgets by 
taking household composition into account. In his 
little book Der Kostenwerth des Menschen (“The 
Cost of Man”), published in 1883, he proposed 
what has since become known as an equivalent 
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adult scale to give appropriate weights to persons 
of different ages and sexes; Engel’s unit, however, 
was not the consumption of an adult male, as in 
most later scales, but the consumption of an in- 
fant under one year. He called this unit “quet” in 
honor of Quetelet. In this book there are also in- 
teresting calculations on some economic aspects 
of education. 

With a little more perseverance Engel could 
have established another claim to fame, namely, 
to have initiated not only the cross-section analysis 
but also the time-series analysis of demand. In a 
paper of 1861 on the Prussian grain market he 
derived an empirical demand curve for rye with 
a price elasticity of about one-half, but he did not 
consider the results sufficiently trustworthy to pur- 
sue the subject, although he did claim to have 
refuted Gregory King’s “law” on the demand for 
grain. The lack of suitable curve-fitting techniques 
may have held back his work in this area, and the 
estimation of empirical demand curves had to wait 
until the twentieth century. 

Engel wrote on many other subjects, including 
labor and industry, taxation, insurance, banking, 
and war. His influence extended into many other 
countries; in the United States, Carroll Wright, 
founder of the Bureau of Labor Statistics, was his 
principal follower. Engel was also active in the 
International Statistical Congresses, the forerun- 
ners of the International Statistical Institute of 
which he was a founder in 1886. 


H. S. HouTHAKKER 


[For the historical context of Engel’s work, see the 
biographies of Kine; LE Pray; MALTHUS; QUETELET. 
For discussion of the subsequent development of his 
ideas, see Consumers, article on CONSUMPTION 
LEVELS AND STANDARDS; CONSUMPTION FUNCTION; 


DEMAND AND SUPPLY, article on ECONOMETRIC 
STUDIES, | 
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Friedrich Engels was born in Barmen, Germany, 
in 1820 and died in London in 1895. He was the 
closest friend of Kar] Marx and cofounder of so- 
called scientific socialism. 

Engels was the oldest son in a family of eight 
children. His father owned a cotton-spinning mill 
in Barmen and was a partner in another spinning 
mill, Ermen and Engels, of Manchester and Engels- 
kirchen. Engels attended the Realschule in Barmen 
and the Gymnasium in Elberfeld. To prepare him- 
self for the study of jurisprudence, he studied 
mercantile problems, first in his father’s firm, and 


then, from 1838 to 1841, in Bremen. Engels then 
did a year’s military service. 

He was greatly impressed in his youth with the 
contrast between the pietistic Calvinism of the 
middle class in his native city and the misery of 
the working class, demoralized by drink. Using the 
pseudonym of Oswald he wrote about this situa- 
tion in his “Briefe aus dem Wuppertal” (1839), a 
vivid description of these conditions that shows 
how early his literary and journalistic ability de- 
veloped. Little by little he turned away from pie- 
tism and religion, strongly influenced by David 
Friedrich Strauss’s critique of the Gospels in Das 
Leben Jesu and, above all, by Ludwig Feuerbach’s 
Wesen des Christentums. He affirmed his radical 
break with religion in his Schelling und die Offen- 
barung (1842a) and in his spirited, witty satire, 
Schelling, der Philosoph in Christo (1842b). These 
were writings in part aimed at the king of Prussia, 
who had appointed Schelling to the University of 
Berlin as Hegei’s successor, hoping that he would 
become the intellectual exponent of his “Christian 
state.” Engels read political criticisms by the 
members of the “Junge Deutschland,” particularly 
those of Ludwig Borne, and he began to write as 
a Young Hegelian, one of that group of left-wing 
intellectuals in Berlin that included the brothers 
Edgar and Bruno Bauer and the anarchist Max 
Stirner. Karl Marx also belonged to this group 
while he was a student in Berlin. Engels’ satirical 
Christian epic, “Die frech bedraute, jedoch wun- 
derbar befreite Bibel: Oder Triumph des Glaubens” 
(1842c), which was published anonymously, re- 
flected the views of this group. Engels wrote this 
work when the king of Prussia withdrew Bruno 
Bauer’s venia legendi at the University of Bonn 
because of his criticism of the Gospels. It was 
Moses Hess who guided Engels to the work of the 
early French socialists such as Saint-Simon and 
Charles Fourier and boasted that he had trans- 
formed this “revolutionary of the year 1” (i.e., 
1793) into a communist. 

Engels moved to Manchester in October 1842 
to complete his commercial training. He criticized 
English political and social conditions in the Rhei- 
nische Zeitung, to which he had contributed since 
March 1842. At the same time he also wrote for 
the periodicals of the Chartists and Owenites, de- 
scribing the development of socialism on the Conti- 
nent. He supplemented his empirical observations 
by studying the English economists. In his brilliant 
“Umrisse zu einer Kritik der Nationalökonomie” 
(1844), which appeared in the Deutsch—Franzo- 
sische Jahrbiicher, edited by Marx and the Young 
Hegelian Arnold Ruge, there is, according to Marx, 
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an early formulation of the general principles of 
scientific socialism. Engels here demonstrated the 
contradictions in liberal economic doctrine and 
showed that all economic phenomena are, in the 
end, based on private property. He demanded that 
private property be abolished. After the publication 
of this essay, Marx and Engels began to corre- 
spond. When they met in Paris in the autumn of 
1844, they found, according to Engels, that they 
were in “complete agreement in all theoretical 
areas.” 

There are similarities in the early development 
of Marx and Engels that seem, on the surface, 
astonishing: a common literary and poetic bent, 
a common concern with religious problems, and a 
similar involvement in juridical—political discus- 
sion that led both of them from left-wing Hegel- 
ianism to communism. But it was, of course, the 
situation in Germany during that time that largely 
determined their common concerns, especially 
the authoritarianism of Friedrich Wilhelm 1v and 
the political radicalization of the German intel- 
lectuals that was a reaction to it. Marx and Engels 
went beyond bourgeois—liberal criticism, however, 
thus parting from their former friends, whom they 
subjected to merciless criticism in The Holy Fam- 
ily (1845a), mainly written by Marx, and The Ger- 
man Ideology (1845-1846), published posthu- 
mously. Their reasons for becoming communists 
were not identical, to be sure. Engels was brought 
to the point of revolutionary outrage by his ob- 
servation of the workers’ misery, while Marx, the 
rationalist, identified the proletariat as the only 
class that might be fitted by its extreme misery to 
effect a revolutionary transformation. More subject 
as Engels was to intense personal and emotional 
impressions, he always remained second in achieve- 
ment to Marx. He phrased this aptly himself: “I 
could never have achieved what Marx did. Marx 
stood higher, saw farther, and had a broader and 
quicker grasp of a situation than all the rest of 
us. Marx was a genius; we others were at best 
talented” ([1886] 1941, p. 292). 

This estimate, however, should not let us over- 
look the fact that at the beginning of their collab- 
oration it was Engels who was the leading partner, 
precisely because he assimilated knowledge readily 
and because his occupation brought him much 
closer to economics. This is illustrated by The Con- 
dition of the Working Class in England (1845b), 
which was planned as a chapter of a comprehen- 
sive social history of England. This work by the 
“doubtless most talented and well-informed of all 
German writers on sociology,” as Hildebrand called 
him ([1848] 1922, p. 170), is according to Lorenz 
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von Stein, “a picture of the deepest poverty, taken 
from the dirtiest district of the dirtiest factory 
town in England, full of incontrovertible facts 
from this most miserable sphere of the industrial 
world, beyond any question the best invective ever 
written in Germany against industrial society and 
its conditions, a partisan book like no other. That 
is why the book had so powerful an effect, much 
more convincing by its exaggerations and errors 
than by its truths, as is the fate of all such works” 
(1852, p. 538). 

After he and Marx began to work together, 
Engels seems intentionally to have left to Marx 
theoretical work, thus acknowledging (and later 
expressly emphasizing) that the elaboration of the 
materialist conception of history was essentially 
Marx’s work. He himself took credit for only a “very 
small part” thereof, asserting that most of the basic 
ideas, especially in the economic and historical 
fields, and more particularly their final acute for- 
mulations, were due to Marx. While Marx was 
writing the important theory sections of The Ger- 
man Ideology and was criticizing Proudhon in The 
Poverty of Philosophy (1847a), Engels was en- 
gaged as an agitator. As early as 1843 he had 
established contact in London with the leaders of 
a German secret society, the League of the Just. 
At the end of 1844 he and Moses Hess held com- 
munist meetings in the Rhineland, until such 
meetings were Suppressed. In Brussels, where he 
moved in April 1845 in order to be close to Marx, 
he was mainly active in the International Com- 
munist Committee of Correspondence, founded by 
Marx and himself, In August 1846 he went to 
Paris, where after prolonged discussions he suc- 
ceeded in weaning the local group of the League 
of the Just from the influence of Wilhelm Weitling, 


stylistic stamp of Marx. 

After the outbreak of the Revolution of 1848, 
Engels and Marx went to Cologne. There Engels 
worked as editor of the Neue Rheinische Zeitung 


istic activity. He also participated in the Elberfeld 
civil insurrection, but its leaders repudiated him 
as a “Red.” After the suspension of the Neue Rhei- 
nische Zeitung in May 1849, he vainly endeavored, 
together with Marx, to expand the armed uprising 
in Baden and the Palatinate into a general German 
revolution. After the uprising collapsed he left for 
Switzerland and, in October 1849, for London. He 
portrayed these events in series of articles pub- 
lished in 1849 and 1850, 

In London Engels wrote The Peasant War in 
Germany (1850). He hoped that the revolutionary 
traditions of the peasants might pave the way for 
a new German revolution, and he was convinced 
that the success of such a revolution depended on 
Supporting the proletarian revolution by some sec- 
ond edition of the peasants’ war of the sixteenth 
century. Engels’ characterization of Thomas Miin- 
zer, one of the leaders of that revolt, reveals his 
insight into his own position. He believed that 
Miinzer had been in a hopeless position: he was 
the leader of an extremist party, forced to take 
over a government prematurely, before the domi- 
nance of the class he represented could be accepted, 
and before the measures that the rule of his party 
implied could successfully be carried out ([1850] 
1956, pp. 138-139), 

Engels, nevertheless, made a very intensive study 
of the wars of the French Revolution after 1792 
to prepare himself for the military leadership of 
the future revolution, He expounded principles of 
revolutionary action in a series of articles, Ger- 
many: Revolution and Counter-revolution (1851- 
1852), the authorship of which was wrongly at- 
tributed to Marx. Engels’ maxims were never to 
play with insurrection and always to act with the 
greatest resoluteness and to seize the offensive. 


The defensive is the death of every armed rising; sur- 
prise your antagonists while their forces are scattering, 
prepare new successes, however small, but daily; keep 
up the moral ascendancy which the first successful 
rising has given you; rally those vacillating elements 
to your side which always follow the Strongest impulse, 


From 1854 until 1870, when Engels was an 
employee and then a partner in his father’s firm 


“General” among his friends. In the future prole- 
tarian revolution he was presumably cast for a role 


comparable to that of Carnot, the organizer of the 
levée en masse and the army of the French Revo- 
lution. He contributed articles to the New Ameri- 
can Encyclopedia on the army, cavalry, fortifica- 
tions, infantry, the navy, and so on. As a journalist 
he repeatedly described the armies of the various 
countries and evaluated their combat effectiveness. 
However, his writings went beyond topical journal- 
ism. He also concerned himself, in detail, with 
armaments, with the radical changes in warfare 
produced by the new industrial technology, and 
with the political implications of these develop- 
ments (1859-1861 ). Thus he advocated expansion 
of the war of France and Italy against Austria into 
a German war against Napoleon m, whom he saw 
as the main adversary of the revolution. His Die 
preussische Militärfrage und die deutsche Arbeiter- 
partei (1865) was also primarily political: in the 
Prussian constitutional conflict over expansion of 
the army, he opposed Ferdinand Lassalle’s com- 
plete dissociation of the interests of the workers 
from those of the middle class. 

The most important evidence for Engels’ activi- 
ties in this period is his correspondence with Marx, 
which shows Engels’ continuing financial assist- 
ance to Marx as well as his encouragement of 
Marx’s writings. Engels’ loyal friendship even led 
him to assume the paternity of Marx’s illegitimate 
son, although this was also an expression of his 
unconventional conception of relations between 
the sexes. He himself lived for nearly twenty years 
with Mary Burns, an Irish working girl, and after 
her death in 1863 he lived with her sister Lizzy, 
marrying her only on her deathbed in 1878. 

After he moved from Manchester to London in 
the autumn of 1870, Engels became a member of 
the General Council of the First International 
(founded in 1864) and its corresponding secretary 
for Spain, Portugal, Italy, Ireland, and for a time, 
Belgium. He played a prominent part in the dispute 
with Bakunin. He wrote L’alliance de la démocratie 
socialiste et Vassociation internationale des travail- 
leurs (1873b) together with Marx and “Die Baku- 
nisten an der Arbeit” (see 1873a) by himself. 
Anti-Diihring: Herr Eugen Diihring’s Revolution in 
Science (1878) was aimed at Dihring’s strong 
influence on German social democracy. Notwith- 
standing its polemics, it was the first popular ex- 
position widely to promote the teachings of Marx. 
His Socialism: Utopian and Scientific (1880), a 
summary of the first three chapters of the Anti- 
Diihring, was even more popular. 

Up to the death of Marx on March 14, 1883, 
Engels’ major effort was devoted to the incomplete 
Dialectics of Nature (1873-1883). Conceived ini- 
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tially as an argument with Ludwig Büchner, it 
became an attack on the kind of metaphysics in 
the natural sciences that regarded natural phenom- 
ena as immutable, instead of considering them as 
ever-flowing, changing facts. By including the nat- 
ural sciences, Engels made dialectics a “science of 
all interrelationships” and went beyond Marx's 
materialist conception of history to the Weltan- 
schauung of “dialectical materialism.” Engels ex- 
tended the original concept yet further to include 
prehistory in The Origin of the Family, Private 
Property, and the State (1884a). This work, writ- 
ten after Marx’s death, was based on L. H. Mor- 
gan’s Ancient Society of 1877 and on Marx's synop- 
sis of Morgan’s book—in fulfillment of a legacy, 
so to speak. 

After Marx's death Engels edited Volume 2 of 
Capital and prepared Volume 3 for the press, using 
the outlines left by Marx and making some studies 
of his own. No critical analysis of the extent of 
Engels’ contributions to Capital has yet been made. 

Engels took care of Marx's works in general, 
arranging new editions, providing them with his 
own prefaces, and defending Marx against learned 
attacks. This greatly overshadowed his own literary 
activity. Yet he did make important contributions 
to the history of Marxism: “Marx und die Neue 
Rheinische Zeitung” (1884b); “History of the Com- 
munist League” (1885); and Ludwig Feuerbach 
and the Outcome of Classical German Philosophy 
(1886). In the last years of his life, Engels wrote 
an important series of letters to Conrad, Schmidt, 
Bloch, Mehring, and Borgius in which he admitted 
the possibility that the superstructure could affect 
the economic infrastructure and thus opposed an 
overemphasis upon the regularity of historical 
events. This significant theoretical clarification had 
a considerable influence upon developments in the 
Soviet Union. 

Engels was Marx’s political as well as his literary 
heir. He played a very important part in the gene- 
sis of the Second International in 1889. He con- 
tinued to act as an adviser to the various socialist 
parties, through Marx’s son-in-law Lafarge in 
France and through Liebknecht, Bebel, and Bern- 
stein in Germany. His forcing the party to publish 
Marx’s Critique of the Gotha Programme (1875- 
1891), which condemned the theoretical conces- 
sions of the Lassalleans, led to violent intraparty 
disputes. In his “Introduction” to Marx’s The Class 
Struggles in France (1895), Engels still defended 
revolutionary tactics despite the great successes 
achieved by social democracy after the establish- 
ment of universal suffrage. To be sure, he did 
stress that they must be different from earlier 
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tactics: “The time is past for a revolutionary sur- 
prise attack carried out by small conscious minor- 
ities at the head of the uncomprehending masses.” 
He attributed special importance to the problem of 
disarmament ( 1893) and foresaw the possibility 
of a future world war. He was stricken with cancer 
of the esophagus in the midst of his work. 

In the influence exerted by Marxism, Engels’ 
work continues to play its part. The extent of the 
effect of his military—scientific writings on, say, 
the Russian or the Chinese revolutions cannot yet 
be assessed, nor has scholarly work distinguishing 
between his specific views and those of Marx pro- 
ceeded very far; in view of the decisive influence 
of Marx and the way in which these thinkers stim- 
ulated each other, this distinction is extremely 
difficult to draw. However, in the intellectual his- 
tory of Marxism one might say that Engels repre- 
sented its descent from early socialism, especially 
early French socialism, while Marx gave it the im- 
print of classical German philosophy. 

THILO RAMM 


[For the historical context of Engels’ work, see MARx- 
ISM; REVOLUTION; SociaLism; and the biographies 
of BAKUNIN; FOURIER; Marx; Morean, Lewis 
HENRY; PROUDHON; SAINT-SIMON. For discussion of 
the subsequent development of his ideas, see the 
biographies of BERNSTEIN; KAUTSKY; LENIN; TROT- 
SKY.] 
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Engineering is a relatively new profession com- 
pared with the professions of law, medicine, and 
the ministry. Like other professions, engineering 
struggles with such problems as redesigning the 
curricula of its professional schools and raising 
standards for entry to the field. In addition, engi- 
neering has some distinctive occupational problems. 
Can it increase the commitment of its members to 
the profession in the face of mounting pressures 
not to pursue it as a lifelong career? Can it assume 
the responsibility for the social effects of techno- 
logical change? As the progenitors of new tech- 
nologies that are transforming modern society, 
engineers are among the most important agents of 
social change. This article will consider some of 


70 ENGINEERING 


the problems and potentialities of the profession, 
that is, some of the factors favoring or inhibiting 
the further professionalization of engineering as an 
occupation. 

Professionalism and __professionalization. In 
identifying various attributes of a profession, soci- 
ologists have often taken as their model the older 
professions of law, medicine, and the ministry. 
Although there is no consensus as to the definition 
of a profession, there is a growing awareness that 
professionalism is a multidimensional phenomenon 
and that occupations differ in their degree of pro- 
fessionalism. 

An index of professionalism may be based on 
ratings of such attributes as (a) the possession of 
a body of technical and Systematic knowledge that 
guides professional practice; (b) an orientation of 
Service to society rather than self-interest; (c) au- 
tonomy in rendering professional service; and (d) 
societal sanction of professional authority. To de- 
velop and transmit the body of technical and sys- 
tematic knowledge, professional schools and train- 
ing programs are established. To contribute to the 
fund of professional knowledge, to promote a serv- 
ice orientation, and to increase autonomy in pro- 
fessional practice, professional associations are 
formed and codes of ethics are developed. To pro- 
tect professional authority and enhance occupa- 
tional prestige, societal sanction is sought in various 
forms, such as the licensing of graduates and the 
exercising of control over the curricula of profes- 
sional schools. 

The process of professionalization involves the 
transformation of an occupation in accordance 
with the ideal-typical components of professional- 
ism. Although the dynamics of professionalization 
may differ for different occupations, and possibly 
for the same occupation in different countries, one 
study suggests that it entails the following sequence 
of stages: (1) full-time performance of the occu- 
pational function; (2) establishment of a school 
that is not connected with a university; (3) estab- 


> 


it may provide a useful description and prediction 
of the process of professionalization, 


Whatever Stages of professionalization are postu- 


tion is irreversible. It is frequently assumed, how- 
ever, that newer professions will follow the pattern 
of development established by the older professions 
and that this pattern is irreversible. The question 
of the conditions under which an occupation may 
be deprofessionalized has hardly been raised, 

Before examining some current problems of the 
professionalization of engineering, we shall briefly 
review the factors that led to the emergence of this 
profession. 

Emergence of the engineering profession. Engi- 
neering as art long antedates engineering as a pro- 
fession. The invention of the stone ax in the 
Paleolithic age was among man’s first engineering 
achievements, In the civilizations of antiquity con- 
siderable technological progress was made, as evi- 
denced by such accomplishments as pyramids, 
aqueducts, canals, bridges, and lighthouses. Direct- 
ing these engineering feats were highly gifted in- 
dividuals, some of whom we would today consider 
engineers. But despite the outstanding work of in- 
dividual engineers, no professional group came 
into being for many centuries. Several factors de- 
layed the formation of a profession of engineering. 
The economies of ancient civilizations did not re- 
quire the organized development and application of 
technology, for which an engineering profession 
was necessary. The prevailing technology was a 
product of trial and error, intuition, artistry, and 
the gross synthesis of experience, unsupported by 
science. In fact, there was pronounced contempt 
for technology in ancient times, Finally, the tradi- 
tion of “craft mystery” interfered with the codifi- 
cation and public transmission of technical knowl- 
edge. This pattern persisted through the Middle 
Ages. 

During the Renaissance the demand for engi- 
neering skills increased, The urgent and recurrent 
demands of war stimulated the development of 
many engines of battle, whence came the term 


» 


1662, the Académie des Sciences, in 1666, and 
several decades later, the Berlin Academy of Sci- 
ences and the Academy of St. Petersburg, reflected 
and promoted the growing influence of science on 


Chaussées, established by the government in 1716, 
was considered as necessary as the Corps des Ingé- 


nieurs de Génie Militaire. In Great Britain engineers 
were commissioned to drain mines, build roads and 
canals, and perfect navigational techniques. Thus, 
the initial stage of the development of engineer- 
ing as a profession—the need for trained, full-time 
engineers—comes into view. 

Engineering schools were gradually established 
in place of traditional methods of apprenticeship 
and pupilage. Among the earliest engineering 
schools to be founded were the Ecole de Ponts et 
Chaussées in 1747 and the Ecole Polytechnique in 
1795. During the American Revolution, George 
Washington, deploring the shortage of engineers, 
asked the Continental Congress to provide facilities 
for the training of a corps of engineers; in 1802 his 
proposal was implemented with the establishment 
of the military academy at West Point, modeled 
after the Ecole Polytechnique. Efforts were made 
to provide professional training, not only through 
special technical schools but also at institutions of 
higher learning. In England a first chair in civil 
engineering was established in 1841. 

Local professional associations had been estab- 
lished in various parts of England and Scotland 
during the latter part of the eighteenth century. 
One was the Society of Civil Engineers, founded in 
1771 by John Smeaton, a member of the Royal 
Society, who is alleged to have been the first Eng- 
lishman to describe himself as a “civil engineer.” 
In 1818 the first national professional association 
for engineers was established in England. Similar 
organizations came into existence in the United 
States in 1852 and in Canada in 1887. 

Thus, the sequence of early developments lead- 
ing to the emergence of the engineering profession 
conforms approximately to the first five stages of 
professionalization mentioned earlier. As engineer- 
ing became a full-time occupation, in the seven- 
teenth and eighteenth centuries, schools not con- 
nected with a university were established, local 
professional societies were formed, followed by 
national associations, and engineering was gradu- 
ally introduced into the curriculum of universities. 

Social structure and engineering. The emer- 
gence of the engineering profession in western 
Europe and in North America points up the impact 
of industrialization on this occupation. Economic 
development requires and generates technological 
development, for which engineers and other profes- 
sionals are essential. Since societies differ markedly 
in their level of economic development, we would 
also expect them to differ in their technological 
capabilities and in the nature and role of their 
engineering professions. In effect, we are hypothe- 
sizing that the relationship between the economic 
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development and technological development of a 
society is partly mediated by its engineering pro- 
fession. Some data bearing on the economic and 
technological development of various countries and 
the size of their engineering professions are pre- 
sented in Table 1. 

For present purposes, economic development is 
indexed by per capita share of gross national prod- 
uct (GNP), and technological development is in- 
dexed by the number of patents issued in a year 
and the percentage of GNP expended for research 
and development (R&D) in any given year. In 
Table 1, countries for which data are available are 
ranked according to the GNP per capita variable. 
The higher the ranking of a country on GNP per 
capita, the more likely it is that it ranks higher on 
the size of its engineering profession and the num- 
ber of patents issued, and on the percentage of 
GNP expended for research and development. These 
data suggest one possible systemic pattern of rela- 
tionships between these variables. A highly indus- 
trialized society has the resources to educate a 
sizable number of engineers, some of whom, to- 
gether with scientists, engage in research and 
development, for which a substantial proportion of 
such a society's resources is expended; this organ- 
ized approach to scientific discovery and invention 
results in increasing numbers of patents; and in 
turn, the process of invention—in which engineers 
play a prominent role (Gilfillan 1935, pp. 52, 82- 
91)—stimulates economic development, which then 
confronts the engineering profession with new 
technical problems. 

Another feature of the social structure (apart 
from the economy and technology) that influences 
the engineering profession is the political system. 
In highly centralized and relatively unindustrialized 
societies, such as some under communist regimes, 
the engineering profession may be disproportion- 
ately large because of the government's concern 
with accelerating the process of industrialization. 
Under such conditions, the autonomy of the pro- 
fession may be circumscribed and it may be called 
upon to perform functions other than those of a 
technological nature. 

The level of industrialization of a society has 
another effect on the engineering profession, which 
is not reflected in the statistics in Table 1. As in- 
dustrialization increases, there is a concomitant 
increase in the number of specialties in the occu- 
pation. From the relatively undifferentiated field of 
engineering existing in the seventeenth and eight- 
eenth centuries, civil engineering emerged and 
itself gave rise to mechanical engineering, mining 
and metallurgical engineering, and electrical and 
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Table 1 — Gross national product (GNP) per capita, number of engineers, number of patents issued, and percentage 
of GNP expended for research and development (R & D) in selected countries 


GNP PER CAPITA 


PATENTS ISSUED PER CENT OF GNP 


IN 1957 NUMBER OF ENGINEERS IN 1964" FOR R&D 
Per 10,000 Year of Year of 
U.S. dollors Number population information Number Percentage information 
United States 2,577 783,000" 44 1959 47,378 2.8 1960/1961 
Coneda 1,947 60,400* 3.2 1963 24,589 1.2 1960 
Switzerland 1,428 14,440” 2.9 1955 10,350 
Sweden 1,380 52,400" 68 1965 6,125 1.8 1959 
ium 1,196 30,492° 3.3 1964 
FAK Kingdom 1,189 156,000* 2.9 1963 34,060 2.5 1958/1959 
Norway 1,130 32,020° 8.6 1965 1,985 0.7 1960 
Denmark 1,057 20,900° 45 1963 2,833 
France 943 112,000= 2.3 1963 33,850 1.3 1961 
West Germany 927 226,200°:¢ 45 1956 20,150 1.4 1959 
Netherlands 836 36,139 2.9 1959; 1965* 13,900 
Austria 670 36,559 5.2 1961 6,850 
Soviet Union 600 1,325,000* 6.0 1962 6,850 2.3 1960 
Ireland 550 2,800* 1.0 1962 434 
Italy 516 190,000° 3.9 1957 
Argentina 490 15,400° 8 1960 
Greece 340 5,756° Z. 1959 
Spain 293 13,035* 4 1960 
Yugoslavia 265 27,429° 1.4 1965 0.7 1960 
Turkey 220 9,106° 3 1960 
Iran 108 7,510° 4 1960 
Communist China 73 175,000' 3 1957 


a. Calculated by subtracting the number of patents on hand in January 1964 from the number on hand in January 1965 and adjusting the difference to put it on an 


annual basis. 


b. Combined total of engineers with university training and those with formal training just below the university level, 


€ University-trained engineers. 
d. Includes chemists. 


e. Count of university-trained engineers in 1965, other count in 1959; number per 10,000 Population is based on 1965 population data and probably is an 


underestimate. 
f. Engineering and technical personnel. 


Sources: GNP from Russet et al. 1964, pp. 
Development 1963, pp. 225-229, tables 13 and 14; Korol 


155-157. Number of engineers from Organization for Economic Cooperation and 
1965, p. 244, table A-I; Emerson 1965, p. 138, table 7; 


Horowitz 1965, p. 7, table 1.4; Baldwin 1965, p. 155, table 5.2; Demographic Yearbook 1963. Patents from U.S. 


Patent Office, Official Gazette . 


chemical engineering, which were followed by au- 
tomotive and aeronautical engineering. In the dec- 
ades since World War 1 various economic, scien- 
tific, and political developments have stimulated 
the rise of new specialties, such as nuclear engi- 
neering, computer technology, astronautical en- 
gineering, and systems engineering. 

As the engineering profession becomes increas- 
ingly differentiated and heterogeneous, a recurrent 
question arises regarding the identity of engineers. 
Periodically, engineering educators, officials of pro- 
fessional societies, and census Officials discuss the 
question, Who is an engineer? In the United States 
the census definition relies, in effect, on the re- 
spondent’s decision as to whether he is an engineer. 
Professional engineering societies emphasize for- 
mal training in an engineering school and/or a 
minimum number of years of engineering experi- 
ence. This concern with clarifying the definition 
of an engineer reflects a changing social and tech- 
nical environment of the occupation, which is due 
to the accelerating rate of growth of scientific knowl- 
edge and increasing levels of industrialization. The 


+ - 1964, p. 777, and 1965, p. 2. Per cent of GNP for R&D from Dedijer 1962, 


changing environment confronts the occupation 
with dilemmas as to the meaning of professional- 
ism and the direction of further professionalization. 
These dilemmas arise in connection with the proc- 
ess of recruitment to the profession, the education 
of engineers, the career decisions of engineers, the 
functions of professional societies, the problem of 
responsibility for the social impact of technological 
change, and the prestige of the profession. 

The recruitment process. In highly industrial- 
ized societies engineering is one of the fastest- 
growing occupations. The average annual rate of 
growth of the profession in some countries tends to 
exceed the rates of growth of the economy and 
of the population. In Communist China the engi- 
neering profession had an average annual growth 
rate of 67.1 per cent during the years 1955 through 
1962 (computed on the basis of data in Chéng 
1964, pp. 111-113); and in the Soviet Union the 
comparable rate for these years was 32.2 per cent 
(based on data in Korol 1965, pp. 242-244, table 
A-1).In comparison, the annual rate of growth for 
the engineering profession in Great Britain during 


the years 1959 through 1963 was 5.5 per cent; in 
France it was 3.2 per cent for the same period; and 
in Sweden it was only 2 per cent for the period 
1955 through 1960 (Organization for Economic 
Cooperation and Development 1963, pp. 136-175). 
Maintaining a balance between the demand for and 
the supply of engineers has been a problem in the 
post-World War 1 period and will probably con- 
tinue to be a problem in the future (e.g., U.S. De- 
partment of Commerce 1963). 

The engineering profession, probably more than 
the older, established professions, tends to recruit 
its members—at least in highly industrialized soci- 
eties—from heterogeneous social origins. Accord- 
ing to one study, a substantial proportion of grad- 
uate engineers in Great Britain have middle-class 
or working-class backgrounds: 36 per cent of their 
father’s occupations are white-collar and 22 per 
cent are blue-collar (Gerstl 1963, p. 19; see also 
Jahoda 1963, p. 54). Several studies of engineering 
students in the United States also indicate a sub- 
stantial degree of recruitment from middle-class 
and working-class backgrounds: 44 per cent of the 
fathers of engineering students at Northwestern 
University pursue manual or white-collar occupa- 
tions (Krulee 1963, p. 20); and 50 per cent of 
engineering students at the University of California 
at Berkeley come from working-class or middle- 
class backgrounds (Trow 1959, p. 68). In the 
Netherlands, on the other hand, opportunities for 
entry into the engineering profession appear to be 
more limited than in the United States and Great 
Britain: approximately 28 per cent are recruited 
from working-class and middle-class backgrounds 
(Kuiper 1956, p. 233, table 2). 

The more heterogeneous the social origins of 
engineers, the more diverse, in all likelihood, are 
the motivations and values involved in their choice 
of occupation. In a study of students at 11 Amer- 
ican universities who chose engineering as a career, 
38 per cent stressed the “chance to earn a great 
deal of money”; 52 per cent, the opportunity to be 
creative and original; and 28 per cent, the oppor- 
tunity to be helpful to others (Goldsen et al. 1960, 
pp. 43-44). A study of American students from 
135 colleges and universities who chose engineer- 
ing as a career in the freshman year found that 
25 per cent mentioned money as a factor, 26 per 
cent mentioned opportunity to be original, and 7 
per cent gave “people” as a reason (Davis 1965, 
p. 188). In Great Britain a study of sixth-form boys 
found that, of those interested in engineering, 32 
per cent gave “money” or “good prospects” as their 
reason; 19 per cent mentioned that it affords an 
opportunity to be creative; and 13 per cent said 
that they were interested because it combines theory 
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and practice (Oxford University 1963, pp. 37-38). 
At two London polytechnics a survey of evening 
students which inquired into their motivations for 
attendance found that 36 per cent hoped for a 
better-paid job, 10 per cent for more job security, 
16 per cent for a more interesting job, and 9 per 
cent for a job with a higher social standing (Cot- 
grove 1958, pp. 102-103). In short, the values of 
money, prestige, security, creativity, integration of 
theory and practice, and helping people are but a 
few of the values affecting the choice of engineer- 
ing as a career. The old hypothesis of a relationship 
between social-class heterogeneity and occupational 
attrition has recently found some support in a study 
of the occupational structure of the United States: 
“The more heterogeneous in social origins the 
young men entering an occupation are, .. . the 
greater is their tendency to leave it later for a 
variety of other occupations. This finding suggests 
that homogeneity in background fosters social sol- 
idarity, which lessens the inclination of its mem- 
bers to leave an occupational group” (Blau 1965, 
p. 490). Thus, we may infer that social-class het- 
erogeneity among engineers very likely contributes 
to occupational attrition, because of reduced social 
solidarity. 

The diversity of motives prompting students to 
enter the field of engineering also creates various 
difficulties for the profession. First, the very exis- 
tence of a diversity of values regarding engineering 
acts to lower the feeling of solidarity among engi- 
neers as an occupational group. Second, the preva- 
lent “extrinsic” values, such as money, prestige, 
and security, contrast with such “intrinsic” work 
values as the opportunity to be creative or to link 
theory with practice. Intrinsic values are probably 
more associated with commitment to a profession 
than are extrinsic values. Finally, the socially het- 
erogeneous recruits to engineering impose an even 
greater demand for professional socialization dur- 
ing and after the period of formal education than 
would socially homogeneous recruits. 

Educational patterns. The educational resources 
of a society, which vary with the level of indus- 
trialization, greatly affect not only the number of 
engineers recruited but also their quality and, in 
turn, their capability to contribute to technological 
development. The feedback effects of an adequate 
supply of well-trained engineers and scientists on 
economic growth has stimulated widespread inter- 
est in developing educational institutions and en- 
larging enrollments, as an investment in “human 
capital.” That facilities for educating engineers 
vary in large measure with the level of industrial- 
ization of a society can be shown by examining the 
relationship between the number of enrolled en- 
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gineering students in various countries and the 
GNP per capita for these same countries. The 
higher the degree of industrialization, as measured 
by GNP per capita, the greater is the number of 
engineers enrolled (unpublished research by the 
author, based on data in Russett et al. 1964, pp. 
155-157; UNESCO, Statistical Yearbook 1963, pp. 
226-249, table 16; DeWitt 1961, p. 318). It is note- 
worthy that, in their effort to accelerate economic 
growth, communist countries have greatly expanded 
their facilities for the education of engineers. In 
China engineering enrollment increased from 
30,300 in 1949, when the new regime was estab- 
lished, to 177,600 in 1957 (Orleans 1961, pp. 68- 
69); in the same year, 40 per cent of all students 
enrolled in Chinese institutions of higher learning 
were majoring in engineering (ibid. ). In the Soviet 
Union the comparable percentage was 39 per cent 
in 1958 (DeWitt 1961, p. 318), and in the United 
States it was 5 per cent in 1962 (computed from 
Statistics in U.S. Office of Education 1965, p. 81, 
table 58, and in U.S. Bureau of the Census 1963, 
p. 136, table 177). 

Critical as is the quantity of engineers educated 
for the economic and technological development of 
a society, the principal problems of professional- 
ism and professionalization revolve around the 
quality of their education. Engineering curricula 
are periodically reviewed by engineering educators 
and professional societies, This, to be sure, is nec- 
essary because the rapid rate of growth of science 
and technology requires that engineering schools 
continually revise their curricula to insure that they 
are transmitting the new state of the art. The task 
of reducing the time lag between the development 
of new knowledge and its incorporation into the 
curriculum is often fraught with difficulty. A case 
in point is the time lag involved in introducing 
courses on computers in engineering schools. 

Designing and redesigning engineering curricula 
in response to technological change is beset by 
many problems other than recruiting a competent 
and adaptable faculty. One of the problems is that 
the practice of engineering is generally based on 
an undergraduate level of education. The fact that 
relatively high Proportions of engineers in some 
countries have not received even this minimal level 
of training highlights the unsolved problems of 
professionalizing this Occupation. For example, in 
China only 35 per cent of engineers had under- 
graduate engineering degrees in 1955 (Chéng 
1964, p. 35), and in the United States only 56 per 
cent had such degrees in 1960 (Organization for 
Economic Cooperation and Development 1966). 
The length of full-time training in engineering 


schools ranges from three to five and one-half years, 
with many European countries and the Soviet 
Union at the high end of this scale (Conference of 
Representatives 1960, vol. 2, p. 42). 

Associated with the time limitation of an under- 
graduate level of engineering education is the prob- 
lem of determining how much of the curriculum 
should be devoted to fundamental sciences, to en- 
gineering sciences, to engineering applications, to 
specialization in the various fields of engineering, 
and to nonengineering subjects (American Society 
for Engineering Education 1955, pp. 11-23; Wood 
1961). This problem is closely related to another, 
which is bound to receive more attention in the 
future, namely, whether engineers should be trained 
in a specific branch of engineering or in the funda- 
mentals of engineering. The fact that the main 
branches of engineering—civil, mechanical, elec- 
trical, chemical, and aeronautical—are becoming 
increasingly interrelated in new technologies makes 
this problem increasingly significant. 

As might be expected, countries differ in the 
degree to which engineering education is oriented 
to the acquisition of knowledge in a particular 
specialty. In the United Kingdom, where about 
one-half the engineers are trained in part-time, 
“sandwich,” or cooperative programs in technical 
colleges, and in the United States, where the varia- 
tion in quality in the more than 250 engineering 
schools is considerable, there is probably a greater 
degree of specialization than in some countries in 
continental Europe. In the Soviet Union and Com- 
munist China the degree of specialization appears 
to be greater still (Korol 1957, pp. 252-253; Chéng 
1964, p. 98). In underdeveloped countries, which 
tend to emulate the educational systems of devel- 
oped countries, an argument has been advanced for 
training general engineers, rather than Specialists, 
in order to help initiate the process of industrial- 
ization (Hunt 1960), 

The role of nonengineering subjects in the cur- 
riculum—what types and how many courses should 
be offered in the humanities, in problems of man- 
agement, or in social sciences—is also an open 
question. There is notable variation between coun- 
tries in this respect, as shown by a recent study 
of some systems of engineering education (Con- 
ference of Representatives 1960, vol, 2, p. 44). In 
part the variation is due to differences in the quality 
of secondary school education; in part it reflects 
variation in the assessment of the kinds of knowl- 
edge and skills required of a practicing engineer. 

Rarely considered in the recurrent reappraisals 
of engineering education is the question of the in- 
culcation of basic values of professionalism in the 


training of engineers (see National Society of Pro- 
fessional Engineers 1963). Among the basic profes- 
sional values to which engineering students might 
be socialized are: (a) the importance of contrib- 
uting to technological innovation, rather than ac- 
cepting the technological status quo; (b) the 
awareness of and concern for the social impact of 
technological innovations; and (c) the conception 
of professional education as a lifelong activity that 
is not confined to the years of formal training in 
engineering schools. Whether and how to inculcate 
these and other professional values are still frontier 
problems in professionalizing engineering. For ex- 
ample, the novel and presumably controversial 
practice, at a French institution of higher learning 
at Sacley, of awarding a degree in reactor engineer- 
ing for a limited period—subject to revalidation 
after five years by means of attendance at refresher 
courses and success at future examinations—is 
based on a conception of engineering education as 
a lifelong process (King 1965). 

Problems of redesigning engineering curricula 
in a quickly changing technological and social en- 
vironment defy easy and durable solutions. They 
are even more resistant to solution without full 
cognizance of the types of careers pursued by 
engineers following graduation from an engineer- 
ing school. 

Career patterns. The process of professional 
socialization obviously does not end upon gradua- 
tion from an engineering school. The organiza- 
tional context in which an engineer works and the 
type of function he performs affect not only the 
course of his career in engineering but also his 
career orientation and his degree of commitment 
to the profession. 

Unlike the members of some of the older profes- 
sions, engineers are predominantly salaried em- 
ployees, with the exception in some countries of a 
small subgroup of engineers who are self-employed 
and engage in consulting work (see, for example, 
Engineers Joint Council 1965, p. 17). Typically, 
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engineers are employed in manufacturing organ- 
izations and in various construction operations of 
a governmental nature. Within the past several 
decades, as the number of research-and-develop- 
ment laboratories has rapidly increased, new work 
contexts have opened up for engineers. In the less 
industrialized countries most engineers still per- 
form various production functions, whereas in the 
more industrially developed countries a rising pro- 
portion are engaged in research-and-development 
activities. This variation in function is suggested 
by the data in Table 2. In the United States and 
the Soviet Union, two of the more highly indus- 
trialized countries, approximately one-third and 
one-fifth, respectively, of the engineers work in 
research and development. 

As several studies of occupations other than en- 
gineering have shown, the first job after graduation 
usually has more effect on career opportunities 
than do subsequent jobs. Engineering is no excep- 
tion to this. Among the factors affecting the engi- 
neer’s first career decision is the quality of the 
engineering school he attended. A graduate of an 
elite school often has the opportunity to begin his 
career in an organization which is in the main 
stream of technological development. He also has 
the opportunity—as is true, for example, of the 
graduate of the Ecole Polytechnique—to orient his 
career toward top management (Granick 1962, 
pp. 26-30). 

If the engineer begins his career in a production 
setting, he is unlikely to subsequently enter a re- 
search-and-development organization or engage in 
teaching and research in an academic environ- 
ment. The only two probable career lines open to 
him are management of a technical or a nontech- 
nical function and the pursuit of an occupation 
other than engineering. The career path of an 
engineer in a research-and-development organiza- 
tion or in a university is probably quite different 
from that of an engineer employed in a production 
organization. In either case, the probability is 


Table 2 — Distribution of engineers by type of work, for selected countries: per cent 


Yeor Production 
Austria . 1961 51 
Canada 1959 30 
Greece ` ~ 1959 80 
Soviet Union = ` 1964 33 
United States 1959 40 


a, Includes scientists. As 
b. Includes engineers engaged in teaching. 


Management Research 
and ond 
administration development Teaching Other 
33.5 3 3.5 9 
28 23 3 15 
7 3 8 1 
10 22 7 28 
8 30 aie 


Source: Organization for Economic Cooperation and Development 1963, 
pp. 134 ff. Data for the Soviet Union are estimates computed for 
the author by Alexander G. Korol. 
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higher that he will not leave engineering for an- 
other occupation; on the other hand, the likelihood 
is that, after some years in research-and-develop- 
ment work, the engineer may transfer to a produc- 
tion or a management function, especially manage- 
ment of a technical operation. The relatively small 
percentage of engineers who enter teaching and 
research in an academic environment, as shown in 
Table 2, in all likelihood continue in this function; 
if they leave the academic environment, their 
career paths are likely to be in research rather than 
in production (LeBold et al. 1960; Gerstl & Hutton 
1966). 

As a salaried employee, the engineer experi- 
ences organizational constraints that he finds diffi- 
cult to reconcile with his expectations as a pro- 
fessional (Kornhauser 1962). The type of function 
he performs as an engineer affects his role con- 
ception, as well as the length of his career in engi- 
neering. In a production function his role relation- 
ships involve interaction with production workers 
and engineering technicians, on the one hand, and 
with managers, on the other. As a staff engineer, 
he lacks the authority of the manager, and he tends 
to be treated, in some organizational contexts, in 
the same manner that an engineering technician 
or a production worker is treated. The norm of 
obedience is more characteristic of the relationship 
he has with his superiors and subordinates than 
the norm of service, which is typical of a profes- 
sional, or the norm of autonomy, which is typical 
of a scientist (Evan 1962, p. 352). 

Ina research-and-development organization the 
engineer's role tends to subject him to the typical 
dilemmas of a marginal man (Shepard 1957). The 
Scientist regards him as a “nuts-and-bolts” engineer; 
the manager, as someone who is insufficiently 
sensitive to cost factors in engineering. In a 
research-and-development setting, however, there 
is less likelihood for his role to be confused with 
the role of an engineering technician or a produc- 
tion worker. (For an analysis of the occupational 
marginality of engineering technicians, see Evan 
1964.) Probably only the small proportion of engi- 
neers engaged in basic or applied research escape 
some of the problems of marginality: their work is 
governed, not by a norm of service, but rather by 
a norm of autonomy. 

As a consequence of the rapid rate of techno- 
logical change, there is a growing tendency for the 
careers of engineers to be abbreviated, The knowl- 
edge and skills of engineers obsolesce so quickly 
that engineers, especially in highly industrialized 
countries, find it necessary to shift into manage- 
ment work or nonengineering occupations in the 


middle of their careers (see, for example, Evan 
1963). No longer can the new graduate engineer 
assume, as some of his predecessors did years ago, 
that he will spend his entire working career in 
engineering. 

To cope with the growing problem of technical 
obsolescence, programs of continuing education 
are being established in the United States, France, 
Germany, and some other countries. The theory 
and methodology required for the retraining of 
engineers in the middle of their careers remain to 
be developed. As yet there is scant evidence as to 
the effectiveness of continuing-education programs 
in helping engineers to cope with their technical- 
updating problems. Another career problem is the 
flattening of the salary curve with age, which may 
be related to the declining market value of older 
engineers undergoing technical obsolescence (see, 
for example, Kornhauser 1962, pp. 128-130), 
These and other career problems of engineers are 
solved in some countries, not by changing em- 
ployers, labor markets, or occupations, but by 
means of emigration. The limited statistics on the 
migration of engineers makes it difficult to ascer- 
tain the countries of origin and destination of 
engineers who emigrate. However, we do know 
that this mode of adaptation to career problems 
has created concern in the countries of emigration. 
In the face of a shortage of technically trained 
manpower in most countries of the world, emigra- 
tion of engineers is looked upon as a “brain drain.” 
This is particularly true in the case of a relatively 
underdeveloped country, such as Argentina, where 
engineers have emigrated in substantial numbers 
to the United States (Oteiza 1965), 

In short, the organizational contexts in which 
engineers are employed, the types of functions they 
perform, and the types of role relationships in 
which they are involved have not been conducive 
to an effective process of professional socialization. 
After his graduation from engineering school, his 
work experiences often do not tend to imbue the 
engineer with a dedication to the occupation 
(Wilensky 1964, pp, 150-155) or an increasing 
awareness of the social consequences of techno- 
logical change. Professional associations have a 
significant function to perform in making up for 
the deficiencies of the work context as an agent of 
professional socialization. 

Professional associations. Only some of the ca- 
reer problems encountered by engineers have thus 
far received attention by professional associations 
of engineers, Economic problems have largely been 
ignored, although licensing regulations may have 
had an indirect beneficial effect on the earnings of 


some engineers. Sporadic efforts in some countries 
to organize trade unions of engineers to promote 
their economic interests have not been successful 
(see, for example, Goldstein 1954; Walton 1961). 
Unlike the medical profession, engineering has not 
acted in unison to enhance the economic position 
of its members. One factor that has hindered the 
professional societies in performing an economic 
function has been the lack of organizational unity 
within the profession. Instead of there being a 
single professional association of all engineers in 
a country, there has been a tendency toward the 
“Balkanization” of the occupation. As new special- 
ties in engineering arise, new professional associa- 
tions come into being, and the proliferation of such 
societies makes for ever greater difficulties in uni- 
fying the profession. This tendency has occurred 
both in the United States and in the United King- 
dom, as well as in western Europe. In the more 
industrialized societies, where the profession is 
further developed, there are a greater number of 
professional associations and correspondingly more 
difficulty in unifying them. 

The principal function that professional societies 
appear to perform is that of a learned society. In 
other words, they see themselves principally as an 
instrument for advancing and disseminating engi- 
neering knowledge, thus supplementing the func- 
tions performed by universities and research insti- 
tutes. It follows, therefore, that they can contribute 
significantly to the engineer's need for continuing 
education. In the future, even more than in the 
present, they are likely to help engineers continue 
their professional development by means of semi- 
nars, abstracting services, and special conferences. 
Thus, participation by engineers in the activities 
of professional associations may increasingly re- 
flect the degree of their professional commitment. 
In the more industrialized countries, where the 
educational services of professional societies are 
apt to be in greater demand, memberships of en- 
gineering societies are likely to be larger than in the 
less industrialized societies. 

Another noteworthy feature about professional 
associations of engineers is the relatively modest 
progress they have made to date in organizing in- 
ternational professional associations. Since World 
War 1 several regional associations of engineering 
societies have come into being, notably the Confer- 
ence of Engineering Societies of Western Europe 
and the United States of America, the European 
Federation of National Associations of Engineers, 
and the Pan-American Federation of Engineering 
Societies. Another significant development was the 
founding of the Union of International Engineering 


ENGINEERING 77 


Organizations, under the aegis of UNESCO, in 
1950. These are in the nature of nongovernmental 
organizations, whose unit of membership is the 
national society of engineers, not the individual 
engineer. The strength of such international bodies 
depends very much upon the strength of the con- 
stituent societies, In their relatively brief history 
these organizations have not yet contributed notice- 
ably to new modes of international cooperation be- 
tween engineers, to new media for dissemination 
of technological knowledge, or to an awareness of 
membership in a world-wide profession. 

In addition to performing economic, educational, 
and knowledge-advancing functions, professional 
associations seek to regulate their members’ con- 
duct by establishing codes of ethics. To the extent 
that professional engineering associations have 
concerned themselves with ethical issues, they have 
attended mostly to the relations of the engineer 
with his fellow engineer and his employer. Ethical 
codes have set forth very general guidelines regu- 
lating conduct in these spheres. Only in most gen- 
eral terms do canons of ethics touch upon the rela- 
tions of the engineer to the public or to society as 
a whole (National Society of Professional Engi- 
neers 1962). Thus far, ethical codes have scarcely 
concerned themselves with the complex and diffuse 
ethical question of the responsibility of the engi- 
neering profession for the social consequences of 
technological change. 

Dilemmas of social responsibility. One reason 
for the widespread neglect on the part of engineers 
of the problem of social responsibility for techno- 
logical change is the difficulty of accepting re- 
sponsibility for events over which they exercise 
virtually no control. As salaried employees, per- 
forming in the main a staff function, engineers are 
rarely in a position to make policy decisions con- 
cerning the wisdom of developing or not develop- 
ing a new engineering product or concerning what, 
if any, action might be taken to counteract its po- 
tential or actual negative social effects. This is par- 
ticularly true for the overwhelming proportion of 
engineers engaged in production or in development 
research, where the norms governing their conduct 
emphasize obedience to directives from manage- 
ment (Evan 1962). Since management makes the 
decision to produce or not to produce a particular 
engineering product or service, the salaried engi- 
neer probably feels that he scarcely has an occa- 
sion for any ethical decision concerning the pos- 
sible adverse effects of a technological innovation. 

In the past the staff function of the engineer 
has in fact absolved him from actively concern- 
ing himself with the question of responsibility for 
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adverse social consequences of technological inno- 
vations (Merton [1949] 1957, p. 568). However, 
it is unlikely that this absolution of responsibility 
will be acceptable to engineers in the future, as 
technological advances generate problems of un- 
employment, environmental pollution, invasion of 
privacy, and an increasing threat of accidental or 
deliberate nuclear war. Pressures from within and 
without the profession will probably stimulate engi- 
neers to come to grips with the social ramifications 
of the technological changes they help develop. 

Some of these ethical dilemmas may be solved 
by new innovative technology; others may require 
innovative social changes. Although it is unlikely 
that the individual engineer will succeed in coping 
with the many complex ethical dilemmas that arise 
in the process of technological innovation, collec- 
tive action by professional associations might prove 
effective. In other words, if the engineering pro- 
fession assumes a social responsibility for the prob- 
lems of negative effects of technological change, 
it may contribute significantly to their solution. 

Prestige of the profession. The prestige of the 
engineering profession may be affected by, among 
other things, the attitudes of the public toward the 
engineer’s role in generating positive or negative 
social consequences of technological innovations. 
The increasing prominence of the role of tech- 
nology in society has probably elevated the prestige 
of engineering in recent years. On the other hand, 
the fact that engineering does not require a formal 
education as prolonged as some other professions 
and the fact that the members are recruited from 
heterogeneous social origins may contribute to a 
lowering of its prestige, relative to other profes- 
sions, in some countries, 

The prestige of engineering has received some 
attention from sociologists in several countries, As 
a result of the interest among sociologists in study- 
ing systems of social Stratification in different so- 
cieties, several parallel studies of the prestige of 
various occupations, including engineering, have 
been undertaken, The methodological differences 
between these studies make a comparison of the 
findings hazardous, Nevertheless, on the basis of 
these studies, it is clear that engineering does not 
have the same prestige in all countries, For exam- 
ple, in the Soviet Union engineers ranked second 
in prestige as compared with other occupations 
(Inkeles & Rossi 1956, Pp. 336-337); in the Philip- 
pines engineers ranked fourth (Tiryakian 1958, 
P. 394); in Great Britain they were in eighth place 
(Hutton & Gerstl 1964, p. 13); in West Germany 
they were in tenth place (Inkeles & Rossi 1956, 


pp. 336-337); and in the United States their pres- 
tige rank was 21.5 (Hodge et al. 1964, p. 290). 
Moreover, the data suggest that the prestige of the 
engineering profession varies inversely with the 
degree of industrialization as indicated by GNP 
per capita. Presumably, as the division of labor be- 
comes more specialized in more industrialized so- 
cieties and as the proportion of professionals in 
the labor force increases, engineering faces more 
competition from other occupations for rewards, 
monetary and other. In addition, as a society be- 
comes more industrialized, the engineering pro- 
fession tends to increase in size, which may also 
become a factor in lowering its prestige. Changes 
in the internal structure of the engineering pro- 
fession and in its social role are likely to affect its 
prestige in the future. 

Potential social roles. What types of roles en- 
gineers will play in the future depends in part on 
the course of professionalization of the occupation 
and in part on the course of political and economic 
development. If the occupation becomes increas- 
ingly professionalized, we may observe a threefold 
division. 

The appreciable segment of the occupation that 
has received limited or low-quality training in en gi- 
neering schools and whose knowledge is based 
largely on practical experience will tend to 
coalesce with engineering technicians (Evan 1964, 
p. 108). This tendency will be encouraged by the 
Progressive application of automation to some of 
the production and design functions performed by 
engineers, thus, in effect, de-professionalizing some 
members of the occupation. At the opposite end of 
the expertise continuum within the profession, 
there is a relatively small but probably increasing 
proportion of engineers working at the frontiers of 
engineering knowledge, who will tend to merge 
with applied scientists, The intermediate and by 
far the largest segment of the occupation will con- 
tinue to perform a high caliber of technical engi- 
neering work, This group may be impelled in one 
of two directions in the future: toward the acquisi- 
tion of power at organizational levels or at the 
national level or toward a new conception of pro- 
fessional service. 

In his manifesto to engineers in the 1920s, Veb- 
len ([1919] 1921, Pp. 138-169) urged them to 
replace “absentee owners” and to run industry 
rationally, in accordance with the “instinct of 
workmanship” rather than the principles of the 
“price system,” Unlike Marx and Engels in their 
manifesto to the proletariat, Veblen expressed no 
hope that his technocratic vision would be real- 


ized. In the decades since Veblen’s essay was pub- 
lished, the rise of highly centralized political sys- 
tems has increased the need for engineers to assist 
in the planning and decision-making process. In 
communist countries the political and economic 
exigencies in domestic and foreign affairs may re- 
quire, in the decades ahead, an even greater reli- 
ance on engineers to perform a technocratic role. 
In France, as we have seen, the recruitment of 
engineering graduates from the Ecole Polytech- 
nique and several other grandes écoles to command- 
ing positions in industry and in the civil service is 
another example of a trend for engineers to perform 
a technocratic role (Granick 1962, pp. 60-72). 
The engineer imbued with the technocratic vision 
believes, on the one hand, in the capacity of tech- 
nology to solve all social problems without recourse 
to value considerations and, on the other hand, in 
the importance of integrating engineers into the 
political power structure of society. 

An alternative role for engineers is that of a 
professionally self-conscious agent of the techno- 
logical and economic development of a society. 
Although performing principally a staff function, 
engineers would explicitly concern themselves with 
developing technology for human welfare and, 
more specifically, with the predictable social rami- 
fications of any new engineering design, product, or 
service (see, for example, Boguslaw 1965, pp. 23- 
29, 181-204). To distinguish this type of social 
role from both the technocratic role and the pre- 
vailing amoral professional role, we might desig- 
nate it a “professional-technologist” role. In accord- 
ance with this role model, an engineer would be 
guided by an explicit orientation of professional 
service in his relations with the technological sys- 
tem of a society. 

If the new role of professional technologist is to 
become institutionalized, at least two developments 
would have to occur. First, if the tempo of develop- 
ment in science and technology stimulates a large 
proportion of practicing engineers to acquire a post- 
graduate degree, say at the master’s level, such a 
trend would increase the exposure of engineers to 
professional socialization in the context of engi- 
neering schools. Second, if a “technological com- 
munity” transcending national boundaries (and 
parallel to the prevailing “scientific community”) 
comes into being, it would provide the normative 
foundations for the professional-technologist role. 
Such a community would be guided by a set of 
norms and values concerning technical as well as 
social facets of engineering, not unlike some of the 
norms current in the scientific community (Merton 


ENGINEERING 79 


[1949] 1957, pp. 550-561). The implementation 
of past proposals for an international institute of 
science and technology (Killian 1962) and the 
emergence of transnational professional societies 
of engineers—whose unit of membership is the 
individual engineer—in addition to the present 
international societies, would probably contribute 
to the emergence of a technological community. 
Which of these two new potential social roles— 
that of the technocrat or that of the professional 
technologist—will predominate in the years ahead 
or whether both roles will become institutionalized, 
albeit in different societies, is obviously difficult to 
predict. A conditional prediction, however, may be 
ventured: the professional-technologist role is likely 
to become institutionalized in societies where a 
democratic and antielitist ethos predominates; con- 
versely, in societies with a nondemocratic and 
elitist ethos, the technocratic role of the engineer 
is likely to become institutionalized. A political- 
ecological factor that may affect this prediction is 
the relationship between international conflict and 
the course of technological development. If inter- 
national conflict in the next decades comes under 
effective international regulation—thus reducing 
the chances of nuclear war—technology, and in 
turn the engineering profession, will be able to 
continue its development largely independent of 
international political and military conflicts. Such 
an international political environment would be 
conducive to the institutionalization of the role of 
the professional technologist, particularly in indus- 
trialized societies, and to the growth of a techno- 
logical community, both of which would usher in 
a new level of professionalization of engineering. 


WILLIAM M. EVAN 


[See also AUTOMATION; LICENSING, OCCUPATIONAL; 
OCCUPATIONS AND CAREERS; PROFESSIONS; TECH- 
NOLOGY. ] 
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ENGINEERING PSYCHOLOGY 


Engineering psychology is a branch of applied 
psychology specifically concerned with the discovery 
and application of information about human be- 
havior and its relation to machines, tools, and jobs 
so that their design may best match the abilities 
and limitations of their human users. The field is 
also referred to, from time to time, as psychotech- 
nology or applied experimental psychology, but 
these two names appear to be gradually dropping 
out of use. 

Engineering psychology can be properly viewed 
as part of industrial psychology. The latter includes 
such additional topics as personnel procurement, 
selection, training, classification, and promotion; 
labor relations; morale and human relations; Or- 
ganizational management; and consumer behavior. 
The field of engineering psychology can also be 
identified as a subarea of human factors engineer- 
ing, or human engineering, as it is generally known 
in the United States, or of ergonomics, as it is 
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usually called in the United Kingdom and Europe. 
The broader field of human factors engineering 
includes, in addition to engineering psychology, 
portions of such human sciences as anatomy, an- 
thropometry, applied physiology, environmental 
medicine, and toxicology. 

These distinctions between engineering psychol- 
ogy, industrial psychology, and human factors en- 
gineering are more academic than real. In his 
practical work, the engineering psychologist needs 
to know enough about all of these disciplines so 
that he can make use of them in arriving at sensi- 
ble and informed design decisions. Rather than 
calling engineering psychology a distinct entity, it 
would be more correct to say that this name is 
more a convenient focus around which training is 
offered in many universities. 


Historical development 


In a certain sense it is correct to say that people 
have been concerned with engineering psychology 
of a sort ever since man began fashioning imple- 
ments for his own use. Nonetheless, engineering 
psychology has emerged as a separate discipline 
only within the past few decades. It was not until 
the end of the nineteenth century that the first 
systematic investigations were conducted on man’s 
capacity to work as it is influenced by his job and 
his tools, Frederick W. Taylor (1898) made em- 
pirical studies of the best design of shovels and of 
the optimum weight of material per shovelful for 
handling different products, such as sand, slag, 
rice coal, and iron ore. Taylor’s interests, however, 
were primarily in rates of doing work and in the 
effects of incentives and worker motivation on rates 
of working. It remained for Frank B. Gilbreth to 
set a firm foundation for this field with his classic 
study of bricklaying (1909). Among other things, 
Gilbreth invented a scaffolding which could be 
quickly adjusted so that the bricklayer could work 
at the most convenient level at all times. A shelf 
held the bricks and mortar at their most convenient 
positions. By further changes of a similar nature 
Gilbreth was able to increase the number of bricks 
laid from 120 to 350 per man per hour. This pio- 
neering work of Taylor and Gilbreth was the begin- 
ning of that branch of industrial engineering now 
known as time and motion study. 

In the years that followed, time and motion en- 
gineers developed a number of principles of motion 
economy, of the arrangement of work, and of work 
design that have been widely applied throughout 
modern industry. Insofar as they have focused on 
human capacities and limitations and have used 
this information to redesign the machine, the task, 
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or the work environment, it is correct to say that 
time and motion engineers are predecessors of the 
modern engineering psychologist. 

Still, the primary emphasis in time and motion 
engineering has been on man as a worker, that is, 
as a source of mechanical power. During the two 
world wars there appeared a new class of machines 
—machines that made demands upon the operator 
not in terms of his muscular power but rather in 
terms of his sensory, perceptual, judgmental, and 
decision-making abilities. The job of a sonar oper- 
ator, for example, requires virtually no muscular 
effort, but it makes severe demands on his sensory 
capacity, his attentiveness, and his decision-making 
ability. Problems of this type could no longer be 
dealt with by common sense or by the time and 
motion engineer’s principles of motion economy. 

World War 1. When the United States entered 
World War 1 in 1917, a group of psychologists 
under Robert M. Yerkes was organized as the Psy- 
chology Committee of the National Research Coun- 
cil. In volunteering their services-to the military 
establishment, they were met at first with consider- 
able skepticism about what they could do of any 
value in the hard business of war. Gradually these 
psychologists were able to make some substantial 
contributions and eventually win the enthusiastic 
endorsement of the military services. 

By and large the psychologists in World War 1 
were concerned with such things as the selection, 
classification, and training of recruits, and with 
morale, military discipline, recreation, and prob- 
lems of emotional stability in soldiers and sailors, 
A few of them, however, notably Raymond Dodge, 
Knight Dunlap, and Carl E. Seashore, encountered 
problems of a different sort—those in which the 
design of machines and equipment had to be re- 
lated to the user. These early problems were found 
in gas masks, in binoculars and monoculars for 
Spotters, in listening devices for locating subma- 
rines, and in aircraft. Questions were more numer- 
ous than answers, and the war ended before many 
solid accomplishments had been made. 

World War m. After the armistice in 1918 this 
pioneering work in engineering psychology was al- 
most entirely abandoned. A few scattered studies 
appeared between the two world wars under the 
auspices of the Industrial Health Board and the 
Industrial Fatigue Research Board of the Medical 
Research Council (Great Britain), but the field was 
largely neglected until World War 1. At that time 
the machines and problems foreshadowed by World 
War I reappeared in profusion. Radar, sonar, high 
altitude and high speed aircraft, naval combat in- 
formation centers, and air traffic control centers 


placed demands upon their human operators that 
were often far beyond the capabilities of human 
senses, brains, and muscles. Operators sometimes 
had to look for targets which were all but invisible, 
understand speech against backgrounds of deafen- 
ing noise, track targets simultaneously in the three 
dimensions of space with both hands, and absorb 
large amounts of information to reach life-and- 
death decisions within seconds. As a result, bombs 
and bullets often missed their mark, planes crashed, 
friendly ships were sunk, and whales were depth- 
charged. The response to the need was so vigorous 
and dramatic that only a few highlights can be 
mentioned here. 

Having entered the war before the United States, 
Great Britain faced these problems first and estab- 
lished a pattern that, in broad outlines, was fol- 
lowed later in the United States. The Medical 
Research Council was responsible for sponsoring 
much research on man—machine problems in sev- 
eral large universities and in the military services 
through the Flying Personnel Research Com- 
mittee, the Royal Naval Personnel Research 
Committee, and the Military Personnel Research 
Committee. In the military services, important 
work was done at such laboratories as the Royal 
Aircraft Establishment, Farnborough; the Admi- 
ralty Naval Motion Study Unit, London; and the 
Admiralty Research Laboratory, Teddington, Mid- 
dlesex. 

Although entering the conflict later, the United 
States met problems equally urgent and dealt with 
them in substantially the same way. The National 
Defense Research Committee through the Office of 
Scientific Research and Development set up numer- 
ous research contracts in universities and industries 
to study these problems, All three military services 
incorporated civilian and military scientist-psychol- 
ogists into their research and development labora- 
tories in order that research findings would be put 
to immediate use. Some of the pioneering work in 
this area was carried out by the Aero Medical 
Laboratory of the (then) Army Air Forces Air Mate- 
riel Command, Wright Field, Dayton, Ohio; the 
Army Air Forces School of Aviation Medicine, Ran- 
dolph Field, Texas; the U.S. Navy Electronics 
Laboratory, San Diego, California; the Naval Re- 
search Laboratory, Washington, D.C.; and the 
Armored Medical Research Laboratory, Fort Knox, 
Kentucky. 

Present status of the field. Since World War 11 
the growth of engineering psychology has been 
very rapid. The Society of Engineering Psycholo- 
gists, Division 21 of the American Psychological 
Association, had 360 members in 1965. About 770 


psychologists are members of the Human Factors 
Society and, indeed, make up over 60 per cent of 
the membership of that organization. Psychologists 
also figure prominently in the Ergonomics Research 
Society (centered in Great Britain), the Société 
d'Ergonomie de Langue Française (centered in 
France), the Nederlandse Vereniging voor Ergono- 
mie, and the Japanese Ergonomics Research Soci- 
ety. Engineering psychologists are employed in every 
branch of the military service, in many independ- 
ent research and consulting organizations, and in 
the aviation, automotive, electronics, communica- 
tions, and home appliance industries. 

At the present time, engineering psychology is 
most fully developed and exploited as a specialty 
in the United States. Other countries which give 
training in this area or make use of it in practical 
affairs to some degree or other are Australia, Bel- 
gium, France, Germany, Great Britain, Israel, 
Japan, the Netherlands, Sweden, Switzerland, and 
the U.S.S.R. 


Methodology 

Engineering psychologists aim to discover prin- 
ciples that can be cast into the form of recom- 
mendations for machine design. Unfortunately, 
they can usually find specific answers for only a 
small proportion of the questions they face. Part 
of the difficulty is that man-machine interactions 
occur in an almost infinite variety. Moreover, the 
range of these problems is diverging rather than 
converging. Engineers are busy designing new and 
complex machines, destined to operate not only in 
the prosaic world of our everyday lives but in hos- 
tile and exotic environments where man has never 
lived—from the crushing Stygian abysses of our 
oceans to the infinite voids of deep space. Literally 
millions of people are actively engaged in the busi- 
ness of designing and constructing machines and 
machine systems, but there are scarcely a thousand 
people who make it their primary business to study 
man in his intimate relations with these machines. 
For reasons such as these, man-—machine problems 
appear to be multiplying faster than we can do 
research on them. 

The practicing engineering psychologist finds 
that he spends a considerable portion of his time 
“trying things out.” In some industries and in some 
laboratories, experimentation of one kind or an- 
other may well take up the major part of the 
engineering psychologist’s working time. As one 
might suppose from the historical development of 
the field and the nature of the work, methodologies 
in engineering psychology are diverse and adapted 
from several disciplines (see Chapanis 1959). The 
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discovery of new methodologies is also a topic of 
continuing and active interest in the literature 
of engineering psychology. For this reason the tech- 
niques mentioned below should be regarded as a 
sample of the ways in which problems have been 
answered in the past rather than as an exhaustive 
list of the tools available to the practitioner in this 
field. 

Whenever he can, the engineering psychologist 
uses full-scale experimentation with the same rigor 
and sophistication that one expects of the best 
tradition of experimentation. Because of the com- 
plexity of his problems and the many variables that 
normally influence human behavior in machine 
situations, the engineering psychologist typically 
employs experimental designs using several vari- 
ables, deriving, for example, from methods of anal- 
ysis of variance. In addition, the psychophysical 
methods are widely used for obtaining useful data 
on sensory capacities, as are articulation test 
methods for measuring the effectiveness of speech 
communication devices and systems. In preliminary 
exploratory work on complex systems, the study of 
critical incidents, accidents, and near accidents has 
proven widely useful in locating potential sources 
of man—machine conflict. 

From the industrial engineer, the engineering 
psychologist has borrowed and adapted a number 
of techniques for directly observing systems in 
operation. Some of these are activity sampling pro- 
cedures, process analysis, and micromotion meth- 
ods. Finally, from more conventional techniques of 
industrial psychology the engineering psychologist 
has adapted to his own needs methods of job 
analysis, task analysis, personnel requirement in- 
ventories, questionnaires, tests, and rating scales. 


Subject matter 

Almost everything that is known about man as a 
living, feeling, behaving organism is relevant to 
the engineering psychologist. Current thinking 
even extends to man as a social organism: a num- 
ber of recent research findings show clearly that 
the effectiveness of complex man-—machine systems 
is determined to a considerable extent by the com- 
patibility of the team of men who work in the 
system. Examples are nuclear submarines and ad- 
vanced space vehicles, where men are forced to 
live and work together for extended periods of time 
in cramped quarters while under unusual stress. 

For all that, the amount of information about 
human behavior that may be required for particular 
applications varies enormously. In the design of a 
space vehicle, the engineering psychologist may be 
faced with problems that cover the full range of 
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human psychology. He needs to consider the sen- 
sory capacities of operators as they relate to instru- 
ment displays and the sensing of information from 
inside and outside the vehicle. Knowledge about 
man’s ability to make rapid and correct decisions 
is vital. Working hours and work-rest cycles are 
certain to be different from those on earth, and it 
is important to know how well man can perform 
under these altered working conditions. The engi- 
neering psychologist also needs to consider the 
human ability to make correct control actions of a 
great variety. Human reactions to exotic environ- 
ments, the ability to learn new and complex skills, 
emotional reactions and personality problems that 
might arise from the stresses of space flight, social 
behavior—all of these are relevant for the engineer- 
ing psychologist. [See SPACE, OUTER.] 

By contrast with the complexity of a space ve- 
hicle, there are many problems in engineering 
psychology that are much simpler and more cir- 
cumscribed. The engineering psychologist who 
works with common consumer items may be faced 
with questions like: How should the controls be 
placed on a stove so that housewives correctly turn 
the correct control to activate a burner? What size 
and spacing of letters, numbers, and symbols 
should be used on the labels of household appli- 
ances so that they will be easily legible? How 
should the numbers and letters be arranged on 
push-button telephones? All of these are examples 
of relatively restricted but genuine problems en- 
countered in the practical business of engineering 
psychology. 

The man—machine model. In their work engi- 
neering psychologists regard man as an element in 
a man—machine system. Basically, a person who 
uses or operates a piece of equipment has to do 
three things. He has first to sense something and 
to perceive what this something means. The thing 
the human operator senses is termed a machine 
display. It may be any of a thousand different 
things—the position of „a pointer on a dial, the 
print-out of a digital computer, a voice coming 
over a loudspeaker, a red light flashing on a con- 
trol panel, a highway sign along a speedway, or 
the resistance felt in a certain kind of control. 

Having sensed a machine output, the man next 
has to interpret what the display means, under- 
stand it, perhaps do some mental computation, and 
reach a decision of some sort. In so doing, the 
human operator often uses other important human 
functions—the ability to remember and recall, to 
compare what he now perceives against past ex- 
periences, to recall operating rules he may have 
learned during training, or to put what he now 


experiences into the context of strategies he may 
have formed for handling events such as this, A 
man is not necessarily aware that he is doing any 
or all of these things, of course. His behavior may 
be so well practiced and routine that the decision 
to do one thing or another may be made almost by 
reflex, just as the experienced driver may decide 
almost unconsciously whether or not he should stop 
when he sees a green traffic light change to yellow. 
All of the functions discussed in this paragraph 
are ordinarily subsumed under the heading of 
higher mental processes in textbooks of psychology. 
Engineering psychologists often use machine termi- _ 
nology instead of more familiar psychological terms 
and, in keeping with this trend, refer to all these 
higher mental processes collectively as data proc- 
essing. 

Having reached a decision about the information 
he has received through his sense organs and dealt 
with in his nervous system, the human operator 
then normally takes some action. The action is nor- 
mally exercised on some sort of a control—a push 
button, lever, crank, pedal, switch, or handle. Man’s 
action upon one or more of these controls exerts 
in turn an influence on the behavior of the ma- 
chine, its output and displays. Many times, of 
course, a machine operator monitoring a process 
may decide to do nothing. This is still regarded as 
an important human output. 

A man-machine system does not exist in isola- 
tion but in an environment. The character of this 
environment influences man’s efficiency and per- 
formance, and the engineering psychologist is often 
vitally concerned with these factors. Among the 
more important environmental influences are such 
commonplace ones as temperature, humidity, ven- 
tilation, lighting, noise, and movement. Some less 
common but still important ones are vibration and 
a whole host of noxious gases and contaminants, 
In more exotic systems the engineering psychologist 
may also have to be concerned with the effects of 
increased acceleration, weightlessness (zero gravity 
conditions), anoxia caused by reduced oxygen at 
high altitudes, radiation, and the effects of reduced 
barometric pressures on the body, 

The display of information. The man—machine 
model described above provides a convenient frame- 
work for Summarizing the main content areas of 
engineering psychology. Machine displays, in a 
manner of speaking, represent the Starting point of 
the man—machine cycle, for it is through such dis- 
plays that the machine communicates to its human 
operator. For this reason, a considerable amount 
of work has been devoted to Studies of displays and 
the ways in which they should be selected and 


designed. Although man has available a dozen or 
so sense channels that could conceivably be used 
to receive information from machine systems, only 
three—vision, hearing, and the sense of touch or 
vibration—have been exploited to any great extent. 

In the area of visual displays, research has been 
done on such problems as the design of mechanical 
indicators; scales; cathode-ray tubes (radar scopes ); 
charts, tables, and graphs; warning lights and sig- 
nals; abstract visual dimensions (symbols varying 
in color, shape, brightness, or size) for coding in- 
formation; and general and specialized lighting 
systems (for ready rooms and radar rooms). 

Problems of auditory displays can be grouped 
into two broad classes: those dealing with tonal 
or noise signals (sirens, diaphones, horns, buzzers, 
bells, gongs, and so on), and those dealing with 
speech communication systems. Research on the 
former class of problems has been generally con- 
cerned with signal processing and control: the 
selection of signals and signal characteristics, the 
filtering of signals to eliminate unwanted or inter- 
fering noise, and the use of signals for coding 
information. Research on speech communication 
systems has been aimed at the design of special or 
efficient languages, the design of the components 
of speech communication systems (microphones, 
amplifiers, and so on), and the design of speech 
communication systems as a whole. [See HEAR- 
ING; PERCEPTION, article on SPEECH PERCEPTION; 
VISION. ] 

Machine displays for senses other than vision 
and hearing have not been used very much, and 
perhaps for this reason, research on such displays 
is relatively meager. Within recent years, however, 
it has been shown that the vibratory sense can be 
used for an efficient communication system. Using 
a special kind of vibratory code applied to a man’s 
chest, Geldard (1957) trained one subject to re- 
ceive up to 38 words per minute. However, because 
of the awkwardness of the equipment and the pos- 
sibility of interference from other sources of vibra- 
tion, it is doubtful whether that particular system 
will be put into any operational situation. This 
research shows, however, that there may be un- 
explored possibilities for communication through 
these other senses. 

Data processing. One important function which 
man serves in many man—machine systems is that 
of data processing. He may be required to perceive 
things, assimilate large masses of data, evaluate or 
assess a situation, do computations, and make de- 
cisions. Despite much research on these higher 
mental processes our understanding of the mech- 
anisms by which people do these things is still im- 
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perfect. As a result, it is in this area of engineering 
psychology that one finds the fewest principles and 
concrete recommendations about the ways in which 
man can be best integrated into man-machine sys- 
tems. As more and more systems become auto- 
mated, however, man’s role in the system becomes 
more and more that of a monitor and decision 
maker. One may expect therefore that some of the 
greatest research gains are yet to be realized in this 
area. 

Machine controls. Research on the design of 
machine controls has yielded a substantial number 
of useful and practical principles. These are con- 
cerned with such things as the factors involved in 
selecting the correct control for a job, control— 
display ratios, direction-of-movement relationships, 
control resistance, ways of preventing accidental 
activation, and control coding. Among the more 
complex kinds of controls are those involved in 
what are called closed-loop tracking systems. ( Driv- 
ing a car along a winding road is a simple example 
of a closed-loop tracking task.) Research on the 
last kind of problem involves considerations of the 
mathematical relationships between the movements 
of the control and the dynamics of the system. 

Environmental problems. Although the study 
of environmental problems might seem to fall ex- 
clusively within the province of the applied physi- 
ologist, the fact is that psychologists have studied 
the effects of a wide range of environmental factors 
on gross behavior. These studies have been con- 
cerned with problems of illumination, noise, anoxia 
(lack of oxygen at high altitudes), certain kinds of 
noxious gases and contaminants, heat and cold, 
vibration, and most recently, weightlessness. 

The aim of research. In common with research 
in most other areas of science, research in engi- 
neering psychology has as its first aim under- 
standing. Beyond this, however, the engineering 
psychologist hopes that his researches will yield 
principles which can be put into the form of defi- 
nite recommendations for machine design. A large 
number of these are now available, and they can 
be found in textbooks and guides on this subject 
(see, for example, Chapanis 1965; McCormick 
1957; Morgan et al. 1963). The following example 
from research on control design will illustrate one 
of the concepts that originated in engineering psy- 
chology and some of the design recommendations 
that followed from it. 

The results of control movements are often 
shown on a display. The tuning knob on a radio 
is a familiar example. As you turn the tuning knob, 
you change a variable condenser inside the radio. 
‘At the same time, a pointer moves along a linear 
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or circular scale to show you what frequency or 
wave length the radio has been tuned to. Examples 
of linked controls and displays are common in the 
world of machines. The controls may be knobs, 
cranks, levers, or translatory controls. Sometimes a 
control may be in one plane (for example, on the 
horizontal working surface in front of an operator ) 
and the display in another plane (for example, 
directly in front of the operator's eyes and at right 
angles to the surface on which the control is 
mounted). The number of permutations of con- 
trols, displays, and orientations is, of course, very 
great. 

In the case of many control—display combina- 
tions it turns out that most people have consistent 
expectations about the way in which a control 
should move in order to produce a change in the 
display. When these expectations are strong and 
found universally, they are called population stereo- 
types. Controls that conform to these population 
Stereotypes are responded to much more quickly 
and with far fewer errors than are controls that 
do not. Human beings are remarkably adaptable 
and, given sufficient training, can learn to use con- 
trols and displays that do not agree with population 
Stereotypes. The interesting thing, however, is that 
if such an operator is subjected to great stress or 
to an emergency situation, he frequently regresses 
or reverts to his natural expectancies. Many acci- 
dents in aircraft have been traced to this single 
factor alone, The design recommendation which 
follows from this research is clear: Whenever 
Strong population stereotypes exist, control and dis- 
play movements should agree with them. 


Man-machine system design 


Although, as was remarked earlier, the scope of 
engineering psychology covers a range from rel- 
atively simple devices to enormously complex ma- 
chine systems, it is the latter which are the most 
challenging, most complex, and most difficult to 
deal with. The design of an air traffic control sys- 
tem, an automated mail handling system, a new 
guided missile system, or a deep space vehicle sys- 
tem is a problem of gigantic proportions. Thou- 
sands of technical and professional experts of a 
hundred or more different varieties may work for 
years to bring such a large system into being. 
Engineering psychologists are generally recognized 
as important members of such design teams. 

In the preceding section the subject matter of 
engineering psychology has been presented from 
the standpoint of the way it is organized by the 
psychologist himself. When we look at the field 
from the standpoint of what the engineering psy- 


chologist contributes to the design of man—machine 
systems, a somewhat different order of topics 
results. 

Design and planning. The first step in the cre- 
ation of any large system is generally called a study 
phase. It is at this time that engineers study in 
detail the specifications and requirements of the 
system. Alternative ways of designing the system 
to meet the requirements are thought up, tested, 
tried out, and discarded, modified, or accepted. Con- 
trary to what many lay people think, there is much 
trial and error involved in this stage of the process, 
and the final conceptualization of the system may 
be quite different from initial ideas. During this 
study phase, engineering psychologists are called 
upon to study and decide about man’s role in the 
system. They usually assist in making decisions 
about precisely which functions of the system 
should be allocated to humans and which to ma- 
chine components. They prepare estimates of the 
number and types of people that will be required 
to man the system when it is completed, the so- 
called QQPRI (Qualitative and Quantitative Per- 
sonnel Requirements Information). If personnel 
with specialized training will be needed, engineer- 
ing psychologists plan and design training programs 
and curricula to ensure that qualified people are 
available to operate the system when it is com- 
pleted. Finally, they may try to anticipate the social 
consequences of the system on its human operators 
or society in general. 

Project engineering. After preliminary designs 
and plans have been completed, the system goes 
into actual construction, The first model of any 
large system is seldom built as originally planned. 
Difficulties appear at this point which had not been 
anticipated in the largely paper-and-pencil study 
phase, and the changes and modifications that must 
be made often number in the thousands. During 
the production phase of system design the engi- 
neering psychologist usually makes substantial 
contributions to the man-machine combination, 
although ultimate responsibility for design usually 
rests with an engineer. The engineering psycholo- 
gist studies, tests, and makes recommendations 
about specific workplace arrangements, solves spe- 
cific problems of display and control, and makes 
design recommendations about the solution of en- 
vironmental problems. 

Another important task of the engineering psy- 
chologist is that of studying the system from the 
standpoint of its reliability and maintenance and 
of anticipating special problems of maintenance. 
With the increase in the number of highly auto- 
mated systems, problems of faultfinding and main- 


tenance are becoming increasingly important. For 
one thing, the cost of a highly automatic system 
is so great that the user cannot afford to have it 
idle for long periods of time. At the same time, 
such systems are so complex that it is becoming 
increasingly difficult for repairmen to diagnose 
what is wrong with them and decide how they can 
be most quickly repaired. As one illustration of the 
magnitude of the problem, the U.S. Air Force esti- 
mates that to repair and maintain a typical system 
(for example, an aircraft) during its normal life- 
time may cost up to ten times the original purchase 
price of the system. Further, of the total time spent 
in actively repairing most large systems, as much as 
80 per cent of a maintenance man’s time is spent 
in merely discovering what is wrong with it. The 
engineering psychologist’s contributions here are 
in planning effective faultfinding strategies and in 
seeing to it that the equipment is so designed (with 
sufficient test points, accesses, and so on) that it 
can be easily maintained, that it is installed where 
it can be readily reached, that appropriate, well- 
written maintenance manuals are ready as soon as 
the equipment is completed, that special tools and 
test equipment are properly designed and con- 
structed, that maintenance men are trained, that 
adequate work and storage facilities are provided, 
and that an adequate supply of spare parts and 
replacements is provided. 

The end result of this part of system develop- 
ment is what is usually termed a prototype, a first 
full-scale working model of the system. 

Test and evaluation and operational use. Once 
a protoype of a system is constructed, it usually goes 
through a series of tests, often termed operational 
suitability tests or evaluations, to discover if the 
system really does what it is supposed to do. It is 
generally recognized that a man—machine system 
has to be tested as a complete entity and that the 
human components in a system may make or break 
it. The problems of testing systems that contain peo- 
ple are far more difficult than conducting simple 
engineering or physical tests. Because of their spe- 
cial training in experimental methodology and in 
the problems of conducting studies on people, en- 
gineering psychologists are often given major 
responsibility for the design and conduct of such 
tests. 

When a large system has been tested, accepted, 
and put into operational use, it may undergo still 
further modifications as experience with it accumu- 
lates. These are usually far fewer in number and 
less sweeping than those which occur earlier in the 
design. Although the engineering psychologist may 
still play some role in this stage of the lifetime of 
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the system, it is usually a much less important one 
than in earlier design phases. It is at this time that 
the training and personnel specialist replaces the 
engineering psychologist in terms of importance. 


ALPHONSE CHAPANIS 


[See also INDUSTRIAL RELATIONS, article on INDUSTRIAL 
AND BUSINESS PSYCHOLOGY. Other relevant material 
may be found in MILITARY PSYCHOLOGY; PSYCHO- 
PHYSICS.] 
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ENTERTAINMENT 
See COMMUNICATION, MASS; DRAMA; FILM; 
LEISURE; LITERATURE. 
ENTREPRENEURSHIP 


There are some unresolved differences in the 
definitions of entrepreneurship, but there is agree- 
ment that the term includes at least a part of the 
administrative function of making decisions for the 
conduct of some type of organization. One group 
of scholars would restrict the term to strategic or 
innovating decisions, and an overlapping group 
would apply it only to business organizations. The 
basis for these differences can be understood from 
the history of the concept. 

The word entrepreneur appeared in the French 
language long before there was any general con- 
cept of an entrepreneurial function. By the early 
sixteenth century, men engaged in leading military 
expeditions were referred to as entrepreneurs, From 
this usage, it was easy to move to applying entre- 
preneur to other types of adventurers. After about 
1700, entrepreneur was frequently applied by the 
French to government road, bridge, harbor, and 
fortification contractors and, somewhat later, to 
architects (Hoselitz 1951, p. 195). Seeing such 
activities as the entrepreneurial function, the mid- 
eighteenth-century French writer Bernard F. de 
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Belidor further defined it as buying labor and ma- 
terials at uncertain prices and selling the resultant 
product at a contracted price (Hoselitz 1951, 
pp. 198-199). 

Richard Cantillon’s Essai sur la nature du com- 
merce en général (1755), probably written a gen- 
eration before its publication date, drew attention 
to entrepreneur as a technical term. The essence 
of the function of the entrepreneur was to bear 
uncertainty, Conversely to the Belidor emphasis, 
Cantillon saw the entrepreneur as anyone who 
bought and sold at uncertain prices. Obviously, 
there is no contradiction between Belidor and 
Cantillon in theory, but merely in the type of ex- 
amples chosen. Except for princes, landowners, 
and salaried workers, Cantillon regarded everyone 
engaged in economic activity as an entrepreneur. 

The physiocratic economists of the later eight- 
eenth century, such as Francois Quesnay and 
Nicolas Baudeau, called the agricultural cultivator 
an entrepreneur. Since the physiocrats also thought 
that only the land was a source of social product, 
this put the entrepreneur in a key position. In the 
sphere of agriculture, Baudeau credited the entre- 
preneur with all the essential characteristics of 
risk taking and innovation that were to be elabo- 
rated in later definitions (in his Première introduc- 
tion à la philosophie économique). At about the 
same time A. R. J. Turgot, in his Réflexions sur la 
formation et la distribution des richesses, spoke of 
the entrepreneur in manufacturing as one who 
risked capital (Hoselitz 1951, pp. 205-212). 

Thus, by 1800, many French economists had 
given special meanings to entrepreneur and entre- 
preneurship, with differences arising largely from 
the characteristics of the sector of the economy 
that chiefly attracted their attention. Those econo- 
mists interested in government saw the entrepre- 
neur as a contractor, the specialists on agriculture 
as a farmer, and the proponents of industry as a 
risk-taking capitalist. The “classic” definition, which 
was to survive until the twentieth century, was 
written by an aristocratic industrialist who had had 
unpleasant practical experience, Jean Baptiste Say. 

In the Catechism of Political Economy, Say 
wrote of the entrepreneur as the agent who “unites 
all means of production and who finds in the value 
of the products . . . the re-establishment of the 
entire capital he employs, and the value of the 
wages, the interest, and the rent which he Pays, as 
well as the profits belonging to himself” ([1815] 
1816), pp. 28-29). This idea appeared earlier in 
A Treatise on Political Economy (1803), which 
was not translated into English until 1827, Say’s 


entrepreneur commonly, but not necessarily, sup- 
plies either his own or borrowed capital. To suc- 
ceed, he must have “judgment, perseverance, and 
a knowledge of the world as well as of business. 
He must possess the art of superintendence and 
administration” (1803, p. 295 in the 1827 edition). 
Say does not, however, discuss the entrepreneur 
in relation to innovation or capital creation. As in 
the case of the British classicists, he was unable to 
make entrepreneurship a ponderable factor in his 
general economic theory. 

In contrast to the tolerably consistent and ex- 
panding French definitions of entrepreneur, the 
English appear to have made rather less use of 
three terms: adventurer, undertaker, and pro- 
jector. While the Merchant Adventurers of the six- 
teenth century were the equivalent of French 
entrepreneurs, adventurer did not come into gen- 
eral use. The seventeenth-century French use of 
entrepreneur for government contractor had its 
English counterpart in the term undertaker. Near 
the close of the century, the third term, projector, 
came into use. Daniel Defoe, in An Essay Upon 
Projects, equates the term projector with inventor, 
but also with fraud or swindler, In Malachy Postle- 
thwayt’s Universal Dictionary of Trade and Com- 
merce the words adventurer and undertaker are 
referred to but these terms are not given precise 
definitions (Redlich 1949, p. 9). Bert F. Hoselitz 
(1951) finds that by the time of Postlethwayt, 
undertaker could be applied to businessmen in 
general but that the term was, in fact, becoming 
obsolete. [See the biographies of CANTILLON; QUES- 
NAY; Say; TURGOT.] 

Early treatment in economic theory. As eco- 
nomic theory became more carefully formulated in 
all the western European nations, no operative 
place was found for the entrepreneur. This was 
particularly evident in English classic theory from 
Smith to Marshall, where many writers made no 
effort either to define or include entrepreneurship. 

The difficulty was that English theory was based 
upon a normal state of equilibrium, established by 
the multiple reactions of businessmen, consumers, 
investors, and workers to the prices of goods and 
services. Individual variations in behavior were 
seen either as canceled out in the aggregate or 
suppressed by competition. In this highly aggrega- 
tive system, any unknown element was to be de- 
rived from the relations of theoretically measurable 
quantities. Such a system could obviously not 
utilize unmeasurable social or cultural factors such 
as entrepreneurship. To say that the entrepreneur 
was rewarded for risk taking, that is, for uncer- 


tainty, was the negation of a proper theoretical 
explanation. 

In addition to the inhospitability of classic the~ 
ory, the mid-nineteenth-century business structure 
of small-sized to medium-sized family firms, or 
closely held firms, obscured the distinctive char- 
acter of the entrepreneurial function. With few 
exceptions, the men performing this function were 
also capitalist owners. Their rewards could be seen 
as a return on capital rather than as special com- 
pensations for entrepreneurial ability per se. In his 
Principles of Economics, Alfred Marshall perceived 
the changed situation inherent in big, managerially 
run business: “Those general faculties, which are 
characteristic of the modern businessman,” he 
wrote, “increase in importance as the scale of busi- 
ness increases” (1891, p. 644). Yet Marshall did 
not elaborate upon the theoretical significance of 
this increasing factor, and in discussing the growth 
of English manufacturing he used capitalist and 
undertaker interchangeably (1891, pp. 40-43). 

Perhaps the relatively early development of big 
corporations in the United States led American 
economists to think of entrepreneurship as a func- 
tion separate from either ownership or the supply 
of capital. In the late 1870s, Francis A. Walker 
emphasized the distinction between capitalists and 
entrepreneurs and called the latter the engineers 
of industrial progress and the chief agents of pro- 
duction (Dorfman 1946-1959, vol. 3, p. 109). 
Frederick B. Hawley, writing in 1882, saw risk 
taking as the distinguishing attribute of the entre- 
preneur, and ranked this as a factor in production 
on a par with land, labor, and capital (ibid., 
p. .32). At the end of the century, the unorthodox 
John R. Commons gave an explanation of risk- 
taking entrepreneurship and profit that anticipated 
some of the more fully elaborated ideas of Joseph A. 
Schumpeter (ibid., p. 283). According to Com- 
mons, one type of profits arose from the ability and 
risk taking of the entrepreneur and was temporary 
and contingent on changes in the economic situa- 
tion. But Schumpeter himself, in 1912, gave John 
Bates Clark credit for being the first to connect 
“entrepreneurial profits considered as a surplus 
over interest (and rent), with successful introduc- 
tion into the economic process of technological, 
commercial, or organizational improvements” (ibid., 
vol. 4, p. 166n). The basic problem of finding an 
operative role for entrepreneurship in economic 
theory, however, remained unsolved; and economic 
theorists, in general, well aware of the incongruity 
of a nonmeasurable human element in a theoret- 
ical structure based on quantifiable assumptions, 


ENTREPRENEURSHIP 89 


moved in other directions, such as the study of 
business cycles, income, saving, and investment. 
[See the biographies of CLank, JOHN Bates; COM- 
MONS; MARSHALL; WALKER.| 

Schumpeter—Cole views. Schumpeter’s accept- 
ance of a chair at Harvard. the translation of The 
Theory of Economic Development into English in 
1934, and the great depression all called new 
attention to his position regarding the essential role 
of the entrepreneur in creating profits. According 
to Schumpeter, both interest and profit arose from 
progressive change, and would not exist in a static 
society, as he defined it. Change, in turn, was the 
work of innovating businessmen or entrepreneurs. 
Since one change was likely to stimulate others, 
there was a tendency for innovations to cluster and 
produce long upswings in profits and business 
activity. 

Much additional interest in entrepreneurship 
arose from the work of two of Schumpeter's col- 
leagues at Harvard, economic historians Edwin F. 
Gay and Arthur H. Cole. In 1944, in his presidential 
address to the Economic History Association, Cole 
offered a historical analysis of the changing char- 
acter of entrepreneurship. Four years later, he and 
Schumpeter cooperated in establishing at Harvard 
the Research Center in Entrepreneurial History. 
From the work of the center, influenced greatly by 
Cole and Leland H. Jenks, there emerged an ap- 
proach to entrepreneurship differing from Schum- 


peter’s. 
In the Schumpeterian view, innovation was the 
criterion of entrepreneurship; “. . . the defining 


characteristic is simply the doing of new things or 
the doing of things that are already being done in 
a new way (innovation)” (Schumpeter 1947, 
p. 151). The “new way” was a “creative response” 
to a situation that had, at least, three essential 
characteristics. 


First . . . it can practically never be understood ex ante. 
. . . Secondly, creative response shapes the whole course 
of subsequent events and their “long-run” outcome. . . . 
Thirdly, creative response . . . has . . . something to do 
(a) with the quality of personnel available in the so- 
ciety, (b) . . . with quality available to a particular 
field of activity, and (c) with individual decisions, ac- 
tions and patterns of behavior. Accordingly a study of 
creative response in business becomes coterminous 
with a study of entrepreneurship. (Schumpeter 1947, 
p. 150) 


Thus, to Schumpeter, a manager was an entre- 
preneur only while he was making a creative or 
innovative response. 

The Cole view of entrepreneurship, however, 
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equates it with the continuing general activities of 
managers, It is “the purposeful activity (including 
an integrated sequence of decisions) of an indi- 
vidual or group of associated individuals, under- 
taken to initiate, maintain or aggrandize a profit 
oriented business unit for the production or dis- 
tribution of economic goods and services” (Cole 
1959, p. 7). “Novelty is successful in the business 
world only if the institution introducing it is being 
effectively maintained” (p. 15). 

Since the difference in the two definitions in- 
volves only the scope within which the term may 
be applied, the followers of Cole and Schumpeter 
have worked together with a minimum of friction. 
Both definitions implied broad social approaches, 
close to the Germanic tradition of Gustav Schmoller, 
who had seen the need for “a deeper insight into 
the social context of the enterprise” (Lane & 
Riemersma 1953, p. 6), or Alfred Weber, who had 
regarded entrepreneurship as a socioeconomic func- 
tion separate from profit making. Both definitions 
also suggest time spans longer than those of dy- 
namic economic theory. [See the biography of 
SCHUMPETER. ] 

Recent thought. In spite of the difficulties in- 
herent in long-run analysis and in unmeasurable 
human factors, the increasing interest of econo- 
mists in economic development has directed their 
attention to entrepreneurship, While economists in 
this field include the entrepreneurial function in 
empirical studies of situations and in recommend- 
ing appropriate economic policies, the theoretical 
problem of finding a place for unmeasurable and 
socially influenced forces in a mathematically ori- 
ented theory has not been solved. More specifically, 
the difficulty is that entrepreneurial earnings would 
have to be accounted for in a theory of profit, but 
no theoretical cost of entrepreneurship can be set. 

Entrepreneurial study continues, however, in the 
unoccupied territory bordered by economics, his- 
tory, and sociology. Two publications—Explora- 
tions in Entrepreneurial History (started by R. 
Richard Wohl and Hugh G. J. Aitken at the re- 
search center at Harvard and continued in the 
1960s under the editorship of Ralph L. Andreano 
at Earlham College) and Economic Development 
and Cultural Change (initiated by Bert F. Hoselitz 
at the University of Chicago)—emphasized, during 
the 1950s and 1960s, the importance of entre- 
preneurship. In addition to scores of articles in 
these two publications, the Harvard center inspired 
a number of larger empirical studies. Of these, 
Fritz Redlich’s volumes on the military entre- 
preneur in Europe cover the longest historical time 
span. In all, a new field of specialization, not firmly 


attached to any single discipline, appears to have 
been established. 

Students of entrepreneurship generally have 
come to agree that while it is a definable function, 
entrepreneur is a term denoting an ideal type 
rather than a term continuously applicable to a 
real person. Any businessman or other official may 
exercise entrepreneurship, but a classification can- 
not be devised that would empirically separate 
entrepreneurs and nonentrepreneurs, 

One group of scholars has expanded the Cole 
point of view that entrepreneurship is a continu- 
ous function in business organization. 


To the extent that behavior in a business firm is or- 
ganized (formally or informally), to that extent we 
have entrepreneurship; to the extent that it is disor- 
ganized, random, or self-defeating, to that extent 
entrepreneurship is lacking. . . . The characteristics 
conventionally associated with entrepreneurship—lead- 
ership, innovation, risk-bearing, and so on—are so 
associated precisely because, in a highly commercial- 
ized culture such as ours, they are essential features 
of effective business organization. By the same logic, 
in a differently oriented culture, the typical character- 
istics of entrepreneurship differ. (Aitken 1963, 
p. 6). 


Another school of thought prefers to keep the 
Schumpeterian distinction between the entrepre- 
neurial function of making strategically important 
or innovating decisions and the managerial func- 
tion of maintaining the more routine operations of 
a business organization. Nonspecialists, also, tend 
to emphasize strategic decision making in distin- 
guishing entrepreneurship from management. 


Tuomas C. COCHRAN 


[See also History, article on BUSINESS HISTORY.]} 
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ENVIRONMENT 


Environment has been defined as “the aggregate 
of all the external conditions and influences affect- 
ing the life and development of an organism” 
(Websters New Collegiate Dictionary). The aim, 
then, with either individual organisms or com- 
munities, is to distinguish between factors arising 
from outside the system and factors inherent in 
the system itself. This sounds simple enough, but 
in practice the distinction between organism and 
environment is not always easy to make. 

In the first place, there is a problem of limits. 
This is essentially a matter of definition; to illus- 
trate with an absurd example, when does an apple 
that a person eats cease to be a part of the environ- 
ment and start to be a part of the man? Perhaps as 
soon as it enters the mouth; perhaps not until di- 
gestion has been completed. Most people, in any 
case, would consider indigestion caused by eating 
a green apple to be an environmental effect. Simi- 
larly, internal parasites would be considered part 
of the environment of the individual in which they 
live, even though they are entirely inside the sys- 
tem: the question is one of origin rather than 
present position. 

The limit problem is not very serious. More con- 
fusing is the problem of interaction between organ- 
ism and environment. The earth’s atmosphere, for 
instance, with oxygen and carbon dioxide as com- 
ponent gases, is an essential part of the environ- 
ment for life as we know it. Yet we now believe 
that these gases were not part of the early atmos- 
phere of the planet; their existence is a conse- 
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quence of the action of living organisms, as well 
as a necessary condition for life. The relation be- 
tween vegetation and soil provides another type of 
example. The kind of forest growing in a particular 
region is at least partly the consequence of the 
type of soil in that region; yet the nature of the soil 
is partly determined by the sort of vegetation that 
has grown on it. 

The interaction problem is particularly confus- 
ing in the case of man and the human environ- 
ment. In orienting and judging our surroundings, 
we depend on our sensory systems; yet, as psychol- 
ogists can so easily show, what we perceive is in 
part a consequence of conditioning and learning. 
We have thus really created many aspects of the 
shapes, colors, sounds, and smells in the world 
about us, at least as they influence our behavior. 

The concept of culture is responsible for many 
difficulties in this connection. Should culture be 
looked at as a part of the individual or of the com- 
munity? Is it a part of the environment? The 
answer surely varies, depending on the nature of 
the study. 

For instance, if we are concerned with the ways 
in which peoples (or individuals) cope with their 
natural environment, we almost necessarily look 
at the physical man with the cultural equipment 
at his disposal for this purpose. Thus the same 
physical surroundings, the same environment, may 
have quite different meanings for, say, a food- 
gathering pygmy, an agricultural Bantu, or a west- 
ern European. We are here concerned primarily 
with cultural adaptations or maladaptations. This 
is the case with many sorts of geographical and 
anthropological studies; in extreme cases, human 
nature may be taken as a constant to be ignored 
and study concentrated on the interaction between 
culture and environment in this sense of the term. 

In many kinds of psychological studies, on the 
other hand, cultural traits are dealt with as parts 
of the environment: we become interested in the 
ways in which individuals are taught to accept 
their culture, or the ways in which they are frus- 
trated by it and the forms that rebellion takes, In 
physiological studies, by contrast, culture may or 
may not be considered as environmental at all. 
Work on heat stress, for instance, may involve a 
naked man on a treadmill under controlled climatic 
conditions, in an attempt to eliminate cultural ef- 
fects; alternatively, it may involve clothed men 
under natural conditions. Investigations of such 
concepts as “comfort zones” assume that people 
will be wearing culturally acceptable clothing. 

Clothing illustrates nicely many aspects of the 
man—culture-environment relationship. Clothing is 
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definitely external, stemming from outside the sys- 
tem of the individual organism. Yet in many kinds 
of practical studies, we must assume that people 
are wearing clothes and observe environmental 
effects on clothed individuals. Furthermore, the 
kind of clothing worn may influence individual 
personality—witness the effect of uniforms, of 
formal dress, or of work clothes. 

Organism and environment, then, are not con- 
trasting or mutually exclusive terms; rather, they 
represent interacting systems, and the distinction 
is useful in analysis only when this is kept in mind. 
Environment is often contrasted with heredity, as 
in the long controversy over “nature versus nur- 
ture” in shaping human personality. There is no 
“versus.” Every individual is the product of a cer- 
tain genetic potentiality finding expression in a 
particular environment or series of environments. 

In studies of biological communities, the concept 
of ecosystem has proved useful: this term covers 
both the living organisms and the abiotic factors 
of the environment in which they occur. Biologists 
thus avoid the dangers of looking at the living 
members of the community as separate from their 
surroundings. The extension of this idea to man, to 
human ecosystems, will surely be rewarding. 

Ecologists have tried various ways of analyzing 
environments to study and compare their different 
components, but no single system is completely 
satisfactory. One way involves the separation of 
the biological and physical environments: the liv- 
ing and the nonliving elements in the surroundings 
of an individual organism or a species. Thus, one 
can isolate climate as an aspect of the physical 
environment. In the case of man, some geographers 
have thought climate to be a controlling factor in 
cultural development, considering civilization, for 
instance, to be a response to a particularly stimu- 
lating climatic situation. On the other hand, they 
see climates that are too warm, too humid, or too 
cold as having a retarding effect on cultural 
development. 

Climate illustrates another aspect of environ- 
ment, that of scale. Climate is ordinarily measured 
in standard meteorological stations that reflect 
conditions under which few organisms live. Con- 
ditions in a forest or meadow, within the soil, or in 
a pond may be quite different from those where the 
standard measurements are made. It is, in fact, 
useful to distinguish among three levels of climate: 
the microclimate in which an individual lives, per- 
haps conditions in a cranny in bark or on the 
underside of a leaf; the ecological climate of some 
particular habitat, like a forest; and the geograph- 
ical climate measured by the standard station. The 


same kind of difference, depending on the focus 
of interest, applies to many other aspects of the 
environment. 

In the case of man, we have to consider the cul- 
tural as well as the biological and physical environ- 
ments, bringing up the whole series of problems 
mentioned earlier. 

Another way of analyzing environmental factors 
is in terms of whether or not they affect the par- 
ticular organism under consideration, and whether 
or not they are perceived. The sum of the forces 
and materials in any situation can be looked at as 
the total environment, but this includes many ele- 
ments that have no influence on behavior. We live, 
for instance, in the magnetic field of the earth, but 
this does not affect our physiology in any known 
way; and we cannot perceive it without the aid of 
some instrument like a compass. Similarly, we can- 
not perceive viruses without instruments, but many 
of them do affect us by causing disease. Viruses 
would, then, form part of our operational environ- 
ment, although not a part of our perceptual 
environment. 

We tend to confuse our own perceptual environ- 
ment with total reality: to think, for instance, that 
the forest we see is the “real” forest—for squirrels, 
birds, and insects as well as for ourselves. Yet it is 
obvious enough, when we stop to think about it, 
that each kind of animal lives in a particular sort 
of perceptual world of its own. The forest that a 
dog sees, hears, and smells is quite different from 
the forest that we perceive. This is something that 
must be clearly kept in mind in studying animal 
behavior as well as animal ecology. 

When we turn to man, the concepts of opera- 
tional and perceptual environments are inadequate 
by themselves, just as are those of biological and 
physical environments. For primitive man, the 
world is full of spirits which form a very real and 
important part of his surroundings—form what 
has sometimes been called the supernatural 
environment. 

This sounds a little condescending; we can see 
that the people themselves created the spirits that 
so dominate their lives—that the spirits are merely 
ideas. Yet our own actions are equally governed 
by ideas. We may not call them spirits or propitiate 
them with elaborate rituals, but ideas play roles 
in our lives that often are at least comparable. We 
might call this world of ideas the conceptual en- 
vironment. This differs somewhat from the cul- 
tural environment: the latter would include all 
human artifacts, such as clothing, housing, and 
tools; the former is concerned with the ideas that 
govern the form and use of the constructs. 


When we look at the human environment, the 
continuing interaction between organism and sur- 
roundings is very evident. It is often said that man, 
far more than any other animal, has developed the 
ability to modify environment and thus to live 
under a wide range of physical and biological con- 
ditions. He has, in fact, created a new “biome” or 
ecological formation—the man-altered landscape, 
which is rapidly replacing other terrestrial land- 
scapes as forests are cleared, grasslands plowed, 
and deserts irrigated. Man, assuming ecological 
dominance within this biome, has affected directly 
or indirectly all other organisms living there; he 
has become a new geological force. 

Most members of the human species now live 
within this man-altered landscape. But this environ- 
ment is not only a consequence of human activity; 
it is also a determinant of that activity. For in- 
stance, man has created the city; but the nature 
of the city, of the urban environment, governs his 
behavior much as the nature of the forest governs 
the behavior of tree squirrels. We could perhaps 
most profitably look at this new kind of biome as 
forming the human ecosystem. 

It is, of course, not a single system; it includes 
a varied collection of differing environments: cities, 
towns, rice paddies, orchards, pastures, highway 
rights of way, abandoned fields, rubber plantations, 
and areas devastated by industrial wastes. Curi- 
ously, both social and biological scientists tend to 
neglect the study of this human ecosystem, per- 
haps because its study requires a blending of both 
social and biological knowledge. 

MARSTON BATES 


[See also CONSERVATION; CULTURE; ECOLOGY; GEOG- 
RAPHY; PLANNING, SOCIAL, article on RESOURCE 
PLANNING; REGION; REGIONAL SCIENCE.] 
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ENVIRONMENTALISM 


The question of the relations between man in 
society and the geographical environment in which 
he lives is a very old one. Hippocrates (fifth cen- 
tury B.c.) wrote a treatise, “On Airs, Waters, and 
Places,” which is generally regarded as the first 
formed expression of an environmentalist doctrine, 
although in view of the limited data available to 
him it is not appropriate to regard this as a state- 
ment sufficiently definitive for a serious critique 
of environmentalism, as Toynbee does (1934). 
Environmental considerations, especially climatic 
ones, play a considerable role in Montesquieu 
(1748) and perhaps reached their peak in the 
mid-nineteenth century, with Victor Cousin’s “give 
me the [physical] map of a country... and I 
pledge myself to tell you, a priori, .. . what part 
that country will play in history, not by accident, 
but of necessity; not at one epoch, but in all 
epochs” (quoted in Febvre [1922] 1925, p. 10). 
Such extreme necessitarianism could hardly go un- 
challenged, and the first serious attack on geo- 
graphical determinism is associated with the name 
of Paul Vidal de la Blache, who about the turn of 
the century became in effect the founder of an 
opposed doctrine known as “possibilism.” 

Possibilist doctrine is perhaps best, or at least 
most characteristically, summed up in a dictum of 
Lucien Febvre ([1922] 1925, p. 235): “There are 
nowhere necessities, but everywhere possibilities; 
and man, as master of the possibilities, is the judge 
of their use.” A protest against crude predestinar- 
ianism was certainly in order; but Vidal’s own 
qualifications are perhaps not always faithfully 
mirrored by his disciples, and Febvre’s epigram- 
matic statements distracted attention from the real 
task of assessing the probabilities posed by the 
indisputable fact that the possibilities are distrib- 
uted over the face of the earth with great inequality. 
This has recently been elegantly demonstrated by 
Lukermann (1965). It is also, perhaps, insuf- 
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ficiently noted that French possibilism was itself 
to some extent determined by a reaction to what 
we would now call an expansionist Geopolitik, ex- 
pressed in Friedrich Ratzel’s Politische Geographie 
of 1897 (Febvre 1922; cf. Spate 1957). 

Be that as it may, the French school of geog- 
raphy, particularly noted for its meticulous and 
luminous style of regional description, was by that 
very technique often able, quite plausibly, to evade 
the issue, while more general works, such as those 
of Brunhes (1910) and Vallaux (1911), tempered 
possibilism by some allowance for the “influences,” 
although not the “controls,” exercised by the phys- 
ical environment. In Germany, also, the broad 
strokes of Ratzelian anthropogeography were gradu- 
ally succeeded by the more subtle chorographic 
analyses of Landschaft, and already in 1907 Alfred 
Hettner had arrived at a formulation not dissimilar 
from Febvre’s (cf. Hartshorne 1939, p. 123). 

In English-speaking countries the evolution was 
different. With the popularization, or (in both 
senses) vulgarization, of Ratzel’s basically deter- 
minist outlook by Semple’s Influences of Geographic 
Environment (1911), a somewhat naive view 
of environmental “controls” became paramount 
among geographers in the United States and 
Britain, and this is what is generally known as 
“environmentalism.” Another powerful influence 
was that of Ellsworth Huntington, whose numer- 
ous works attached a preponderating role to broad 
climatic factors. But the antienvironmentalist reac- 
tion, if much later than in France and Germany, 
was all the more complete. To some extent both 
the acceptance and the reaction stemmed from a 
rather uncritical empiricism, and this was espe- 
cially notable in Britain. While in Britain possi- 
bilism in its purest form held undisputed sway in 
the 1930s, in the United States environmentalism 
was not so much negated as simply sidetracked. 
There were indeed plenty of overt rejections (Sauer 
1925; Platt 1948—an extremist case), but on the 
whole the emphasis was on geography as simply 
the study of areal differentiation. This, of course, 
has clear analogues with the German development, 
and Hettner in particular was a strong influence, 
especially through the comprehensive methodologic 
study of Richard Hartshorne (1939). In Britain 
there was no comparable searching out of funda- 
mentals, and indeed possibilism fitted well into a 
rather superficial and characteristically “English” 
empirical distrust of theory. The qualifications, 
subtle and unstressed but nonetheless significant, 
of the French school were ignored, and geography 
became in effect an entirely idiographic study in 
which it would be indecent to draw conclusions. 


There were, of course, heretics: in the United 
States, Peattie (1940); in Britain, Markham 
(1942); but they had no effect. 

The old view of geography as primarily a study 
of man-environment relations is now outmoded, 
and it is probable that a reasonable consensus 
would be found in favor of Hartshorne’s formula- 
tion (1959, p. 21): “accurate, orderly, and rational 
description and interpretation of the variable char- 
acter of the earth surface.” However, relationships 
vary with the distribution of the phenomena that 
are in relation, and provided that we do not pre- 
judge the issue by insisting that they are confined 
to those between man and natural environment or 
are one-way only, there is still ample scope for the 
examination of environmental problems. It is not, 
as Toynbee says (1961, p. 635), modern arrogance, 
but humility in face of data still inadequate, which 
refuses to take his refutation of Hippocrates as a 
final judgment. Moreover, while it may be true that 
external demands (as from history and sociology) 
for environmental assessments may represent a 
hangover from days when geographers were all too 
ready with crude causal explanations (and they 
got the habit from historians), it yet remains true 
that very often historical, sociological, economic, 
anthropological, political, and even religious and 
aesthetic phenomena cannot be properly compre- 
hended without careful attention to environmental 
considerations. 

Thus, the question is by no means so decisively 
closed, in an antienvironmentalist sense, as it 
seemed two or three decades ago; and as we shall 
see, it has taken on an entirely new aspect with 
the application of new techniques to geographical 
inquiry. While there have always been individual 
divergences from the general trend and, not infre- 
quently, internal inconsistencies in the work of in- 
dividuals, whether styling themselves environmen- 
talists or possibilists, the question (as in many 
controversies) has been bedeviled by the assump- 
tion by both sides of a too rigid dichotomy. Whether 
tenable in strict logic or not, a more balanced 
probabilistic hypothesis seems warranted. This 
seems avoidable only if, as Hartshorne hints (1959, 
Pp. 55), we altogether abandon any distinction be- 
tween man and nature; and this repudiation, dubi- 
ously metaphysical as it is, in practice seems im- 
possible to maintain (Spate 1963a, pp. 255-259). 
In practice, except on an absurdly mechanistic 
plane, it is impossible to hold that all man’s activi- 
ties are absolutely conditioned or determined by 
his natural environment, even if we resort to in- 
tricate rationalizations as to its expression through 
social institutions. But it is absurd, also, to take 


Febvre’s dictum at face value and so slide into the 
position of ignoring the fact that possibilities vary 
greatly from milieu to milieu and, hence, in any 
given milieu are in fact limited. One may in a sense 
overcome this by saying that anything is possible 
anywhere if only one is willing to pay the price; 
but then, paying the price is itself a compelled 
adjustment to the environment. The flight from 
“controls” into a denial of “influences” takes us 
nowhere; or, if anywhere, into solipsism. 

A reaction against possibilism became apparent 
around 1950. It avoided the crudity of the earlier 
concept of environmental control, as well as the 
dead end of possibilism, by stressing in any given 
situation the balance of probability, as, of course, 
both environmentalists and possibilists had often 
done in practice without admitting it. Some signs 
of rapprochement are found even in contributions 
avowedly committed to one side or the other (Tat- 
ham 1951; Taylor 1951), Perhaps the first really 
vigorous reassertion of geographic determinism 
was that of Martin (1952). 

This newer and more cautious environmentalism 
gives more play to social factors than did the old. 
It recognizes that the geographical environment is 
only a part of the total environment and allows for 
the modifications of environment introduced by 
human activity; geographical influences act through 
society, and cultural tradition has a certain auton- 
omous and reciprocal effect. Strands of causation 
may therefore be extremely subtle, and dogmatism 
is avoided. At the same time, it is firmly held that 
there is a larger irreducible minimum of influence 
by the physical environment than possibilism allows 
for. Although the impact of this will vary with the 
converse impact of human technological levels, 
nevertheless there will always be at least the ad- 
justment by price and very often a much more 
direct adjustment. 

The mandates of the geographical milieu are, 
however, often more negative and permissive than 
positively imperative, Thus, a total of 200 frost-free 
nights does not enforce the growing of cotton but 
does permit it, and fewer frost-free nights inhibit 
it. Further, while in a given situation the general 
cast of development may be very strongly influ- 
enced or conditioned by geographical factors, the 
detail may be dependent on quite other factors. 
This introduces a margin-of-error concept and may 
be illustrated by the difference between the general 
location of a frontier zone or a communications 
node (given the existence of a society with these 
features), which may be fully conditioned by geog- 
raphy, and the precise siting of a boundary within 
the frontier zone or of a city near the node, which 
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may depend on historical accident and which may, 
perhaps, in turn become a geographical factor in a 
new chain of relationships (Spate 1957). 

This revival of methodological debate in geog- 
raphy owes much to the general increase in sophis- 
tication in the social sciences. This is perhaps more 
particularly true of the newer, quantitative ap- 
proaches, but is by no means confined to them. 
It may fairly be said that the net result of the 
debate has been a material change in the general 
temper of geographical writing. If there has been 
no return to the compulsions of the older environ- 
mentalist school, as exemplified perhaps not so 
much in Ratzel himself as in Semple’s rendition of 
him in the Influences of Geographic Environment 
(1911), it is equally true that pure possibilism, in 
the Febvre version, seems also to be dead. Stimu- 
lating as a protest, it was in the long run stultify- 
ing. Perhaps its most valuable residuum is that, 
indirectly at least, it helped to break away from 
the static concept of environment as a once-and- 
for-all given thing in itself, and it raised the ques- 
tion, Environment for what? This, however, seems 
to have no necessary connection with a possibilist 
view, and it may indeed have gone too far in the 
direction of a metaphysical identification of man 
with nature. The newer, probabilistic approach in 
regional writing is more likely to draw conclusions 
of general import than possibilism did, or at any 
rate to draw them more consciously and responsibly. 

The debate has not been entirely internal to 
geography. One factor was the interest aroused by 
Toynbee’s somewhat cavalier direct treatment of 
environmentalism and by the large if sometimes 
erratic importance he attached to it in such con- 
cepts as “the stimulus of New Ground” and the 
effects of a Völkerwanderung by sea (1934). Toyn- 
bee’s analyses are of great interest, although viti- 
ated by unfamiliarity with the main current of 
geographical writing and lack of a sense of scale; 
but both negatively and positively he contributed 
to putting environmentalism on the map again. 
The environmental component in such studies as 
Wittfogel’s Oriental Despotism (1957) or, on a 
different scale, Sahlins’ Moala (1962) is obvious. 

The new trend has not, of course, gone unchal- 
lenged. As has been noted, it places some stress on 
probability, and the almost accidental introduction 
of the rather clumsy term “probabilism” (for which 
this writer must regretfully accept responsibility ) 
has naturally attracted some dialectical criticism. 
Important contributions to the debate are those of 
Montefiore and Williams (1955) and the Sprouts 
(1956; 1957; 1965). 

The former appear to approach the problem from 
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the standpoint of logical positivism. Their criticism 
of a too naive acceptance of cause and effect as 
the only way of looking at scientific explanation is 
acute and vigorous, and they end with calling a 
plague on both houses: “. . . there can be no further 
point in their continuing a dispute which has vir- 
tually no bearing on their activities as working 
geographers.” However, it may be suggested that 
this does not dispose of the issue. Belief does nor- 
mally have some bearing on activity, and the 
dichotomy has been resurrected in a new (and, to 
some, alarming) fashion by quantifiers of the type 
of Warntz and Isard. The fundamentally important 
papers by the Sprouts include a very careful seman- 
tic analysis of hypotheses under the categories 
“environmental determinism,” “mild environmen- 
talism,” “environmental possibilism,” “environmen- 
tal probabilism,” and “cognitive behaviorism.” They 
point out the logical residuum of environmentalist 
thinking implied in the possibilist approach and 
give at least a qualified blessing to probabilism; but 
it may be said that to a geographer their possibilism 
looks more like probabilism, and their probabilism 
seems in turn to hold a more predictive element 
than those who would not call themselves environ- 
mentalists tout court would allow. 

The rise of applied and quantitative geography 
has in some respects given a new emphasis to en- 
vironmental studies. One may instance Soviet 
geography, in which there is theoretically no prob- 
lem: the laws of nature govern physical geography 
but are entirely separate from the social laws which 
govern man, and therefore there can be no unified 
geography (which is the essence of environmental- 
ism) but only physical and economic geographies. 
Practice, and large-scale planning do, however, 
compel very meticulous attention to environmental 
factors, and even “influences” are not altogether 
banned, as they are in pure possibilism (Spate 
1963b). In practically all fields involving the physi- 
cal application of technology, whether under Soviet 
or Western auspices, very careful attention to prob- 
lems of the physical environment is essential, if 
only as part of estimating costs. 

It is often stated that the impact of modern tech- 
nology has minimized, even annihilated, the sig- 
nificance of the environmental factor. However, on 
analysis it will frequently appear that the role of 
the physical milieu, if less “brutal” than it may be 
for a primitive-subsistence society, is pervasive in 
a more subtle way. It may be theoretically possible 
to grow anything almost anywhere, at a price; but 
the effect of price itself, in alliance with modern 
communications, may well be not to widen the 
range of a given crop but to narrow it to the area 


physically best suited for it: witness the formerly 
wide and presently restricted extent of flax growing 
in Europe and cotton growing in India. Large-scale 
technical installations may often depend for their 
economic efficiency on a nice balance of environ- 
mental considerations. 

The basic assumptions of the new, quantifying 
schools have strong determinist, if not mechanistic, 
Overtones, as suggested by the very title “social 
physics” (Stewart & Warntz 1958). At the least, 
they are strongly probabilistic, as is well demon- 
strated by Burton (1963). They aim at being nomo- 
thetic rather than idiographic, as were possibilism 
and much of the work of the chorographic approach 
standard in the Hartshorne era. They avowedly 
seek out laws with a capital L, as did Semple 
(Dodd & Pitts 1959). They work largely in models, 
and a high degree of prediction is regarded as the 
ideal. A culmination of this attitude is that of Isard 
(1956) in his desire for a “true” set of regions suit- 
able for all purposes. There is often a tendency, as 
in the concept of population potential (Stewart & 
Warntz 1958), to abstract all but one or two fac- 
tors, considered determinative; but these, also, are 
considered as some sort of summing up of the 
essence of the total environment. 

It cannot, therefore, be assumed, as it was only 
a few years ago, that the ancient debate regarding 
the role of environmental factors is played out. 
That role changes with every change in technology, 
but it also must enforce technological changes, if 
the full and effective deployment of technical poten- 
tial is to be made possible. Nor would the conquest 
of space necessarily mean the supersession of en- 
vironment; there may be other than terrestrial 
environments for man, and these will compel spe- 
cial adjustments, social and technical. Meanwhile, 
the study of environment on this earth is far from 
complete; and while claims that it would provide 
an all-embracing rationale of society are justly 
dead, its significance must always be reckoned with 
in such studies as anthropology, archeology, soci- 
ology, and political science, to say nothing of his- 
tory, and it forms an essential bridge between 
these social studies and the natural sciences. 
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EPIDEMIOLOGY 


Epidemiology is a branch of ecology that in- 
cludes both the sum of what is known concerning 
the differential distribution of disease throughout 
a population and the techniques for collecting 
and analyzing data dealing with the prevalence 
and incidence of disease among different social 
groups. While originally limited to the study of 
epidemics or the spread of contagious disease, epi- 
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demiology today covers all types of disease, de- 
generative as well as communicable, and all popu- 
lation characteristics—social and psychological as 
well as biological and physical—that may help to 
describe or explain the prevalence of disease. 

Methods of epidemiology. In the broad sense 
of the term, epidemiology deals with the occur- 
rence and distribution of disease among different 
population groups, whether human, animal, or 
plant. The discovery or description of these dif- 
ferences has been called descriptive, or compara- 
tive, epidemiology, whereas the analysis of the 
causal factors and conditions producing these dif- 
ferences is usually referred to as explanatory, or 
analytic, epidemiology. As epidemiology becomes 
increasingly concerned with the study of the origin 
and course of disease, rather than solely with its 
distribution, this distinction is gradually disap- 
pearing. 

Because of its emphasis upon the relationship 
between environmental factors and disease, epi- 
demiology is properly regarded as a major branch 
of human ecology, or “the study of the relations 
between man and his environment, both as it af- 
fects him and as he affects it” (Rogers 1960, 
p. vii). In general, three main sets of interacting 
factors form the focus of epidemiological interest: 
the host, or human individual varying in genetic 
resistance, susceptibility, and degree of immunity 
to the disease; the agent, or carrier of the disease, 
including any adverse process, whether it be an 
excess, deficiency, or interference of a microbial, 
toxic, or metabolic factor, and varying according 
to infectivity, virulence, and pathogenesis; and the 
environment, or surrounding medium, social as 
well as physical, which affects both the suscepti- 
bility of the host, the virulence of the agent or 
disease process, and the quantity and quality of 
contact between host and agent (Paul 1950, pp. 
53-54). These three sets of factors do not exist 
in any simple one-to-one relationship but maintain 
a complex, ever-changing balance. The occurrence 
of disease, especially mass disease, is the result 
of a multiplicity of causal factors, each of which 
contributes to, rather than accounts for, the ap- 
pearance of the disease. 

Epidemiological knowledge consists of the avail- 
able facts and theories concerning the relationships 
between these three factors and the various disease 
entities and health conditions. Social epidemiology, 
as a subdivision of epidemiology, concentrates on 
the social, as opposed to the physical or biological, 
factors in the incidence and prevalence of disease. 
In the case of the chronic, degenerative diseases 
and the mental and behavioral disorders, both of 


which constitute primary targets of modern epi- 
demiology, distinctions between host, agent, and 
environmental factors and between social and bio- 
logical or physical factors are becoming increas- 
ingly difficult to maintain. 

As a research method, epidemiology refers to 
“the application of scientific principles to investiga- 
tions of conditions affecting groups in the popula- 
tion [constructive epidemiology]” (Clark [1953] 
1958, p. 65). Predominantly, this involves the ob- 
servation of the occurrence of disease under natu- 
ral conditions in whole populations, as opposed to 
clinical or laboratory investigations. Epidemiologi- 
cal method, for the most part, uses the research 
techniques of the population survey to discover the 
relationship between the occurrence of disease and 
the presence of various biological, physical, and so- 
cial factors. The kind of “proof” that it tries, for the 
most part, to obtain is statistical association be- 
tween the presumed “causal” factor and the occur- 
rence of the disease. Dawber and Kannel (1963, 
pp. 433-434 ) have spoken of “macroscopic” studies, 
which correlate rates of a disease with other statis- 
tical measures for an area or population group 
(ecological correlations), as contrasted with “mi- 
croscopic” studies, which correlate personal char- 
acteristics with the presence or absence of disease 
within the individual (individual correlations). 
Experimental epidemiology, involving the con- 
trolled introduction of epidemic conditions into 
populations of experimental animals in the labora- 
tory (Greenwood 1932), field experiments to test 
the efficacy of various immunizing agents, or vari- 
ous types of preventive measures (MacMahon et 
al. 1960, pp. 268-279), represents an attempt to 
apply the experimental method to epidemiological 
problems. 

Historical background. The scope of epidemi- 
ology, which was “originally concerned only with 
epidemics, . . . was extended first to include in- 
fectious diseases which do not ordinarily occur in 
epidemic form, such as leprosy, syphilis, and tuber- 
culosis, and later to noninfectious diseases” (Doull 
1952, p. 76). The birth of epidemiology as we 
know it may be traced back to England in the 
late seventeenth century, when John Graunt in 
1662 developed the first mortality tables. However, 
it was not until the mid-nineteenth century that 
men like Johann Siissmilch and Adolphe Quetelet 
utilized these statistics to help identify etiological 
factors in disease. The major emphasis of epi- 
demiology under such eminent pioneers as John 
Snow (cholera), Peter Panum (measles), William 
Budd (typhoid), and Kenneth Maxcy (endemic 
typhus) was upon the discovery of host, agent, 


and environmental factors associated with the 
spread of these highly contagious diseases, or what 
has been called “the mass-phenomena of infectious 
diseases” (see Frost 1910-1939). 

The dramatic conquest of the infectious diseases 
in the present century, together with the growing 
importance of the chronic, degenerative diseases, 
soon made it apparent that epidemiology could no 
longer be restricted to epidemics. As a matter of 
fact, epidemiological studies of nutritional (James 
Lind on scurvy) and occupational (Henry B. 
Baker on lead colic) diseases had already demon- 
strated the applicability of epidemiological method 
to noninfectious diseases. The use of statistical 
associations based upon population surveys became 
one of the foremost methods for studying the oc- 
currence of cancer, cardiovascular disease, and 
mental illness and for the difficult task of identi- 
fying specific etiological agents. Today, the value 
of epidemiological research for the study of all 
diseases is well established (James & Greenberg 
1957). 

Uses of epidemiology. As a standard tool of 
medical investigation, epidemiology has been 
brought to bear upon almost all aspects of the pre- 
vention and treatment of disease. Morris (1957) 
has listed seven fundamental applications: the de- 
termination of individual risks on the basis of 
morbidity tables and cohort analysis—for example, 
the chances of a forty-year-old male getting can- 
cer; the securing of data on subclinical and un- 
detected cases; the identification of syndromes or 
clusters of symptoms; the determination of his- 
torical trends of disease; the diagnosis of commu- 
nity health needs and resources; program planning, 
operation, and evaluation; and the search for 
causes of disease. Similar uses are described by 
Breslow (1957) for a large-scale epidemiological 
survey of chronic diseases in California. These in- 
clude a demographic description of the changing 
population composition, a broad picture of the state 
of health and illness in the community, more ex- 
tensive knowledge about disease prevalence, data 
on the utilization of health services, case rosters 
for follow-up investigations, and data on etiologi- 
cal factors. Thus, epidemiology provides a large 
portion of the scientific base for public health 
practice. 

The diversity of these applications would sug- 
gest that epidemiological surveys are often com- 
bined, or confused, with general community health 
Surveys. A survey that asks questions about health 
conditions and medical care of a population sample 
does not automatically become an epidemiological 
study. From a more rigorous point of view, the 
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major contribution of epidemiological research 
should be in the development and testing of hy- 
potheses concerning specific factors that may in- 
fluence the distribution of some particular disease 
in a defined population. On the basis of existing 
knowledge, theory, or observation, the epidemiol- 
ogist identifies subgroups of the population believed 
to have varying incidence rates of the disease being 
investigated. He then hypothesizes certain etiologi- 
cal factors related to the disease and also believed 
to differ among the subgroups being studied. By 
means of a field survey or the analysis of existing 
data, he then tests the direction and degree of 
association between the occurrence of the disease 
and the presence or absence of the group charac- 
teristic hypothesized as the etiological factor. 

Epidemiology and social science. Epidemiology 
has theoretical and methodological ties to the so- 
cial sciences. Both the epidemiologist and the social 
scientist are concerned with demography and ecol- 
ogy—the relationship of man to his environment 
(Fleck & Ianni 1958). When the environment 
includes sociocultural factors as possible “causes” 
of disease, either indirectly (as in the case of 
poverty leading to malnutrition or unsanitary liv- 
ing conditions) or directly (as in the case of emo- 
tional disturbance leading to mental disease or 
addictive disorders, such as drinking and alcohol- 
ism or drug addiction), then all three basic compo- 
nents of epidemiology—host, agent, and environ- 
ment—take on important social dimensions (King 
1963). Epidemiology is becoming increasingly con- 
cerned with “the social component of environment 
... that part which results from the association 
of man with his fellow man. . . the attainments, 
beliefs, customs, traditions, and like features of a 
people” (Gordon 1952, pp. 124-125). In the cur- 
rent era of chronic, degenerative diseases, in which 
an individual’s whole way of life may become more 
important than any single infectious agent in the 
disease process, social factors become a primary 
target for epidemiological investigation. 

Methodologically, both the epidemiologist and 
the social scientist rely heavily upon the popula- 
tion survey and field experiment. Similar problems 
of research design confront both groups, while 
technical considerations such as sampling, ques- 
tionnaire construction, interviewing, and multi- 
variate analysis are objects of mutual methodologi- 
cal interest (Wardwell & Bahnson 1964). 

Recent research. All major diseases today are 
the subject of epidemiological research, and almost 
all of these include, at the minimum, such social 
groupings as sex, age, marital status and family 
composition, occupation, socioeconomic status, re- 
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ligion, and race. In addition, many studies are 
specifically aimed at the investigation of social 
factors, such as social stress, as possible etiological 
agents in the occurrence of the disease. Compre- 
hensive reviews have been prepared by Glock and 
Lennard (1956) on hypertension, Graham (1960) 
on cancer, Mishler and Scotch (1963) on schizo- 
phrenia, Dawber and others (1959) on heart 
disease, Jaco (1960) and Hoch and Zubin (1961) 
on mental disease, Suchman and Scherzer (1960) 
on childhood accidents, King and Cobb (1958) on 
rheumatoid arthritis, among others. The state of 
knowledge in this field is advancing rapidly, and 
the findings of epidemiological surveys appear reg- 
ularly in such periodicals as the American Journal 
of Public Health and the Journal of Chronic Dis- 
eases. 

In general, these studies reveal a large number 
of significant differences in the occurrence of dis- 
ease among different subgroups of the population 
(Pemberton 1963). For example, coronary artery 
disease is found to vary according to such socio- 
cultural variables as occupation, economic status, 
race, and rural-urban residence. Cancer of the 
uterine cervix occurs much less frequently among 
Jewish women; men are more likely to incur car- 
diovascular disease; and mental illness is found 
more often among the lower socioeconomic groups. 
On a more psychological level, insecurity and 
stress tend to be associated with a higher incidence 
of mental illness, alcoholism, narcotics addiction, 
heart disease, arthritis, and a host of psychoso- 
matic conditions (Leighton 1959). Perhaps the 
most famous of these epidemiological correlations 
deals with the association between smoking be- 
havior and lung cancer (Dorn & Cutler 1958), 

Some problems of research design. The major 
conceptual and methodological problems in epi- 
demiological research stem from its dependence, 
by and large, upon associational evidence. The 
basic research design of epidemiological method 
consists in the comparison of two groups, each 
with varying rates of a disease, with respect to 
other characteristics hypothesized as explanatory 
of these varying disease rates. This is essentially 
an ex post facto form of survey research and one 
that may undertake demographic studies of exist- 
ing vital statistics or several other types of study 
using data specially gathered for the purpose. 
These can be classified as being either retrospective 
studies, which secure data on different group 
characteristics hypothesized as etiological factors 
from at least two groups with varying rates of the 
disease being investigated, or prospective studies, 
which follow up groups of individuals with and 


without the hypothesized etiological characteristics 
in order to determine the differential development 
of the disease. 

In all three study designs, the objective is the 
determination of a series of statistical associations 
from which etiological inferences may be drawn. 
These three types of design offer progressively 
more rigorous and plausible evidence of causality. 
The demographic method, relying as it does on 
ecological correlations, is the weakest, since varia- 
tions in rates of occurrence between phenomena 
do not necessarily mean that these phenomena are 
related (Clausen & Kohn 1954); it is possible to 
have high ecological associations with little or no 
individual correlation. Retrospective studies do 
provide individual correlations, but there is often 
no way of knowing which of the two factors in 
an observed correlation came first. Prospective 
studies using a longitudinal study of cohorts are 
strongest, since these enable one to define the 
population at risk in advance of the development 
of disease and then to check one’s predictions over 
time [see COHORT ANALYSIS]. 

Smoking and lung cancer. The association be- 
tween smoking and lung cancer provides an excel- 
lent example of the progression from demographic 
to retrospective and finally to prospective studies. 
The initial association was suggested by demo- 
graphic comparisons showing a much higher inci- 
dence of lung cancer among men than women. 
Retrospective studies revealed a correlation be- 
tween smoking histories and the occurrence of 
lung cancer. Finally, intensive prospective studies 
following up smokers and nonsmokers showed a 
higher development of lung cancer among the 
former. The continuing controversy today, how- 
ever, demonstrates the further need and demand 
to prove, through experimental rather than epi- 
demiological studies, that smoking can “cause” 
cancer. 

Validity of epidemiological method. The in- 
ability of the epidemiologist to “randomize” his 
experimental and control groups and to alter de- 
liberately the characteristics of his experimental 
group constitutes an intrinsic conceptual and 
methodological shortcoming that requires a con- 
tinuing close working relationship between epi- 
demiological and experimental research. Certain 
basic prerequisites must be satisfied if epidemio- 
logical method is to produce reliable and valid 
associations. First, the representativeness and 
generalizability of the sample from whom data are 
obtained must be ascertainable. This sample should 
include not only persons who are known to have 
the disease but also who are free of the disease. 


The definition of what is a “normal,” or disease- 
free, control group presents a particularly difficult 
problem for epidemiological study of the chronic 
diseases, since these may not become apparent 
until a fairly late stage. Second, the disease being 
studied must be defined in such a way that it can 
be reliably and validly diagnosed using field tech- 
niques. Errors due to false positives (the propor- 
tion of individuals classified as diseased among 
those truly not diseased) and false negatives (the 
proportion classified as not diseased among those 
truly diseased) can often lead to spurious associa- 
tions (Rubin et al. 1956). Third, the hypothesized 
etiological factors must be similarly capable of 
objective definition and measurement. These are 
difficult conditions to meet, especially in relation 
to the chronic diseases, which often lack both 
clear-cut diagnostic criteria and well-developed 
theories of etiology and process (Pollack & Krueger 
1960). 

Future developments. Epidemiological method 
is bound to increase in importance as the search 
for etiological factors in the chronic diseases forces 
the medical researcher to supplement his labora- 
tory experiments with field studies, both as source 
and proof of his hypotheses. The multiple nature 
of etiological factors (many, if not most, of which 
cannot be reproduced or controlled in the labora- 
tory) will require greater reliance upon population 
surveys and field trials. Probabilities of disease will 
replace certainties, and associated conditions 
rather than specific causes will dominate the 
picture. Prominent among these conditions will be 
the cultural, social, and psychological forces that 
determine how man lives and which in later years 
influence the degenerative processes. Today we 
deal with these social factors on the most elemen- 
tary level, that of descriptive group memberships. 
Tomorrow we may hope to be able to determine 
the dynamic factors underlying these group mem- 
berships and to develop and test specific hypoth- 
eses of how and why social factors relate to the 
origin and course of disease. 

EDWARD A. SUCHMAN 


[See also DRINKING AND ALCOHOLISM; DRUGS, article 
ON DRUG ADDICTION: SOCIAL ASPECTS; ECOLOGY, arti- 
cle on HUMAN ECOLOGY; PUBLIC HEALTH; VITAL 
STATISTICS; and the biographies of Graunt and 
QUETELET.] 
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I. THE CONCEPT oF EQUALITY 
I. EQUALITY as AN IDEAL 
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Irving Kristol 


I 
THE CONCEPT OF EQUALITY 


In the context of the social sciences, the concept 
of equality refers sometimes to certain properties 
which men are held to have in common but more 
often to certain treatments which men either re- 
ceive or ought to receive, Traditional characteriza- 
tions of kinds of treatment as either egalitarian 
or inegalitarian often turn out to be disguised value 
judgments or empty statements. It is possible, 


however, to find descriptive criteria apt to capture 
the egalitarian and inegalitarian features of prin- 
ciples which have been advocated at different 
times. 

Equality of characteristics. Equality must be 
construed here in the sense of similarity, that is, 
of agreement in certain properties. That men are 
equal means that men share some qualities; these 
must be specified. Men are evidently unequal in 
many characteristics. There are natural differences 
(sex, color, character traits, natural endowments, 
etc.) and institutional variations (citizenship, re- 
ligion, social rank, etc.). Other properties are com- 
mon to all but in varying amounts (age, strength, 
intelligence, possessions, power, etc.). To claim 
that all men are equal in such respects can only 
mean that the resemblances are in some way more 
significant than the differences, as when Hobbes 
states that “nature hath made men so equal, in the 
faculties of the body and mind” that the weakest 
can kill the strongest and no one can outwit the 
other. 

Men are sometimes held to be equal in the sense 
of having a common “human nature”—a tautologi- 
cal assertion, unless it is specified that all are nat- 
urally good or sinful or that they have the same 
basic motives (say, self-interest ), or common basic 
needs, or similar capacities to feel pleasure and 
pain, or the same ability to act deliberately and to 
choose rationally. 

Equality of treatment. Moralists ever since the 
Stoics have claimed that men, in spite of differences 
of character or intelligence, are of equal dignity, 
worth, or desert. Statements of this kind are to be 
interpreted in a normative sense, to the effect that 
all men are entitled to be treated equally. The same 
applies to the allegation that all men have the 
same moral or natural rights. To say that I have 
a moral right implies that others should let me 
exercise it (whereas to have a legal right means 
that it is conferred by positive law). Thus, Locke 
interprets his own statement “that men by nature 
are equal” as referring, not to “all sorts of equality,” 
since men differ as to “age or virtue,” but to “the 
equal right that every man hath to his natural 
freedom”; this means that men “should also be 
equal amongst another,” that is, that they should 
be given the corresponding legal rights. Analo- 
gously, to claim that those of one nation or race or 
class are “superior” to all others is to hold that they 
ought to receive preferential treatment. 

Whether individuals or groups are, in fact, 
treated equally or unequally by others depends on 
the way in which benefits or burdens are allotted 
to them. These may be legal rights (e.g., to own 


property, to vote) and legal duties (e.g., to respect 
the rights of others); material benefits (e.g., wages, 
unemployment benefits, social services) and liabil- 
ities (e.g., punishment, taxation, military service); 
and opportunities (e.g., to hold certain positions or 
offices ). 

Factual statements about equality may be about 
equality of either characteristics or treatment. 
Normative statements about equality are always 
concerned with treatment but may contain refer- 
ences to characteristics as well, as when it is being 
argued that men should be treated equally because 
they are equal in certain characteristics. References 
to both characteristics and treatment are also con- 
tained in general rules of the type: all persons 
having a certain characteristic are to be allotted 
a certain benefit or burden (in such and such an 
amount). This leads to the question of how to de- 
termine whether an actual or proposed kind of 
treatment is egalitarian. 


Traditional criteria of egalitarianism 


(1) Impartiality. Equal treatment means, first 
of all, the impartial allocation of some benefit or 
burden by one actor to another, say, by a judge to 
a claimant. Equality before the law thus means 
impartial application of the law. Allocations are 
impartial or partial only by reference to a rule of 
allocation. With respect to a specified legal or moral 
rule, a person is treated impartially by another 
provided his allotment is determined exclusively by 
the rule itself and not by other factors, such as the 
latter person’s like or dislike of the former. Par- 
tiality (allotments made in violation of some given 
rule) would be the only kind of inegalitarian treat- 
ment in this sense. Since any rule—for example, 
one restricting suffrage to adult citizens or to white 
citizens—can be applied impartially or partially, 
we must determine the conditions under which 
rules themselves are to be considered egalitarian. 

(2) Equal shares to all. According to the utili- 
tarians, “everybody [is] to count for one, nobody 
for more than one” in the allocation of benefits 
and burdens—not of every conceivable kind, of 
course, but of certain specified types. Similarly, 
“equality” to the French revolutionaries meant that 
the same basic legal rights should be granted by 
every government to all its citizens. Rules which 
allocate a benefit or burden in equal amounts to 
everyone are undoubtedly egalitarian. 

(3) Equal shares to equals. Most rules of al- 
location grant equal shares of some kind, not to 
all generally but to all who are equal with respect 
to some property; for example, all adult citizens 
have the right to vote; whoever commits a certain 
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crime shall suffer a certain punishment; persons 
within the same income bracket are liable to the 
same income tax. According to the previous cri- 
terion, such rules would not be egalitarian. The 
concept of egalitarianism has therefore been en- 
larged to cover rules which allot “equal shares to 
equals”; and a rule is considered inegalitarian by 
Aristotle “when either equals are awarded unequal 
shares or unequals equal shares.” 

Now, every rule may be considered egalitarian 
in this sense; for a rule stipulates that all, or else 
that only those who are equal in a specified respect, 
receive the same specified treatment. Universal suf- 
frage means that the right to vote is given to all 
adult citizens but not to minors and aliens. A gradu- 
ated income tax treats any two taxpayers within 
the same bracket equally and any two within dif- 
ferent brackets unequally. To treat all whites alike 
and all Negroes alike but persons of different color 
differently is to practice racial discrimination. 
Every conceivable rule treats equals (in some speci- 
fied respect) equally and unequals unequally. 

(4) Proportional equality. To narrow down the 
criterion again, unequal allotments have been held, 
ever since Aristotle, to be egalitarian if and only 
if they satisfy the requirement of “proportional 
equality.” A rule is generally considered to satisfy 
this requirement if it provides that the amount of 
benefit or burden is a monotonically increasing 
function of the specified characteristic; that is, the 
more of the characteristic, the more benefit or bur- 
den. And any two persons are treated “in propor- 
tion to their inequality,” provided the difference in 
the amount allotted to each is similarly correlated 
to the degree in which they differ in the character- 
istic specified by the rule. But again, any rule 
which allots “equal shares to equals” implicitly not 
only allots “unequal shares to unequals” but also 
allots them “in proportion to their inequality.” Both 
rules—“to each according to his need” and “to each 
according to his height’—assign different shares 
to different persons in the proportion in which they 
differ as to need or as to height. A flat rate and a 
graduated income tax both fulfill the requirement 
of proportional equality. 

(5) Unequal shares corresponding to relevant 
differences. Inequality in allotment has been held 
to be egalitarian provided it is based on relevant 
differences in personal characteristics. Thus, age 
and citizenship are relevant to voting rights but 
not so sex or race or wealth; it is therefore held 
egalitarian to limit the franchise to adult citizens 
but inegalitarian to restrict it to men or whites or 
poll-tax payers. Wealth is relevant to taxation; 
hence, a graduated income tax is viewed as egali- 
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tarian but not a sales tax, which disregards this 
relevant criterion by taxing poor and wealthy 
buyers at the same rate. 

Judgments to the effect that characteristic x is 
relevant to treatment y are valuational, not factual. 
That color is not relevant to voting but age is 
means that it is unjust to base the franchise on 
color and just to require a minimum age. Equality 
becomes tantamount to distributive justice: “The 
unjust is unequal, the just is equal” (Aristotle); 
that is, it is unjust to make unequal awards to 
those who share a relevant characteristic. Or, in a 
recent formulation: to be egalitarian, “a difference 
of treatment requires justification in terms of rele- 
vant and sufficient differences between the claim- 
ants” (Ginsberg 1965, p. 79; italics added). The 
same purely normative criterion underlies the idea 
of “equality of consideration,” that is, “that none 
shall be held to have a claim to better treatment 
than another, in advance of good grounds being 
produced” (Benn & Peters 1959, p. 110; italics 
removed). This principle is not only purely valu- 
ational but also purely procedural—compatible 
with whatever substantive discriminatory rule may 
be established on “good grounds.” 

(6) To each according to his desert. Accord- 
ing to Aristotle, a person’s desert is the only char- 
acteristic relevant to allocations. To be both egali- 
tarian and just, these must therefore be based on 
proportionate equality on the basis of desert. The 
problem is here merely pushed a step further back, 
since judgments of someone's relative desert are 
again valuational. Unless there are objective cri- 
teria for relevant or just or good grounds for dif- 
ferential treatment or for a person’s desert or 
worth, it is impossible to refute the racist’s counter- 
claim that color is relevant to franchise or that 
whites are of superior worth. (His claim that color 
is relevant to intelligence would be an empirical 
one and could be refuted on empirical grounds, but 
it is intelligence rather than color or desert which 
he proposes in this case as a relevant criterion for 
granting franchise.) According to criteria 3 and 4, 
every rule of allocation is egalitarian, and any rule 
may be considered just and hence egalitarian ac- 
cording to criteria 5 and 6. 


Operational criteria of egalitarianism 


(1) Egalitarian rules of allocation and distribu- 
tion. Even advocates of racial discrimination are 
likely to consider it egalitarian to give preferential 
treatment to the needy regardless of race but in- 
egalitarian to give it to whites regardless of need. 
The reason seems to be that the first policy aims at 


the equal satisfaction of everybody's basic needs, 
while the second is incompatible with that prin- 
ciple. This points to a distinction which must be 
made between (1) rules which determine how 
some benefit or burden is to be allocated among 
persons, that is, how much of it is to be given to 
each or to be taken from each, and (2) rules con- 
cerning the distribution of a benefit or burden 
which is to result from some allocation, that is, 
how much each person is to have at the end. 

Rules of allocation and rules of distribution may 
be (a) egalitarian or (b) inegalitarian. Rules of 
allocation are egalitarian if they allocate the same 
kind or amount of benefit or burden to all. Simi- 
larly, rules of distribution are egalitarian if they 
stipulate that all are to have equal shares. Here are 
some examples: (1a) universal suffrage, universal 
head tax; (1b) suffrage only for whites, graduated 
income tax; (2a) political equality, equality of 
possessions; (2b) political inequality, inequality of 
possessions. 

(2) Inegalitarian allocations compatible with 
egalitarian distributions. Egalitarian allocations 
often lead to egalitarian distributions and inegali- 
tarian allocations to inegalitarian distributions. 
Universal suffrage promotes political equality, not 
so suffrage for whites only. But an egalitarian dis- 
tribution does not necessarily require an egalitarian 
allocation. For example, to bring about an equal 
distribution of the holdings of A, who has 8 units, 
and of B, who has Q, it is necessary to take, say, 3 
from A and to give 3 to B. But taking 1 from A and 
1 from B would leave the previous inequality of 
their distribution unaffected. Egalitarian alloca- 
tions may thus result in inegalitarian distributions. 
With respect to an egalitarian rule of distribution, 
a rule of allocation may be said to be egalitarian 
if its application is a means to, or a consequence 
of, the former's implementation and inegalitarian 
if it is incompatible with the former, A rule of allo- 
cation which is intrinsically inegalitarian may thus 
be egalitarian with respect to some egalitarian rule 
of distribution, while an intrinsically egalitarian 
tule of allocation may be inegalitarian in this re- 
spect. With respect to equality (or rather, to reduc- 
ing inequality) of wealth, a graduated income tax 
is egalitarian and a head tax is inegalitarian. 

(3) Degrees of egalitarianism. A rule of allo- 
cation or of distribution may be considered more 
egalitarian (or less inegalitarian) than another if 
it insures “that a larger number of persons (or 
classes of persons) shall receive similar treatment 
in specified circumstances” (Berlin [1955-1956] 
1961, p. 135)—or rather, similar preferential treat- 


ment. Universal suffrage which excludes only 
minors and aliens is more egalitarian than suffrage 
which excludes also Negroes and may therefore 
be considered fully egalitarian for practical pur- 
poses. Disenfranchising women is more inegali- 
tarian than disenfranchising Negroes if the latter 
constitute a small segment of the population but 
less inegalitarian if Negroes form a large per- 
centage. 

On the basis of these purely descriptive criteria, 
persons with divergent value commitments can 
agree (or disagree) on an empirical level whether 
a given rule of allocation or of distribution is egali- 
tarian and to what degree, and whether a rule of 
allocation is egalitarian with respect to some egali- 
tarian rule of distribution. The resulting classifica- 
tion corresponds in a satisfactory way to our 
everyday distinctions between egalitarian and in- 
egalitarian treatment. 


Instances of egalitarianism 


Equality of opportunity. Equal treatment of all 
in every respect was advocated by some nineteenth- 
century anarchists: equality of occupation (for 
example, intellectuals would participate in manual 
work), of consumption (all would eat and even 
dress alike), and especially of education would 
ultimately wipe out existing inequalities of talent 
and capacity. Most egalitarians, however, consider 
such an ultimate goal neither desirable nor pos- 
sible. They realize that in every society individuals 
are bound to have varying degrees of ability and 
to hold positions that yield varying degrees of 
status if not of remuneration. How to match un- 
equal individuals with unequal positions has been 
their central concern and equality of opportunity 
their principal answer. This rule deals with the 
distribution of access to positions in society, not 
with the allocation of the positions themselves. 
Opportunities to occupy all positions, including the 
most attractive ones, are to be distributed in an 
egalitarian way to all on a competitive basis, re- 
gardless of such differences as social status or eco- 
nomic resources and regardless even of differences 
of ability, since “the least able and the most able 
are given an equal start in the race for success” 
(Pennock 1950, p. 81). If everyone has an equal 
start, then the position he will occupy at the end 
will, in theory at least, depend exclusively on how 
far and how fast he runs, that is, on his own re- 
sourcefulness (but also on his luck). As the French 
Declaration of the Rights of Man proclaimed: all 
citizens “are equally eligible to all honors, places 
and employments, according to their different 
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abilities, without any other distinction than that 
of their virtues and talents.” 

Legal equality. “Equality of opportunity, in the 
broad sense of the career open to personality, is 
and has been the inclusive goal within which the 
partial goals of the special equalities have their 
significance” (Hofstadter 1956, p. 137). Equality 
of legal rights has been, historically, the first of 
these special equalities. Classical liberalism held 
that the equal distribution of opportunities required 
merely the equal allocation of the basic rights of 
“life, liberty, and property.” If legal privileges are 
abolished and legal rights protected, no obstacle 
will stand in the way of anyone's pursuit of 
happiness. 

Equal satisfaction of basic needs. Increasing in- 
dustrialization brought about an increasing aware- 
ness that equality of opportunity cannot be achieved 
by the “majestic equality of the law which forbids 
rich and poor alike to steal bread and to sleep 
under bridges” (Anatole France). Equality of op- 
portunity does presuppose the equal allotment of 
certain rights, but it also requires the application 
of another egalitarian rule of distribution, namely, 
equality of the satisfaction of certain basic needs, 
which in turn calls for an inegalitarian rule of 
allotment: privileges for the economically under- 
privileged. Indeed, those who lack the basic phys- 
ical or educational necessities do not have the 
same opportunities to reach the higher positions 
as do the better endowed. To bring the former up 
to the general starting line, government must com- 
pensate them for these initial disadvantages by 
means of social legislation and social services such 
as minimum wages, tax exemptions, unemploy- 
ment benefits, free public schools, and scholarships. 


Equality of opportunity is not simply a matter of legal 
equality. Its existence depends, not merely on the 
absence of disabilities, but on the presence of abilities. 
It obtains in so far as, and only in so far as, each 
member of a community, whatever his birth, or occu- 
pation, or social position, possesses in fact, and not 
merely in form, equal chances of using to the full his 
natural endowments of physique, of character, and of 
intelligence. (Tawney [1931] 1965, pp. 103-104) 


To condemn such inegalitarian allotment is to 
oppose equality of opportunity. Herbert Spencer 
agreed with his neoliberal opponents that “insuring 
to each the right to pursue within the specified 
limits the objects of his desires without let or 
hindrance is quite a separate thing from insuring 
him satisfaction” but insisted that the state should 
“confine itself to guaranteeing the rights of its 
members” and not “assume the role of Reliever- 
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general to the poor.” Such advocacy of mere 
equality of rights had by that time become an in- 
egalitarian policy which deprived the poor of 
equality of opportunity and promoted the “survival” 
of the wealthy at their expense. 

Privileges for nobles or property owners and 
disabilities imposed on a particular sex, religion, 
or race are inegalitarian, not only in themselves 
but probably also with respect to any conceivable 
egalitarian rule of distribution. But privileges for 
religious, racial, or ethnic minorities may constitute 
an egalitarian policy when these are considered 
as constituting economically or socially disad- 
vantaged groups. 

Economic equality. Equality of the right of 
property is compatible with extreme inequality in 
the distribution of property. Equality of opportunity 
does not imply equalization of wealth either, cer- 
tainly not at the end of the “race for success.” 
Nevertheless, to give all an equal start, some must 
be lifted up and others moved down. The equal 
satisfaction of basic needs as a precondition for 
equality of opportunity does require economic 
equality, that is, a reduction of extreme inequalities 
in the distribution of commodities. “By equality, 
we should understand, not that the degree of 
power and riches be absolutely identical for every- 
body, but that . . . no citizen be wealthy enough 
to buy another, and none poor enough to be forced 
to sell himself” (Rousseau). “The socialist seeks 
a distribution of rewards, status, and privileges 
egalitarian enough to . . . equalize opportunities” 
(Crosland [1956] 1957, p. 113). With respect to 
this goal, unequal taxation of unequal incomes is 
egalitarian. 

Common ownership of the means of production. 
Marx, too, realized that “one man is superior to 
another physically and mentally” and interpreted 
equality—at least in the first phase of communism 
—as the opportunity for each to occupy the posi- 
tion which corresponds to his ability. Contrary to 
the neoliberals and socialists, Marx believed that 
this goal could not be reached through a redistribu- 
tion of the means of consumption (the demand for 
fair distribution as well as for equal rights was to 
him “obsolete verbal rubbish”) but only through 
the abolition of private control of the means of 
production. Their “common ownership” would elim- 
inate the possibility of exploitation and class strug- 
gle; and “with the abolition of class distinctions, 
all social and political inequality arising from them 
would automatically disappear by itself.” 

To each according to his merit. If there is 
equality of opportunity and if higher positions bring 


higher salaries, both will go to those of greater 
merit or ability. The result would ideally be, “not 
an aristocracy of birth, not a plutocracy of wealth, 
but a true meritocracy of talent” (Young [1958] 
1959, p. 19). Unequal allocation of rewards, cor- 
related with inequality of ability, is a consequence 
of equal distribution of opportunities. With respect 
to equality of opportunity, rewards according to 
merit in the sense of ability is therefore an egali- 
tarian principle. 

This is not so with rewards according to merit 
in the sense of desert. Plato and Aristotle held not 
only that people’s relative desert or moral worth 
can be objectively ascertained but also that “there 
are innate differences which fit them for different 
occupations” (Plato), that “a distinction is already 
marked, immediately at birth between those who 
are intended for being ruled and those who are 
intended to rule” (Aristotle), and between those 
who are “by nature” either slaves or free. Each is 
to be assigned the function corresponding to his 
pre-established desert. Aristotle’s principle of “pro- 
portional equality according to desert” is really 
inegalitarian, not only intrinsically but also with 
respect to equality of opportunity and probably 
every other egalitarian rule of distribution. For the 
same reason, all rigidly stratified societies are in- 
egalitarian, from feudalism to the Indian caste 
system. 

To each according to his need. Equality of op- 
portunity does not, however, necessarily entail that 
rewards (as well as positions) go to each according 
to his ability. Marxists believe that the first stage 
of communism, in which means of consumption 
are distributed according to the work performed, 
will inevitably evolve, in Lenin’s words, “from 
formal equality to real equality, i.e., to realizing the 
tule: ‘From each according to his ability, to each 
according to his need.’” Positions would still be 
correlated to ability, but everyone (so Marxists be- 
lieve) will work spontaneously to the best of his 
ability even without incentives, and compensations 
will differ according to need, regardless of the type 
of work. With respect to equality of opportunity to 
Occupy various positions, this would be another 
egalitarian rule of allocation, 

Political equality. While some political thinkers 
have advocated the equalization of political power 
through direct democracy or predicted the abolition 
of political power through the withering away of 
the state, it is generally assumed that political 
power is ubiquitous and always unevenly distrib- 
uted and that political equality can only mean 
equality of opportunity to participate in the political 


process. Political equality has therefore been as- 
sociated with the democratic institutions of suf- 
frage, representation, and majority rule. Early 
liberals did not include political rights among the 
basic rights to be given to all; they demanded 
merely that wealth should replace birth as a cri- 
terion for franchise. Extending suffrage to all prop- 
erty owners was originally an egalitarian demand 
directed against hereditary privileges of the nobility. 
Property qualifications for voting rights became an 
inegalitarian rule when it was invoked in defense 
of vested property interests against proponents of 
universal suffrage (which was not instituted in 
most countries until the decline of laissez-faire 
liberalism). 


Egalitarianism and other social goals 


Egalitarian rules may conflict not only with one 
another (for example, equality of rights and of 
opportunities, equality of opportunities and of wel- 
fare) but also with other social goals. The equal 
distribution of welfare does not necessarily lead to 
its maximization. The latter goal might be most 
effectively realized by slavery or by wage incentives 
to higher production far greater than would be 
compatible with equality of welfare and even of 
opportunity. Equal welfare and equal freedom, too, 
are competing goals, since the former goal requires 
government to impose greater restrictions on the 
freedom of economically dominant groups. Free- 
dom of all citizens with respect to government may 
result in suppression of freedom of the minority 
of individualists by the majority of conformists. 
Equalization of wealth does away with the leisure 
class, which some consider essential to cultural de- 
velopment. Egalitarianism may thus lead to down- 
ward leveling and stifle individuality, diversity, and 
cultural excellence, Greater equality of opportunity 
may also generate more frustration and greater 
unhappiness. Political equality entails majority 
rule, but the majority may decide on inegalitarian 
policies. Both Edmund Burke and J. S. Mill were in 
substantial agreement as to these causal ramifica- 
tions; yet, the former drew the balance in favor of 
inegalitarianism, while the latter espoused egali- 
tarianism. 

Inegalitarian rules are usually advocated as 
means to other goals, such as order, efficiency, 
diversity, and cultural excellence. Egalitarianism, 
on the other hand, is more often considered in- 
trinsically desirable and morally right. Both egali- 
tarian and inegalitarian principles have been held 
demonstrably valid on the ground that they are “in 
agreement with nature.” That men should receive 
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equal treatment has been taken as a normative 
conclusion from the factual premise that “men are 
equal”—unless this statement itself is interpreted 
in a normative sense (see above). But it has also 
been argued that men ought to be treated unequally 
because they are of unequal rank or ability or 
race. 

Yet, normative principles cannot be derived from 
factual generalizations; neither equality nor in- 
equality of characteristics entails the desirability 
of either egalitarian or inegalitarian treatment. 
There is surely no inconsistency in maintaining that 
men should be treated equally (e.g., as to rights) 
in spite of the fact that they are unequal (e.g., as 
to natural endowments) or that they should be 
treated unequally (e.g., as to salary) regardless of 
their common features (e.g., as to basic needs). 
Once the causal connections between egalitarianism 
or inegalitarianism and other social goals have 
been clarified, the adoption of one or the other of 
these two normative doctrines remains a matter 


of subjective commitment. 
FELIX E. OPPENHEIM 


[See also DEMOCRACY; JUSTICE; POLITICAL THEORY.] 
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i 
EQUALITY AS AN IDEAL 


Among the definitions of “equality” provided by 
the Oxford English Dictionary are the following 
three: (1) “the condition of having equal dignity, 
rank, or privileges with others”; (2) “the condition 
of being equal in power, ability, achievement, or 
excellence”; (3) “fairness, impartiality, equity, due 
proportion, proportionateness.” 

A moment's contemplation will reveal that these 
three definitions of “equality,” although all of them 
are consistent with common usage, are not entirely 
or necessarily consistent with one another. If, for 
example, men are unequal in power, ability, 
achievement, or excellence, then an adherence to 
definition (3) will lead to a violation of definition 
(1), while an adherence to definition (1) will lead 
to a violation of definition (3). It is only if men are 
in fact equal in power, ability, and excellence that 
equity preserves a condition of equal rank. 

But, in fact, men are not equal in power, ability, 
and excellence. From this it would seem to follow 
that justice requires a certain measure of inequal- 
ity. And, indeed, in all social orders, no matter how 
vehement their passion for equality, we observe 
that some inequalities are regarded as inevitable 
and natural. At the same time, no egalitarian soci- 
ety can have an easy conscience about the inequal- 
ities within it. There is a sentiment, inchoate yet 
profound, that no matter how unequal men may 
be in their abilities, in some deeper sense all men 
are equal merely by virtue of being men. 

The issue of legitimacy. It is certainly true 
that, in Western civilization at least, men have 
always believed that equality is in some sense the 
norm from which inequality represents a devia- 
tion. As Wollheim and Berlin have pointed out 
(1955-1956, pp. 281 ff.), the “naturalness” of the 
idea of equality seems to derive from the dual 
assumption that (a) men are all members of one 
species, of a simple class of objects (i.e., human 
beings) and (b) all members of a class should be 
treated uniformly, unless there is good and suffi- 
cient reason not to do so. This assumption, Berlin 
emphasizes, is so pervasive that it has almost the 


status of a category (in the Kantian sense) of 
human rationality: 


If I have a cake, and there are ten persons among 
whom I wish to divide it, then if I give exactly one 
tenth to each, this will not, at any rate automatically, 
call for justification; whereas if I depart from this 
principle of equal division I am expected to produce a 
special reason. It is some sense of this, however latent, 
that makes equality an ideal which has never seemed 
intrinsically eccentric .... (ibid., p. 305) 


This being the case, it is not surprising that all 
the golden ages, all the utopias, and all the para- 
dises created by the human imagination are egali- 
tarian (although not necessarily democratic—there 
may be an infinitely benevolent, if scrupulously 
egalitarian, despot). However, whereas in classical 
antiquity, utopia is located in word, not in deed, 
and in the succeeding Christian centuries it is 
located in transcendent hope rather than in actual- 
ity, in the modern era utopia has become an “ideal” 
to be realized (if never fully realized) by human 
effort. 

For Plato, as later for Augustine and Aquinas, 
utopia is conceived as prehistorical, as existing 
prior to some primordial Fall—a catastrophe that 
implicates the entire human race and that sets the 
conditions of its destiny and its progress. The out- 
standing consequence of this Fall is the abolition 
of original equality and the establishment of the 
principle of hierarchy as the “natural” principle of 
cosmic and social order. The original, harmonious 
prehistorical unity is shattered, and the universe 
becomes subject to differentiation—soul and mat- 
ter, spirit and flesh, idea and reality are now oppo- 
site poles, between which the tension of existence 
tries to maintain an equilibrium. The most perfect 
equilibrium (indeed, the only enduring one) is, 
obviously, that which recognizes the superiority of 
the noble over the base—of soul over matter, spirit 
over flesh, idea over actuality. The articulation of 
this order results in a metaphysics of hierarchy, in 
which both the cosmos and human society are en- 
visaged as part of a “great chain of being,” with 
precedence and consequence clearly defined and 
with noblesse oblige and humble obedience the 
only two reasonable political perspectives available 
to the human imagination. 

It requires more empathy than most twentieth- 
century men possess to realize how utterly “natural” 
the idea of hierarchy came to seem to classical and 
medieval thinkers—and even to most modern 
thinkers prior to the American and French revolu- 
tions. We are inclined to view this as an anti- 
egalitarian mode of thought, but the idea of hier- 
archy saw itself as containing the only feasible 


idea of political equality, rather than as in any way 
opposing equality. For equality was defined in 
terms of justice—of giving each man his due, so 
that equal men received equal rewards. That all 
men were not equal—and certainly not equal in all 
respects—was a platitude confirmed daily by the 
most casual observation. The hierarchical idea was 
accepted in good faith and good conscience by 
almost everyone; if we now deem it an ideology, 
then it was the ideology not of a class but of an 
entire historical epoch. 

It is Shakespeare, through the sublimity of his 
language, who makes the older idea of hierarchy 
available to us better than any political philos- 
opher. Ulysses’ speech on “degree,” in Troilus and 
Cressida, is the locus classicus: 


The heavens themselves, the planets, and this centre 
Observe degree, priority, and place, . . . 

But when the planets 

In evil mixture to disorder wander, 

What plagues and what portents! what mutiny! 
What raging of the sea! shaking of earth! 
Commotion in the winds! Frights, changes, horrors, 
Divert, and crack, rend and deracinate 

The unity and married calm of states 

Quite from their fixture! O, when degree is shak’d, 
Which is the ladder to all high designs, 

Then enterprise is sick! How could communities, 
Degrees in schools, and brotherhoods in cities, 
Peaceful commerce from dividable shores, 

The primogenitive and due of birth, 

Prerogative of age, crowns, sceptres, laurels, 

But by degree, stand in authentic place? 

Take but degree away, untune that string, 

And, hark, what discord follows! Each thing meets 
In mere oppugnancy.... (Act I, scene 3) 


Yet even as Shakespeare wrote these lines, the 
planets were on their way “in evil mixture to dis- 
order wander.” As Sanford A. Lakoff pointed out 
in his historical survey Equality in Political Philos- 
ophy (1964), the new astronomy replaced the 
cosmos with a neutral universe in which “the laws 
of nature” applied without distinction to heavenly 
and earthly bodies. Simultaneously, the rise of 
experimental science and the overthrow of Aristo- 
telian teleology nullified previous distinctions be- 
tween “base” and “noble” in nature—and, most 
especially, in human nature. Just as the new physi- 
cal science declared all the parts of nature to be 
equal, so the new scientific (i.e., materialistic ) 
psychology declared all the parts of man to be 
equal—none was intrinsically base or intrinsically 
noble. The denial of the superiority of spirit over 
matter, and of mind over body, inevitably sug- 
gested that there was no good reason for those who 
worked with their hands to have an inferior status 
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compared with those whose work was nonmanual 
and nonmenial. In this way the philosophical 
foundations of modern bourgeois society were 
established. 

Christianity itself, in the course of its several 
“reformations,” buttressed these new foundations 
(without, however, necessarily intending this re- 
sult). Luther's denial of the distinction between 
“spiritual” and “carnal” authorities and vocations 
was destructive of churchly hierarchy. The keys of 
St. Peter were distributed among the congregation 
of believers, as the monopoly of the Catholic clergy 
over the apostolic succession was denied and as its 
exclusive authority to interpret Scripture faithfully 
was transferred to the entire body of Christendom. 
Successive generations of reformers carried this 
antiauthoritarian impulse forward, so that Christen- 
dom itself experienced the multiplication of new 
self-governing and (in the literal sense of the 
term) self-righteous sects and denominations. 

The response to inequality. But while these 
secular and religious trends were to create the 
bourgeois world, energies were being released that 
were to point beyond this world. Even during the 
Middle Ages, Christian Messianism was only with 
difficulty kept within the confines of the church; 
and as the authority of the church crumbled, this 
messianism became an independent spiritual and 
political force. The kingdom of God was trans- 
ferred from a transcendental hereafter to this 
world, this time, this place. 

It was during the English revolution of 1640- 
1660 that Christian Messianism first revealed its 
full political ambitions. There had been previous 
incidents, to be sure (e.g., the Anabaptist revolts 
in Germany), but these had as their primary aim 
the creation of local utopias. In contrast, the left- 
wing Commonwealth’s-men sought not only to 
transform their own national society but also to 
prescribe principles for all truly Christian and 
truly just societies. Overton, the Leveller, spoke in 
the recognizable accents of modernity when he 
said: “Every man by nature being a king, priest, 
prophet in his own natural circuit and compass, 
whereof no second may partake but by deputation, 
commission, and free consent from him whose 
right and freedom it is” (Lakoff 1964, p. 65). In 
declarations such as these, the metaphysical foun- 
dations of egalitarian, representative self-govern- 
ment were firmly outlined. It required only the 
slightest amendment for these metaphysical foun- 
dations to be entirely secularized, with “rights” and 
“freedoms” the prerogatives of men qua men, rather 
than merely men qua Christians. The political 
philosophers of the seventeenth and eighteenth 
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centuries moved steadily in this direction, and the 
political ideologists of the American and French 
revolutions acted violently upon these new prin- 
ciples of civic organization. 

As has been noted, however, even as modern, 
liberal society was being formed, there was an 
egalitarian perspective that looked beyond it. This 
perspective delineated not merely an equality of 
rights and freedoms but a fraternal equality of con- 
dition. Thus, Winstanley, the Digger, prefigured the 
socialist idea of equality with his declaration that 
“the earth was made by Almighty God to be a 
common treasury of livelihood for whole mankind 
in all his branches . . .” (Lakoff 1964, p. 79). This 
is socialism, but of a premodern kind. The “com- 
mon treasury” is a static conception of wealth, and 
all premodern socialist thinkers envisaged a good 
society as one of economic modesty rather than 
of economic abundance—goods were to be dis- 
tributed equally, and everyone was to be content 
with what he had. It was not until the nineteenth 
century that modern socialism, alert to the possi- 
bilities of the industrial and technological revolu- 
tions it was witnessing, put forward the prospect 
of equality conjoined to increasing wealth for all. 
Since premodern socialism demanded a certain 
measure of asceticism from its adherents, while 
modern socialism could appeal simultaneously to 
human idealism and to human appetites, it is not 
surprising that modern socialism has a far more 
powerful popular appeal. 

It is a distinguishing characteristic of the mod- 
ern age that “equality” should be not merely an 
abstract ideal but also a politically aggressive idea. 
It is generally accepted—it is, indeed, one of the 
most deeply rooted conventions of contemporary 
political thought—that the existence of inequality 
is a legitimate provocation to social criticism. Every 
inequality is on the defensive, must prove itself 
against the imputation of injustice and unnatural- 
ness. And where such proof is established, it never 
asserts itself beyond the point where inequality is 
to be tolerated because it is, under particular con- 
ditions, inescapable. That inequality may be per se 
desirable is a thought utterly repugnant to the 
modern sensibility. 

The modern egalitarian impulse has had its ob- 
jective social correlatives. Modern society tends to 
have a more equal distribution of income and 
wealth than previous social orders in Western his- 
tory. Statistics are fragmentary and are open to 
dispute as to their significance. But, for France, 
Jean Fourastié ([1951] 1960, p. 30) has made the 
following estimate: “The salary of a councilor of 
state increased by a factor of at least 40 from 1800 


to 1948; the salary of a professor at the Collége de 
France by 100; the average salary of an office boy 
in a government agency by 220; the hourly wages 
of labourers in provincial cities by more than 400.” 
The general tendency would seem to be unmis- 
takable. 

The limits and potential of equality. Never- 
theless, there is considerable controversy over the 
issue of whether equality has been adequately 
realized in our modern social arrangements. This 
controversy derives in large part from the ambigu- 
ities inherent in the idea of equality. Thus, a com- 
parative percentile increase for lower-income groups 
represents a step toward equality only from a lim- 
ited, statistical point of view. In another perspec- 
tive, it can be regarded as a movement toward 
further inequality. For when a man’s income is 
increased from a million dollars a year to two mil- 
lion (i.e., doubled), while another’s income is in- 
creased from one thousand to five thousand (i.e., 
quintupled ), it can fairly be said that the rich man 
has benefited more notably—in absolute magni- 
tudes—than the poor. Whether one wishes to make 
such an assertion will depend entirely upon one’s 
conception of equality—whether, that is, one is 
measuring equality by absolute or relative stand- 
ards. The progressive income tax represents an 
effort by the modern state to mediate between these 
two notions of equality. 

A similar ambiguity—between equality of con- 
dition and equality of opportunity—plays a most 
significant role in American social and political 
thought. Equality of opportunity will inevitably 
result in inequality of condition, since some men 
are more able, more energetic, and more fortunate 
than others. The American creed sanctions such 
inequality—but only halfheartedly. For there has 
always been an implicit corollary—derived from 
the premise that all men are created equal—that 
equality of opportunity ought to lead to approxi- 
mate equality of condition, and that failure to 
realize this goal reflects a deficiency (if not a 
positive error) in the existing social and economic 
arrangements, The tides of American politics flow 
between these two polar conceptions of equality. 

In recent years, some of the leading thinkers of 
American sociology have attempted to transcend 
this debate over equality by declaring inequality to 
be a necessary condition of all social organization. 
To some extent, this effort originates in the experi- 
ence of just how little effect popular or political 
ideas about equality have on comparable social 
structures. The distribution of income in all mod- 
ern industrialized nations is astonishingly similar, 
no matter whether the governing ideology of this 


nation be socialist (e.g., Sweden) or capitalist 
(e.g., the United States). Instead of wondering 
about the origins of inequality and of social classes 
—as did the sociologists of the eighteenth and nine- 
teenth centuries—such thinkers as Talcott Parsons, 
Kingsley Davis, and Wilbert E. Moore have at- 
tempted to demonstrate that social differentiation 
and social stratification are indispensable to the 
very existence of a social structure—that each soci- 
ety has functional “norms,” both inwardly and out- 
wardly coercive, which prescribe the acceptable 
degrees and kinds of inequality to be tolerated. 
This sociological thesis represents a covert re- 
turn to the hierarchical principles of distributive 
justice and proportional equality elaborated by 
Aristotle in Book v of the Ethica nicomachea and 
Book m1 of the Politics. And since the history of 
the idea of equality in the Western world is to a 
considerable extent a record of intermittent, and 
sometimes violent, dissatisfaction with these prin- 
ciples, it is understandable that the debate over 
equality should be an unending one, with every 
new resolution the occasion for a new beginning. 


IRVING KRISTOL 


[See also Democracy; HUMAN RIGHTS; JusTIcE.] 


BIBLIOGRAPHY 

ARISTOTLE Ethica nicomachea. Translated by W. D. Ross. 
Oxford: Clarendon Press, 1925. 

ARISTOTLE Politics. With an English translation. London: 
Heinemann; Cambridge, Mass.: Harvard Univ. Press, 
1959. 

. DAHRENDORF, Ratr 1962 On the Origin of Social In- 
equality. Pages 88-109 in Peter Laslett and W. G. 
Runciman (editors), Philosophy, Politics and Society 
(Second Series): A Collection. New York: Barnes & 
Noble. 

Emerson, RaLpH Waroo (1844) 1920 Politics. Pages 
310-323 in Ralph Waldo Emerson, Essays: First and 
Second Series. New York: Dutton. 

Fourastié, JEAN (1951) 1960 The Causes of Wealth. 
Glencoe, Ill.: Free Press. > First published in French. 

LAKOFF, SANFORD A. 1964 Equality in Political Philoso- 
phy. Cambridge, Mass.: Harvard Univ. Press. 

WoLtHeEIM, RICHARD; and BERLIN, ISAIAH 1955-1956 
Equality. Parts 1-2. Aristotelian Society for the Sys- 
tematic Study of Philosophy, London, Proceedings 56: 
281-326. 


EQUILIBRIUM 
See ECONOMIC EQUILIBRIUM and HOMEOSTASIS. 


ERASMUS 


Desiderius Erasmus (1466?—1536) (he was bap- 
tized Erasmus Rogerii, or Gerards—Desiderius was 
his own addition), a Dutch humanist, spent his 
early youth in Gouda and later attended the Latin 
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schools at Deventer and '’s Hertogenbosch. There 
he had contact with the devotio moderna and 
joined the canons regular of the Augustinian mon- 
astery of Steyn near Gouda, taking his monastic 
vows in 1488. Between 1493 and 1516 he lived a 
wandering life: in service with the bishop of Cam- 
brai, studying at the University of Paris with Robert 
Gaguin, Jean Vitrier, and Lefévre dEtaples, making 
three visits to England, where he met John Colet 
and Thomas More and studied theology and Greek, 
and visiting Italy and Basel. From 1517 to 1521 
he taught at Louvain, then lived in Basel until the 
Reformation was instituted there, and in Freiburg 
from 1529 to 1535; he died in Basel. 

Erasmus was above all a remarkable classical 
scholar. He fiercely deplored the corrupt state of 
the Latin of his time and advocated the use of 
Cicero's language, which he used in all his writings 
and for which he wrote manuals. Following the 
example of the Italians, he sought out what, in his 
opinion, were the most authentic texts of the clas- 
sics, of which he drew up annotated editions, the 
Greek with Latin translations. In addition to many 
classical works he edited the New Testament and 
the writings of the church fathers. 

Erasmus also had great influence on his contem- 
poraries through his hortatory works; humorous 
and satirical sketches of the life of his contem- 
poraries (Colloquia 1518; Eng. trans. The Collo- 
quies of Erasmus), paraphrases of the Gospels and 
Epistles, and his most famous essays: Enchiridion 
militis christiani (1503) and Moriae encomium 
(1511; Eng. trans. In Praise of Folly). In them he 
bitterly mocked stupidity, selfishness, and vanity 
and pointed out that man can find the true hap- 
piness that lies in harmony and peace only through 
leading a truly Christian life and increasing one’s 
knowledge. Rulers, including the pope, were sharply 
criticized by Erasmus for their destructive and 
useless wars. Education to rationality can and must 
be the key to a better public life; to this end he 
wrote, among other things, Institutio principis 
christiani (1515-1516; Eng. trans. The Education 
of a Christian Prince), dedicated to Charles v. 

Erasmus took up arms early in his career against 
the idle disputations of the Scholastics, the for- 
malism of the church of his time, the wealth and 
temporal power of the priesthood, and later, above 
all, against the monks, whom he regarded as his 
archenemies and whose monastic life he saw as 
useless. He hoped for a reconstruction of the 
church in line with what he saw as primitive Chris- 
tianity: not a doctrine of redemption from sin and 
death, but a philosophia Christi that teaches man 
to live in conformity with the commandments of 
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love of neighbor, mercy, self-control, and reason, 
as the best of the classics also taught. He did not 
regard the sacraments as means of grace. He con- 
demned the church’s doctrine of absolution by 
penance and good works, the worship of saints 
and their relics, and the practice of pilgrimages. 
Edification by word and example, he held, is the 
only task of the priest. At the same time, he 
wanted to retain the principal dogmas (including 
papal power, provided it was confined to matters 
of faith). However, his interpretation of these 
dogmas differed so far from that professed by the 
church that the authorities, even after the Council 
of Trent, condemned his writings. But his ideas 
found much support and are still alive, both among 
Catholics and liberal Protestants. 

Erasmus greeted Luther’s public stand against 
the church’s doctrine of penance with sympathy. 
But he soon realized that Luther had in mind an 
entirely different reformation of the church than 
he himself desired. At the same time, he continued 
to advocate a conciliatory attitude toward Luther 
and opposed his condemnation. It was only after 
much pressure had been exerted that Erasmus 
wrote the Diatribe de libero arbitrio (1524; Eng. 
trans. Discourse on Free Will) against Luther. In 
it he set forth his own conception of human dignity 
and free will, which was related to the ideas of the 
Italian humanists and the classics, as opposed to 
the total depravity of man that Luther preached, 
However, he did this in such a way that his book 
pleased the Catholics no more than the Protestants. 
Until his death he pleaded for reconstruction of the 
church and a rapprochement between the Roman 
and Lutheran factions. 
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I. NONSAMPLING ERRORS Frederick Mosteller 


u. EFFECTS OF ERRORS IN 
STATISTICAL ASSUMPTIONS 


Robert M. Elashoff 
I 
NONSAMPLING ERRORS 


The view has sometimes been expressed that 
statisticians have laid such great emphasis on the 
study of sampling errors (the differences between 
the observed values of a variable and the long-run 
average of the observed values in repetitions of the 
measurement) that they have neglected or encour- 
aged the neglect of other, frequently more impor- 
tant, kinds of error, called nonsampling errors. 

Errors in conception, logic, statistics, and arith- 
metic, or failures in execution and reporting, can 
reduce a study’s value below zero. The roster of 
possible troubles seems only to grow with increas- 
ing knowledge. By participating in the work of a 
specific field, one can, in a few years, work up con- 
siderable methodological expertise, much of which 
has not been and is not likely to be written down. 
To attempt to discuss every way a study can go 
wrong would be a hopeless venture. The selection 
of a kind of error for inclusion in this article was 
guided by its importance, by the extent of research 
available, by the ability to make positive recom- 
mendations, and by my own preferences. 

Although the theory of sampling is generally 
well developed, both the theory and practice of the 
control of nonsampling errors are in a less satis- 
factory state, partly because each subject matter, 
indeed each study, is likely to face yet uncata- 
logued difficulties. Empirical results of methodo- 
logical - investigations intended to help research 
workers control nonsampling errors have accumu- 
lated slowly, not only because of myriad variables 
but also because the variables produce results that 
lack stability from one study to another. 

This article deals mainly with techniques for 
reducing bias. The portions on variability are not 
exceptions, for they offer ways to avoid under- 
estimating the amount of variability. The presenta- 
tion deals, first, with the meaning of bias and with 
conceptual errors; second, with problems of non- 
sampling errors especially as they arise in the 
sample survey field through questionnaires, panel 
studies, nonresponse, and response errors; and, 
third, with errors occurring in the analysis of 
nearly any kind of quantitative investigation, errors 
arising from variability, from technical problems 
in analysis, in calculations, and in reporting. Some 
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discussions of nonsampling errors restrict them- 
selves to the field of sample surveys, where prob- 
lems of bias and blunder have been especially 
studied, but this article also treats some nonsam- 
pling errors in experimental and observational 
studies. 


Bias and conceptual errors 


Bias and true values. What is bias? Most defi- 
nitions of bias, or systematic error, assume that for 
each characteristic to be measured there exists a 
true value that the investigation ideally would pro- 
duce, Imagine repeatedly carrying out the actual 
proposed, rather than the ideal, investigation, get- 
ting a value each time for the characteristic under 
study, and obtaining an average value from these 
many repetitions. The difference between that aver- 
age value and the true value of the characteristic 
is called the bias. The difference between the out- 
come of one investigation and the true value is the 
sum of bias and sampling error. The point of aver- 
aging over many repetitions is to reduce the sam- 
pling error in the average value to a negligible 
amount. (It is assumed that for the process under 
study and for the type of average chosen, this 
reduction is possible.) 

Is there a true value? The concept “true value” 
is most touchy, for it assumes that one can de- 
scribe an ideal investigation for making the meas- 
urement. Ease in doing this depends upon the 
degree of generality of the question. For example, 
the measurement of “interventionist attitude” in 
the United States during World War 11 is discussed 
below. For such a broad notion, the concept of a 
true value seems vague, even admitting the possible 
use of several numbers in the description. It is 
easier to believe in a true value for the percentage 
of adults who would respond “Yes” to “Should we 
go to war now?” Even here the training of the 
interviewers, the rapidly changing fraction of the 
population holding given opinions, and the effect 
of the social class and opinions of the interviewer 
upon the responses of those interviewed must raise 
questions about the existence of a true value. At 
the very least, we wonder whether a true value 
could represent a time span and how its conditions 
of measurement could be specified. In designing an 
ideal sample survey, what kind of interviewer 
should be used? 

Today some scientists believe that true values 
do not exist separately from the measuring process 
to be used, and in much of social science this view 
can be amply supported. The issue is not limited 
to social science; in physics, complications arise 
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from the different methods of measuring micro- 
scopic and macroscopic quantities such as lengths. 
On the other hand, because it suggests ways of 
improving measurement methods, the concept of 
“true value” is useful; since some methods come 
much nearer to being ideal than others, the better 
ones can provide substitutes for true values. (See 
the discussion on describing response error in the 
section on “Response error,” below. ) 

To illustrate further the difficulty of the notion 
of a true value, consider an example from one of 
the most quantitative social sciences. When the 
economist assesses the change in value of domestic 
product, different choices of weights and of base 
years yield different results. He has no natural or 
unique choice for these weights and years. He can 
only try to avoid extremes and unusual situations. 
While, as noted above, the belief in a true value 
independent of the measuring instrument must be 
especially weak in the area of opinion, similar 
weaknesses beset measures of unemployment, 
health, housing, or anything else related to the 
human condition. [See INDEX NUMBERS.] 

Conceptual errors. Since the variety of sources 
of biases is practically unlimited, this article dis- 
cusses only a few frequently encountered sources. 

Target population-sampled population. Often 
an investigation is carried out on a sample drawn 
from a population—the sampled population—quite 
different from that to which the investigator wants 
to generalize—the target population. This mismatch 
makes the inference from sample to target popu- 
lation shaky. To match target and sampled popula- 
tion perfectly is usually impossible, but often the 
expenditure of time and money or the use of 
special skills or cooperation can patch what cannot 
be made whole. 

Some examples of this process follow: (1) The 
psychologist wants to establish general laws of 
learning for all organisms, and especially for man, 
but he may choose to study only the college sopho- 
more, usually in his own college and rarely outside 
his own country. His principal alternatives are the 
rat and the pigeon. Reallocation of time and money 
may extend the sampled population and bring him 
closer to the target he has in mind. 

(2) The sociologist may want to study the 
actual organization of trade unions and yet be hard 
pressed to study in depth more than a single union. 
This limitation is impossible for an individual to 
overcome, but cooperative research may help, (For 
a remarkable cooperative anthropological study of 
child rearing, see Six Cultures: Studies of Child 
Rearing [Whiting 1963]. ) 

(3) The historian or political scientist may want 


to exposit the whole climate of opinion within 
which an important decision is made, yet he must 
pick some facts and omit others, emphasize some 
and not others. The sampling of historical records 
offers a compromise between scanning everything, 
which may be impossible or unsatisfactorily super- 
ficial, and the case study of a single document or 
of a small collection, 

(4) The man who generalizes on educational 
methods on the basis of his studies in one class, or 
one school subject, or one grade, or one school, or 
one school system, or even one country, needs to 
consider whether the bases of his investigations 
should be broadened. 

(5) The investigator, especially in studies where 
he does not regard his investigation as based on a 
sample, but on a population or census, would be 
wise to consider what population he hopes his in- 
vestigation applies to, whether the full breadth of 
it has had an appropriate chance to contribute 
cases to his study and, if not, how he might get at 
the rest. He may be satisfied with describing the 
population under study, but often he is not. 

(6) More narrowly, in sampling the member- 
ship of a professional society, the investigator may 
find his published membership list out of date by 
some years. For a fee the society may be willing to 
provide its current mailing list, which is probably 
as close as one can get to the target population. 
Obviously, the target population changes even 
while the study is being performed. 

Incompatibility of meaning. While arguing for 
Statistical thinking in the attempt to generalize 
one’s results, one must not fall into the pit of sta- 
tistical nonsense. Both anthropologists and his- 
torians call attention to mistakes that can come 
from regarding seemingly like objects, rituals, or 
behavior in different cultures as exchangeable com- 
modities for statistical purposes. The notion of 
“father” without distinction between “pater” and 
“genitor” offers an example. In the Trobriands, a 
boy lives with his benign, biological father until he 
is nine or ten years old, then moves to his mother’s 
brothers’ village for training and discipline, and 
there he inherits property. In the United States the 
biological father theoretically plays the role of dis- 
ciplinarian, and the uncles frequently play benign, 
indulgent roles. 

Pilot studies. Toward the completion of a 
study, investigators usually feel that it would have 
been better done in some other way. But a study 
can be petted and patted so long that, before com- 
pletion, its value is past. The huge, never-completed 
study usually damages the investigator’s reputa- 
tion, however wise the termination. Much can and 


must be learned by trying, and therefore nearly 
any investigation requires pilot work. Pilot work is 
little written about, perhaps because it is hard to 
summarize and perhaps because the results usually 
sound so obvious and often would be were they not 
hidden among thousands of other possible obvious 
results that did not occur. The whole spectrum 
from the tightest laboratory experiment to the 
loosest observational study requires careful pilot 
work. Pilot studies pinpoint the special difficulties 
of an investigation and, by encouraging initial ac- 
tion, overcome doctrines of omniscience that re- 
quire a complete plan before starting. While it is 
true that the statistician can often give more 
valuable aid at the planning stage by preventing 
errors than by salvaging poor work through anal- 
ysis, firm plans made in the absence of pilot studies 
are plans for disaster. 

Hawthorne effects. Psychologists sadly say that 
even under the most carefully controlled conditions, 
laboratory animals do as they please. Humans do 
even worse. When Roethlisberger and Dickson 
(1939) carried out their experiments to find con- 
ditions that would maximize productivity of fac- 
tory teams at the Hawthorne Works of Western 
Electric, they found that every change—increasing 
the lighting or reducing it, increasing the wage 
scale or reducing it—seemed to increase the group 
productivity. Paying attention to people, which 
occurs in placing them in an experiment, changes 
their behavior. This rather unpredictable change is 
called the Hawthorne effect. Instead of trying to 
eliminate this effect, it has been suggested that all 
educational efforts should be carried out as por- 
tions of experiments, so as to capitalize on the 
Hawthorne effect. No doubt boredom, even with 
experimentation, would eventually set in. 

The existence of Hawthorne effects seriously re- 
stricts the researcher's ability to isolate variables 
that change performance in a consistent manner. 
Although experimenters, by adjusting conditions, 
may create substantial changes in behavior, what 
causes the changes may still be a mystery. Reliable 
repetition of results by different experimenters 
using different groups can establish results more 
firmly. 

What treatment was applied? In experimental 
work with humans, it is especially difficult to know 
whether the treatment administered is the one that 
the experimenter had in mind, For example, in an 
unsuccessful learning experiment on the produc- 
tion of words by individuals, subjects in one group 
were instructed that every word in the class of 
words that they were seeking contained the same 
letters of the alphabet. When no differences in 
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learning rates emerged between these subjects and 
those told nothing about the class of words being 
sought, further investigations were made. It turned 
out that few subjects listened to this particular in- 
struction, and among those who did, several forgot 
it during the early part of the experiment. If a par- 
ticular instruction is important, special efforts have 
to be made to ensure that the subject has received 
and appreciated it. 

One approach to the problem of Hawthorne 
effects uses, in addition to experimental groups, 
two kinds of control groups: groups who are in- 
formed that they are part of an experiment and 
other groups who are not so informed. As always, 
the investigator has to be alert about the actual 
treatment of control and experimental groups. L. L. 
Thurstone told me about experimenting for the 
U.S. Army to measure the value of instruction 
during sleep for training in telegraphy. Thurstone 
had control squads who were not informed that 
they were in the study. The sergeants instructing 
these control squads felt that the “sleep learning” 
squads were getting favored treatment, and to 
keep their squads “even,” they secretly instituted 
additional hours of wide-awake training for their 
own squads, thereby ruining the whole investi- 
gation. 

Randomization. Generally speaking, random- 
ization is a way to protect the study from bias in 
selecting subjects or in assigning treatments. It 
aids in getting a broad representation from the 
population into the sample. Randomization helps to 
communicate the objectivity of the study. It pro- 
vides a basis for mathematical distribution theory 
that has uses in statistical appraisals and in simu- 
lations. [See EXPERIMENTAL DESIGN; RANDOM 
NUMBERS. | 

Bad breaks in random sampling. Valuable as 
randomization is, chance can strike an investigator 
stunning blows. For example, suppose that a psy- 
chological learning experiment is intended to rein- 
force 5 randomly chosen responses in each burst 
of 20. If the randomization accidentally gives rein- 
forcement to the first 5 responses in each burst of 
20, the psychologist should notice this and realize 
that he has selected a special kind of periodic re- 
inforcement. The objectivity of the random assign- 
ment cannot cure its qualitative failure. 

Similarly, suppose that in preparing to study 
fantasy productions under two carefully controlled 
conditions the clinical psychologist observes that 
his randomizing device has put all his scientist 
subjects into one group and all his humanist sub- 
jects into another. In that case, he should recon- 
sider the grouping. 
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In principle, one should write down, in advance, 
sets of assignments that one would not accept. 
Unfortunately, there are usually too many of these, 
and nobody is yet adept at characterizing them in 
enough detail to get a computer to list them, even 
if it could face the size of the task. One solution is 
to describe a restricted, but acceptable, set of as- 
signments and to choose randomly from these, 
Omitting some acceptable assignments may help 
to make the description feasible while keeping the 
list satisfactorily broad. 

If this solution is not possible, then one prob- 
ably has either to trust oneself (admittedly risky) 
or else get a more impartial judge to decide 
whether a particular random assignment should 
be borne. 

If there are many variables, an investigator can- 
not defend against all the bad assignments. By 
leaning upon subject matter knowledge and accept- 
ing the principle that the variables usually thought 
to be important are the ones to be especially con- 
cerned about, stratification, together with random- 
ization, can still be of some assistance. For example, 
the stratification might enforce equal numbers of 
each sex, with individuals still randomly chosen 
from the potential pools of men and women. In 
studying bad breaks from randomization, the in- 
vestigator can afford to consider rejecting only the 
assignments too closely related to proved first-order 
or main effects and not second-order effects or 
boomerang possibilities conceived in, but never ob- 
served from, armchairs. 

Random permutations. Although arranging ob- 
jects in a random order can easily be done by using 
an ordinary random number table, few people 
know how to do it. In any case, making these per- 
mutations is tedious, and it is worth noting the 
existence of tables of random permutations in some 
books on the design of experiments and in the book 
by Moses and Oakford (1963) that offers many 
permutations of sets numbering 9, 16, 20, 30, 50, 
100, 200, 500, 1,000 elements. A set of any other 
size less than 1,000 can be ordered by using the 
permutations for the next larger size. With larger 
sets, some stratification is almost sure to be 
valuable. 

Example. One (nonstratified) permutation for 
a set of 30 elements is shown in Table 1 (read left 


Table 1 
WW 5 29 26 3 
1 19 14 4 20 
24 25 27 6 id 
30 15 13 18 12 
28 22 7 8 7 
2 21 16 10 23 


to right). To arrange at random the letters of the 
alphabet, we might assign the integers, starting 
with 1 for a and ending with 26 for z, to the posi- 
tions of the letters in the alphabet. Then, accord- 
ing to the permutation of Table 1, 11 and 5 corre- 
spond to k and e. 29 is omitted, 26 corresponds 
to z. Continuing gives the permutation: 


kezcasndtxyfiomrlughqbupjw. 


For a random sample of 5 letters from the alpha- 
bet, drawn without replacement, we could just take 
the first 5 listed. 

Simulations for new statistical methods. Large- 
scale simulation of economic, political, and social 
processes is growing in popularity; social scientists 
who invent new statistics would often find it 
profitable to try these out on idealized popu- 
lations, constructed with the aid of random num- 
ber tables, to see how well they perform their 
intended functions under perfectly understood con- 
ditions. This sort of exploration should be encour- 
aged as part of the pilot work. To illustrate the 
lack, many books and hundreds of papers have been 
written about factor analytic methods, yet in 1966 
it is hard to point to more than a single published 
simulation (Lawley & Swanson 1954; Tucker 
1964) of the methods proposed on artificially con- 
structed populations with random error, 
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Questionnaires. Questionnaires themselves pre- 
sent many sources of bias, of which the wording 
of questions and the options offered as answers are 
especially important. Some topics discussed below 
(“Panel studies,” “Nonresponse,” and “Response 
error”) also treat questionnaire matters. [See espe- 
cially SAMPLE SURVEYS; SURVEY ANALYSIS; see 
also INTERVIEWING.] 

Wording. The wording and position of ques- 
tions on questionnaires used in public opinion polls 
and other investigations illustrate the difficulties 
surrounding the notion of true value mentioned 
earlier. Rugg and Cantril’s survey article (1944) 
analyzes and illustrates the effects of the manner 
of questioning on responses. For example, prior to 
the U.S. entry into World War n, variations on a 
question about American aid to Great Britain, 
asked of American citizens within a period of about 
six weeks, produced the following percentages in 
favor of the “interventionist” position: 76, 73, 58, 
78, 74, 56. Here the interventionist position meant 
approval of “giving aid even at the risk of war.” At 
much the same time, unqualified questions about 
“entering the war immediately” produced the fol- 
lowing percentages in favor, 22, 17, 8, numbers 


substantially different from those in the previous 
set. Although one would be hard put to choose a 
number to represent degree of support for inter- 
vention, the interval 55 to 80 per cent gives a 
range; this range was clearly higher than that in 
support of entering the war immediately. 

Pilot studies of the wordings of questions test 
their meaning and clarity for the intended popu- 
lation. Phillip Rulon recalls interviews with very 
bright second graders from a geography class to 
discuss a test item that they had “missed”: “Wind- 
eroded rocks are most commonly found in the (a) 
deserts, (b) mountains, (c) valleys.” They chose 
“valleys” because few people would find wind- 
eroded rocks in the mountains or the deserts, how- 
ever many such rocks might be in those places. 
After a question previously found to be unsatisfac- 
tory is reworded, bitter experience advises the test- 
ing of the new version. 

In single surveys, one needs to employ a variety 
of questions to get at the stability and meaning of 
the response. The use of “split ballots” (similar but 
modified questionnaires administered to equivalent 
samples of individuals) offers a way to experiment 
and to control for position and wording. 

Changing opinions. To ignore the results of 
the polls because of the considerable variation in 
responses would be as big a mistake as to adopt 
their numbers without healthy skepticism. Since 
opinion in time of crisis may move rapidly, it is 
easy to misappraise the tenor of the times without 
a systematic measuring device. For example, be- 
tween July 1940 and September 1941, the per cent 
of U.S. citizens saying that they were willing to risk 
war with Japan rather than to let it continue its 
aggression rose from 12 to 65 per cent. Again, 
although in June 1940 only 35 per cent thought it 
more important to help England than to keep 
out of war, by September 1940 the percentage had 
risen to the 50s (Cantril 1944, p. 222). In Sep- 
tember 1940, President Roosevelt made a deal that 
gave Great Britain 50 destroyers in return for 
leases of bases (Leuchtenburg 1963, pp. 303-304); 
in the face of the fluctuations of public opinion a 
historian considering the destroyer deal might 
easily believe that Roosevelt acted against, rather 
than with, the majority. (As I recall from experi- 
ence at the Office of Public Opinion Research, 
Roosevelt had his own personal polls taken regu- 
larly, with reports submitted directly to him, usu- 
ally on a single question. ) 

Seemingly minor variations in questions may 
change the responses a good deal, and so to study 
changes over time, one needs to use one well- 
chosen question (or sequence) again and again. 
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Naturally, such a question may come under attack 
as not getting at the “true value.” If the question is 
to be changed, then, to get some parallel figures, 
it and the new question should be used simulta- 
neously for a while. 

Intercultural investigations. Considering the 
difficulty of getting at opinions and the dependence 
of responses upon the wording of the questions 
asked, even within a country, the problem of ob- 
taining comparable cross-cultural or cross-national 
views looks horrendous. Scholars planning such 
studies will want to see three novel works. Kluck- 
hohn and Strodtbeck’s (1961) sociological and 
anthropological Variations in Value Orientations 
especially exploits ranking methods in the com- 
parison of value orientations in Spanish-American, 
Mormon, Texas, Zuni, and Navajo communities. 
Subjects describe the many values of their culture 
by ordering their preferences, for example, for 
ways of bringing up children; past (the old ways), 
present (today’s ways), or future (how to find new 
ways to replace the old). Cantril’s (1966) social- 
psychological and internationally oriented Pattern 
of Human Concerns uses rating methods and sam- 
ple surveys to compare values and satisfactions in 
the populations of 15 nations. For example, the 
respondent's rating, on a scale of O to 10, expresses 
his view of how nearly he or his society has 
achieved the goal inquired about, and another 
rating evaluates how much either might expect to 
advance in five years. In international economics, 
measurements may be more easily compared, al- 
though the economist may be forced to settle for 
measuring the measurable as an index of what he 
would like to evaluate. Harbison and Myers’ study, 
Education, Manpower, and Economic Growth 
(1964), illustrates this approach. 

Panel studies. Although the single sample sur- 
vey can be of great value, in some problems it is 
desirable to study the changes in the same people 
through time. The set of people chosen for repeated 
investigation is called a panel, One advantage of 
the panel study over the single survey is the deeper 
analysis available. For instance, when a net 5 per 
cent change takes place, does this mean that only 
5 per cent of the people changed, or perhaps that 
15 per cent changed one way and 10 per cent an- 
other? Second, additional measurement precision 
comes from :natching responses from one inter- 
view to another. Third, panel studies offer flexi- 
bility that allows later inquiries to help explain 
earlier findings. [See PANEL STUDIES.] 

Dropouts. Panel studies, even when they start 
out on an unbiased sample, have the bias that the 
less informed, the lower-income groups, and those 
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not interested in the subject of the panel tend to 
drop out. Sobol (1959) suggested sampling these 
people more heavily to begin with, and she tried to 
follow movers. According to Seymour Sudman of 
the National Opinion Research Center, in national 
consumer panels and television rating panels where 
a fee is paid to the participant, the lower-income 
groups do not drop out. 

Beginning effects. When new individuals or 
households first join a panel, their early responses 
may differ from their later ones, The “first-month” 
effect has unknown origins. For example, after the 
first month on the panel, the fraction of unem- 
ployed reported in private households decreases 
about 6 per cent. Over the course of several panel 
interviews, more houses become vacant and con- 
sumer buying decreases (Neter & Waksberg 1964a; 
Waksberg & Pearl 1964), Household repairs de- 
creased by 9 per cent between the second and third 
interview. In consumer panels, the reports made 
during the first six or eight weeks of membership 
are usually not included in the analysis. The start- 
up differences are not clear-cut and emphatic but 
unsettling enough that the data are set aside, ex- 
pensive as that is. 

Sample surveys are not alone in these “first-time” 
effects. Doctors report that patients’ blood pres- 
sures are higher when taken by a strange doctor. 
In the Peirce reaction-time data, presented in Table 
4, the first day’s average reaction time was about 
twice those of the other 23 days. 

Long-run effects. A most encouraging finding 
in consumer panel studies has been the stability of 
the behavior of the panelists. By taking advantage 
of the process of enlarging two panels, Ehrenberg 
(1960) studied the effects of length of panel mem- 
bership in Great Britain and in Holland. When he 
compared reports of newly recruited households 
(after their first few weeks) with those of “old” 
panel members, he found close agreement for pur- 
chasing rates, brand shares of market, and diary 
entries per week. 

Panels do have to be adjusted to reflect changes 
in the universe, and panel families dissolve and 
multiply. 

Nonresponse. The general problem of non- 
response arises because the properties of the non- 
respondents usually differ to some degree from 
those of respondents. Unfortunately, nonresponse 
is not confined to studies of human populations. 
Physical objects can be inaccessible for various 
reasons: records may be lost, manholes may be 
paved over, a chosen area may be in dense jungle, 
or the object may be too small to be detected, One 
tries to reduce nonresponse, adjust estimates for 
it, and allow for it in measures of variability, 


Mail questionnaires. The following advice, 
largely drawn from Levine and Gordon (1958- 
1959) and Scott (1961), is intended to increase 
response from mail questionnaires: 

(1) Respondent should be convinced that the 
project is important. 

(2) Preparatory letter should be on the letter- 
head of a well-known organization or, where ap- 
propriate, should be signed by a well-known per- 
son. In the United States and in Great Britain, 
governmental agencies are more likely to obtain 
responses than most organizations. Indeed, Scott 
(1961) reports 90 per cent response! Special popu- 
lations respond to appeals from their organizations. 

In pilot studies preparatory to using mailed 
census questionnaires in the initial stage of enu- 
meration for the 1970 census (enumerator to fol- 
low up nonrespondents), the U.S. Bureau of the 
Census got the percentages of responses to the 
mailing shown in Table 2. 


Table 2 
PERCENTAGE OF RESPONSE 
Long form Short form 
Cleveland, Ohio 78 80 
Louisville, Kentucky 85 88 


(3) Rewards may be used (gifts, trading stamps, 
Sweepstakes ). Do not offer a copy of the final report 
unless you are prepared to give it. 

(4) Make questionnaire attractive (printing on 
good paper is preferred), easy to read, and easy to 
fill in, remembering that many people have trouble 
reading fine print. Longer questionnaires usually 
lower the response rate. 

(5) Keep questions simple, clear, as short as 
possible, and where multiple-choice answers ap- 
pear, make sure that they do not force respondent 
to choose answers that do not represent his 
position, 

(6) Try to keep early questions interesting and 
easy; do not leave important questions to the end; 
keep related questions together, unless there are 
strong reasons to act otherwise, 

(7) Usea high class of mail, first-class, airmail, 
and even special delivery, both for sending the 
questionnaire and on the return envelope, Do not 
expect respondent to provide postage. In Great 
Britain, Scott (1961) found that compared with a 
postcard a card to be returned in an envelope 
raised response. 

(8) Follow hard-core resistance with repeat 
questionnaire (the sixth mailing may still be re- 
warding), telegram, long-distance phone call, or 
even personal interview, as discussed below, Small 
tesponse from early mailings may be badly biased; 


for example, successful hunters respond more 
readily than unsuccessful ones to questions about 
their bag (Kish 1965, p. 547). 

(9) Do not promise or imply anonymity and 
then retain the respondent's identity by subterfuge, 
however worthy the cause. Views on the effects of 
anonymity are mixed. If respondent's identity is 
needed, get it openly. 

The principles set out above for mail question- 
naires and those below for personal interviews may 
well be culture-bound for they are largely gathered 
from Western, English-speaking experience. For 
example, where paper is expensive, questionnaires 
on better paper may be less likely to be returned 
than those on poorer paper. 

Sample surveys using personal interviews. In 
personal interviews, 80 to 90 per cent response has 
been attained even on intimate topics. In 1966, 
85 per cent was regarded as rather good for pre- 
designated respondents in household surveys. In 
addition to the relevant maxims given above for 
mail surveys, to reduce nonresponse in personal 
interview surveys Sharp and Feldt (1959), among 
Others, suggest some of the following: 

(1) Send preview letter; use press to announce 
survey. In three lengthy surveys on different topics, 
according to Reuben Cohen of the Opinion Re- 
search Corporation, a letter sent in advance led to 
an average gain of 9 per cent in reaching, after 
four calls, urban adult respondents randomly 
drawn from the household list. Cohen also suggests 
that follow-up letters, after unsuccessful interview- 
ing attempts, can reduce urban nonresponse by 
about one-third. Some students of polling believe 
that the actual impact of the preview letter is 
largely on the interviewer, who thinks that obtain- 
ing cooperation will be easier because of it—and 
so it is. 

(2) Use trained interviewers, that is, interview- 
ers trained especially to handle opening remarks, 
to explain the need for full coverage, and to get 
information about profitable times to make later 
calls (“callbacks”) to reach respondents who are 
initially not at home. Experienced interviewers 
have had 3 per cent to 13 per cent fewer nonre- 
spondents than inexperienced ones. 

(3) Be flexible about calling at convenience of 
respondent, even at his place of work or recreation, 
on evenings and on week ends. 

(4) Allow interviewer to call back many times 
to locate assigned respondent. 

(5) Employ interpreter when appropriate. 

(6) In more esoteric situations, know the cul- 
ture. Do not plan to interview farmers in the peak 
periods of farm activity. An anthropologist sched- 
uled a survey of current sexual behavior among 
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South Sea islanders during the season when women 
were taboo to fishermen—the natives, finding it 
a great joke, were slow to explain. 

Extra effort. When a survey carried out in the 
usual way produces a surprisingly large nonre- 
sponse, an all-out effort may be mounted using 
many of the devices mentioned earlier, A rule of 
thumb is that the nonresponse can be reduced by 
about half. 

Oversampling nonrespondents. Repeated call- 
backs are the traditional method for reducing non- 
response in personal interviews, and careful cost 
analysis has shown that their cost per completed 
interview is lower than was at first supposed when 
quota sampling was popular. Kish and Hess (1959) 
report a procedure for including in the current 
sample nonrespondents from similar previous sur- 
veys, so as to have in advance an oversupply of 
persons likely not to respond. Then the sample sur- 
vey, although getting responses from these people 
at a lower rate than from others, more nearly fills 
out its quotas. 

Subsampling nonrespondents. To reduce non- 
response in mail surveys, subsampling the nonre- 
spondents and pursuing them with personal inter- 
views has been used frequently (formulas for 
optimum design are given in Hansen & Hurwitz 
1946). In methods thus far developed the assump- 
tion is made that nonrespondents can surely be 
interviewed. When this assumption is unjustified, 
the method is less valid. 

Adjusting for respondents not at home. The 
next method adjusts for those not at home but 
does not handle refusals, which often come to 
about half the nonresponse. Bartholomew (1961) 
has got accurate results by assuming that most of 
the bias arises from the composition of the popu- 
lation available at the first call. By finding out 
when to call back, the interviewer reduces later 
biases from this source. The interviewer gets in- 
formation either from others in the house or from 
neighbors. To illustrate, in empirical investigations 
of populations of known composition, Bartholomew 
studied the percentage of men in political wards of 
a city. In four wards, differences between first-call 
and second-call samples in percentage of men were 
17 per cent, 29 per cent, 36 per cent, and 38 per 
cent, substantial differences. But the differences 
between the second-call percentage of men and the 
actual percentage of men not reached by the first 
call were only 6 per cent, 2 per cent, 2 per cent, 
and 2 per cent, supporting Bartholomew’s point. 

Suppose that proportion p of the population has 
the characteristic of interest. It is convenient to 
regard p as the weighted average pp, + (1—p)p:, 
where p is the proportion of first-call responders 
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in the population, p, is the proportion of first-call 
responders having the characteristic, and P: is the 
proportion of others in the population having the 
characteristic. (It is assumed that p, is independent 
of response status after the first call.) Now if N, 
the total sample size, is expressed as N = N, + 
N. + Ns, where N, is the number of first-call re- 
sponders in the sample, N, is the number of second- 
call (but not first-call) responders in the sample, 
and N, is the number of others, then p is naturally 
estimated by N,/N (the proportion of first-call re- 
sponders in the sample), and p, by n,/N, (the 
proportion of first-call responders in the sample 
who have the characteristic) and p, by n./N, (the 
proportion of second-call responders in the sample 
having the characteristic), Putting these estimators 
in the weighted average gives, as estimator of P, 


+ [ nı + aA N, ] 5 
For example, if the number of men in the first call 
is n, = 40 out of N, = 200 interviewed, the second- 
call data are n, = 200, N, = 400, and the original 
sample size is N = 1,000, then the estimate of the 
proportion of men is 0.44. Even if the theory were 
exactly true, some increase in variance would arise 
from using such weights instead of obtaining the 
whole sample (Kish 1965, secs. TETE 1.76): 

Extrapolation. Hendricks ( 1956) suggests plot- 
ting the variable being measured against the per- 
centage of the sample that has responded on suc- 
cessive waves and extrapolating to 100 per cent. 
This simple, sensible idea could profit from more 
research, empirical and theoretical. 

Effect on confidence interval. In sample sur- 
veys, nonresponse increases the lengths of the con- 
fidence intervals for final estimates by unknown 
amounts. For dichotomous types of questions, the 
suggestion is often made that all the nonresponses 
be counted first as having the attribute, then as not 
having it. The effect on the 95 per cent confidence 
interval is shown in Table 3, When such extreme 
allowances are required, the result of, say, 20 per 
cent nonresponse is frequently disastrous. For large 
random samples, this treatment of nonresponse, as 
may be seen from Table 3, adds approximately 
the per cent of nonresponse to the total length of 
the confidence interval that would have been ap- 
propriate with 100 per cent response. For example, 
with a sample of 2,500 from a large population, a 
95 per cent confidence interval from 58 per cent 
to 62 per cent would be lengthened by 20 per cent 
nonresponse to 48.5 per cent to 71.5 per cent. This 
additional length gives motivation enough for want- 
ing to keep nonresponse low. 


Table 3 — Allowance to be added to and subtracted 
from the observed percentage to give at 
least 95 per cent confidence of covering the 
true value* 


PER CENT SAMPLE SIZE 
NONRESPONSE 100 2,500 Infinite 
0 9.8 2.0 0.0 
5 10.7 4.1 2.5 
10 12.9 6.6 5.0 
15 15.1 9.0 7.5 
20 17.4 11.5 10.0 


* These numbers are approximately correct for percentages near 0.50; they are 
likely to be conservative otherwise. 


Source: Cochran, Mosteller, and Tukey 
1954, p. 280. 


No one believes that these “worst possible” limits 
represent the true state of affairs, nor should any- 
one believe the optimist who supposes that the 
nonrespondents are just like the respondents. In 
large samples, differences as large as 28 per cent 
in the fraction possessing a characteristic between 
the first 60 per cent interviewed and the next 25 
per cent have been reported. To develop a more 
sensible set of limits in the spirit of Bayesian infer- 
ence would be a useful research job for sample 
survey workers and theoretical statisticians. [See 
BAYESIAN INFERENCE.) This urgently needed work 
would require both empirical information (pos- 
sibly newly gathered) and theoretical development. 

The laboratory worker who studies human be- 
havior rarely has a defined target population and 
he frequently works with a sample of volunteers. 
Under such circumstances we cannot even guess 
the extent of nonresponse. Again the hope is that 
the property being studied is independent of will- 
ingness or opportunity to serve as a subject—the 
position of the optimist mentioned above, 

Since a few experimenters do sample defined 
populations, the argument that such sampling is 
impossible has lost some of its strength. The argu- 
ment that such sampling is too expensive has to 
be appraised along with the value of inferences 
drawn from the behavior of undefined sampled 
populations. 

Studying differences between groups offers more 
grounds for hope that bias from nonresponse works 
in the same direction and in nearly the same 
amount in both groups and that the difference may 
still be nearly right. This idea comes partly from 
physical measurements where sometimes knowl- 
edge can make such arguments about compensat- 
ing errors rigorous. But, as Joseph Berkson warns, 
no general theorem states “Given any two wrong 
numbers, their difference is right.” 

Response error. When incorrect information 
about the respondent enters the data, a response 


error occurs. Among the many causes are mis- 
understandings, failures of memory, clerical errors, 
or deliberate falsehoods. The magnitudes of some 
of these errors and some ways to reduce them are 
discussed below. 

Telescoping events. In reporting such things as 
amount of broken crockery or expenditures for 
household repairs, some respondents telescope the 
events of a considerable period into the shorter one 
under study. As a possible cure, Neter and Waks- 
berg (1964b) have introduced a device called 
“bounded recall.” In a study of household repairs, 
the respondent was interviewed twice, first under 
unbounded recall, during which the full story of 
the last month, including the telescoping from 
previous months, was recorded by the interviewer. 
Second, in an interview using bounded recall a 
month later, the respondent was deliberately aided 
by the record of repairs from the first interview. 
The magnitudes of the effects of telescoping are 
considerable, because the “unbounded” interview 
for household repairs gave 40 per cent more jobs 
and 55 per cent higher expenditure than did the 
“bounded” interview. Data from a “bounded” 
interview produce less bias. 

Forgetting. Although telescoping occurs for 
some activities, chronic illness (Feldman 1960), 
which had already been clinically diagnosed, was 
reported only at a 25 per cent rate in household 
interviews. Others report rates in the 40 per cent 
to 50 per cent range. Feldman despairs of the 
household interview for this purpose; but if im- 
provement in reports is to be attempted, he recom- 
mends more frequent interviews by competent, 
trained interviewers and, in a panel study, the use 
of a morbidity diary to improve self-reporting. One 
limitation, not attributable to forgetting, is set be- 
cause physicians choose not to inform their patients 
of every illness they diagnose. 

Sudman (1964) compared consumer panel re- 
ports based upon diary records with reports based 
upon unaided recall. First, he shows that for 72 
grocery products (55 being food), the purchases 
recorded in the diary underestimate the amount 
shipped by the manufacturer (after adjustment 
for nonhousehold use) by a median of about 15 
per cent. The underreporting was highly predict- 
able, depending on both the properties of the prod- 
uct (frequency of purchase, where most often 
purchased) and its treatment in the diary (type 
size, page number, position on page). Second, when 
recall was compared with diary, the median ratio 
of purchases (purchases recalled divided by pur- 
chases in diary) was 1.05 for nonfood products, 
1.83 for perishable food, 1.54 for staple foods. 
Leading nationally advertised brands have their 
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market shares overstated under recall by 50 per 
cent compared to diary records, and chain brands 
are understated. 

Use of experts. After respondents had valued 
their own homes, Kish and Lansing (1954) ob- 
tained expert appraisals for a sample of the homes. 
The comparison of the experts’ appraisal with the 
homeowners’ appraisal can be used to adjust the 
total valuation or to adjust valuations for groups 
of houses. For example, the homeowners may 
average a few per cent too high. 

Editing records. Whenever a comparison of 
related records can be made, the accuracy of rec- 
ords can probably be improved. For example, Cen- 
sus Bureau editors, experienced in the lumber 
business, check annual sawmill production reports 
against those of the previous year, and large 
changes are rechecked with the sawmill. 

Describing response errors, One common meas- 
ure in the analysis of nonsampling errors puts bias 
and sampling variability into one index, called the 
mean square error. The larger the mean square 
error, the worse the estimate. The mean square 
error is the expected squared deviation of the ob- 
served value from the true value. This quantity 
can be separated into the sum of two parts, the 
variance of the observation around its own mean 
and the square of the bias. Although true values 
are not available, in the United States, the Bureau 
of the Census, for instance, tries to find standards 
more accurate than the census to get an estimate 
of the response bias to particular questions, For 
example, using the Current Population Survey as 
a standard, the Bureau of the Census not only finds 
out that the census underestimates the percentage 
in the labor force, but the bureau also gets data on 
the portions of the population not being satisfac- 
torily measured, either because of variability or 
bias. Using such information, the bureau can profit- 
ably redesign its inquiries because it knows where 
and how to spend its resources. 

Response uncertainty. In attitudinal studies 
(Katz 1946), the investigator must be especially 
wary of reports obtained by polling the public on 
a matter where opinion is not crystallized. The 
No Opinion category offers one symptom of trouble: 
for example, Katz reports that in 1945 only 4 per 
cent had No Opinion about universal military 
training in the United States, 13 per cent had No 
Opinion about giving the atom bomb secret to the 
United Nations, and 32 per cent had No Opinion 
about U.S. Senate approval of the United Nations 
charter. Even though the vote was 66 per cent to 
3 per cent in favor of approving the charter, the 
32 per cent No Opinion must suggest that the 69 
per cent who offered an opinion contained a large 
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subgroup who also did not hold a well-formed 
opinion. 


Errors in analysis 


Troubles with variability. In analyzing data, 
the presence of variability leads to many unsus- 
pected difficulties and effects. In addition to treat- 
ing some of the common traps, this section gives 
two ways to analyze variability in complicated prob- 
lems where theoretical formulas for variance are 
either unavailable or should be distrusted. 

Inflated sample size. The investigator must 
frequently decide what unit shall be regarded as 
independent of what other unit. For example, in 
analyzing a set of responses made by 10 individ- 
uals, each providing 100 responses, it is a common 
error to use as the sample size 10 x 100 = 1,000 


responses and to make calculations, based perhaps . 


on the binomial distribution, as if all these re- 
sponses were independent. Unless investigation has 
shown that the situation is one in which inde- 
pendence does hold from response to response 
both within and between individuals, distrust this 
procedure. The analysis of variance offers some ways 
to appraise both the variability of an individual 
through time and the variation between individuals. 
Use of matched individuals. Some investigators 
fail to take advantage of the matching in their 
data. Billewicz (1965, p. 623) reports that in 9 of 
20 investigations that he examined for which the 
data were gathered from matched members in ex- 
perimental and control groups, the analysis was 
done as if the data were from independent groups. 
Usually the investigator will have sacrificed con- 
siderable precision by not taking advantage in his 
analysis of the correlation in the data, Usually, but 
not always, the statistical significance of the re- 
sults will be conservative. When matched data are 
analyzed as if independent, the investigator owes 
the reader an explanation for the decision. 
Pooling significance tests. In the same vein, 
investigators with several small effects naturally 
wish that they could pool these effects to get more 
extreme levels of significance than those given by 
the single effects. Most methods of pooling signifi- 
cance tests depend upon independence between the 
several measures going into the pool. And that 
assumption implies, for example, that data from 
several items on the same sample survey cannot 
ordinarily be combined into a significance test by 
the usual pooling methods because independence 
cannot be assured, Correlation is almost certainly 
present because the same individuals respond to 
each item. Sometimes a remedy is to form a bat- 
tery or scale that includes the several items of 


interest and to make a new test based upon the 
battery (Mosteller & Bush 1954, pp. 328-331), 
Naturally, the items would be chosen in advance 
for the purpose, not based post hoc upon their 
results. In the latter case, the investigator faces 
problems of multiplicity, discussed below. 

Outlying observations. Frequently data contain 
suspicious observations that may be outliers, ob- 
servations that cannot be rechecked, and yet that 
may considerably alter the interpretation of the 
data when taken at their face value. An outlier is 
an observation that deviates much more from the 
average of its mates or has a larger residual from 
a predicted value than seems reasonable on the 
basis of the pattern of the rest of the measure- 
ments. The classic example is given by the in- 
come distribution for members of a small college 
freshman class, exactly one member of which hap- 
pens to be a multimillionaire. The arithmetic mean 
is not typical of the average member's income; but 
the median almost ignores an amount of income 
that exceeds the total for all the others in the class. 
Sometimes the outliers can be set aside for special 
study. 

One current approach tailors the analysis to the 
type of outlier that is common in the particular 
kind of investigation by choosing statistics that are 
both appropriate and not especially sensitive to 
outliers. For example, as a measure of location, one 
might systematically use the median rather than 
the mean, or for more efficiency, the trimmed 
mean, which is the average of the measurements 
left after the largest and smallest 5 per cent (or 
100a per cent) of the measurements have been 
removed (Mosteller & Tukey 1966, secs. A5, B5). 
In normal populations, the median has an effi- 
ciency of about 64 per cent, but the trimmed mean 
has most of the robustness of the median and an 
efficiency of about 1 — 3a, where a is the propor- 
tion trimmed off each end, For a = 0.10, the effi- 
ciency is 93 per cent. [See Errors, article on the 
EFFECTS OF ERRORS IN STATISTICAL ASSUMPTIONS; 
NONPARAMETRIC STATISTICS, article on ORDER 
STATISTICS; STATISTICAL ANALYSIS, SPECIAL PROB- 
LEMS OF, article on OUTLIERS.] 

Shifting regression coefficients. When one fits 
a regression equation to data, this regression equa- 
tion may not forecast well for a new set of data. 
Among the reasons are the following: 

(1) The fitted regression coefficients are not 
true values but estimates (sampling error). 

(2) If one has selected the best from among 
many predictive variables, the selected ones may 
not be as good as they appeared to be on the basis 
of the sample (regression effect). 


(3) Worse, perhaps none of the predictive vari- 
ables were any good to start with (bad luck or poor 
planning). 

(4) The procedure used to choose the form of 
the regression curve (linear, quadratic, exponen- 
tial, . . .) has leaned too hard on the previously 
available data, and represents them too well as 
compared with the total population (wrong form). 

(5) The new sample may be drawn from a pop- 
ulation different from the old one (shifting popula- 
tion ). 

What are the effects of (2) and (5)? Consider 
the regression of height (Y) on weight (X) for a 
population of boys. Suppose that the true regression 
equation for this population is 


E(Y) =a+ b(X ~ px), 


where a and b are unknown constants, E(Y) is the 
expected value of Y for a given value of X, and 
ux is the mean of X. Suppose that an individual's 
height has a predictive error e that has mean 0, 
variance o?, and is unrelated to X, Y, and the true 
values of a and b. 

Suppose that the experimenter chooses fixed val- 
ues of X, x;, such as 70, 80, 90, 100, 110, 120, 130, 
140 pounds, obtaining boys having each of these 
weights and measuring their heights y;. Then the 
data are paired observations (x;, yi), i= 1, 2,--:, n. 

Estimating a and b from the sample by the usual 
least squares formulas one gets â and b. Given a 
new sample with the same values of X from the 
same population, one can estimate the Y’s for the 
new sample by 


Y; = â + b(x; — ž), 


(where č is the average of the X;) and then the 
expected mean square error of the estimates for the 
new sample is 


expected value of (Y; — Y¥;)?/n = o°(1 +2/n), 


where Y’ is the height for an individual in the new 
sample. Note that g? is the expected mean square 
error that would obtain were a and b known ex- 
actly instead of having been estimated. 

Suppose that in addition to this population, there 
is a new population with different values of a and b, 
say a’ and b'. Both populations come from a group 
of several populations with a and b varying from 
population to population and having ā and b as the 
mean values of a and b, respectively, and oł and 
oj as the variances of these sets of regression Co- 
efficients, For the example of the boys’ weights, 
consider the distribution of values of a and b from 
one city to another. 

The regression line fitted on the basis of the 
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sample from one population and then used.on an- 
other population yields expected mean square error 


(1 + 2) o? + 2(a? + 0303), 


where g? is the variance of the chosen set of x's. 
The first term comes as before from ordinary sam- 
pling variation of the Y’s around the fitted regres- 
sion line (the 2 of 2/n being the dimension of the 
parameter space), but the 2(02+ 0307) comes 
from drawing two sets of regression coefficients 
from the population of regression coefficients. This 
term may be substantial compared to 2o%/n or 
even o°. 

We need extensive empirical results for such 
experiments to get a notion of the size of 2(0} + 
oo?) in various settings of interest to social and 
natural scientists. These investigations have not 
yet been carried out. The formulas for mean square 
error in this realistic situation must cause concern 
until more empirical studies are done. The exist- 
ence of the added term should be recognized and 
an attempt made to assess its contribution numer- 
ically. 

Uncontrolled sources of variation. Although the 
important formula oy=o/\/n for the standard 
deviation of a mean X, a random variable, iS cor- 
rect when n uncorrelated measurements are drawn 
from a distribution with standard deviation o, two 
difficulties arise. The measurements may not be 
uncorrelated, and the distribution may change from 
one set of measurements to another. 

Peirce’s data illustrate these difficulties. In an 
empirical study intended to test the appropriateness 
of the normal distribution, C. S. Peirce (1873) 
analyzed the time elapsed between a sharp tone 
stimulus and the response by an observer, who 
made about 500 responses each day for 24 days. 
Wilson and Hilferty (1929) reanalyzed Peirce’s 
data. Table 4 shows sample means, X, estimated 
standard deviations of the mean s;, and the ratio 
of the observed to the estimated interquartile range, 
Q, — Q,. The observed interquartile range is based 
on percentage points of the observed distribution; 
the estimated interquartile range is based on the 
assumption of a normal distribution and has the 
value 2(0.6745s), where s is the sample standard 
deviation. In passing, note that the ratio is system- 
atically much less than unity, defying the normality 
assumption, More salient for this discussion is the 
relation of day-to-day variation to the values of sz 
based on within-day variation. The latter varies 
from 1.1 to 2.2 (after the first day’s data, whose 
mean and standard deviation are obviously outliers, 
are set aside). These limits imply naive standard 
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Table 4 — Daily statistics from Wilson and ssh 
analysis of C. S. Peirce’s data 


Ris, Q—a 
Doy (milliseconds) 2(0.6745s) 
1 475.6 + 4.2 0.932 
2 241.542. 0.842 
3 203.1 + 2.0 0.905 
4 205.6 + 1.8 0.730 
5 148.5 + 1.6 0.912 
6 175.6 + 1.8 0.744 
7 186.9 + 2.2 0.753 
8 194.1 $1.4 0.840 
9 195.8 + 1.6 0.756 
10 215.541.3 0.850 
n 216.6 + 1.7 0.782 
12 235.6 + 1.7 0.759 
13 244.5 + 1.2 0.922 
14 236.7 + 1.8 0.529 
15 236.0 + 1.4 0.662 
16 233.2 + 1.7 0.612 
17 265.5 + 1.7 0.792 
18 253.0 + 1.1 0.959 
19 258.7 + 1.8 0.502 
20 255.4 + 2.0 0.521 
21 245.0 + 1.2 0.790 
22 255.6 + 1.4 0.688 
23 251.4 + 1.6 0.610 
24 243.4411 0.73¢ 


Sourre: Wilson & Hilferty 1929. 


deviations of the difference between means for 
pairs of days ranging from 1.6 to 3.1. If these 
applied, most differences would have to be less 
than twice these, 3.2 to 6.2, and practically all less 
than 4.8 to 9.3. Table 4 shows that the actual dif- 
ferences —38, +2, —57, +27, ---, +11, —4, —8 im- 
politely pay little attention to such limitations. 

In the language of analysis of variance, Peirce’s 
data show considerable day-to-day variation. In the 
language of Walter Shewhart, such data are “out 
of control” —the within-day variation does not prop- 
erly predict the between-days variation [see QUAL- 
ITY CONTROL, article on PROCESS CONTROL]. Nor is 
it just a matter of the observer “settling down” in 
the beginning, Even after the twentieth day he still 
wobbles. 

Need for a plurality of samples. The wavering 
in these data exemplifies the history of the “per- 
sonal equation” problem of astronomy. The hope 
had been that each observer's systematic errors 
could be first stabilized and then adjusted for, thus 
improving accuracy. Unfortunately, attempts in 
this direction have failed repeatedly, as these data 
suggest they might. The observer's daily idiosyn- 
crasies need to be recognized, at least by assigning 
additional day-to-day variation, 

Wilson and Hilferty (1929, p, 125) emphasize 
that Peirce’s data illustrate “the principle that we 
must have a plurality of samples if we wish to 


estimate the variability of some statistical quantity, 
and that reliance on such formula as o/\/n is not 
scientifically satisfactory in practice, even for esti- 
mating unreliability of means” (see Table 4). 

Direct assessment of variability. One way to _ 
get a more honest estimate of variability breaks the 
data into rational subgroups, usually of equal or 
nearly equal sizes. For each subgroup, compute the 
Statistic (mean, median, correlation coefficient, 
spectral density, regression equation, or whatever), 
base the estimate for the whole group on the aver- 
age of the statistic for the subgroups, and base the — 
estimate of variability on Student's t with one de- 
gree of freedom less than the number of subgroups. 
That is, treat the k group statistics like a sample of 
k independent measurements from a normal dis- 
tribution. [This method, sometimes called the meth- 
od of interpenetrating samples, generalizes the 
method for calculating the sampling error for non- 
probability samples described in SAMPLE SURVEYS, 
article on NONPROBABILITY SAMPLING.] 

At least five groups (preferably at least ten) are 
advisable in order to get past the worst part of the 
t-table. This suggestion encourages using more, not 
fewer, groups. For two-sided 5 per cent levels, see 
Table 5. 

Two major difficulties with this direct assess- 
ment are (a) that it may not be feasible to calcu- 
late meaningful results for such small amounts of 
data as properly chosen groups would provide, or — 
(b) even if the calculations yield sensible results, 
they may be so severely biased as to make their use 
unwise. 

A method with wide application, intended to 
ameliorate these problems, is the jackknife, which 
offers ways to reduce bias in the estimate and to 
set realistic approximate confidence limits in com- 
plex situations. 

Assessment by the jackknife. Again the data 
are divided into groups, but the statistic to be jack- 
knifed is computed repeatedly on all the data except 
an omitted group. With ten groups, the statistic is 


Table 5 — Two-sided 5 per cent levels for Student's t for 
selected degrees of freedom 


Degrees of 5 per cent 
freedom critical point 
1 12.7 
2 43 
3 3.2 
5 2.6 
10 2.23 
20 2.09 
60 2.00 
500 1.96 


œ% 1.96 


computed each time for about 90 per cent of the 
data 

More generally, for the jackknife, the desired 
calculation is made for all the data, and then, after 
the data are divided into groups, the calculation is 
made for each of the slightly reduced bodies of data 
obtained by leaving out just one of the groups. 

Let y,,, be the result of making the complex cal- 
culation on the portion of the sample that omits the 
jth subgroup, that is, on a pool of k — 1 subgroups. 
Let y. be the corresponding result for the entire 
sample, and define pseudo values by 


(1 y»; = kyni — (k — 1s j=1,2, =>, k. 


These pseudo values now play the role played by 
the values of the subgroup statistics in the method 
of interpenetrating samples. For simple means, the 
jackknife reduces to that method, 

As in the method of interpenetrating samples, in 
a wide variety of problems, the pseudo values can 
be used to set approximate confidence limits through 
Student's t, as if they were the results of applying 
some complex calculation to each of k independent 
pieces of data. 

The jackknifed value y-, which is the best single 
result, and an estimate, s?, of its variance are 
given by 


y= LY $e + Ya), 


s= [Eyi — ke] [o -1), 
st = s*/k. 


If the statistic being computed has a bias that 
can be expressed as a series in the reciprocal of the 
sample size, N, the jackknife removes the leading 
term (that in 1/N) in the bias. Specifically, sup- 
pose that fi, the biased estimate of p, has expected 
value 


My b 
E(i)=p+47 +t > 


where a, b, and so on are constants. If fi is the 
jackknifed estimate, its expected value is 


E(je) =p této 


where a, 8, and so on are constants. To give a trivial 
example, $ (X; — X)2/N is a biased estimate of o*. 
Its expected value is o? — (o?/N), and so it has the 
sort of bias that would be removed by jackknifing. 
To understand how the first-order bias terms are 
removed by jackknifing, one might compute the 
expected value of y- for the special case where 


E(a) =p +a/N. 
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Then with the use of k groups of equal size, n, so 
that kn = N, 


Ely.) =p +a/(k-1)n 

E(y..,) = y + a/kn 

Ely) = Efky,,, — (k-11 yu] 
= kp + a/n — ((k-1) + a/n) 
=j. 


Finally, 
E(y-) = ky/k = p. 


The leading term in the bias was removed in the 
construction of the y-,’s. Even if the sample sizes 
are not equal, the leading term in the bias is likely 
to have its coefficient reduced considerably. 

Example of the jackknife; ratio estimate. In 
expounding the use of ratio estimates, Cochran 
({1953] 1963, p. 156) gives 1920 and 1930 sizes 
(number of inhabitants) for each city in a random 
sample of 49 drawn from a population of 196 large 
U.S. cities. He wishes to estimate the total 1930 
population for these 196 cities on the basis of the 
results of the sample of 49, whose 1920 and 1930 
populations are both known, and from the total 
1920 population. The example randomly groups his 
49 cities into 7 sets of 7 each. Table 6 shows their 
subtotals. 

The formula for the ratio estimate of the 1930 
population total is 

1930 sample total ) 

E canbe weal 0 sample total) x (1920 population total), 
so that the logarithm of the estimated 1930 popu- 
lation total is given by log(1930 sample total) — 
log(1920 sample total) + log(1920 population 
total). Consequently the jackknife is applied to 
z=log(1930 sample total) —log(1920 sample 
total), since this choice minimizes the number of 
multiplications and divisions. 

Further computation is shown in Table 7 where 
in the “all” column the numbers 5,054 and 6,262 
come directly from the totals of the previous table, 
and in the “i= 1” column the numbers 4,303 = 
5,054 — 751 and 5,347 = 6,262 — 915 are the re- 


Table 6 — Subtotals in thousands for sets of 7 cities 


1920 1930 

First 7 751 915 
Second 7 977 1,122 
Third 7 965 1,243 
Fourth 7 385 553 
Fifth 7 696 881 
Sixth 7 830 937 
Seventh 7 450 611 

Total 5,054 6,262 


Source: Cochran [1953] 1963, p. 156. 
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Table 7 — Details of jackknifing the ratio estimate 


all i=1 i=2 
Xi) (1920 sample) 5,054 4,303 4,077 
log x,;) 3.70364 3.63377 3.61034 
Yi) (1930 sample) 6,262 5,347 5,140 
log Ya) 3.79671 3.72811 3.71096 
za) = log [Yaxa] .09307 09434 10062 
es Tii Oxi = 08545 04777 
rounded zs; - 085 048 


i=3 i=4 i=5 i=6 i=7 
4,089 4,669 4,358 4,224 4,604 
3.61162 3.66922 3.63929 3.62572 3.66314 
5,019 5,709 5,381 5,325 5,651 
3.70062 3.75656 3.73086 3.72632 3,75213 
.08900 08734 .09157 -10060 .08899 
11749 12745 .10207 .04789 11755 
117 127 102 .048 118 


Sum = .645; .645/7 = .092 = mean = zs 


“v 
e 
3 
wn 
2 
Il 


006547 
6X7 


065979; .065979 — (.645)?/7 = .006547 = sum sq. deviations 
= .00015588 = sł 


00015588 = .0125 = se 
|ts].0s = 2.447; (.0125) (2.447) = .0306 = allowance 


sults of omitting the first 7 cities, and so on for the 
other columns. Five-place logarithms have obvi- 
ously given more than sufficient precision, so that 
the pseudo values of z are conveniently rounded to 
three decimals. From these are computed the mean 
z» and the 95 per cent limits = mean + allowance. 
Table 8 gives all the remaining details. The result- 
ing point estimate is 28,300, about 100 lower than 
the unjackknifed estimate. (Since the correct 1930 
total is 29,351, the automatic bias adjustment did 
not help in this instance. This is a reminder that 
bias is an “on the average” concept.) The limits on 
this estimate are ordinarily somewhat wider than 
would apply if each city had been used as a sepa- 
rate group, since the two-sided 95 per cent level 
for Student's t with 6 degrees of freedom is |t«|.o5 = 
2.447, while with 47 degrees of freedom it is 
|t|. = 2.012. The standard error found here was 
.0125 in logarithmic units, which converts to about 
840 in the final total (4.360 + z+ s» = 4.464; 
antilog 4.464 = 29,140; 29,140 — 28,300 = 840). 
The conversion from logarithmic units to original 
units for the confidence interval represent an ap- 
proximation that may not always be appropriate 
[see STATISTICAL ANALYSIS, SPECIAL PROBLEMS OF, 
article on TRANSFORMATIONS OF DATA]. (Further 
material on the jackknife can be found in Mosteller 
and Tukey 1966, sec, E). 


Table 8 — Final computations for the ratio estimate* 


95 PER CENT 
CONFIDENCE 
INTERVALS 
0.062 10 0.123 
log total 4,360 + zs = 4.452 4.422 to 4.483 

total antilog 4.452 = 28,300 26,000 to 30,400 


* Base data: 1920 total = 22,919, log (1920 total) = 4.360 
log total = log (1920 total) + log ratio 


VALUE OF ESTIMATE 


log ratio z» = 0.092 


Analytical difficulties. In analyzing data or 
planning for its analysis, the choice of a base for 
rates is not always obvious; comparing many 
things leads to biases that need adjustment, selec- 
tion reduces correlation, and selection for excel- 
lence leads to disappointments. This section treats 
these matters, 

Bases for rates. The investigator should think 
about more than one possible base for a percent- 
age or a rate and consider the value of reporting 
results using different bases. Examples from acci- 
dent statistics may suffice. Are young women safer 
drivers than young men? Yes: in the United States 
in 1966 insurance rates for young women were 
ordinarily lower because they caused less expensive 
damage. On the other hand, these rates were based 
on total disbursements in a fixed period of time. 
Young women may well drive much less than 
young men, and if so, their accident rate per mile 
may be the higher. 

Coppin, Ferdun, and Peck (1965) sent a ques- 
tionnaire on driving in 1963 to a sample of 10,250 
California drivers who were aged 16 to 19} at the 
beginning of the period, Based on the information 
from the 65 per cent of questionnaires returned, 
where respondents estimated mileage driven per 
week, and on accidents reported in the respond- 
ents’ Motor Vehicle Department files, the accident 
rates per 100,000 miles shown in Table 9 were 
found. On accidents, nonrespondents were very 
similar to respondents, but nonrespondents had 
considerably more violations. Since the mileage is 
estimated, the evidence is weak; but it seems to be 
the best available. Boys had more accidents per 
mile at 16, girls at 17, and after that their rates 
were nearly equal. 

How should airplane safety (or danger) be as- 


Table 9 — Accidents and violations per 100,000 miles 


ACCIDENTS VIOLATIONS 
Age Males Females Moles Females 
16 2.9 2.1 7.1 49 
17 19 2.3 6.1 42 
18 1.4 1.4 5.6 37 
19 1.4 1.5 5.4 3.5 


Source: Coppin et al. 1965, pp. 27-28. 


sessed? Deaths per million passenger miles, deaths 
per trip, and casualties per hour flown suggest 
themselves, and each can be supported. 

In general, different answers may be appropri- 
ate for different questions, as was the case in the 
insurance companies’ view versus the accident-per- 
mile view of the safety of young drivers given 
above. Ease and economy may recommend giving 
several answers as well as the investigator's judg- 
ment about their merits. In some problems, no 
resolution may be possible, and then the investi- 
gator would do well to admit it. 

Problems of multiplicity. When methods of ap- 
praisal designed for single comparisons are used 
to compare many things, the multiplicity may mis- 
lead. When means of two samples drawn from the 
same normal population are compared, they differ 
by more than twice the standard deviation of their 
difference in less than 5 per cent of the sample 
pairs. Among ten sample means from the same 
population, some pair is more likely than not to 
differ this much (Table 11). Although statistics has 
come a long way in providing honest methods of 
making comparisons when there are many to be 
made, it has largely done this in the framework of 
a closed system, where the particular items to be 
compared have already been specified. For ex- 
ample, many workers have offered suitable ways 
to measure the significance not only of all pos- 
sible differences but also of all possible linear con- 
trasts (weighted sums, the weights adding to zero) 
on the same data. [See LINEAR HYPOTHESES, AT- 
ticles on MULTIPLE COMPARISONS.] 

Statistics has not yet provided a way to test the 
significance of results obtained by peeking at large 
bodies of data and developing hypotheses as one 
goes along. The facility of the human brain for 
rationalizing almost any observed fact immediately 
after its realization is something that cannot yet be 
allowed for. This means that it is rarely possible 
to validate a hypothesis on the same body of data 
that suggested it and usually new studies are neces- 
sary to test hypotheses developed on completely 
different data (Mosteller & Tukey 1966, sec. B6). 

Selection effects. Users of tests for purposes 
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of selection (admission to college, personnel selec- 
tion) often complain that the scores used to make 
the selection do not correlate well with the in- 
service performance of the individuals after selec- 
tion. Possibly the chosen test does not give scores 
that correlate well with the performance being 
measured, but one must remember that when a 
population is truncated on one of its variables, the 
correlation of that dimension with the others is 
likely to be reduced toward zero. To illustrate, sup- 
pose that freshman calculus grades Y and pre- 
course examination grades X are bivariately nor- 
mally distributed with correlation coefficient p. 
Suppose that only individuals whose pretest scores 
exceed a certain value X = x are admitted to the 
calculus course. This means that selection is based 
on the variable X with the criterion x. The new 
correlation p’ between the grades of those taking 
the course and their pretest scores would be given 
(see Cochran 1951, p. 453) as 


where 


p = proportion that the selected group is 
of the whole population, 

t = standard normal deviate having pro- 
portion p to the right, 

z = height of the standard univariate nor- 
mal at the position t. 


If the proportion selected p= 0.05, then t= 
1.645, z = 0.1031. Values of p’ for selected values 
of p and p are shown in Table 10. To return to the 
example, if pretest scores and calculus grades had 
originally been correlated p = 0.8, in the 5 per cent 
selected the correlation would drop to 0.44. 

Table 10 shows that as the percentage truncated 
increases the correlation in the remaining popula- 
tion slowly decreases from its initial value. For 
initial correlations between .1 and .8 the reduction 
is between a half and a third of the original cor- 
relation when 75 per cent of the population has 
been removed. A very rough approximation for 
p’ is (.7p+.3)p for p> .25 and 0<p<.7. The 
new correlation decreases sharply for the higher 
initial correlation coefficients when more than 90 
per cent of the population is deleted. Unfortunately, 
these results may be rather sensitive to the de- 
tailed shape of the bivariate population studied and 
so this bivariate normal example can only illus- 
trate the possibilities. 
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Table 10 — Values of p’ for various values of 100(1—p), p pairs 


ASCENT Breer 
oceans VALUES OF p SELECTED 
.100 .300 500 700 800 900 950 

5 .090 272 461 661 768 881 939 95 

10 .085 256 438 637 747 867 932 90 

25 .073 224 389 .583 698 834 912 75 

50 .060 186 329 .509 626 .780 878 50 

75 .049 153 .273 434 548 712 831 25 

90 041 .128 .231 374 481 647 781 10 

95 .037 16 210 1342 446 609 749 5 

99 .031 .098 178 293 385 542 689 1 


Regression effect. Suppose that a fallible meas- 
ure selects from many individuals a few that appear 
to be best. On a reassessment based on fresh per- 
formance data, the selected ones will ordinarily 
not do as well as they originally appeared to do on 
the selection test. The reason is that performance 
varies and on the occasion of the test some indi- 
viduals accidentally perform much better than 
their average and are selected. Happily, individuals 
selected to be worst do not do as badly on reassess- 
ment. This phenomenon is known as regression 
toward the mean; instances are sometimes called 
regression effects or shrinkage effects. To illus- 
trate, Mosteller and Wallace (1964, p. 209) se- 
lected words and obtained weights for their rates 
of use with intent to discriminate between the 
writings of Alexander Hamilton and James Madi- 
son. Writers differ in their rate of use of such 
words as of, and, the, to, and upon. On the basis of 
the writings used for the selection and weighting 
of the word counts, the two statesmen’s writings 
were separated by 6.9 standard deviations. When 
the same words and weights were applied to fresh 
writings not used in selecting or weighting, the new 
writings were separated by 4.5 standard devia- 
tions—still good discrimination, but a loss of 2.4 
standard deviations is substantial and illustrates 
well the effect. Losses are usually greatest among 
the poorer discriminants. Usages of the word upon 
originally separated the writings by 3.3 standard 
deviations, and did even better, 3.8, in the fresh 
validating materials; but a less effective set of 
words giving originally a separation of 1.3 stand- 
ard deviations dropped to 0.3 on retesting. The 
lesson is that optimization methods (such as least 
squares and maximum likelihood) do especially 
well on just the data used to optimize. Plan for 
validation, and, where hopes are high for much 
gain from many small effects, prepare for dis- 
appointment, 

Weights. If individuals are sampled to find out 
about their families, as in investigations carried 


out in schools, unless some account is taken of 
weights, a peculiar distribution may arise. For ex- 
ample, if a sample of girls is asked to report the 
numbers of sons and of daughters (including 
themselves) in their families, it turns out that the 
average number of daughters observed in the 
sample is approximately one more than the aver- 
age number of sons. (More precisely, mathematics 
not given here shows the difference to be: [variance 
of number of daughters minus covariance of num- 
ber of sons and daughters] divided by [average 
number of daughters]. When the distribution of 
the number of daughters is approximately Poisson 
and the numbers of sons and of daughters are 
independent, the ratio is approximately unity.) Es- 
sentially, families of three girls report three times 
as often as families with one girl, and families with 
no girls do not report at all. If account is taken of 
the dependence of frequency of reporting upon the 
number of daughters, this matter can be adjusted, 
provided information about families with no daugh- 
ters is available or is not needed. 

Similarly, in studying the composition of special 
groups, unless the analysis is done separately for 
each family size, one needs to remember that more 
children are first-born than second, and so on. 

Errors in calculation. A well-planned format 
for laying out calculations and careful checking 
aid in getting correct answers, To give a base line, 
a sample survey by the Internal Revenue Service 
(Farioletti 1952, pp. 65-78) found arithmetical 
errors in only 6 per cent of 160,000 personal in- 
come tax returns. Considering that the task is 
sometimes troublesome and often resented, this 
record appears good. 

In scientific work, misreadings of numbers, mis- 
placed decimals, errors in the application of for- 
mulas all take their toll. As a first step in the con- 
trol of error, regard any unchecked calculation as 
probably wrong. 

Overmechanization. Overmechanization of com- 
puting puts great pressure on the analyst to make 


one enormous run of the data and thereby eco- 
nomically get all the analyses he wished. Alas, one 
great sweep is never the way of good data analysis. 
Instead, we learn a little from each analysis and 
return again and again. To illustrate, in deciding 
whether to transform the data to square roots, 
logarithms, inverse sines, or reciprocals before 
launching on the major analysis, tests may be run 
for each function separately, leading to the choice 
of one or two transformations for use in the next 
stage. Otherwise the whole large calculation must 
be run too many times because there are many 
branch points in a large calculation with several 
choices available at each. Furthermore, data anal- 
ysis requires extensive printout, little of which will 
be looked at; therefore the data analyst must resist 
the notion that the good computer user makes the 
machine do all the work internally and obtains 
very little printout. He must also resist the idea of 
having ever speedier programs at the cost of more 
and more time for programming and less and less 
for analysis. Fine programs are needed, but the 
cost of additional machine time from slow pro- 
grams may be less than the cost of improvements 
in programming and of the waiting time before 
analysis can begin. 

Possibly with the increase of time-sharing in 
high-speed computation and the handy packaging 
of general purpose programs for the analysis of 
data, the opportunities for making studied choices 
at each point in the analysis will become easier 
and less time-consuming. 

Preserving data from erasure. After process- 
ing, data should usually be preserved in some form 
other than a single magnetic tape. Contrary to 
theory and rumor, magnetic tapes containing basic 
data are occasionally erased or made unusable in 
the high-speed computing process, and all the ex- 
planations in the world about how this could or 
could not have happened cannot restore a bit of 
information. One remedy is to have a spare tape 
with your data or program copied upon it. When 
disaster strikes, remember that few things seem 
more likely to recur than a rare event that has just 
happened, and so copy your spare tape before you 
submit it to the destroyer. 

Hand copying. Since human copying is a major 
source of error, keep hand copying to a minimum 
and take advantage where possible of the high- 
speed computer's ability to produce tables in im- 
mediately publishable form and of mechanical re- 
production processes. Editing can be done by 
cutting, pasting, and painting out. When copying 
is necessary, checks of both column totals and row 
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totals are believed superior to direct visual com- 
parisons of individual entries with the manuscript. 

Checks. Checking the programming and cal- 
culations of a high-speed computer presents a 
major unsolved problem. One might suppose that 
once a machine began producing correct answers, 
it always would thereafter. Not at all. It may re- 
spond to stimuli not dreamed of by the uninitiated. 
To find, for example, that it throws a small error 
into the fifth entry in the fourth column of every 
panel is disconcerting and scary, partly because 
small errors are hard to find and partly because 
one wonders whether undetected errors may still 
be present. Thorough and systematic checking is 
advised. Some ways are through sample problems; 
through fuller printout of the details of a problem 
already worked by hand; by comparing correspond- 
ing parts of several problems, including special 
cases whose answers are known by outside means; 
and by solving the problem in more than one way. 

In addition to the checks on the final calcula- 
tions, check the input data. For input punched on 
cards, for example, some process of verifying the 
punching is required. Methods of checking will 
vary with the problem. Partly redundant checking 
may not be wasteful. Look for impossible codes in 
columns, look for interchanges of columns. Try to 
set up checks for inconsistency in cards. (In West- 
ern cultures, nursery school children are not mar- 
ried, wives don’t have wives, and families with 42 
children need verification.) Consider ways to 
handle blanks based on internal consistency. 

In working with computers, be wary of the way 
symbols translate from keyboard to card or tape— 
dashes and minus signs or zeros, letter O’s, and 
blanks are a few sources of confusion. In dealing 
with numbers using, say, a two-digit field, a number 
such as 6, unless written 06, may wind up as 60 or 
as a meaningless character, The possibilities here 
are endless, but in a given problem it is usually 
worth organizing systematic procedures to combat 
these difficulties. 

Order-of-magnitude checking. When calcula- 
tions are complete, order-of-magnitude checks are 
always valuable. Are there more people in the state 
of New York than in the United States? Does 
leisure plus work plus sleep take much more than 
24 hours per day? Exercises in calculations of 
comparative orders of magnitude can be reward- 
ing in themselves because new connections are 
sometimes made between the research and the rest 
of the subject matter. 

Significant figures. Both hand and high-speed 
calculations require numbers to be carried to more 
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places than seem meaningful and to more places 
than simple rules learned in childhood would sug- 
gest. These rules seem dedicated to rounding early 
SO as not to exaggerate the accuracy of one’s result. 
But they may erase the signal with the noise. About 
the only reassuring rule for complex calculations 
is that if the important digits are the same when 
the calculation is carried to twice as many places, 
enough accuracy has likely been carried. 

The old rules for handling significant figures 
come from a simplified idea that a number can re- 
port both its value and its accuracy at the same 
time. Under such rules the numbers 3.26 and 
0.0326 were thought of as correct to within half a 
unit in the last place. Sometimes in mathematical 
tables this approach is satisfactory. For data-based 
numbers, the uncertainty in a number has to be 
reported separately. 

One-of-a-kind calculations. One-of-a-kind cal- 
culations, frequent in scientific reports, are espe- 
cially error prone, both because the investigator 
may not set up a standard method of calculation, 
complete with checks, and because he does not 
have the aid of comparisons with other members 
of a long sequence. For example, some pollsters 
believed that their wrong forecast about a vote 
would have been close had proper weighting for 
household size been applied, a claim worth check- 
ing. Their ultimate error was in thinking that this 
claim was right. How did they make it? Pages of 
weightings carefully checked down to, but not in- 
cluding, the final estimate showed no error in 
their reanalysis, But their one-of-a-kind calculation 
leading to the final estimate was a ratio composed 
of an inappropriate numerator and an inappropri- 
ate denominator grabbed from the many column 
totals. By accident this meaningless quotient gave 
a number nearly identical with that produced by 
the voters, And who checks further an answer be- 
lieved to be correct? Actually, the weighting for 
household size scarcely changed their original fore- 
cast. The moral is that the one-of-a-kind calcula- 
tion offers grave danger, 

Consequently, each new calculation can well be 
preceded by a few applications of the method to 
simple made-up examples until the user gets the 
feel of the calculation, of the magnitudes to be 
expected, and of a convenient way to lay out the 
procedure. Having someone else check the calcula- 
tion independently requires that the investigator 
not teach the verifier the original mistakes, Yates 
has suggested that, in a large hand calculation, 
independence could nearly be preserved when dif- 
ferent individuals calculate on separate machines 
in parallel in two different numerical units; for ex- 


ample, one computes in dollars, the other in 
pounds. At the end, the final answers are converted 
for comparison. 

Gross errors in standard deviations. Since the 
sample range w (largest measurement minus small- 
est measurement ) is easy to compute, it is often used 
to check the more complicated calculation of a 
sample standard deviation s. In the same sample, 
the ratio w/s must lie between the lower and upper 
bounds given in Table 11 or else the range, sample 
standard deviation, or quotient is in error, The 
table shows the 2.5 per cent and 97.5 per cent 
point of the distribution of w/s for a normal dis- 
tribution. When calculations lead to ratios falling 
outside these limits but inside the bounds, they are 
not necessarily wrong; but further examination 
may pay. 

Table 11 also shows the median of the distribu- 
tion of the range of a sample of size n drawn from 
a standard normal distribution, It gives one an 
idea of the spread measured in standard deviations 
to be expected of the sample means of n equal- 
sized groups whose population means are identical. 
Note that through n = 20 a rough rule is that the 


Table 11 — Bounds on the ratio: range/standard devia- 


tion? 
Lower w/s Upper Median 
n bound 2,5% 97.5% bound of w/o 
2 1.41 1.41 1.41 1.41 Rio 
3 1.73 1.74 2.00 2.00 1.5% 
4 1.73 1.93 2.44 2.45 1.98 
5 1.83 2.09 2.78 2.83 2.26 
6 1.83 2.22 3.06 3.16 2.47 
vf 1,87 2.33 3.28 3.46 2.65 
8 1.87 2.43 3.47 3.74 2.79 
9 1.90 2.51 3.63 4.00 2.92 
10 1.90 2.59 3.78 4,24 3.02 
15 1.94 2.88 4,29 5.29 3.42 
20 1:95. 3.09 4.63 6.16 3.69 
30 1.97 3.37 5.06 7.62 4.04 
50 1.98 3.73 5.54 9.90 4.45 
100 1.99 4.21 6.11 14.07 4.97 
200 1.99 4.68 6.60 19.95 5.49° 
500 2.00 5.25 7.15 31.59 6.07° 
1,000 2.00 5.68 7.54 44.70 6.48" 


.a. Lower bound, 2.5% point, 97.5% point, and upper bound for the ratio: 
range/sample standard deviation (w/s); median of the distribution of 
the ratio: range/population standard deviation (w/a); the sample size is 
n. The upper and lower bounds apply to any distribution and sampling 
method; the percentage points and the median are computed for random 
sampling from a normal distribution, but they should be useful for other 
distributions, 


b. This is not an error. The median is expressed as the multiplier of the popu- 
lation standard deviation, whereas the bounds relate range and sample 
standard deviation. 

c. The mean of the distribution is given as an approximation to the median 
because the latter is not available. 


Sources: Pearson & Stephens 1964, p. 486, 
for lower and upper bounds and 
for 2.5% and 97.5% points; 
Harter 1963, pp. 162-164, for me- 
dians; Pearson & Hartley 1954, 
p. 174, for means, 


median distance between the largest and smallest 
sample mean is Vnos. 

Reporting. When writing the final report, re- 
member that making clear the frame of reference 
of a study helps the reader understand the dis- 
cussion. 

Need for full reporting. In reporting on the in- 
vestigation, be sure to give detailed information 
about the populations studied, the operational defi- 
nitions used, and the exceptions to the general 
rules, Unless the details are carefully reported, 
they are quickly forgotten and are soon replaced 
by cloudy fancies. Discussions of accuracy, checks, 
and controls are needed in the final report. 

Full and careful reporting can lead to ample 
prefaces, numerous appendixes, some jargon, and 
lengthy discussions. Shrink not from these para- 
phernalia, so amusing to the layman, for without 
them the study loses value; it is less interpretable, 
for it cannot be properly compared with other 
studies. Jargon may be the price of brevity. 

The reader may object that editors will not allow 
such full reporting. Certainly the amount of detail 
required does vary with the sort of report to be 
made. Many studies that are published in short re- 
ports turn out to present a long sequence of short 
articles, and these, in one place or another, can 
give the relevant details. 

Try to go beyond bare-bones reporting by giving 
readers your views of the sorts of populations, cir- 
cumstances, or processes to which the findings of 
the study might apply. Warn the reader about gen- 
eralizations that you are wary of but that he, on 
the basis of your findings, might reasonably expect 
to hold. While such discussions can be criticized as 
speculation, you owe it to the reader to do your 
best with them and to be as specific as you can be. 

Beyond all this, where appropriate, do write as 
nontechnical a summary as you can for the inter- 
ested public. 

Suppression of data. In pursuit of a thesis, 
even the most careful may find it easy to argue 
themselves into the position that the exceptions to 
the desired proposition are based upon poorer data, 
somehow do not apply, would be too few to be 
worth reporting if one took the trouble to look them 
up, would mislead the simpleminded if reported, 
and therefore had best be omitted. Whether or not 
these views are correct, and some of them may 
well be, it is preferable to present the whole picture 
and then to present one’s best appraisal of all the 
data, The more complete record puts readers in a 
much better position to consider both the judg- 
ments and the proposition. 

FREDERICK MOSTELLER 
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[Directly related are the entries EXPERIMENTAL DESIGN; 
FALLACIES, STATISTICAL; SAMPLE SURVEYS.] 
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H 
EFFECTS OF ERRORS 
IN STATISTICAL ASSUMPTIONS 

All physical and social laws or models rest ulti- 
mately upon assumptions. These laws do not yield 
exact numerical statements. Even the much ad- 
mired exactness of the physicists’ laws means only 
very close approximation—how close depends upon 
the circumstances. So, too, techniques for sta- 
tistical analysis require assumptions about the 
data to justify the use of the techniques in par- 
ticular situations. When these assumptions are not 
correct for the data under study, the results of the 
statistical analysis may be very misleading. This 
article discusses the effects on statistical analysis 
of incorrect assumptions and considers some ways 
of mitigating the problem. The discussion is set in 
the frameworks of the matched-pairs design, a 
time-series design, the one-way analysis of vari- 
ance, and a repeated-measurements design. 


The matched-pairs design 


In the matched-pairs design two treatments or 
conditions are studied by assigning one treatment 
to eacn member of a pair of matched individuals. 
For example, a department of Slavic languages is 
interested in finding out whether one of two dif- 
ferent teaching methods for a first-year language 
course is better than the other. A language aptitude 
and proficiency examination is given on the first 
day of class, and the scores on these tests are used 
to pair students who have approximately the same 
aptitude and proficiency. Then, for each pair of 
students, one student is randomly assigned to one 
teaching method and the other student is assigned 
to the other method. An examination is given at 
the end of the term to determine whether differ- 
ences exist between the teaching methods. 

Comparisons between the treatments are based 
on the difference between the responses to the 
treatments within each pair. Thus the data consist 
of the n differences X, , X», -++ , X„, where X; denotes 
the difference between the response scores in the 
jth pair. It is assumed that X,, X., «++, X, consti- 
tute a simple random sample from some popula- 
tion; possible further assumptions will be discussed 
in the next section. 
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A social scientist may want to make inferences 
about several features of the probability distribu- 
tion underlying his matched-pairs experiment. He 
may ask which estimators or formulas should be 
used to estimate the unknown mean or median p. 
In addition, he may want to know which signifi- 
cance test to use to find out whether the treatments 
differ. The following sections indicate how answers 
to these questions may be obtained. 

Criteria for point estimation. An investigator 
is frequently faced with a dilemma in his choice 
of an estimator. Suppose in the matched-pairs de- 
sign he wants to estimate u, the mean difference 
between treatments. If he is willing to assume 
that the observations are randomly drawn from 
a normal distribution, then the sample mean 
X=(1/n)=X; is the unique “best” unbiased es- 
timator for u. 

Evidence may exist, however, that although the 
underlying distribution is symmetrical, there are 
too many extreme values for the normality assump- 
tion to be correct. For distributions of this long- 
tailed kind, the sample mean may have a very 
large variance. The sample median is generally a 
reasonable estimator to use for such distributions 
but has a higher variance than the sample mean 
for more nearly normal distributions. How can one 
achieve a reasonable compromise—an estimator 
that is good under reasonable assumptions? 

Here, consideration is restricted to long-tailed 
symmetrical distributions and, in particular, to 
compound normal distributions that arise in the 
following way: (1) An observation is randomly 
drawn from one of two normal populations. (2) 
With probability 1 — r, this observation is randomly 
drawn from a normal population that has mean pu 
and variance o. (3) With probability 7, this ob- 
servation is randomly drawn from another normal 
population having mean y and variance K*o*. The 
values of K and 7 that are considered are K > 2 and 
0<7<.10. In short, the compound normal dis- 
tributions considered are mixtures of two normal 
distributions with a common mean. [See DISTRIBU- 
TIONS, STATISTICAL, article on MIXTURES OF DIS- 
TRIBUTIONS. | 

A useful way to compare any two unbiased esti- 
mators of the same parameter is to compute their 
efficiency. The efficiency of estimator 1 relative to 
estimator 2 is defined as 


variance of estimator 2 100. 


e(1 23 
cya) variance of estimator 1 


An estimator is chosen that compares favorably in 
terms of efficiency with its competitors over the 
range of plausible distributions. An estimator has 
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robustness of efficiency relative to another if the 
above ratio does not dip far below 100 per cent for 
plausible alternatives. 

Estimators of the mean (or median). One pos- 
sible compromise between the sample mean and 
the sample median is 


Xa + Xin + + Xoan 
n—2 : 


where X,,, is the jth smallest observation; that is, 
the largest and smallest observations are discarded 
and the mean of the remaining observations com- 
puted. [See NONPARAMETRIC STATISTICS, article on 
ORDER STATISTICS. ] 

In general, an arbitrary percentage of the ob- 
servations may be discarded. Define the a per cent 
trimmed mean, X,, as the mean of the observations 
remaining after the smallest (a/100)n observa- 
tions and the largest (a/100)n observations are 
excluded. The 0 per cent and 50 per cent trimmed 
means are the sample mean and the sample me- 
dian, respectively; thus, in particular, X, = X. 

Efficiency comparisons of the 0 and 6 per cent 
trimmed means for the compound normal distribu- 
tion in large samples with K = 3 and 0 <7 <.10 
are given by Tukey (1960). He shows, for exam- 
ple, that the 0 per cent trimmed mean X is the best 
possible estimator if 7 = 0 (that is, if all observa- 
tions are from one population), but that, even in 
this extreme case, X, has efficiency 97 per cent 
relative to X. On the other hand, if 7 = .05, X, has 
approximately 143 per cent efficiency with respect 
to X. These computations (and many more) indi- 
cate that there is more to gain than to lose by dis- 
carding some extreme observations. It is important 
to add that in many problems the study of extreme 
observations may give important clues to the im- 
provement of the experimental or observational 
technique. [See STATISTICAL ANALYSIS, SPECIAL 
PROBLEMS OF, article on OUTLIERS.] 

Test and confidence interval criteria. A test is 
to be chosen to compare the null and alternative 
hypotheses 


H.: #=0 
H,: p#0 (or H}: u> 0), 


respectively. From the test, confidence intervals for 
p are to be obtained. The one-sided and two-sided 
t tests are the “best” tests of H, against H, or H‘ 
if the underlying distribution is normal, but the 
goal is to choose a good test under less stringent 
assumptions. 

Two requirements for a good test are that it must 
possess robustness of validity and robustness of 
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Table 1 — Values of y, and y, for some familiar distributions 


CHI-SQUARE ap DISTRIBUTION* 


d=1 d=5 df=10 
" 2.83 1.26 0.89 
a 12 2.40 1.20 


* Degrees of freedom denoted by df. 


efficiency over the range of plausible underlying 
distributions (Box & Tiao 1964; Tukey 1962). 

A statistical test or confidence interval has va- 
lidity if the basic probability statements asserted 
for the procedure are correct or nearly so. Thus, from 
t tables, the one-sided t test with 9 degrees of free- 
dom has probability .05 of exceeding 1.833 under 
the null hypothesis u = 0. This statement is valid 
if the normal assumption holds. But suppose the 
normality assumption is in error. Can t tables still 
be used to find the probability that t will be greater 
than an arbitrary value t, for plausible underlying 
distributions? If the answer to this question is “yes,” 
then the one-sided t test and associated confidence 
intervals are said to have robustness of validity. 
Robustness of validity has not been defined rigor- 
ously. A quantitatively precise definition of robust- 
ness of validity is difficult to give, since it must 
depend upon the interpretation of the outcome of 
a significance test in the given experiment. 

In addition to possessing robustness of validity, 
the test should be a good discriminator between the 
hypotheses, The discriminating ability of a test is 
measured by its power, which is the probability of 
rejecting H, given that H, (or H?) is true [see 
HYPOTHESIS TESTING]. Both the one-sided and 
two-sided t tests have the strong property that their 
power is higher than the power of any other reason- 
able test if the normality assumption obtains. This 
is no longer true for nonnormal distributions, how- 
ever, and competitors must be sought. 

Thus a way to compare two tests is necessary. 
It is natural to make such comparisons by defining 
a concept of relative efficiency for two tests, 1 and 
2, representable by a numerical index e(1, 2). 
Efficiency of tests and estimators are related con- 
cepts [see NONPARAMETRIC STATISTICS for a dis- 
cussion of efficiency]. If e( 1, 2) is greater than one, 
test 1 is more powerful than test 2. A test is said to 
have robustness of efficiency if its efficiency relative 
to its competitors is not appreciably below one for 
credible alternative distributions. 

Tests and confidence intervals. The informa- 
tion available on the validity and efficiency proper- 
ties of the t test, the Wilcoxon signed-rank test, 


COMPOUND NORMAL DISTRIBUTION 


7=0 7=.05 7=.10 
K=3 K=3 K=3 
0.00 0.00 0.00 
0.00 4.65 5.33 


and the sign test under nonnormality is now sum- 
marized [see NONPARAMETRIC STATISTICS for these 
tests]. The trimmed t promises to be a strong com- 
petitor to the preceding tests (Tukey & McLaughlin 
1963). 

The one-sided and two-sided t tests are valid in 
large samples, although this validity does not ex- 
tend to very high significance levels, such as .001, 
and .0001 (see Hotelling 1961). 

The validity of the t test will be considered for 
two different nonnormal distributions. First, as- 
sume that the nonnormal distribution has the com- 
pound normal form considered above. Second, the 
nonnormal distribution is assumed to be an Edge- 
worth distribution with skewness parameter y, and 
kurtosis (or peakedness) parameter y». 

Langley and Elashoff (1966) have conducted a 
Monte Carlo investigation of the performance of 
the one-sided t test. One thousand samples of n 
(n = 6, 9, 16) observations each were taken from 
compound normal distributions with K=3 and 
7 = 0.0, 0.20, 0.40. In all these situations, the em- 
pirical probability of t being greater than the nor- 
mal theory .05 point was between .04 and .06. 

The effects on the one-sided t test when sam- 
pling is from an Edgeworth population with param- 
eters yı, y: will be studied next. For symmetrical 
distributions y, = 0, while y, > 0 for distributions 
with a long right tail; for normal distributions 
y: = 0, but y. > 0 for bell-shaped symmetrical dis- 
tributions with long tails and y. <0 for similar 
distributions with short tails (see Scheffé 1959, pp. 
331-333). In order to provide some feel for the de- 
criptive meaning of the y,, y. parameters, Table 1 
gives values of y, and y. for several nonnormal 
distributions. For normal distributions yı = y2 = 0. 

Table 2 indicates the performance of the one- 
sided t test on a sample of 10. Each entry denotes 
the probability that t > 1.833 if the null hypothesis 
that u = 0 is true (under normal theory this prob- 
ability is .05). The performance of the one-tailed 
t test as shown in Table 2 may be summarized as 
follows: (1) The true significance level a is always 
slightly less than .05 for long-tailed symmetrical 
distributions and slightly greater than .05 in short- 
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Table 2 — Probability that t > 1.833 
y 


—06 -02 0 02 06 


—1.0 .072 .057 .051 .045 .036 
0.0 071 .056 .050 .044 .035 
va 1.0 .070 .055 .049 .043 .034 
2.0 .069 .054 .048 .043 .033 


Source: Srivastava 1958, p. 427. 


tailed symmetrical distributions. (2) Skewness (y:) 
is more important than kurtosis (y.). In fact, the 
true significance level is almost constant in each 
column. A long right tail means that the true sig- 
nificance level is much less than .05; a long left 
tail leads to a true significance level much greater 
than .05. The skewness values covered in Table 2 
represent very moderate skewness. The two-sided 
t test is less affected by skewness and kurtosis (see 
Srivastava 1958). 

The Wilcoxon signed-rank test is the most fre- 
quently used competitor to the t test. It was designed 
to have perfect robustness of validity with respect 
to significance level for symmetrical distributions; 
its validity in asymmetrical distributions is un- 
known. The sign test has perfect robustness of 
validity with respect to significance level for both 
symmetrical and asymmetrical distributions. 

Some efficiency computations are reported here 
[they use the definition of large sample efficiency 
given in the article NONPARAMETRIC STATISTICS, 
which also discusses the properties of these tests]. 
Table 3 presents some efficiency computations of 
the Wilcoxon and sign tests relative to the t test 
when the underlying distribution is a compound 
normal type. If the underlying distribution is normal, 
the corresponding efficiencies are e(w, t) = .955 
and e(s, t) = .636. 

The preceding validity analysis and suggestive 
efficiency study permits these recommendations: 
(1) The Wilcoxon signed-rank test should be used 
for symmetrical distributions with moderately long 
tails. If confidence intervals are desired, the Walsh 


Table 3 — Efficiency of the Wilcoxon and sign tests rela- 
tive to the t test 


K 
2 3 5 
bi mR z 
os 10 05.10 05 10 
ows 1,231.53 145 207 220 3.52 
ast 0.69 0.74 83 991.29 (1.88 


a. The efficiency of the Wilcoxon signed-rank test relative to the f test. 
b. The efficiency of the sign test relative to the f test. 
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procedure should be employed. (2) If samples are 
large and the underlying symmetrical distribution 
has long tails, the routine use of the Wilcoxon test 
and Walsh confidence procedure may require a 
large computing cost. In these instances, if the 
distribution has only moderately long tails, the t 
test and its confidence procedure should be a rea- 
sonable compromise. If the distribution has very 
long tails, the sign test and its associated con- 
fidence interval procedure provide a reasonable 
compromise. 


A simple time-series design 


Nonrandomness among observations can occur 
in two ways: (1) the observations may not be in- 
dependent or (2) the observations may not have a 
common distribution. Each type of nonrandomness 
will be considered below to show the important 
effects such nonrandomness may have on statistical 
methods based upon the assumption of random- 
ness. 

Dependence among the observations. A psychol- 
ogist observes an individual’s response at n points 
in time. Suppose that X,, t= 1, 2,-:-,m, denotes 
the individual's response at time t. The psychol- 
ogist assumes that the response has a linear re- 
gression over time; that is, 


a) X= a+ p(t- ti) +e. 


(The use of (t/n) —[(n+1)/(2n)] instead of 
just t represents merely a convenient coding of the 
t values. In particular, (n + 1)/(2n) is just the 
average of the (t/n)’s: (n + 1)/(2n) = (1/n): 

" , (t/n).) The e, may represent errors of meas- 
urement or errors in the assumption of linear 
regression and are assumed to be random. The 
psychologist suspects that the e,, and hence the 
X,, may be correlated; that the e, all have the same 
distribution is not questioned here. The goal of the 
experiment is to estimate and test hypotheses about 
the intercept, œ, and the slope, £. 

In the estimation problem only two estimators 
for a and two estimators for £ will be studied here. 
Furthermore, each estimator is a linear combina- 
tion of the X’s (a,X; + a.X,+ +++ + aX). The 
reasons for this restriction are that such estimators 
have been studied most thoroughly and that under 
normality they have optimal properties. Johnston 
(1963) studies the estimation problem in some 
detail. 

Suppose, at first, the psychologist believes that 
there is no dependence among the observations. 
Then reasonable estimators for a and B are found 
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by the method of unweighted least squares, which 
gives 


B-50% -0i JÈ G- my. 


If no dependence exists, then these estimators have 
minimum variance among all linear estimators 
that are unbiased. The standard errors of å and f are 


s.e. = Vo/Jn, 


(2) n A 
set |/o/ 5 G- zty, 


t=1 


where o? is the variance of X;. 

Now, suppose that dependence exists among the 
observations and assume that the correlation be- 
tween X, and X,, denoted by p,,, is given by the 
relation 


(3) Pa =p", 


where |s — t| denotes the absolute value of the dif- 
ference s—t. What are the effects of nonzero 
values of p on the estimators â and ĝ? First, 
while these estimators are still unbiased, they are 
in this case no longer the minimum variance 
linear unbiased estimators (see Johnston 1963 for 
the way to construct the latter estimators). Effi- 
ciency computations comparing Ê and the mini- 
mum variance linear unbiased estimator, 8*, are 
given in Table 4, where cell entries are the ratio 
of the variance of f* to the variance of ĝ for a 
sample of size 5. 


Table 4 — Efficiency comparison of Bp and B* 


p —.40 —.20 0 .20 40 
var B*/var Ê 921 982 1 .990 972 


The estimator 8* used in Table 4 is computed 
under the assumption that p is known. When the 
sample size is large, ĝ has efficiency one compared 
with 8* for the particular pattern of correlation 
considered, 

A second effect of nonzero p is that the standard 
errors â and Ê given in (2) are incorrect. The 
correct standard errors in large samples for the 
example are 


These standard errors may depart quite radically 
from (2) as Table 5 shows. The ratios obtained in 
Table 5 are identical to the corresponding ratios 
of s.e. å. Since the standard errors in (2) may 
be in serious error, it is clear that the standard 
error of prediction, that is, the standard error of 
â + Bl(t/n) — (n+1)/(2n)], may also be very 
wrong. The third effect of ignoring correlation be- 
tween the observations concerns s?, the conven- 
tional estimator of o°, the underlying variance; 


e= D| x-et- [om-». 


If p # 0, then sis a biased estimator of o?. Some 
sampling experiments by Cochrane and Orcutt 
(1949) suggest that the expected value of s? is less 
than o°; when there are a large number of observa- 
tions, the bias is negligible. 


Table 5 — Ratio of incorrect to correct s.e. 6 


P —.50  —.20 =40; 7055.10 .20 50 
s.e. Ê from (2) 


5 1.78" Vota? = NTO ERSO 82 -57 
s.e. Ê from (4) 


In testing hypotheses about œ and f a primary 
concern is with the behavior of the standard ¢ test, 


t= ey (£-231). BIV, 


to examine H,: 8=0 against the alternative 
H,: B #0 in the presence of the correlation model 
(3). Table 6 gives the probability that |t;| > 1.96 if 
H,: B =0 is true when the sample size is large (if 
p = 0 this probability is .05). Table 6, and addi- 
tional tables when H, is true, vividly demonstrates 
the sensitivity of the standard t test to nonzero cor- 
relation between the observations when (1) and 
(3) hold. The nonrobustness of t comes primarily 
from the use of an incorrect standard error of in 
the denominator of t. The probability computations 
in Table 6 also hold when H/: a=0 is tested 
against H’: a # 0 using the statistic t= Vn åâ/ ys". 
In many social science problems p> 0; as seen 
from Table 5 the null hypothesis would be rejected 
more often than the nominal 5 per cent level in 
such situations, assuming that the null hypothesis 
is true. 

Hoel (1964) has reported on a sampling experi- 
ment to assess the effects of correlation in small 
sample sizes on an F test for a polynomial trend. 
His results support the conclusions reached in the 
preceding paragraph. Readers interested in robust 
tests, assuming the correlational structure (3), 
should consult Hannan (1955). 
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Table 6 — Large sample probability that 1t! > 1.96 when correlation exists 


P =—50 -40 -W -20 
Probobility 


It is important to remember that the magnitude 
of effects on a statistical technique from depend- 
ence among the observations is a function of the 
technique, the model of dependence, and the values 
of correlational parameters. 

Nonidentically distributed observations. Sup- 
pose that the psychologist is principally interested 
in testing the hypothesis H,,: œ = 0 against H,: a # 
0. Now, however, assume that strong evidence 
exists that p = 0. The psychologist believes 8 = 0 
in (1), but he is not certain about this belief. Thus 
he asks the question, “What is the effect on the 
t test of examining H.: a=0 against H,: a #0, 
assuming £ = 0, if in fact 8 #0?” The t statistic 
the psychologist wants to employ (assuming 
B=0) is t= Vn X/V š, where now $ is given by 
E(X, — X)*/(n — 1). 

Table 7 gives the probability that |t| > 1.96 if 
H.: «= 0 is correct for various values of |8| and 
o? when n is large (the nominal significance level 
is .05). 


Table 7 — Probability that |t| > 1.96 when a slope, 
assumed zero, is not zero 


a 
o 25 50 10 


25 05 0414 0238 .0028 

50 05 0477. 0414 0238 

£ 1.00 05 0488 0477 0414 
2.00 05 0500 .0488 0477 


Two important effects of incorrectly assuming that 
B= 0 are apparent from Table 7: (1) the behavior 
of the t statistic depends upon the unknown o* as 
well as |@|, and (2) the stated significance level .05 
is always at least as large as the true significance 
level. 

It must be remembered that the effects of non- 
identically distributed observations depend on the 
model underlying the observations and the statis- 
tical method being used. 


The one-way analysis of variance 

The one-way analysis of variance may arise when 
n individuals are randomly assigned to k treat- 
ments. The data consist of the n; response scores 
xy (i=1, e, k; j=1, t, m) with Din =n 
The observations are assumed to be independent, 
and the probability distributions of the response 


=.10 
00068 .0026 .0076 .0164 .0302 .0500 .0762 .1096 .1498 .1994 .2576 


tt) 10 .20 -30 40 -50 


variable are assumed identical for individuals re- 
ceiving the same treatment. 

The one-way layout is frequently employed to 
estimate the means or medians, @,, and the vari- 
ances, o?, of the treatments, to establish confidence 
intervals on the differences, p; — uj, and to test 
hypotheses about the u, and o?. The point estima- 
tion problems present no essentially new questions. 

Tests and confidence intervals for the median or 
mean. To discriminate between the null and alter- 
native hypotheses 


(5) Hy: py = Me = = Be 
(6) H,: p; # u; for some i, j, 


one ordinarily employs the F test, the Kruskal- 
Wallis H test, or the k sample median test [see 
NONPARAMETRIC STATISTICS, article on RANKING 
METHODS; LINEAR HYPOTHESES, article on ANALY- 
SIS OF VARIANCE]. The F test is usually a very good 
way to make this discrimination if (1) the observa- 
tions are drawn from a population in which each 
treatment has the same underlying normal dis- 
tribution except for possible differences among the 
pi, and (2) the alternative hypothesis is not fur- 
ther specified. 

The following sections discuss the validity of 
these tests. Note that the discussion is germane to 
the validity of Scheffé’s method of multiple com- 
parisons [see Scheffé 1959, chapter 3; see also 
LINEAR HYPOTHESES, article on MULTIPLE COM- 
PARISONS], since that method is equivalent to the 
confidence set based on the F test. 

Validity assuming oł} = o? for alli. The F test 
has perfect validity for all sample sizes if the popu- 
lations are normal with equal variances and the 
observations are independent; that is, if the k sam- 
ples are drawn from the same normal distribution. 
In large samples the standard F statistic 


Ss, ni (Xi —X)*/(k- 1) 
EÈ (Xi; — Xi)?/(n — k) 


provides a valid test for the hypothesis (5) except 
at high significance levels. For small samples it 
will be assumed that departures from normality 
can be represented by an Edgeworth distribution 
with skewness and kurtosis parameters 71: and yzi 
in each population. Table 8 gives the probability 
that F > 2.87 if (5) is true (under normal theory 
this probability is .05) for yı; = Yı and yzi = yz for 
all i, k = 5, all n;’s = 5. 


Fy, nx = 
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Table 8 — Probability that F > 2.87 for k=5 and all 


n,=5 
Va 
-1 —0.5 0 0.5 1 
0 0.053 0.051 0.050 0.049 0.048 ` 
FE 0.5 0.053 0.051 0.050 0.049 0.048 
1 0.053 0.052 0.050 0.050 0.049 


Source: Box & Andersen 1955, p. 14. 


Table 8 and other work indicate that the F test for 
(5) possesses robustness of validity relative to 
significance levels (type 1 error) and power when 
Yui =, all i (Gayen 1949; Pearson 1931). The 
kurtosis parameter y, has practically no effect on 
the F test for (5). However, when yn ¥ y:2; it is 
known that for k = 2 the one-sided t test does not 
have robustness of validity. 

The Kruskal-Wallis test and the median test are 
valid under the null hypothesis that the k samples 
come from the same population. When k = 2, the 
Kruskal-Wallis test is equivalent to the two-tailed 
Wilcoxon rank-sum test. In this case of k= 2, if 
Yu # Yı and Yy: # Yz, the one-sided Wilcoxon test 
appears to be less robust than the one-sided t test 
relative to significance levels when the null hypoth- 
esis is (5) and p is a median or mean (Wetherhill 
1960). 

Inequality of variance. The validity of the pre- 
ceding tests and some further tests when the as- 
sumption of equal variances is dropped will be 
studied; the normality assumption is retained un- 
less otherwise indicated. 

It is necessary at this point to examine the 
rationale for carrying out a test. Suppose a random 
sample of n mental patients is drawn and n, are 
assigned to treatment 1 and n, (=n — nı) are as- 
signed to treatment 2. After a period of treatment, 
each patient is tested and given a score that is 
assumed to be normally distributed in each popu- 
lation. The null hypothesis tested is that ba = 
(wi is the mean for treatment i); suppose the con- 
clusion is that u, < u» . If high scores are indicative 
of improvement, a decision is made to use the sec- 
ond treatment. Why? 

When the variances are equal, the treatment 
with the higher mean is more likely to give rise to 
scores greater than or equal to any given score ws. 
Thus the significance test gives the psychiatrist 
usable results, especially if a score at w, or above 
means release from the psychiatric hospital. 

Suppose now that each treatment has a different 
variance and that p< m.. Then it is by no means 
uniformly true that the treatment with the higher 
mean is more likely to give rise to scores greater 


than or equal to a release score of w,. For ex- 
ample, suppose that the scores for treatment 1 
and treatment 2 follow normal distributions with 
#, = 0, of = 9 and p, = 1, o$ = 1, respectively (n= 
variance of treatment i scores). In this case Table 9 
gives the probability that a randomly chosen score 
on treatment i exceeds w,, i = 1, 2. 

Nonetheless, it is often appropriate to test 
equality of means even when the variances may 
be different, and the remainder of this section 
deals with that case. This problem is often called 
the Behrens—Fisher problem. Consider, first, the 
large sample validity of the two-tailed, two-sample 
t test (equivalent to F when k = 2) based on 


pee ae 


ae 


(n,—1) 8} + (n,—1) s? 
n +n. —2 i 


where 


ga 


The denominator of t is not a consistent estimator 
of the standard error of X,—X, unless either 
oł =o; or n,=n,. This fact partly explains the 
nonrobustness of t clearly shown in Table 10. 

Table 10 gives the probability that |t| > 1.96 for 
different values of 0 = o2/o? and R=n,/n, when 
the null hypothesis of equal means holds. Table 
10 indicates the importance of equal sample 
sizes in controlling the effects of unequal vari- 
ances: the significance level remains at .05 irre- 
spective of the value of @ if R=1. Moreover, if 
@<1 and R>1 so that the most variable popula- 
tion has the smallest sample, the true significance 
level is always larger than .05 and may be seri- 
ously so. On the other hand, if 0 > 1 and R > 1, the 
true significance level is always less than .05. 
These results are essentially independent of y, and 
y: because of the large sample sizes, This lack of 
robustness of significance-level validity extends to 
power. The small sample validity of t follows along 
the lines of the large sample theory (see Scheffé 
1959, p. 340). 

Since equal sample sizes are sometimes difficult 
to obtain, even approximately, considerable re- 
search has been focused upon alternative ways to 


Table 9 — Probability that a randomly chosen score on 
treatment i exceeds w, 


Release score Treatment 
Wo 1 
1.0 3707 .5000 
1.5 3085 +3085 
2.0 2514 1587 
2.5 -2033 10668 
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Table 10 — Large sample probability that |t| > 1.96 
when sample sizes and variances differ 


.20 50 10 2.0 5.0 
1 0.050 0.050 0.050 0.050 0.050 
R 2 0.120 0.080 0.050 0.029 0.014 
5 0.220 0.120 0.050 0.014 0.002 


Source: Scheffé 1959, p. 340. 


test (5) versus (6). Transformation of the response 
variable may achieve equality of variance for the 
transformed variable, so that t may be used. But 
the user must note that a hypothesis on the means 
of the transformed variates is being tested. [See 
STATISTICAL ANALYSIS, SPECIAL PROBLEMS OF, AT- 
ticle on TRANSFORMATIONS OF DATA.] 

Welch (1938; 1947) investigates the alternative 
test statistic, 


Š,- Š. 
v= ————., 

= & 

Ty eR, 


and indicates how to obtain significance levels (see 
also Dixon & Massey [1951] 1957, p. 123). Note 
that the statistics v and t are the same if n, =. 
Furthermore, Welch (1938) shows that the ap- 
proximate significance level of v may be obtained 
from tables of the t distribution with f degrees of 
freedom, where 


EAN 
tt Te 
f= 1 2 


si s 
m(n, —1) n(n, -1) 


The v test is valid for both significance level and 
power in large samples and is much less sensitive 
to @ than the usual t in small samples. For example, 
if @=1 and n,=5, n, = 15, the exact probability 
that |v| > 5.2 is .05, where 5.2 is the 5 per cent 
point of the exact distribution of v. The probability 
that |v| > 5.2 for any other 0 value is always between 
.035 and .085. In addition, even if @=1, so that 
the t test is valid, the v test is nearly as efficient as 
t. When both n, and n, are small, s? and sł will 
have low precision. In these situations, compute f 
from the following formula: 


2 
ee: 
frat 1 2 
AER E 
na(n, =1) n(n, —1) 
Alternative testing methods for k=2 exist (see 


Behrens 1963; Cochran 1964). 
Inequality of variance also affects the signifi- 
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cance levels of the Wilcoxon rank-sum test w and 
the median test. H. R. van der Vaart (1961) gives 
significance levels of the Wilcoxon test for various 
ð values, assuming normal distributions and large 
samples. Pratt (1964) extends van der Vaart’s in- 
vestigation in several ways. Surprisingly, w is sensi- 
tive to @ in the case of equal samples, while t re- 
mains unaffected. The median test appears to have 
greater robustness of significance-level validity 
than t or w near 0 = 1 when the three tests are 
comparable. 

The conclusions concerning the effects of in- 
equality of variance on the usual F test for k = 2 
generally hold for arbitrary k. Robust tests along 
the lines of Welch’s v test have been developed for 
general k (see James 1951). 

Efficiency considerations. When the shapes 
(including variances) of the k distributions are 
different, it is important to be precise about which 
null hypothesis is being tested. The null hypothesis 
of equality of means is tested by F; the null hy- 
pothesis that p = 4 is tested by the Kruskal-Wallis 
method (when k= 2, p is the probability that a 
random observation under treatment 1 is greater 
than a random observation under treatment 2). 
The null hypothesis of equality of medians is tested 
by the median test. The means may be equal but 
the medians different, or conversely. Either the 
means or the medians may be equal, but p # 3, or 
conversely. These facts imply the noncomparability 
of the three tests if shape differences exist. They 
also imply that a satisfactory analysis of the data 
may require an investigator to assemble evidence 
from all three significance tests—and possibly 
additional tests. 

If the k distributions have the same shape, the 
following conclusions are justified: the F test does 
not have robustness of efficiency for bell-shaped 
symmetrical distributions with moderately long 
tails. No adequate study has been made of the ro- 
bustness of efficiency of the Kruskal-Wallis test, 
but for distributions such as that described above, 
no competitor is in sight. The median test should 
have high efficiency for very long tailed sym- 
metrical distributions. 

Tests for equality of variances. In many data 
analyses an investigator is interested in comparing 
variability among the k treatments; thus he may 
carry out a test of 


against 
H,: o} #o% for some i, j. 


2 


and find confidence intervals for all ratios o%/o7, 
i#j [see the article on VARIANCES, STATISTICAL 
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STUDY OF]. The robustness of validity of Bartlett’s 
test for homogeneity of variance, and hence the va- 
lidity of confidence intervals derived from Bartlett’s 
test, will next be investigated. Bartlett’s test is based 
upon the statistic M: 


M= (Xn, —k) Ins? — Dm, — 1) Ins?, 
where 


Si = (ni — 1) $3/ (Zn; — k)» 

s= (Xy = X)2/ (n: i, 
The significance level of M may be approximated 
from a x? table with k — 1 degrees of freedom. The 
M test requires normality and has almost no robust- 
ness of validity, as may be seen from Table 11 
where the nonnormality is characterized by the 
Y: parameter (y, = 0). 

The disastrous behavior of M for long-tailed 
symmetrical distributions may be explained by the 
following suggestive argument by Box (1953). Let 
T denote a statistic; for example, X, s?/s?, M are 
statistics. Then, in large samples, T divided by its 
estimated staridard error is usually normally dis- 
tributed by the central limit theorem and associated 
mathematical facts. Thus, even though sampling 
may be from a nonnormal distribution, X/ (s/ Vn) 
is normally distributed. This result explains the 
robustness to nonnormality of the t test in the 
matched-pairs design and the F test in the one-way 
analysis of variance when the populations have the 
same shape. But Bartletts M test does not have 
this structure of T divided by its standard error; 
hence, it does not find protection under the central 
limit theorem. 

The nonrobustness of the M test requires the use 
of an alternative test. Scheffé (1959) has devel- 
oped a robust test and a robust multiple compari- 
son method for this problem from a suggestion by 
Box (1953). 

The extreme sensitivity of the M test for equality 
of variances to the value of y, indicates that one 
may expect trouble with the normal theory analysis 
of the random effects model, sometimes cailed 
model 11 [see LINEAR HYPOTHESES, article on ANAL- 


Table 11 — True probability of exceeding the .05 normal 
theory point of M in large samples 


Ya 
=l 0 `] 2 
2 0056 05 i 166 
3 0025 05 1136 224 
Pets 0008 05 176 315 
10 001 05 287 489 


Source: Box 1953, p. 320. 


YSIS OF VARIANCE; see also Dixon & Massey (1951) 
1957, p. 174]. Real difficulties do exist with such 
analyses, even in large samples. 


A repeated-measurements design 


An investigator records the response of each in- 
dividual to a stimulus repeated at each of p differ- 
ent points in time. The data consist of response 
scores, X;, where X;, denotes the score of the ith 
individual at time t, i= 1,2, ---,n and t=1, 2, 
‘++, P. Each individual has p response scores. The 
investigator assumes that 


(7) X= utai tT Hen. 


The a; and 7; denote the individual and time effects, 
respectively. It is assumed that the random errors 
eit have a common distribution and that the corre- 
lation between e;, and ei, is given by pil, It is 
assumed then that an individual's response at 
time t is correlated with his response at another 
time s, but that the responses of different individ- 
uals are independent. The investigator's principal 
interest lies in testing the hypotheses. 


(8) Hf: a, =a,=-:-=a,; 

against H*; a; # a; for some i, j 
(9) Hp r=7=---=7,; 

against H7: 7, #7, for some t, s. 
Assume that the e;, are normal and study the 
effects of nonzero p on the standard F tests in the 
two-way analysis of variance. 


The hypotheses (8) and (9) may be examined, 
respectively, by the statistics 


ÈR- 8)/(m 1) 


F MSE 


a = 
m-1,(M=1) (p-1) 


2 (Rr = Rp- 1) 


F; MSE 


p-1,(n-1)(p-1) = 


> 


where MSE represents the mean square error, 
that is, (Xi, — X;.— X., + X)2/(n—1)(p— 1). 
Table 12 gives the probability that F° > 3.01 for 
different values of p (the exact probability is .05 
for p = 0) and the probability that Fr > 3.01 when 
n=p=65. 

Table 12 shows clearly the considerable effect 
of correlation on the test for individuals and the 
slight effect such correlation has for the test on 
times. In the terminology of the two-way analysis 
of variance, if individuals denote the rows and 
times denote the columns, then correlation within 
a row seriously affects the test on rows and only 
slightly affects the test on columns—with the given 
model for the correlation. Two explanations for the 
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Table 12 — Probability that F* > 3.01 and probability 
that F” > 3.01 in the presence of correla- 
tion* 

Correlation, p 


—0.40 —020 0.0 +0.20 +0.40 


Exact probability for the 
F% test on individuals .0003 .0101 .05 .1305 .2470 


Exact probability for the 
FT test on different times 0590 .0527 .05 .0537 .0668 


* The cell entries were computed assuming p'!~*! = p if |t-s| = 1 and p't-*! =0 
if |t-s| > 2 in order to simplify the computations. This approximation cor- 
recily indicates the order of magnitude of the more general model p'*-*'. 


Source: Box 1954, p. 497. 


nonrobustness of the F” test are (1) the numerator 
and denominator of F* are correlated, contrary to 
the ideal condition, and (2) essentially the wrong 
standard error of the means for individuals is used. 


It has been shown that statistical analyses based 
on assumptions that are incorrect for the data can 
produce misleading inferences. Furthermore, ways 
have been indicated to choose good statistical anal- 
yses, based on plausible assumptions, so that infer- 
ences will not be distorted. The question arises, 
“How does one decide which assumptions to make?” 
For example, suppose that there is interest only in 
making inferences about the mean difference. It 
then seems preferable to use an inference proce- 
dure that is robust against suspected departures 
from assumptions rather than to make preliminary 
significance tests of the assumptions of equality on 
variances, normality and/or symmetry, random- 
ness, and so forth. A procedure that is robust 
against all failures in assumptions cannot be found, 
so a procedure must be chosen that is robust against 
those failures in assumptions that are known to be 
likely from experience with the problem under 
study or that would distort the inferences most 
severely, 

ROBERT M. ELASHOFF 


[See also FALLACIES, STATISTICAL; NONPARAMETRIC 
STATISTICS. ] 
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I 
POINT ESTIMATION 


How many fish are in this lake? What propor- 
tion of the voting population favors candidate A? 
How much paint is needed for this particular 
room? What fuel capacity should this airplane have 
if it is to carry passengers safely between New York 


and Paris? How many items in this shipment have 
the desired quality? What is the specific gravity of 
this metal? Questions like these represent problems 
of point estimation. In present-day statistical meth- 
odology, such problems are usually cast in the fol- 
lowing form: A mathematical model describing a 
particular phenomenon is completely specified ex- 
cept for some unknown quantity or quantities, 
These quantities must be estimated. Galileo’s model 
for freely falling bodies and many models in learn- 
ing theory, small group theory, and the like provide 
examples. 

Exact answers are often impossible, difficult, 
expensive, or merely inconvenient to obtain. How- 
ever, approximate answers that are quite likely to 
be close to the exact answer may be fairly easily 
obtainable. The theory of point estimation provides 
a guide for obtaining such answers; above all, it 
makes precise, or provides enough framework so 
that one could make precise, such phrases as “quite 
likely to be close” and others such as “this estimator 
is better than that one.” 

As an introduction to some of the problems in- 
volved, consider estimating the number N of fish 
in a given lake. Suppose that M fish are taken from 
the lake, marked, and returned to the lake un- 
harmed. A little later, a random sample of size n 
of fish from the lake is observed to contain x marked 
fish. A little thought suggests that probably the 
ratio x/n is near M/N or that the unknown N and 
the ratio Mn/x (defined only if x > 0) are not too 
far apart. For example, if M = 1,000, n = 1,000, 
and x = 20, it might be reasonable to believe that 
N is close to 50,000. [A similar example, concern- 
ing moving populations of workers, is discussed in 
SAMPLE SURVEYS.] 

Clearly, this procedure may lead one badly astray. 
For example, it is possible, althoughly highly un- 
likely, that the same value x = 20 could be ob- 
tained, and hence, using the above procedure, N 
be estimated as 50,000, even if N is actually as 
small as 1,980 or as large as 10,000,000. Clearly, 
considerations of probability are basic here. If L(N) 
denotes the probability of obtaining 20 marked fish 
when N fish are in the lake, it can be shown that 
0 = L(1,979) < L(1,980) < --- < L(49,999) = 
L(50,000) and L(50,000) > L(50,001) > --- ; that 
is, N = 50,000 maximizes the likelihood of obtain- 
ing 20 marked fish. 

Design of experiments. What values of M and 
n are most satisfactory in the above experiment? 
Clearly, the bigger n is, the better it is for estima- 
tion purposes, but the more expensive the experi- 
ment [see EXPERIMENTAL DESIGN]. A balance has 
to be reached between the conflicting goals of min- 
imizing error and minimizing expense. Also, per- 
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haps another experimental design might give better 
results. In the above problem, let M = 1,000, but 
instead of pulling a fixed number of fish out of the 
lake, pull out fish until exactly x marked fish have 
been obtained, where x is fixed in advance. Then 
n, the sample size, is the observation of interest 
[see SEQUENTIAL ANALYSIS]. Which design, of all 
the possible designs, should be used? This kind of 
question is basic to any estimation problem. 

Testing hypotheses. An altogether different 
problem would arise if one did not really want the 
value of N for its own sake but only as a means of 
deciding whether or not the lake should be re- 
stocked with small fish. For example, it might be 
desirable to restock the lake if N is small, say less 
than 100,000, and undesirable otherwise. In this 
case, the problem of whether or not the lake should 
be restocked is equivalent to testing the hypothesis 
that N is less than 100,000 [see HYPOTHESIS TEST- 
ING]. In general, a good estimator does not neces- 
sarily lead to a good test. 

Confidence intervals. The value of an estimator, 
that is, a point estimate, of N for a particular sam- 
ple is a number, hopefully one close to N; the value 
of a confidence interval, that is, an interval esti- 
mate, of N for a particular sample is an interval, 
hopefully one that is not only small but that also 
contains N [see ESTIMATION, article on CONFIDENCE 
INTERVALS AND REGIONS]. The problem of finding a 
good interval estimate is more closely related to 
hypothesis testing than it is to point estimation. 

Note that certain problems are clearly point esti- 
mation problems rather than problems of interval 
estimation: when deciding what the fuel capacity 
of an airplane should be, the designers must settle 
on one particular number. 


Steps in solving an estimation problem 


The first step in the solution of an estimation 
problem, as suggested above, is to design an ex- 
periment (or method of taking observations) such 
that the outcome of the experiment—call it x—is 
affected by the unknown quantity to be estimated, 
which in the above discussion was N. Typically, x 
is related to N probabilistically rather than deter- 
ministically. This probability relation must be speci- 
fied. For example, the probability of obtaining x 
marked fish in a sample of size n is given by the 
hypergeometric distribution, 


()G=2)/G) 
x/\n-x nj)? 
provided the sample has been drawn randomly 
without replacement [see DISTRIBUTIONS, STATIS- 


TICAL, article on SPECIAL DISCRETE DISTRIBUTIONS]. 
(The denominator is the number of combinations 
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of N things taken n at a time, and so forth.) If the 
randomness assumption is not quite satisfied, then 
the specified probability relation will be only ap- 
proximately true. Such specification problems and 
their implications will be discussed later. Next, 
after the experiment has been designed and the 
probability model specified, one must choose a func- 
tion f defined for each possible x such that if x is 
observed, then f(x), the value of the function f at 
x, is to be used as a numerical estimate of N. Such 
a function f is called an estimator of N. The prob- 
lem of the choice of f will be discussed later. Finally, 
after a particular estimator f has been tentatively 
settled on, one might want to calculate additional 
performance characteristics of f, giving further in- 
dications of how well f will perform on the average. 
If the results of these calculations show that f will 
not be satisfactory, then changes in the design of 
the experiment, for example, an increase in sample 
size, might be contemplated. Clearly, there is a 
good deal of interplay among all the steps in the 
solution of an estimation problem outlined here. 

Terminological note. Some authors distinguish 
terminologically between the estimator, the func- 
tion f, and its numerical value for a particular 
sample, the estimate. Another distinction is that 
between a random variable and a generic value of 
the random variable. (Some authors use X for the 
former and x for the latter.) Such distinctions are 
sometimes important, but they are not generally 
made in this article, although special comments 
appear in a few places. Otherwise it should be clear 
from context whether reference is made to a func- 
tion or its value, or whether reference is made to a 
random variable or its value. 


Choice of estimator 

As a means of illustrating the various consider- 
ations influencing the choice of an estimator, a few 
typical examples will be discussed. 

Example 1. Let x be the number of successes 
in n independent trials, the probability of a success 
on an individual trial being p. (For example, x 
might be the number of respondents out of n ques- 
tioned in a political poll who say they are Demo- 
crats, and p is the probability that a randomly 
chosen individual in the population will say he is a 
Democrat.) Here p is unknown and may be any 
number between 0 and 1 inclusive. An estimator f 
of p ideally should be such that f(x) is close to p 
no matter what the unknown p is and no mat- 
ter what the observation x is. That is, the error 
f(x) — p committed by using f(x) as an approxi- 
mation to p should always be small. This is too 
much to expect since x can, by chance, be quite 
misleading about p. However, it is not too much to 
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expect that the error be small in some average 
sense. For example, the mean squared error, 


E(f- p} = a [F(a) — pe(2) pra - p)"=, 


should be small no matter what the unknown p is, 
or the mean absolute error E,|f — p| should be small 
no matter what p is, or the like. For the time being, 
estimators will be compared only on the basis of 
their mean squared errors. A more general ap- 
proach, the underlying ideas of which are well 
illustrated in this special case, will be mentioned 
later. The first question that arises is, Can one find 
an estimator f such that, for every p satisfying 
0<p<1, the mean squared error of f at p is 
smaller than (or at least not greater than) the 
mean squared error at p of any other estimator? 
Obviously, such an estimator would be best in this 
mean squared error sense. Unfortunately, and this 
is what makes the problem of choosing an esti- 
mator a nontrivial problem, a best estimator does 
not exist. To see this, consider the estimates f, and 
f- defined by f,(x) =x/n and f2(x) = 4. It is not 
hard to show that E,(f,—p)?=p(1—p)/n, and 
clearly, E,(f: — p)? = (4 — p)2. If a best estimator f 
existed it would have to satisfy E,(f—p)?< 
E,(f2—p)*. But the latter quantity is zero for 
p = 3, implying that f = f. However, f. is not best 
since E,(f; — p)? is smaller than E,(f. — p)? for p 
near 0 or 1. 

Although no best estimator exists, many good 
estimators exist. For example, there are many esti- 
mators f satisfying E,(f—p)?<1/(4n) for 0< 
p < 1. The estimator f,, defined above, is such an 
estimator. The estimator f, defined by f(x) = 
[Vn(x/n) + 4]/(V/n + 1) with mean squared error 
E,(fs — p)? = 1/[4(1 + Vn)?] is another. If n is 
large, the mean squared error of any such esti- 
mator is small for each possible value of p. In this 
problem, as is typical, any one of many good avail- 
able estimators would no doubt be reasonable to use 
in practice, Only by adding further assumptions, 
for example, assumptions giving some information 
about the unknown p, can the class of reasonable 
estimators be narrowed. Note that estimators are 
still being compared on the basis of their mean 
squared errors only. 

The estimator f, is minimax in the sense that 
f: minimizes max E,(f — Pp)? with respect to f. The 


minimax approach focuses attention on the worst 
that can happen using f and chooses f accordingly 
[see DECISION THEORY]. Note that the estimator 
fı (f(x) =x/n) does have slightly larger mean 
squared error than does f, for values of p near 3; 
for values of p near O or 1 the advantage lies 


wholly with f,. Other properties of these estimators 
will be discussed later. 

Example 2. Suppose that x,,x,,---,2x, are 
observations on n independent random variables, 
each having the Poisson distribution with param- 
eter A, where à is unknown and may be any non- 
negative number [see DISTRIBUTIONS, STATISTICAL, 
article on SPECIAL DISCRETE DISTRIBUTIONS]. For 
example, x, could be the number of occurrences 
during the kth time interval of unit length of any 
phenomenon occurring “randomly” over time, pos- 
sibly telephone calls coming into an exchange, cus- 
tomers coming into a store, and so forth [see 
QueEvEs]. Knowing that à is both the mean and 
variance of the Poisson distribution, it might not 
be unreasonable to suppose that both the sample 
mean, 


M(x) = MX, °*+; An) = Sain =x, 
k= 


and the sample variance, 


s(x) = DOr — *)*/(n—-1) 


(here, one must assume that n > 1), provide good 
estimators of the unknown A. It is not hard to show 
that m is better than s?, that is, Ex(m — Aà)? < 
E,(s? — i)? for all à> 0, with strict inequality for 
some A > 0. 

An estimator is inadmissible (with respect to a 
given criterion like mean squared error) if a better 
one exists; accordingly, the estimator $? is here in- 
admissible. An estimator is admissible if it is not 
inadmissible. Although it is not obvious, the esti- 
mator m is admissible. In fact the class of admis- 
sible estimates is very large here, as is typically the 
case. In example 1, all three estimators discussed, 
fı, fs, and fs, are admissible. 

Example 3. Let x,, xs, +++, x, be observations 
on n independent random variables each having the 
normal distribution with mean p and variance o°, 
where both pu and o? are unknown: p may be any 
real number and o? may be any positive number. 
One might be interested in estimating only p, only 
aè, the pair (u, 0%), or perhaps some combination 
such as y/o. 

Example 4. Let x,,x., +++, x, be observations 
on n independent random variables each having 
the uniform distribution over the set of integers 
(1, 2, :--, N}, where N may be any positive integer. 
For example, in a state where automobile license 
plates are numbered from 1 to N, each x; would be 
the number of a randomly chosen license plate. 
What is a good estimator of N? 

Sufficient statistics. A simple and effective way 
to narrow the class of estimators that one ought 


to consider when choosing a good estimator is to 
identify a sufficient statistic for the problem and 
to consider only those estimators that depend on 
the sufficient statistic [see SUFFICIENCY]. Roughly 
speaking, if t is a sufficient statistic, knowing t(x) 
is as useful as knowing x. The following result is 
important. If t is a sufficient statistic and f is an 
estimator with finite mean squared error and f does 
not depend on t (that is, f is not essentially ex- 
pressible as f = h(t) for some function h), then 
there is another estimator f, that does depend on t 
and such that f, is better than f (in the technical 
sense defined above). One fy that works is the con- 
ditional expectation of f relative to t. 

In example 2, m is a sufficient statistic, hence 
only estimators depending on m need be con- 
sidered. In particular s?, which does not depend on 
m in the sense defined above, need not be con- 
sidered, In example 3, the ordered pair (m, $°), 
where m and s? are defined as in example 2, is a 
sufficient statistic. In example 4, the estimator 2m 
might seem at first to be a plausible estimator of N. 
However, it does not depend on the sufficient sta- 
tistic u defined by u(x) = the largest of the x. 
Much better estimators than 2m exist. For example, 
the rather complicated 


f ki u”! — (u= 
a1) 


is such an estimator. (Note that f, is approximately 
equal to (n+1)u/n.) 

Further criteria for choice of estimator. So far 
estimators have been compared on the basis of 
their mean squared errors only. Since no best es- 
timator exists, a unique solution to the problem of 
choosing an estimator is generally not obtainable 
by this approach. This is not really too regrettable 
since many good estimators usually exist. Even de- 
manding that an estimator be minimax, not neces- 
sarily always a reasonable demand, does not always 
lead to a unique estimator. In example 2, every 
estimator of \ has unbounded mean squared error; 
and in example 3 every estimator of » has un- 
bounded mean squared error. Hence, in these two 
examples, all estimators of à and p, respectively, 
are minimax, but the concept loses all interest. In 
example 1, demanding minimaxity does lead to the 
unique minimax estimator f,;. A unique minimax 
estimator is clearly admissible. 

The strong intellectual and psychological tend- 
ency of human beings to be satisfied only with 
unique answers has often led to further demands 
being placed on estimators in addition to the one 
that their mean squared errors be small. 

Unbiasedness. An estimator is unbiased if the 
mean value of the estimator is equal to the quan- 
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tity being estimated. In example 1, fı is unbiased 
since E,f, =p, 0<p <1. Both m and s* are un- 
biased estimators of A in example 2. In example 3, 
m is an unbiased estimator of u and sè is an un- 
biased estimator of o°. In example 4, both 2m and 
f. are unbiased estimators of N. The search for a 
best unbiased estimator often leads to a unique 
answer. In example 2, an estimator f would be best 
unbiased (or minimum variance unbiased) if it 
is unbiased and satisfies 


E\(f-A)? < E, (f'A)? 


for every A> 0 and every unbiased estimator f’. 
The estimator m is such an estimator for this prob- 
lem and is the only such estimator. The estimator 
fı is the best unbiased estimator of p in example 1; 
the estimator m is the best unbiased estimator of u 
in example 3; and the estimator f, is the best un- 
biased estimator of N in example 4. The relative 
efficiency of two unbiased estimators is the ratio of 
their reciprocal variances. Relative efficiency may 
well depend on the parameter value. 

Unbiased estimators fail to exist in some im- 
portant problems. Using the first design mentioned 
in the problem of estimating the number of fish in 
a lake, N, no unbiased estimator of N exists. Al- 
though in example 3, s? is a best unbiased estimator 
of o2, another estimator of o°, (n — 1)s?/(n +1), 
despite being biased, is actually better than s? 
in the sense of mean squared error. This shows 
that placing extra demands on estimators can 
actually come into conflict with the small mean 
squared error demand. Of course, the relative im- 
portance of the various properties an estimator may 
have will no doubt be judged slightly differently 
by different reasonable individuals. 

Invariance. Notions of invariance can some- 
times be invoked so that a best invariant estimator 
exists. For example, if x = (%1, ***, Xa), b is a real 
number, and y = (x, + b, «++, Xa + b), the estimator 
m is invariant in the sense that it satisfies m(y) = 
m(x) +b. It turns out that among all the esti- 
mators of u in example 3 with this property of 
scale invariance, the estimator m is best in the 
usual mean squared error sense. The argument for 
invariance may be stated rather loosely as follows. 
Irrelevancies in the data (for example, whether 
time is measured from 12 noon New York time or 
from 12 noon Greenwich time) should not make a 
fundamental difference in the results obtained 
from the analysis of the data. 

A different kind of invariance problem can be 
troublesome in some circumstances. Suppose in 
example 1 that interest centers not on p but on 
some function of p, say 1/p. If f is a satisfactory 
estimator of p, it need not follow that 1/f is a 
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satisfactory estimator of 1/p, for properties like 
unbiasedness, mean squared error functions, etc., 
can change drastically under nonlinear trans- 
formations. Fortunately, in many problems the 
parameter itself, or a single function of it, is of 
central interest, so that this kind of noninvariance 
is not serious. 

Specification problems. So far, estimators have 
been chosen relative to given probability models. If 
an estimator seems satisfactory for a given proba- 
bility model, it may be relevant to ask if this esti- 
mator is also good for probability models closely 
related to the given one. For example, it is too 
much to expect that a model postulating normal 
distributions describes exactly the practical situa- 
tion of interest. Fortunately, in many common 
problems slight changes in the probability model 
will not materially affect the goodness of an esti- 
mator reasonable for the original model [see 
Errors, article on the EFFECTS OF ERRORS IN STA- 
TISTICAL ASSUMPTIONS]. For example, the estimator 
m of u in example 3 is actually a fairly reasonable 
estimator of the population mean yp in a large 
variety of cases, particularly if the population vari- 
ance o° is finite and the sample size n is not too 
small, as can be seen from the formula for its 
mean squared error, o*/n. Circumstances arise, 
however, in which alternative estimators, for ex- 
ample, the sample median, not so much affected 
by slight changes in the tails of the distribution, 
may need to be considered. [A process for arriving 
at other such estimators, called Winsorization, is 
discussed in NONPARAMETRIC STATISTICS, article 
on ORDER STATISTICS; the closely related concept 
of trimming is discussed in Errors, article on the 
EFFECTS OF ERRORS IN STATISTICAL ASSUMPTIONS. | 

More than one parameter. Most of the material 
of this article deals with estimation of a single 
parameter. The multiparameter case is, of course, 
also important and multiparameter analogues of 
all the topics in this article exist. They are treated 
in the references, for example by Kendall and 
Stuart (1946), Cramér (1945), and Wilks (1962). 


Constructive estimation methods 


Maximum likelihood estimators. In example 1, 
the estimator f, is the maximum likelihood esti- 
mator of p: For each x, f,(x) is that value of p 
maximizing (7) p*(1—p)"*, the probability of ob- 
taining x. In example 2, m is the maximum likeli- 
hood estimator of À. In example 3, (m,[n — 1]s?/n) 
is the maximum likelihood estimator of (1,02). In 
example 4, u is the maximum likelihood estimator 
of N. In the problem of estimating the number of 
fish in a lake, N, using the first design, no maxi- 
mum likelihood estimator exists since no such 


estimate can be defined for x=0, although for 
x > 0 no trouble occurs. In some examples, there is 
no unique maximum likelihood estimator. 

Maximum likelihood estimators are often easy 
to obtain. A maximum likelihood estimator does 
not necessarily have small mean squared error nor 
is it always admissible. So the maximum likelihood 
principle can sometimes conflict with the small 
mean squared error principle. Nevertheless, maxi- 
mum likelihood estimators are often quite good 
and worth looking at. If the sample size is large, 
they tend to behave nearly as nicely as the esti- 
mator m of u in example 3. 

Maximum likelihood estimation is often con- 
structive, that is, the method provides machinery 
that often gives a unique estimating function. 
There are other constructive methods, three of 
which are described here: the method of moments, 
least squares, and Bayes estimation. One or another 
of these constructive methods may provide a 
simpler or a better behaved estimator in any par- 
ticular case. 

The method of moments. The approach of the 
method of moments (or of expected values) is to 
set one or more sample moments equal to the cor- 
responding population moments and to “solve,” if 
possible, for the parameters, thus obtaining esti- 
mators of these parameters. The method is particu- 
larly appropriate for simple random sampling. In 
example 1, if the sample is regarded as made up 
of n observations, the kth being a 1 (success) or 
O (failure), the sample mean is x/n and the pop- 
ulation mean is p, so the resulting method of mo- 
ments estimator is x/n. In example 4, the method 
of moments, as it would ordinarily be applied, 
leads to a poor estimator. The method can, none- 
theless, be very useful, especially in more complex 
cases with several parameters. 

Least squares. The least squares approach is 
especially useful when the observations are not 
obtained by simple random sampling. One con- 
siders the formal sum of squares >),(a, — EX)’ 
where x, is an observation on the random variable 
Xx; with expectation EX; (depending on the param- 
eters to be estimated). Then one attempts to min- 
imize the sum of squares over possible values of 
the parameters. If a unique minimum exists, the 
minimizing values of the parameters are the values 
of their least square estimators. 

The method is particularly appropriate when the 
X, are independent and identically distributed ex- 
cept for translational shifts that are given functions 
of the parameters. If the X, all have the same 
expectation, as in examples 1—4, the least squares 
estimator of that expectation is the sample mean. 
Least squares estimation, without modification or 


extension, does not provide estimators of param- 
eters (like g? in example 3) that do not enter into 
expectations of observations. [A fuller treatment of 
this topic appears in LINEAR HYPOTHESES, article 
on REGRESSION. ] 

Bayes estimation. Consider example 1 again, 
this time supposing that the unknown p is itself 
the outcome of some experiment and that the prob- 
ability distribution underlying this experiment is 
known. For example, x could be the number of 
heads obtained in n tosses of a particular coin, the 
probability of a head for the particular coin being 
p where p is unknown, but where the coin has been 
picked randomly from a population of coins with 
a known distribution of p values. Then it would be 
reasonable to choose an estimator f that minimizes 
the mean value of the squared error [f(x) — p]* 
where the averaging is done with respect to the 
known joint distribution of x and p. Such a min- 
imizing estimator is called a Bayes estimator; of 
course it depends on the distribution assigned to p 
[see BAYESIAN INFERENCE]. A distribution may be 
assigned to p merely as a technical device for ob- 
taining an estimator and completely apart from the 
question of whether p actually is the outcome of an 
experiment. This is the spirit in which Bayes esti- 
mators are often introduced, as a way of obtaining 
an estimator that may or may not have good prop- 
erties. On the’ other hand, one may assign a dis- 
tribution to p in such a way that those values of p 
that seem more likely to obtain are given greater 
weight. Of course, different individuals might assign 
different distributions, for this is a matter of judg- 
ment. However, this approach does provide one 
possible method for using any previously obtained 
information about p that may be available. It would 
be rather rare that the only information available 
about p before the experiment is that 0<p< T. 

Examples of Bayes estimators include the esti- 
mator f, of example 1, obtained by assigning a cer- 
tain beta distribution to p, and the estimator fs of 
p, defined by fs(x) = (x +1)/(n + 2), obtained by 
assigning to p the uniform distribution on the 
interval between O and 1. [See DISTRIBUTIONS, 
STATISTICAL, article on SPECIAL CONTINUOUS DIS- 
TRIBUTIONS, for discussions of these specific dis- 
tributions.] Even f, is a Bayes estimator. However, 
fı is not a Bayes estimator but is rather the limit 
of a sequence of Bayes estimators. 

Restricting attention to estimators that are Bayes 
or the limits (in a certain sense) of sequences of 
Bayes estimators usually assures one of not over- 
looking any admissible estimator. Bayes methods 
frequently prove useful as technical devices in 
solving for minimax estimators and in many other 
Situations. 
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Asymptotic estimation theory 


Because it is often difficult to compare estimators 
for small sample sizes, much research on point 
estimation is in terms of large sample sizes, work- 
ing with limits as the sample size goes to infinity. 
In this context, an estimator itself is not considered, 
but rather a sequence of estimators, each member 
of which corresponds to a single sample size. For 
example, consider the sequence of sample means 
m, ma, +, where m,(X,,°**,%a) = ENN. If 
a sequence of estimators has desirable properties in 
a limiting large sample sense, it is often presumed 
that particular members of the sequence will to 
some extent partake of these desirable properties. 

Consistency. An asymptotic condition that is 
often regarded as essential is that of consistency, 
in the sense that the sequence of estimators is 
close to the true value of the parameter, with high 
probability, for large sample sizes. More precisely 
if {t,} is the sequence of estimators, and if 6 is the 
parameter being estimated, the sequence {tn} is 
said to estimate 0 consistently if, for every interval 
I containing @ in its interior, the probability that 
the value of t, belongs to I approaches 1 as n ap- 
proaches infinity, no matter what the value of @ is. 
(There is also a nonasymptotic concept of con- 
sistency, closely related to the above. Both ideas, 
and their applications, originated with R. A. Fisher. ) 

Comparison of estimators. For simplicity, con- 
sider now independent identically distributed ran- 
dom variables with common distribution depending 
on a single parameter, 0. Let ġe be the density 
function (or frequency function) corresponding to 
that common distribution for the parameter value 
0. A large number of regularity conditions are tra- 
ditionally, and often tacitly, imposed on ¢¢; for 
example, distributions like those of example 4 do 
not come under the standard theory here. In this 
brief summary, the regularity conditions will not 
be discussed. With almost no modifications, the 
discussion applies to qualitative, as well as numer- 
ically valued, random quantities. 

Two sequences of estimators, competing as esti- 
mators of @, are often compared by considering the 
ratios of their asymptotic variances, that is, the 
variances of limit distributions as n approaches 
infinity. In particular, one or both sequences may 
have the lowest possible asymptotic variance. In 
discussing such matters, the following constructs, 
invented and named by R. A. Fisher, are important. 

Score, Fisher information, and efficiency. The 
score of the single observation x; is a function of 
both x, and 0, defined by 


so) = -EIn deme), 
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and it provides the relative change in ¢ (for each 
possible value of x+) when @ is slightly changed. 
Two basic facts about the score are E,s, = 0, 
var,s, = —E,(ds,/0). The quantity, —E(0s,/00), 
is often called the Fisher information contained in 
a single observation and is denoted by I( 6). 

For the entire sample, x = {x,, %2, °°", Xn}, the 
sample score is just the sum of the single observa- 
tion scores, 


Sh) = Ese). 


The Fisher information I,(@) contained in the en- 
tire sample is defined as above with se, replacing 
sə; it is just the sum of the Fisher information 
values for the n single observations. Under the 
assumptions, each observation contributes the same 
amount to total information—that is, I(@) is the 
same for each observation—so that I,(@) = n I(0). 

Except for sign, I„(0) is the curvature of the 
likelihood function near the true value of 6. Roughly 
speaking, sharp curvature of the likelihood func- 
tion corresponds to sharper estimation, or lower 
variance of estimation. The information inequality 
says that, for sequences of estimators {t,} such that 
Vn (t, — @) converges in distribution to a distribu- 
tion with mean zero and variance o?, 


piven ase 


1(6) 

Nonasymptotic variants of this inequality have 
been explored by Darmois, Dugué, Cramér, Rao, 
and others. The basic variant, for an unbiased esti- 
mator t,, based on a sample of size n, is 


1 
ES eee! 
varot;, T, (0) 
(This is usually called the Cramér—Rao inequality. ) 
Under the tacit regularity conditions, this inequal- 
ity becomes an equality just when 


Son 


T, (0) — 


This can happen only if the right side is not a 
function of 0, and this in turn occurs (under reg- 
ularity ) when and only when the distributions given 
by ¢» form an exponential family [see DISTRIBU- 
TIONS, STATISTICAL, article on SPECIAL CONTINU- 
OUS DISTRIBUTIONS]. 

The maximum likelihood estimator of 0 based 
on x= (x,, +++, Xu), say 6,, is (under regularity ) 
the solution of the likelihood equation So,(x) = 
0. Under these circumstances, So,/V1,(0) and 
VI, (0) [6, — 6] are both asymptotically normal 
with zero mean and variance unity. Further, the 


th =0+ 


difference between these two quantities converges 
to zero in probability as n increases. 

Thus the maximum likelihood estimator is 
asymptotically efficient, in the sense that its 
asymptotic variance is as low as possible, for it 
satisfies the asymptotic information inequality. In 
general there exist other (sequences of ) estimators 
also satisfying the information inequality; these are 
called regular best asymptotically normal ( RBAN) 
estimators. The RBAN estimators are those that 
are indistinguishable from the maximum likelihood 
estimator in terms of asymptotic distribution, as it 
is traditionally construed. Often some RBAN esti- 
mator distinct from the maximum likelihood es- 
timator is easier to compute and work with. 

The word “regular,” used above, refers in part to 
regularity conditions on the estimators themselves, 
considered as functions of the sample. Without 
that restriction, somewhat strange swperefficient 
estimators can be constructed. 

The concept of asymptotic treatment has been 
extended recently in other directions than those 
summarized above, in particular by the work of 
R. R. Bahadur and C. R. Rao. 


A more general approach to estimation 


So far the discussion has been based largely on 
comparing estimators through their mean squared 
errors. The mean absolute error could, of course, 
have been used. More generally, suppose that 
W(@,d) is the loss incurred when the numerical 
estimate d is used as if it were the value g(@). Here 
0 is the unknown parameter of the probability dis- 
tribution underlying the outcome x of an experi- 
ment, and g(@) is to be estimated. If f is an 
estimator, x has been observed, and f(x) is used 
as if it were the value of g(@), then the loss 
incurred is W[6,f(x)]. The mean loss, EW (0,f), 
denoted by r(6,f), a function of both 6 and f, is of 
interest. The function r is called the risk function. 
Now such terms as better, admissible, minimax, 
Bayes, and so forth could be defined using the risk 
function, r, rather than mean squared error. For 
example, f is better than f" (relative to the loss W) 
if r(0,f) < r(0,f°) for all 6 with strict inequality for 
some @ [see DECISION THEORY]. 

In the earlier discussion, W was taken to be 
W(6,d) = [d — g(0)]? and r was therefore mean 
squared error. 

In the more general multiparameter context 
mentioned earlier, @ is a vector of more than one 
ordinary (scalar) parameter, and so may be g(®), 
the quantity to be estimated. For example, in ex- 
ample 3, @ = (,0°), g(@) could be 0, and W(0,d) 
could be (d, — u)? + (d, — 0°), where d = (d;,d,) 


is an ordered pair of real numbers. Or consider the 
following example in which an infinite number of 
quantities are simultaneously estimated. 

Example 5. Let x,, Xz, ***, Xa be observations 
on n independent random variables each having 
the same distribution function F, where F may be 
any distribution function on the real line. The 
problem is to estimate the whole function F, that 
is, to estimate F(a) for each real number a. Here 
@ =F, g(®) =F, d may be any distribution func- 
tion, and W(@,d) may be given, for example, by 
sup |d(a) — F(a)|, where the supremum (least 
upper bound) is taken over all real a. A quite satis- 
factory estimator, the sample distribution function, 
exists here. For large n, its risk function is near 0. 
For x = (X, Xz, ***, Xn), the value of the sample 
distribution function is that distribution that places 
probability 1/m on each of X1, X2, ***, Xn if these 
values are distinct, with the obvious differential 
weighting otherwise. 

One difficulty with the more general approach to 
estimation outlined here is that the loss function W 
is often hard to define realistically, that is, in such 
a way that W(0,d@) approximates the actual loss 
incurred when d is used as if it were the value of 
g(®). Fortunately, an estimator that is good relative 
to one loss function, say squared error, is often 
good relative to a wide class of loss functions. 

Perhaps the key concept in estimation theory is 
better. Once it has been decided what “this estimator 
is better than that one” should mean, a large part 
of the theory follows naturally. Many definitions 
of better are possible. Several others besides the 
one mentioned here appear in the literature, but 
none has been so deeply investigated. 


History 


The theory of point estimation has a long history 
and a huge literature. The Bernoullis, Moivre, 
Bayes, Laplace, and Gauss contributed many im- 
portant ideas and techniques to the subject during 
the eighteenth century and the early part of the 
nineteenth century. Karl Pearson stressed the meth- 
od of moments and the importance of computing 
approximate variances of estimators. During the 
early twentieth century, no one pursued the subject 
with more vigor than R. A. Fisher. His contribu- 
tions include the development of the maximum 
likelihood principle and the introduction of the im- 
portant notion of sufficiency. Neyman’s systematic 
study of interval estimation appeared in 1937. 
Although the possibility of a loss function approach 
to statistical problems had been mentioned by Ney- 
man and E. S. Pearson in 1933, its extensive devel- 
opment was not initiated until the work of Abraham 
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Wald in 1939 [see the biographies of BAYES; BER- 
NOULLI FAMILY; FISHER, R. A.; Gauss; LAPLACE; 
Motvre; PEARSON; WALD]. 


New and nonstandard estimation problems re- 
quiring new and nonstandard techniques of solu- 
tion will no doubt continue to arise. Remarkable 
solutions to two such problems have recently been 
proposed under the general name of stochastic 
approximation [see SEQUENTIAL ANALYSIS]. 

Ideally, scientific constructs should possess not 
only great explanatory power but simplicity as 
well. The search for both will, no doubt, encourage 
more and more mathematical model building in 
the social sciences. Moreover, it is quite likely that 
these models will have to become more and more 
probabilistic if they are to achieve these aims. As 
a consequence, the statistical problems involved, 
checking the goodness of fit of the model, esti- 
mating the unknown parameters, and so forth, will 
have to be handled with ever-increasing care and 
knowledge. 

D. L. BURKHOLDER 


[See also STATISTICS, DESCRIPTIVE. ] 
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Il 
CONFIDENCE INTERVALS AND REGIONS 


Confidence interval procedures—more generally, 
confidence region procedures—form an important 
class of statistical methods. In these methods, the 
outcome of the statistical analysis is a subset of 
the set of possible values of unknown parameters, 
Confidence procedures are related to other kinds 
of standard statistical methods, in particular to 
point estimation and to hypothesis testing. In this 
article such relationships will be described and 
contrasts will be drawn between confidence meth- 
ods and superficially isimilar methods of other 
kinds, for example, Bayesian estimation intervals 
[see BAYESIAN INFERENCE; ESTIMATION, article on 
POINT ESTIMATION; HYPOTHESIS TESTING]. 

As an example of this sort of procedure, sup- 
pose the proportion of voters favoring a candidate 
is to be estimated on the basis of a sample. The 
simplest possible answer is to give a single figure, 
say 47 per cent; this is the type of procedure called 
point estimation. Since this estimate of the propor- 
tion is derived from a sample, it will usually be 
different from the true proportion. How far off the 
true value is this estimate likely to be? This ques- 
tion can be answered by supplementing the esti- 
mate with error bounds, say +.5 per cent. Thus, 
one might say that the true proportion lies between 
46.5 per cent and 47.5 per cent. This statement 
might be false. One task of the statistician is to 
develop a procedure for the computation of such 
intervals, a procedure that guarantees that the 
statements are true in, say, 99 per cent of all appli- 


cations of this procedure. Such procedures are 
called confidence procedures. 

Estimation by confidence intervals. It is per- 
haps easiest to begin with a simple example from 
normal sampling theory. 

Example 1. Let X,,--:, X, be a random sample 
of size n from a normal distribution with unknown 
mean p and known variance o°. Then the sample 
mean X = });X;/n is a reasonable point estimator 
of p. Hence +2.580/\/n are reasonable error 
bounds in the following sense: the estimator X 
lies between w—2.580/V/n and p+2.580/Vn 
with probability .99. In other words, the interval 
(u — 2.580/ VN, p+ 2.580/\/n) contains the es- 
timator X with probability .99; that is, whatever the 
value of u really is, 


Py {u — 2.580/Vn < X < u + 2.580/ V7} = .99. 


This probability statement follows directly from the 
facts that (X—)/(o/V7) has a unit normal 
distribution and that a unit normal random vari- 
able lies in the interval (—2.58, +2.58) with prob- 
ability .99. 

This statement can be given a slightly different 
but equivalent form: the interval (X — 2.580/ Vm, 
X+2.580/V/n) covers u with probability .99, or 
whatever p really is, 


Py (X —2.580/Vn <u < X + 2.580/ Vn} = .99. 


The interval (X — 2.580/ Vn, X+2.580/\/n) is 
called a confidence interval for p with confidence 
coefficient (or confidence level) .99. The confi- 
dence interval is a random interval containing the 
true value with probability .99. Note that it would 
be incorrect to say, after computing the confidence 
interval for a particular sample, that p will fall in 
this interval with probability .99; for p is an un- 
known constant rather than a random variable. It 
is the confidence interval itself that is subject to 
random variations. 

Generally speaking, there is an unknown pa- 
rameter, say 9, to be estimated and an estimator 
F(X) depending on the sample X = (X,, +++, Xn): 
In example 1, 0 is called p and f(X) is X, As this 
estimator f is based on a random sample, it is itself 
subject to random variations. If f is a good esti- 
mator, its probability distribution will be concen- 
trated closely around the true value, 9. From this 
probability distribution of f, one can often derive 
an interval, with lower bound c(@) and upper 
bound é(@), containing the estimator f(X) with 
high probability 6 (for example, B= .99). That is, 
whatever the actual value of 0, 


(1) Pofe(0) < F(X) < ĉ(0)} =B. 
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Often these inequalities can be inverted, that is, 
two functions @(X) and 6(X) can be specified such 
that 9(X) <8 < 6(X) if and only if ¢(@) < f(X) < 
č(0). Then, whatever @ really is, 


(2) P,{9(X) <0 < 6(X)} = B. 


This means that the interval (@(X), 6(X)) contains 
the true value @ with probability 8. Quantities like 
@(X) and 6(X) are often called confidence limits. 
In example 1, the bounds.c(@), €(@) and 9(X), 6(X) 
are given by u + 2.580/\/n and È + 2.580/yn, re- 
spectively. 

It is also possible to develop the concept of a con- 
fidence region procedure in general, without refer- 
ence to point estimation. Denote by Ps the assumed 
probability distribution depending on a parameter @ 
(which may actually be a vector of several univari- 
ate, that is, real valued, parameters). Let © be the set 
of all possible parameter values @. By a confidence 
procedure is meant a rule for assigning to each sam- 
ple X a subset of the parameter space, say @(X). If 
@(X) contains the true value @ with probability £, 
regardless of the true value of @ (that is, if for all 
0 cO, Po(9e@(X)}=f), then O(X) is called a 
confidence region for @. The probability £ that the 
true parameter value is covered by @(X) is called 
the confidence coefficient. 

In example 1, the interval (X — 2.580/V7, 
X +2.580/Vn) is the confidence region for the 
sample X = (X,,-:*,X,) with confidence coeffi- 
cient .99, 

The probability specified by the confidence co- 
efficient has the following frequency interpreta- 
tion: If a large number of confidence regions are 
computed on different, independent occasions, 
each with a confidence coefficient £, then, in the 
long run, a proportion £ of these confidence regions 
will contain the true parameter value. There is 
some danger of misinterpretation. This occurs if 
@ itself is erroneously considered as a random 
variable and the confidence statement is given the 
following form: the probability is 6 that @ falls 
into the computed confidence set @(X). It should 
be clear that @(X) is the random quantity and 
not 6. 

In the simplest applications, @ is a real param- 
eter and the confidence region 0(X) is either a 
proper interval (@(X),6(X)) or a semi-infinite 
interval; (—2, 6(X)) or (@(X), +2). If for all 0, 
Pa(0 < 6(X))=B, then 6(X) is called an upper 
confidence bound for @ with confidence coefficient 
B. Similarly, @(X) is a lower confidence bound. 

Let @(X) and 6(X) be lower and upper confi- 
dence bounds with confidence coefficients £, and £», 
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and suppose that 9(X) < (X) for all samples X. 
Then the interval (@(X), 6(X)) is a confidence 
interval with confidence coefficient 6, + £: — 1. If 
B:=8:, that is, if Pa{8(X) < 6} = Po(8< 9(X)}, 
the confidence interval (@(X),6(X)) is called 
central. 

Example 2. As in example 1, let X = (X,, =", 
X,) be a sample of n independent normally distrib- 
uted random variables with unknown mean p and 
known variance o?. Then 6(X) =X +2.330/Vn 
is an upper confidence bound for u at confidence 
level .99. Thus (—2, X + 2.330/ vm) is a semi- 
infinite confidence interval for y with confidence 
coefficient .99, as is (X — 2.330/\/n, + ©). Hence 
(X — 2.330/ Vn, X + 2.330/\/n) is a central con- 
fidence interval for » with confidence coefficient 
.98 = .99 + .99 — 1. This central confidence inter- 
val differs from that in example 1 in that the latter 
has confidence coefficient .99 and is correspond- 
ingly wider. 

Example 3. Let X= (X,, +++, Xn) be a random 
sample from a norm.’ distribution with known 
mean #=0 and unknown variance o*. In this 
case S? = > ,X3/n is a reasonable estimator of o°. 
(A subscript is used in “S?” because “S?” will later 
denote a more common, related, but different quan- 
tity.) Suppose n = 10. Then the central confidence 
interval for o? with confidence coefficient .98 is 
given by (10S?/23.21, 10S?/2.56). The constants 
23.21 and 2.56 are readily obtained from a table 
of quantiles for the chi-square distribution, for 
nS?/o? has a chi-square distribution with 10 de- 
grees of freedom. This example shows that the end- 
points of a confidence interval are generally not 
symmetric around the usual point estimator. 

Relation to point estimation. The computation 
of confidence intervals is often referred to as inter- 
val estimation, in contrast to point estimation. As 
outlined above, in many practical cases, interval 
estimation renders information about the accuracy 
of point estimates. The general definition of con- 
fidence intervals is, however, independent of the 
problem of point estimation. 

In many cases, a particular point estimator is 
related to the set of central confidence intervals. 
One forms the estimator for a given sample by 
thinking of the progressively narrowing intervals 
as the confidence level decreases toward zero. Ex- 
cept in pathological cases, the interval will squeeze 
down to a point, whose numerical value furnishes 
the estimator, Such an estimator is, for continuous 
distributions, median unbiased; that is, it is equally 
likely to be above and below the parameter under 
estimation. 
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Relation to hypothesis testing. The theory of 
confidence intervals is closely related in a formal 
way to the theory of hypothesis testing [see Hy- 
POTHESIS TESTING]. 

Example 4. In example 1, the confidence inter- 
val for u with confidence coefficient .99 was given 
by X — 2.580/ VN < u < X + 2.580/Vn. To test 
the hypothesis u = m against the alternative BH by 
at significance level .01, accept the hypothesis if 


(3) po- 2.580/ V7 < X < m + 2.580/ V7; 


reject it otherwise. This is the customary two-sided 
test. 

Observe that, given X, the confidence interval 
consists of all those values m for which the hy- 
pothesis u = po would be accepted. In other words, 
the confidence interval consists of all Mo whose 
acceptance region contains the given X, 

On the other hand, given the confidence interval 
with confidence coefficient .99, it is easy to per- 
form a test of a hypothesis H= po: Accept the 
hypothesis if the hypothetical value Po belongs to 
the confidence interval; otherwise reject the hy- 
pothesis. Proceeding in this way, the pattern is 
precisely that of testing the hypothesis H = fo, Since 
Ho belongs to the confidence interval if and only if 
(3) is fulfilled, that is, if the hypothesis u = po 
would be accepted according to the test procedure. 

This duality is illustrated generally in Figure 1. 
The figure is directly meaningful when there is a 
single (real) parameter 9 and when the sample 
can be reduced to a single (real) random variable. 
The latter reduction can frequently be accomplished 
via a sufficient statistic [see SUFFICIENCY]. When 
the problem is more complex, the figure is still of 
schematic use, 

The figure shows that for each value of 0 there 
is an acceptance region, A(@), illustrated as an 


O(X) 


Figure 1 — A confidence region for the parameter 0 


interval. The two curves determine the lower and 
upper bounds of this interval respectively, The set 
of all those @ for which A(@) contains a given X, 
©(X), is the interval on the vertical through X be- 
tween the two curves. 

If the graphic representation is considered in 
a horizontal way (in terms of the X axis), the 
lower curve represents the lower confidence bound 
9(X) as a function of X, and similarly the upper 
curve represents the upper confidence bound 6(X), 
If it is considered from the left (in terms of the 
6 axis), the functions @(X) and 6(X) depending 
on X are inverted into the functions €(@) and c(0) 
respectively, depending on @. (For this reason the 
letters are turned. ) 

The general duality between the testing of simple 
hypotheses and confidence procedures may be de- 
scribed as follows: Let © be the set of unknown 
parameter values and assume that to each sample 
X a confidence set @(X) is assigned, such that 
P,{0 € @(X)} = £ for all 6 € ©. On the basis of such 
a confidence procedure, a test for any hypothesis 
6 = @ can easily be defined as follows: Let A(@) 
be the set of all X, such that @€0(X). Then the 
events XeA(@) and 6¢€@(X) are equivalent, 
whence P,{X e A(@)} =Pe{ € @(X)} = B. There- 
fore, if A(0,) is taken as the acceptance region for 
testing the hypothesis @=6,, a test with accept- 
ance probability 8 (or significance level œ = 1 — £) 
is obtained. On the other hand, given a family of 
acceptance regions (that is, for each hypothesis 
@€® an acceptance region A(@) contains the 
sample X with probability 8 when 6 is the case), 
it is possible to define a confidence procedure by 
assigning to the sample X the set @(X) of all 6 for 
which A(@) contains X (that is, the set of all 
parameter values @ for which the hypothesis @ 
would be accepted on the evidence X ). Then, 
again 6€@(X) if and only if X«A(@), whence 
Po{0 € O(X)} = PX € A(@)} =. These remarks 
refer only to the case of simple hypotheses. In 
practice the more important case of composite hy- 
potheses arises if several real parameters are pres- 
ent and the hypothesis consists in specifying the 
value of one of these. (This case is dealt with in 
“Nuisance parameters,” below. ) 

Under exceptional circumstances the confidence 
set @(X) may show an unpleasant property: For 
some X, @(X) might be empty, or it might be iden- 
tical with the whole parameter space, ©. Those 
cases are usually of little practical relevance. 

Thus a confidence statement contains much 
more information than the conclusion of a hypoth- 
esis test: The latter tells only whether a specified 
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hypothesis is compatible with the evidence or not, 
whereas the confidence statement gives compati- 
bility information about all relevant hypotheses. 

Optimality. The duality between confidence 
procedures and families of tests implies a natural 
correspondence between the optimum properties of 
confidence procedures and optimum properties 
of tests. 

A confidence procedure with confidence region 
@’(X) is called most accurate if @’(X) covers any 
value different from the true value with lower 
probability than any other confidence region 0(X) 
with the same confidence coefficient: 


Pa {0 € O(X)) < Pa {0EO(X)} for any 04 0.. 


Another expression occasionally used instead of 
“most accurate” is “most selective.” The term 
“shortest,” originally introduced by Neyman, is now 
unusual because of the danger of confusing short- 
est confidence intervals and confidence intervals 
of minimum length. 

The family of tests corresponding to most accu- 
rate confidence procedures consists of uniformly 
most powerful tests: Let A’(@) and A(@) be the 
acceptance regions corresponding to the confidence 
regions @’(X) and @(X) respectively; then 


Po {X € A’(0)} = Po, {0 € O'(X)} 
< Po, {0 € O(X)} = Pa, {X € A(0)) 
for any 04 b. 


Therefore, by using the acceptance region A’ the 
false hypothesis @ is accepted with lower proba- 
bility than by using A. 

Uniformly most powerful tests exist only in ex- 
ceptional cases, Therefore, the same holds true for 
most accurate confidence procedures. If, however, 
the class of tests is restricted (to unbiased tests or 
invariant tests, for example), the restricted class 
often contains a uniformly most powerful test with- 
in that class. Similarly, tests most powerful against 
a restricted class of alternatives can often be ob- 
tained. In the case of a real parameter a test for 
the hypothesis ð, that is most powerful against all 
6> 0, may typically be found. All these restricted 
optimum properties of tests lead to corresponding 
restricted optimum properties of confidence pro- 
cedures, 

A confidence procedure is called unbiased if the 
confidence region covers no parameter value dif- 
ferent from the true value with probability higher 
than its probability of covering the true value. The 
corresponding property of tests is also called un- 
biasedness. Therefore, families of uniformly most 
powerful unbiased tests lead to most accurate un- 


biased confidence procedures, that is, confidence 
procedures that are most accurate among the un- 
biased confidence procedures: No other unbiased 
confidence procedures’ exist leading to confidence 
regions that contain any value different from the 
true value with lower probability. The confidence 
interval given in example 1 is unbiased and most 
accurate among all unbiased confidence procedures 
with confidence ¢oefficient .99. On the other hand, 
the confidence interval given in example 3 is not 
unbiased, 

The optimum properties discussed above are re- 
lated to concepts of optimality derived from the 
duality to the testing of hypotheses. A completely 
different concept is that of minimum length. For 
instance, the confidence interval given in example 
1 is of minimum length. In general the length of 
the confidence interval is itself a random variable, 
as in example 3. It is therefore natural to consider 
a confidence procedure as optimal if the expected 
length of the confidence intervals is minimal. This 
concept is appropriate for two-sided confidence in- 
tervals, For one-sided confidence intervals the con- 
cept is not applicable immediately, as in this case 
the length is infinite. However, the expected value 
of the boundary value of the one-sided confidence 
interval can be substituted for expected length. 

In general, confidence intervals with minimum 
expected length are different from, for example, 
most accurate unbiased confidence intervals (where 
such intervals exist). Under special circumstances, 
however (including the assumption that the dis- 
tributions of the family have the same shape and 
differ only in location), invariant confidence pro- 
cedures are of minimum expected length. The con- 
fidence procedure given in example 3 is not of 
minimum expected length. 

Two objections that may be raised against the 
use of expected length as a criterion are (1) when 
a confidence interval fails to cover the true param- 
eter value, a short interval is undesirable in that it 
pretends great accuracy when there is none, and 
(2) expected length depends strongly on the mode 
of parameterization, for example, there is no sharp 
relation between the expected length of a con- 
fidence interval for @ and that of the induced 
interval for 6°. 

Discrete distributions. In the general considera- 
tion above it was assumed that there exists a con- 
fidence procedure with confidence coefficient £ in 
the sense that, for all @ in ©, the probability of 
covering the parameter @ is exactly £ when @ is 
the true parameter. This means that for each 0 
there exists an acceptance region A(@) such that 
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P,{A(@)} =f. This is, however, in general true 
only for distributions of the continuous type, not 
for discrete distributions such as the binomial and 
Poisson distributions [see DISTRIBUTIONS, STATIS- 
TICAL]. Thus acceptance regions A(@) of proba- 
bility approximately 8 must be chosen, with the 
degree of approximation depending on @. In prac- 
tice the acceptance region is selected such that 
Pe{A(@)} approximates 8 as closely as possible, 
either with or without the restriction Ps{A(@)} > £. 
These acceptance regions A(@) define the confi- 
dence regions @(X) with (approximate) confidence 
coefficient 8. When the restriction Pa{A(0)} > 8 is 
made, the term “bounded confidence region” is 
often used, and the region is said to have bounded 
confidence level 8. 

Example 5. Let X be the number of successes 
in n independent dichotomous trials with constant 
probability p of success, Then X is binomially dis- 
tributed, that is, P,{X =k} =(*)p(1— p)"*. Choose 
the confidence coefficient 8 = .99. Choose for each 
p, 0 <p<1, the smallest integer c(p) such that 

cip) 

D (2 ) pF(1 = p)* > 0.99. 

k=0 k 
Inverting the bound c(p) one obtains one-sided 
confidence intervals of confidence coefficient .99 
for p. 

As an illustration, let n = 20 and p=.3. Since 
P{X<11}=.995 and P{X <10}=.983, the small- 
est integer such that P{X <c(p)} > 0.99 is c(p) = 
11. Troublesome computations of c(p) can be 
avoided by use of one of the tables or figures pro- 
vided for this purpose. For references see Kendall 
and Stuart ([1943-1946] 1961, p. 118). 

Nuisance parameters. In many practical prob- 
lems, more than one parameter is involved. Often 
the interest is concentrated on one of these param- 
eters, say @, while the others are regarded as 
nuisance parameters. The aim is to make a con- 
fidence statement about 0 that is true with high 
probability regardless of the values of the nuisance 
parameters, The corresponding test problem is that 
of testing a composite hypothesis that specifies the 
value of ð without making any assertion about the 
nuisance parameters. The test is required to have 
significance level less than or equal to a prescribed 
@ regardless of the nuisance parameters. The cor- 
responding confidence procedure will yield confi- 
dence intervals that cover the true value at least 
with probability 1 — œ regardless of the nuisance 
parameters, that is, confidence intervals with 
bounded confidence level 1 — œ. A special role is 
played by the so-called similar tests, having exactly 


significance level œ for all values of the nuisance 
parameters. They lead to confidence intervals coy- 
ering the true value with probability exactly 1 — a 
regardless of the nuisance parameters. 

Example 6. Let X,, +--,X, be a random sam- 
ple from a normal distribution with unknown mean 
p and unknown variance o*. The variance g? is 
to be considered a nuisance parameter. Let X= 
:X;/n and S? = X: (X: — X)2/(n — 1). For n= 10 
a (similar) confidence interval for u with confi- 
dence coefficient .99 is given by X — 3.17 S/V/10< 
u < X +3.17 S/ VTO. For general n, the confidence 
interval with confidence coefficient .99 is given 
by X = tonna S/ VV < p< X + toonn-1.S/ VN, where 
toosn1 is the upper .005 point of the tabled t- 
distribution with n—1 degrees of freedom, for 
example taos., = 3.17. Hence the above confidence 
procedure corresponds to the usual t-test. As for 
large n, because toos.n-ı is close to 2.58, the confi- 
dence interval given here corresponds for large n 
to the confidence interval given in example 1. The 
confidence procedure given here is most accurate 
among unbiased confidence procedures. 

Example 7. Consider u in example 6 as the 
nuisance parameter. Define S? as in example 6 
and again take n = 10. Then a one-sided confidence 
interval for o? of confidence coefficient .99 is given 
by o? <9 S2/2.09. In general, the one-sided con- 
fidence interval for o? with confidence coefficient 
-99 is given by o? < (n — 1)S2/y?. 65-1, Where X201.041 
is the lower .01 point of the chi-square distribution 
with n — 1 degrees of freedom. Observe that here 
the number of degrees of freedom is n — 1 while in 
example 3 it is n. 

Confidence coefficient. The expected length of 
the confidence interval depends, of course, on the 
confidence coefficient. If a higher confidence coeffi- 
cient is chosen, that is, if a statement that is true 
with higher probability is desired, this statement 
has to be less precise; the confidence interval has 
to be wider. 

It is difficult to give general rules for the selec- 
tion of confidence coefficients. Traditional values 
are .90, .95, and .99 (corresponding to significance 
levels of .10, .05, and .01, respectively). The con- 
siderations to be made in this connection are the 
same as the considerations for choosing the size of 
a test [see HYPOTHESIS TESTING]. 

Nested confidence procedures. One would ex- 
pect the wider confidence interval (belonging to the 
higher confidence level) to enclose the narrower 
confidence interval (belonging to the lower confi- 
dence level). A confidence procedure with this prop- 
erty is called “nested.” All the usual confidence 
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procedures are nested, but this is not a fully general 
property of confidence procedures, 

Sample size. Given the confidence coefficient, 
the expected length of the confidence interval de- 
pends, of course, on the sample size. Larger sam- 
ples contain more information and therefore lead 
to more precise statements, that is, to narrower 
confidence intervals. 

Given a specific problem, the accuracy that it is 
reasonable to require can be determined. In order 
to estimate the number of housewives knowing of 
the existence of the superactive detergent X, a con- 
fidence interval of +5 per cent will probably be 
sufficiently accurate. If, on the other hand, the aim 
is to forecast the outcome of elections and the per- 
centage of voters favoring a specific party was 
48 per cent in the last elections, an accuracy of 
+5 per cent would be quite insufficient. In this 
case, a confidence interval of length less than +1 
per cent would probably be required. 

Given the accuracy necessary for the problem at 
hand, the sample size that is necessary to achieve 
this accuracy can be determined. In general, how- 
ever, the confidence interval (and therefore the 
necessary sample size as well) depends on nuisance 
parameters. Assume that a confidence interval for 
the unknown mean p of a normal distribution with 
unknown variance g? is needed. Although in exam- 
ple 6 a confidence interval is given for which no 
information about g? is needed, such information 
is needed to compute the expected length of the 
confidence interval: The length of the confidence 
interval is 2tosn-1S/Vm, the expected value for 
large n is therefore nearly equal to 2 995,n-1 7/ Vn. 
Therefore, in order to determine the necessary sam- 
ple size n, some information about g? is needed. 
Often everyday experience or information obtained 
from related studies will be sufficient for this pur- 
pose. If no information whatsoever is at hand, a 
relatively small pilot study will yield a sufficiently 
accurate estimate for o°. This idea is treated rigor- 
ously in papers on sequential procedures for obtain- 
ing confidence intervals of given length (Stein 
1945). In the case of the binomial distribution, 
no prior information at all is needed, for o= 
p (1 — p) <4, whatever p might be. Using 4 instead 
of o can, however, lead to wastefully large samples 
if p is near 0 or 1. 

Robustness—nonparametric procedures. Any 
statistical procedure starts from a basic model on 
the underlying family of distributions. In exam- 
ple 1, for instance, the basic model is that of a 
number of independent normally distributed ran- 
dom variables. Since it is never certain how closely 


these basic assumptions are fulfilled in practice, 
desirable statistical procedures are those that are 
only slightly influenced if the assumptions are vio- 
lated. Statistical procedures with this property 
are called robust [see HYPOTHESIS TESTING]. An- 
other approach is to abandon, as far as possible, 
assumptions about the type of distribution leading 
to nonparametric procedures. 

As the duality between families of tests and con- 
fidence procedures holds true in general, robust or 
nonparametric tests lead to robust or nonpara- 
metric confidence procedures, respectively. [Exam- 
ples showing the construction of confidence inter- 
vals for the median of a distribution from the sign 
test and from Wilcoxon's signed rank test are given 
in NONPARAMETRIC STATISTICS. ] 

Relationship to Bayesian inference. If the pa- 
rameter is not considered as an unknown constant 
but as the realization of a random variable with 
given prior distribution, Bayesian inference can be 
used to obtain estimating intervals containing the 
true parameter with prescribed probability [see 
BAYESIAN INFERENCE]. 

Confidence statements can be made, however, 
without assuming the existence of a prior distribu- 
tion, and hence confidence statements are preferred 
by statisticians who do not like to use “subjective” 
prior distributions for Bayesian inference. A some- 
what different, and perhaps less controversial, ap- 
plication of subjective prior distributions is their 
use to define so-called subjective accuracy. Subjec- 
tively most accurate confidence procedures are 
defined in analogy to the most accurate ones by 
averaging the probability of covering the fixed pa- 
rameter with respect to the subjective prior distri- 
bution. It can be shown that a most accurate con- 
fidence procedure is subjectively most accurate 
under any prior distribution with a positive density 
function (Borges 1962). 

Relation to fiducial inference. Fiducial infer- 
ence was introduced by R. A. Fisher (1930). This 
paper and succeeding publications of Fisher con- 
tain a rule for determining the fiducial distribution 
of the parameter on the basis of the sample X [see 
FIDUCIAL INFERENCE]. 

As in Bayesian inference, this distribution can 
be used to compute “fiducial intervals,” giving in- 
formation about the parameter 0. The fiducial inter- 
val is connected with a probability statement, which 
admits, however, no frequency interpretation (al- 
though some advocates of fiducial methods might 
disagree ). 

For many elementary problems, fiducial intervals 
and confidence intervals are identical. But this is 


156 ESTIMATION: Confidence Intervals and Regions 


not true in general. One of the attractive properties 
of fiducial inference is that it leads to solutions 
even in cases where the classical approach failed 
until now, as in the case of the Behrens—Fisher 
problem, 

Many scholars, however, find it difficult to see 
a convincing justification for Fisher’s rule of com- 
puting fiducial distributions and to find an intuitive 
interpretation of probability statements connected 
with fiducial intervals. 

A reasonable interpretation of fiducial distribu- 
tions would be as some sort of posterior distri- 
butions for the unknown parameter. It can be 
shown, however, that fiducial distributions cannot 
be used as posterior distributions in general; a 
Bayesian inference, starting from two independent 
samples and using the fiducial distribution of the 
first sample as prior distribution to compute a 
posterior distribution from the second sample, 
would in general lead to a result different from the 
fiducial distribution obtained from both samples 
taken together. For the comparison of fiducial and 
Bayesian method, see Richter (1954) and Lindley 
(1958). 

Prediction intervals, tolerance intervals. Where- 
as confidence intervals give information about an 
unknown parameter, prediction intervals give in- 
formation about future independent observations. 
Hence prediction intervals are subsets of the sam- 
ple space whereas confidence intervals are subsets 
of the parameter space. 

Example 8. If X,,-:-,X, is a random sample 
from a normal distribution with unknown mean 4 
and unknown variance o°, the interval given by 
(È = tanaSV (Nn F 1)/n, X + tanaSV(n + 1)/n) is 
a prediction interval containing a future independ- 
ent observation X,,, with probability 1 — 2a, if 
tana is the upper æ point of the t-distribution with 
n — 1 degrees of freedom. Note that the probability 
of the event 

X — tanıSV (n + 1)/n 
<Xnu < È + tana SV (nF 1)/n 
is 1 — 2a before the random variables X,,---, Xn 
are observed, For further discussion of this exam- 
ple see Proschan (1953); for discussion of a sim- 
ilar example, see Mood and Graybill (1950, pp. 
220-244, 297-299). 

The prediction interval, computed in example 8 
above, must not be interpreted in the sense that it 
covers a proportion a of the population. In a spe- 
cial instance, the interval computed according to 
this formula might cover more or less than the 
proportion a. Only on the average will the propor- 
tion be a, 


In many cases, there is a need for intervals cov- 
ering a proportion y with high probability, say £. 
This is, however, not possible. In general, it is pos- 
sible only to give rules for computing intervals 
covering at least a proportion y with high prob- 
ability 8. Intervals with this property are called 
y-proportion tolerance regions with confidence co- 
efficient 8. In the normal case, one might, for 
example, seek a constant c, for given y and £, such 
that, whatever the values of u and øg, 


X+es 


P | fpo) du>y =p, 


X-c8 
where f(u; p, o) is the normal density with mean u 
and variance o°. 

The constants c, leading to a y-proportion toler- 
ance interval (X — cS, X + cS) with confidence co- 
efficient 8, cannot be expressed by one of the stand- 
ard distributions (as was the case in the example 
of the prediction interval dealt with above), Tables 
of c can be found in Owen (1962, p. 127 ff.). 
For further discussion see Proschan (1953), and 
for nonparametric tolerance intervals see Wilks 
(1942). [See also NONPARAMETRIC STATISTICS. | 

Confidence regions. In multivariate problems, 
confidence procedures yielding intervals are gen- 
eralized to those yielding confidence regions. 

Example 9. Let X and Y be two normally dis- 
tributed random variables with unknown means 
p» and v, known variances 2 and 1, and covariance 
—1. A confidence region for (u, v) with confidence 
coefficient .99 is given by (X — u)? + 2(X — w)(Y — v) 
+2(Y — v)? < 9.21. The figure 9.21 is obtained 
from a chi-square table, since the quadratic form 
on the left is distributed as chi-square with two 
degrees of freedom. The confidence region is an 
ellipse with center (X, Y). When such a region is 
described in terms, say, of pairs of parallel tangent 
lines, the result may usefully be considered in the 
framework of multiple comparisons. [See LINEAR 
HYPOTHESES, article on MULTIPLE COMPARISONS.] 
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I 
ETHICAL SYSTEMS AND SOCIAL STRUCTURES 


In a consideration of the relationship of ethical 
systems to social structure, it is important to show 
how these terms are being used; different mean- 
ings can represent different degrees of abstraction, 
and the kinds of relationship possible will vary 
accordingly. 

“Social structure” has been taken by Radcliffe- 
Brown (1952, p. 11; compare pp. 188-204) to 
mean “an arrangement of persons in institutionally 
controlled or defined relationships”; in this case, 
the term stands for a social organization with 
actual individuals as its constituents. It may be 
taken, as by Evans-Pritchard ([1940] 1963, p. 
262), to exclude relations between persons, but to 
describe such relations between groups as have a 
high degree of constancy and consistency. Or it 
may be taken in a still more abstract sense—as a 
network of relationships between sets of institu- 
tionalized social roles (Firth 1954; Nadel 1957; 
Emmet 1960; 1966). 

The view here adopted is that although observa- 
tion must start from the first of these senses (in- 
teractions between persons) and may proceed 
through the second (regularities in group inter- 
actions), the systematic notion of a “social struc- 
ture” will need to be couched in the abstract 
terminology of relationships between roles. 

The notion of an ethical system is even less 
clearly determined. It may be taken to mean (a) 
the mores of a given society as a sociologist ob- 
serves them; (b) a systematic code of moral prin- 
ciples, such as that of the Roman Catholic church; 
and (c) a philosophical theory about the rationale 
of moral action, such as utilitarianism. 

Ethics and social structure. In considering rela- 
tionship to a social structure, we would be tempted 
to say that we need be concerned only with (a). 
This, however, would be unsatisfactory, since to 
talk of an ethical system is to imply far more 
than a pattern of observed forms of behavior; rules 
of conduct, as derived from ethical notions, may be 
honored in the breach as well as in the observance. 
In order to discover a people’s ethical system even 
in sense (a), it will therefore be necessary to take 
into account their statements about what is con- 
sidered right and wrong and why, as well as to 
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describe conformities in their behavior and the 
working of sanctions against deviation. 

For this reason, it might be logically preferable 
to consider an ethical system simply in sense (b), 
as a body of beliefs about right and wrong, although 
these are unlikely in many cases to be as system- 
atic as those connected with a formulated theo- 
logical position, such as that of the Roman Catholic 
church. Sense (b), however, can be related to 
social structure only by showing how the ethical 
beliefs in question affect the ways members of the 
society behave in their social roles. 

Social structure in theory and practice. We also 
need to distinguish here between an idealized view 
of the social structure, seen as a network of roles 
played according to the rules (or, where rules are 
broken, corrected by sanctions) and the social 
structure as a generalized description of typical 
role behavior that may fall short of official ethical 
prescriptions. In the latter case, however, ethical 
prescriptions must be taken into account in seeking 
to understand the behavior, if only to show ways 
in which the prescriptions are being evaded; the 
notion of “role expectations,” often used in speak- 
ing of social structures, can thus be ambiguous. It 
may stand for predictions of how a person is likely 
to behave in a given role. It may also stand for 
“what is expected of him” (normatively) in that 
role; and notoriously people do not always live up 
to these “expectations.” An ethical system as a set 
of norms for action needs to be distinguished, 
therefore, from a descriptive account of the mores 
as customary ways of behavior (cf. Sumner [1906] 
1959, chapter 2). 

Distinctiveness of ethical judgments. It is also 
important to try to distinguish those aspects of the 
mores that should properly be called ethical from 
those more properly called religious, legal, political, 
or matters of etiquette. This is a matter in which 
the anthropologists of the late nineteenth and early 
twentieth centuries, such as Westermarck, were 
interested, but to which less direct attention has 
been paid more recently, perhaps because moral 
values pervade these other aspects of social life 
and are difficult to isolate from them (Edel 1962). 
Some recent work has been done by moral phi- 
losophers on what may be the distinctive criteria 
of moral as distinct from other kinds of judgment 
(Ladd 1957; Brandt 1954; cf, Macbeath 1952). But 
the question of whether these criteria are logically 
necessary to anything that can be called an ethical 
system, or whether they are culture bound, is a 
matter calling for cooperative work between phi- 
losophers and anthropologists. Until there is a 
larger body of material for comparative study di- 


rected to such questions as whether primitive 
peoples have specifically ethical notions that are 
independent of their religious or political notions, 
the field of study will remain largely speculative. 

Two kinds of ethical relativism. The distinc- 
tion between the logical criteria of what makes a 
system an ethical system and the substantive prin- 
ciples it contains has not always been drawn by 
writers on the cultural conditioning of ethical be- 
liefs. While generally holding that ethical beliefs 
are “relative” to a culture, they do not always dis- 
tinguish the “reductionist” form of ethical relativ- 
ism, which presents the ethical beliefs of a people 
as functionally dependent on their other beliefs and 
practices, and the kind of “content” relativism 
which, while allowing that substantive ethical be- 
liefs and practices may be affected by other factors 
within the society, nevertheless recognizes that 
there may be distinctive moral interests not ex- 
haustively explicable in terms of other interests. 
The difference between these two approaches can 
be summarized by saying that the reductionist 
maintains that the ethical beliefs of a people can 
be exhaustively rendered in terms of their non- 
ethical interests, such as the familial or economic, 
whereas the “content” relativist is prepared to ad- 
mit that the belief that “X is right” can provide a 
bona fide reason for acting accordingly, although 
the content of X may be culturally variable (Em- 
met 1966, chapter 5). 

The latter approach would be concerned with 
seeing how these different interests may affect each 
other in producing a particular “way of life”; an 
instance is Max Weber’s well-known thesis on the 
relation between the Puritan ethic and capitalism 
in the seventeenth century (Weber 1904-1905). 
This need not be taken (as by some Marxists) to 
mean that the ethical ideas of the Puritans were 
simply a superstructure rationalizing their eco- 
nomic behavior. On the contrary, it can mean that 
the kinds of behavior, such as hard work and 
thrifty living, prescribed by their ethical beliefs 
fitted the kinds of behavior needed for successful 
entrepreneurial activity in the early stages of a 
capitalist economy. Thus, a mutual reinforcement 
of two strong human interests—the ethical and 
the economic—would be produced and a way of 
life with survival value established. This type of 
analysis aims at finding functional interrelations 
between ethical and other practices within a so- 
ciety without prejudging the question of whether, 
nevertheless, there may not be distinctively ethical 
motives and interests; for instance, the belief that 
hard work is morally commendable need not only 
be a disguised way of saying hard work is eco- 
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nomically profitable, nor is it necessarily caused 
by the fact that hard work is profitable. 

Ethical systems—form and content. The ques- 
tion of the distinctive criteria of ethical as distinct 
from other kinds of judgment has not been over- 
looked, however, by all writers on the social rela- 
tivism of morals. Westermarck, in particular, held 
that there was a universal form of ethical judg- 
ments inasmuch as they expressed disinterested 
retributive emotions (1906-1908; 1932). This 
question of the distinction between the general 
logical character and the particular substantive 
content of an ethical system is a point where the 
third meaning of the term that we distinguished 
earlier—philosophical theories about the nature of 
ethical systems—becomes relevant. Edel and Edel 
(1959) have suggested that an ethical system may 
be distinguished by certain broad notions that any 
such system may be supposed to provide for; for 
example, it will contain some kind of sanction, 
reasons justifying some kinds of conduct and not 
others, and, more specifically, some means of con- 
trolling aggression and some notion of distributive 
justice. This may be compared with what Hart 
(1961, p. 189) has called the minimum content of 
natural law. This is not a notion of natural law as 
a universal rational code of ethical principles but 
a listing of certain basic requirements that any 
code must somehow meet if people are to live to- 
gether sufficiently permanently to satisfy the logical 
and empirical requirements of constituting a “so- 
ciety” (see also Levy 1952, chapter 4 on “The Func- 
tional Requisites of Any Society”). Comparative 
work on these requirements, and on what differ- 
ences of emphasis may be given them, would be 
one of the ways in which the study of ethical sys- 
tems and social structures could be brought 
together. 

Some alternatives to functionalism. The struc- 
tural-functionalist approach reflected in the terms 
“ethical system” and “social structure” has some- 
times been interpreted as assuming a more highly 
integrated and normatively controlled unity within 
a society than need in fact obtain. The work of 
Parsons, especially The Social System, 1951, has 
been criticized on these grounds (Lockwood 1956; 
see also Emmet 1958 for a more extended discus- 
sion of the issues). An approach of perhaps more 
immediate empirical applicability has been out- 
lined by Merton (1957, chapters 8 and 9), who uses 
the term “reference group” to denote the group or 
groups from which an individual may take his 
ethical cues. Modern societies in particular may 
contain many persons who, although they are con- 
formists from the point of view of their own refer- 
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ence group, are deviants by the values of the larger 
society of which their group is a part. Study of 
deviance and conformity in terms of reference 
groups may have the effect of reviving interest in 
the formal means, such as political and legal sys- 
tems, of preserving social cohesion within plural- 
istic societies. Structural—functional studies, on the 
other hand, have been mainly concerned with the 
less formal sanctions of custom and unplanned in- 
stitutional practices [see REFERENCE GROUPS]. 

Students of organizations have also drawn atten- 
tion to the importance of informal as well as formal 
structures, The workings of a large industry cannot 
be understood simply by looking at the organization 
chart or by consulting official statements of aims; 
it is also necessary to discover the unofficial net- 
works of communication, interaction, and leader- 
ship. In some cases, elements within these unof- 
ficial structures may have their own ethical systems 
(for example, views on the amount of work that 
ought to be done), and these can frustrate the 
official system unless they are taken into account 
[see ORGANIZATIONS, article on EFFECTIVENESS 
AND PLANNING OF CHANGE]. 

Role, status, and the individual. If we recog- 
nize the looseness of the texture of actual social 
life, in contrast with any simplified model of the 
social structure, we see the individual not only as 
carrying specified role obligations but also as hav- 
ing to meet the demands of a number of different 
and perhaps conflicting roles. A variety of social 
structures can be abstracted from the whole field 
of human relationships: professional, political, 
family, and friendship roles may all be played by 
the same individual and are likely to produce com- 
peting pressures. Barnard (1938) has called atten- 
tion to this in the case of high executives, showing 
how positions of responsibility produce conflicting 
claims that make heavy demands on an individual's 
intellectual and moral resources. It is unlikely that 
any ethical system can be so structured as always 
to show the priorities among these claims, or any 
social structure be so simple as not to produce 
these conflicts. 

In relating ethical systems to social structure, 
therefore, it may be asked whether the former can 
thereby be explained in terms of the latter. A “socio- 
logical explanation,” following Durkheim (1895), 
may here be taken to mean an account of behavior 
not in terms of historical or psychological causa- 
tion, but in terms of the ways groups are related 
to one another within the society. Role behavior in 
social groups is defined partly with reference to 
ethical norms of expected conduct (cf. Durkheim 
1893 on how this is so even in the economic field); 
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we may therefore say that the ethical system of 
defining role obligations can be considered as an 
aspect of the social structure, insofar as ethical 
notions enter into the ways roles are seen and per- 
formed. Here a mutual conditioning between 
ethical beliefs and social arrangements, as we have 
said, seems more plausible than a one-way causa- 
tion of the one by the other. 

Role performance and social change. It may, 
of course, be asked whether an individual may not 
be conditioned by the training he receives through 
the institutions of his society in order to see his 
role obligations in only one particular way. How- 
ever, individuals can have their own styles in role 
performance; they may deviate in various ways; 
they will have to decide between conflicting role 
obligations; and in some cases they may create a 
new role for themselves. There may thus be much 
individual behavior that will not enter into the 
description of a social structure except insofar as 
it may produce innovations that alter the image of 
an existing role or create a new one. Indeed, it may 
be said that individual innovation becomes socio- 
logically important only when it modifies role be- 
havior to such an extent that social structure is 
affected (Emmet 1966, chapters 7 and 8). Indi- 
vidual conduct, therefore, is not being considered 
as such and is more properly left to psychologists 
and philosophers. 


Nevertheless, the study of social structure can 
show how certain kinds of behavior will be ex- 
pected and certain possibilities will be foreclosed 
because of features in the social situation; and to 
study the nature of ethical systems in relation to 
the social structures in which they are embedded 
may help us to understand why certain actions are 
thought of as right or wrong in particular societies. 
These two kinds of understanding can thus fructify 
one another without being thought of as mutually 
reducible. 

DOROTHY EMMET 


[See also ROLE; SOCIAL STRUCTURE; STATUS, SOCIAL; 
UTILITARIANISM; and the biographies of BARNARD; 
DURKHEIM; WESTERMARCK.] 
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Il 
ETHICAL ISSUES IN THE SOCIAL SCIENCES 


Ethics is concerned with standards of conduct 
among people in social groups; for this reason, re- 
search in social science is inextricably bound up 
in ethical problems. The initial choice of a problem 
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for investigation by the social scientist is often 
value-laden. The process of inquiry in the social 
sciences, engaging as it frequently does the lives 
of people, must meet moral as well as scientific 
standards. And the product of inquiry constantly 
adds new data and new theories requiring the re- 
vision of established ethical systems. Ethics and 
social science thus move in contrapuntal relation- 
ship, each adding to the character of the other 
(Shils 1959). 

Old issues and new. There are a number of 
principles of ethics in social science research that 
are so widely recognized and honored that they do 
not need detailed discussion. Among these are 
maintaining highest standards of work, reporting 
procedures and results faithfully, protecting in- 
formation given in confidence, giving appropriate 
credit to co-workers, making appropriate acknowl- 
edgment of other writers’ materials, representing 
accurately one’s own qualifications, and acknowl- 
edging, when appropriate, sources of financial sup- 
port. The central issue in all of these is integrity, 
as indeed it is in every step of a true research 
endeavor, For this reason some social scientists 
have objected to proposals to define ethical stand- 
ards for research, arguing that the canons of sci- 
ence are an exacting and sufficient guide to con- 
duct, However, new problems arise as scientists 
move into new areas under new auspices; old prob- 
lems appear in new contexts and require new 
solutions. Ethical standards must be redefined con- 
tinually to keep them relevant to contemporary 
situations. Below are several issues that are sub- 
jects of concern and of lively debate as this article 
is written. If these issues are soon dated and no 
longer lively, it is probably a healthy sign that con- 
sensus is being reached on them and that new 
issues are capturing concern. 

Deception in social science research. In many 
experiments or inquiries in the social sciences, it 
is necessary, or has been widely considered neces- 
sary, to disguise the nature of the task assigned 
to the subject. The procedure arises usually from 
the need to control the “set” or “expectancy” with 
which the subject approaches the task, since set 
is known to be an important determinant of re- 
sponses. While in most instances the consequences 
are trivial, in some instances they may not be 
trivial at all. In all instances the issue is raised, 
Is deception ever justified? 

Clearly, scientists think that deception is some- 
times required to achieve a good that would not 
otherwise be achievable. For example, it is common 
practice in medical research to administer a pla- 
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cebo to a control group in order to assess the effects 
of a drug. No harm is done; the control subjects 
might still be given the drug if it proves effective. 
But the outcome of deception is not always be- 
nign. In one of the classical experiments on deceit, 
the investigators tempted children to steal and de- 
ceived them into believing that their action could 
not be detected. Some children did indeed steal. 
The investigators concluded that honesty is often 
influenced by the situation, a point demonstrated 
as much in their own behavior as in that of the 
children (Hartshorne & May 1928). In a second 
well-known investigation, social psychologists in- 
filtrated a religious group, posing as converts 
(Festinger et al. 1956); their conduct has been 
questioned (Smith 1957). In an experiment on the 
effects of group pressure on judgment, five co- 
workers of the experimenter were represented as 
uninstructed subjects, just like the person whose 
resistance to social pressure was to be tested (Asch 
1948). Both the deception and the stress generated 
thereby may be questioned, from an ethical view- 
point. Russian psychologists investigating the same 
problem have avoided the need for deception by 
using all naive subjects and analyzing the data for 
trends that occur naturally, accepting the loss in 
experimental efficiency. 

A reasonable ethical standard for such a situa- 
tion would be that the investigator has an obliga- 
tion to inform his prospective subject of any aspect 
of the experiment that might be considered an 
important factor in the subject's decision to serve. 
While such an ethical policy obviously has much 
to commend it, the losses would be great; many 
experiments concerned with the dynamics of hu- 
man behavior would be made impossible. Ethics 
aside, there are pragmatic arguments in favor of 
a policy of full disclosure of intent. With growing 
sophistication, the public may come to regard all 
social science experiments as situations in which 
deception is to be expected. At this point even truth 
is suspect. The problem is not simple, nor is it 
unimportant. Perhaps a minimum obligation of 
the social scientist is to make the public aware of 
the problem. 

Stress in social science research. While many 
experimenters have subjected participants in re- 
search to stress, one investigator has been taken 
to task for his seeming insensitivity to the excru- 
ciating ordeal his subjects were going through and 
for his failure to see the larger implications of his 
methodology. The critic (Baumrind 1964) very 
reasonably questioned the ethics of subjecting 
people to extreme stress and pointed out the moral 
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parallels to historical situations in which innocent 
people have been tortured in the interest of science. 
The experimenter’s rejoinder (Milgram 1964) pro- 
vides further instruction in the complexity of the 
problem and demonstrates the value of a continu- 
ing debate of ethical issues in research. 

The customary routine is to talk with the subject 
after an experiment involving stress, to explain the 
procedure, and to try to relieve any residual dis- 
comfort. This procedure may suffice in many in- 
vestigations, but there are others reported in the 
literature in which the stress is so severe that one 
could not realistically hope to repair the damage 
by such a postsession conference. A suitable topic 
for cross-disciplinary research would be an investi- 
gation of possible lingering or delayed effects of 
experiments involving stress or deception. 

It has been proposed that there is already enough 
Stress in life arising from natural causes and that 
social scientists should not add to it. An alternative 
is to study stress reactions in natural settings. 
Many of these are unpredictable and are amenable 
only to observational study after the event, but 
some excellent research has been done following 
disasters, such as tornadoes and earthquakes, by 
sociologists who were prepared to take advantage 
of an unpredictable event. There are also predict- 
able and necessary stressful situations that are a 
normal part of living and could be used in research. 
A first-grade classroom on the first day of school 
and the father’s waiting room at a maternity hos- 
pital are settings where stress can be studied with- 
out the investigator's causing it. Webb and his co- 


cluding attention to ethical problems. 
Protection of research data. The right of the cli- 
nician to keep data confidential is widely (though 
not universally ) recognized by custom and in some 
States and countries by law. But the scientific in- 
vestigator does not as clearly enjoy such protection. 
For example, the social scientist engaged in survey 
research may encounter a serious ethical problem, 
and lack clear guidelines for conduct, when his 
evidence is introduced in a court as legal testi- 
mony. The court or either contending party may 
have a legitimate interest in the reliability of the 
survey and may demand that respondents be iden- 
tified in order to call them as witnesses. But survey 
data are generally obtained with assurances of 
anonymity; a violation of this pledge would not 
only involve a betrayal of confidence but would 
also impair the survey method as a research tech- 
nique by diminishing public confidence in agencies 


that use the procedure. In at least one ruling, a 
court has sustained the right of a survey agency 
to keep confidential the names of persons inter- 
viewed, but other judges may rule differently 
Obviously, the social scientist engaged in survey 
research has a minimum obligation to inform him- 
self on the issues involved so that he can behave 
responsibly toward people who supply him with 
information (King & Spector 1963), He might also 
be expected to anticipate such problems in the 
planning stages of a study and to take protective 
measures against a number of contingencies. The 
issue of proper protection of data, here discussed 
with reference to surveys, may be equally relevant 
in other kinds of research. The problem is compli- 
cated by the investigator's obligation to keep his 
work open for scrutiny by competent scientists 

The invasion of privacy. Privacy is a most 
cherished right of the individual in a free society, 
and it may well be an important condition for the 
integration of experience and the achievement of 
autonomous selfhood. Social scientists are engaged 
in a number of enterprises that can lead to a re- 
duction of individual privacy. The ethical issue 
that seems most frequently involved is that in- 
formation about a person or his family may be 
collected, and perhaps used officially, without the 
individual's being aware of what is happening. The 
use of personality tests for appraising prospective 
employees, screening school children, and so on 
has recently attracted public attention. In some 
instances, restrictive regulations have been im- 
posed to prevent what is seen as an undue invasion 
of privacy. 

Privacy is not always an individual matter but 
may involve social institutions which depend for 
their effectiveness on assurances against intrusion; 
such is true of the jury system in the United States. 
In 1955, some sociological investigators, with the 
permission of the trial judge and the contending 
lawyers, concealed microphones in a jury room and 
recorded the jury's deliberation. Although the in- 
formation obtained was treated with scrupulous 
care by the investigators, the incident created a 
national furor. The jurors had clearly been de- 
ceived and were appropriately indignant. An issue 
of broader concern involved in this instance was 
the appropriateness of scientific inquiry into an 
established social institution; the social scientist 
who undertakes such studies must be uncommonly 
concerned with ethical issues, since damage may 
be done both to social science and to the institu- 
tions studied by social scientists, 

As computers become increasingly available and 
efficient in both storage and processing capacities, 


we face the prospect of an invasion of 


sutfkient period of time, and with the dats con 


trally stored and processed, the possibility is immi- 
peni that extensive and reliable inferences 


or from the sheer tedium and expense of analysts 
may indeed be lost. 

One example may suffice to indicate the further 
significance of technological developments: it is 
now possible to obtain, by mail order, a detailed 
analysis of an individual's responses to the Minne 
sota Multiphasic Personality Inventory; the 
tion that once required the services of a 
skilled clinician can be provided now, in 
shorter time, by a computer. The ethical 
tions of advances in computer 
to be explored, 

The invasion of privacy 
point of intersection of two 
goods: the need for knowledge about 
Opinions, motivations, and expectancies 
and the need fór preservation 
While the conflict of social values 
ancient one (the rack and screw 
tion-obtaining devices), the problem 
contemporary importance because of 
increase in amount of, and reliance 
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computers, personal inventories, 
niques, and planted informers or confederates, on 
the other hand. 

Among the issues that must be considered in 
achieving a proper balance of conflicting social 
and individual interest are the importance of the 
investigation, the informed consent of subjects, 
the preservation of confidentiality, and the judi- 


Search it has generally been the practice to obtain 


| 
E 
f 
i 
i 


PRITE 
il 
ninii 
Lt 
Aat 


i: 
: 
age 
li 
i 


i 


EELER 
i s AH 
TH 
yeti 
HHIH ; 
HAREE | 


ite 
ate 
iil 
ij; 
i 


z 
E 
i 

ii 


the professional an- 
thropologist (see, for example, Redfield 1953 and 
also the “Statement on Ethics of the Society for 
Applied Anthropology” 1963-1964), were thrust 
into public prominence, in 1965, by the debacle 
of Project Camelot, an inquiry sponsored by the 
U.S. Department of Defense into “the causes of 
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revolutions and insurgency in underdeveloped 
areas of the world.” Exposure of the project in a 
South American country led to protests from the 
U.S. ambassador, a Congressional investigation, 
the cancellation of the project, and a policy re- 
quiring that all government-sponsored, foreign-area 
research be approved by the U.S. Department of 
State. The fact that Camelot became a national 
and international cause célébre involving ambassa- 
dors, senators, cabinet members, newspapermen, 
university officials, social scientists, and the presi- 
dent himself, and that it was interpreted as a cloak- 
and-dagger operation in spite of the sincerity and 
good will of the participating scientists, has served 
to obscure the ethical issues involved, issues that 
demand serious and sophisticated consideration by 
the social scientist, whether involved in cross- 
cultural studies or not. 

Among the ethical issues are these: Should 
the intentions of a sponsoring agency be the con- 
cern of a social scientist even when he is person- 
ally allowed full freedom of inquiry? Should the 
social scientist be concerned with the uses to which 
the results of his studies will be put? What is the 
responsibility of the social scientist for ensuring 
that the very process of inquiry does not have a 
deleterious effect on the people being studied? 
Does the social scientist have an obligation to pre- 
serve access to people for subsequent investigators? 
Is there a point at which inadequacies of design or 
procedure, or lack of scientific merit in a study, 
become intrinsically ethical issues by virtue of their 
imposition on others? These and similar questions 
may appear to have easy answers, but a sym- 
pathetic study of Project Camelot will show their 
complexity and emphasize the need for social sci- 
entists to consider them anew in the context of 
every proposed investigation (Horowitz 1965). 

Social science and social issues. Social science 
may often have relevance to crucial matters of 
public policy. With increasing frequency advo- 
cates of diverse political and social policies turn 
to the social scientist for support of their position. 
Or the social scientist himself, exercising the pre- 
rogative of a citizen to make public statements 
on social and political issues, may find his state- 
ments given credence beyond what could be sup- 
ported by data, by virtue of his being recognized 
as a scientist, regardless of his competence on the 
particular topic. Drawn into such an unaccustomed 
arena, the social scientist must be especially mind- 
ful of how he presents his qualifications and of 
the ethical implications of his statements. Issues 
related to racial characteristics, for example, have 
so conjoined science and public policy that they 
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have been made the subject of study by the Com- 
mittee on Science in the Promotion of Human Wel- 
fare of the American Association for the Ad- 
vancement of Science (“Science and the Race 
Problem” 1963). 

Care of animals in research. The psychologist 
has relied heavily on animals—rats, dogs, birds, 
primates—as subjects in research. To protect lab- 
oratory animals from neglect or abuse, formal 
regulations governing the management of animal 
laboratories have been developed. These require 
the provision of adequate food, water, and medical 
care, the maintenance of sanitary living quarters, 
the use of anesthetics to prevent pain in opera- 
tions and other procedures, the provision of post- 
operative care, and the destruction of animals by 
humane means. Committees on care of laboratory 
animals review problems periodically. The U.S. 
Public Health Service publishes a booklet entitled 
“Guide for Laboratory Animal Facilities and Care” 
(Animal Care Panel 1963) and requires recipients 
of grant support to observe the requirements to 
assure proper and humane treatment of research 
animals. The American Psychological Association 
requires posting in “all rooms where animals are 
housed and where animal experimentation is con- 
ducted” of regulations titled “Guiding Principles for 
the Humane Care and Use of Animals.” In spite of 
these efforts to assure highest ethical standards in 
the care of laboratory animals, there is a perennial 
demand for federal legislation to control practices, 
especially with respect to dogs and cats. In 1964 
there were eight bills introduced in the 88th Con- 
gress of the United States, two of which would 
have been severely restrictive. Although there are 
occasional cases of negligence or of needless inflic- 
tion of pain, animals are generally well cared for, 
and the Congress has shied away from enacting 
legislation on the matter (Brayfield 1963). 

Communication in social science research. Marin 
Mersenne promoted science in seventeenth-century 
France by copious letter writing; the problem of 
communication in science has since become ex- 
ceedingly complex, with many attendant ethical 
issues. Ethical problems have involved such issues 
as plagiarism, misrepresentation of data, the be- 
trayal of confidence, claiming undue credit, and 
other clearly unacceptable behavior. With the de- 
velopment of what has been called “big science” 
with extensive government support, problems of 
a new and more subtle character have emerged. 
For example, the assignment of credit for research 
accomplished by a large organization seems to be 
solved neither by crediting the director alone, as 
has been done and protested, nor by crediting 
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30 contributors, as was done in a recent listing 
of authors. Although promotions may depend on 
publications, there is a growing need to limit pub- 
lication to significant findings likely to be of value 
to others. The sheer volume of reports threatens 
to overwhelm our most efficient systems for cod- 
ing, storing, and finding information. Thus, for 
an investigator to impose the same findings twice 
on about the same audience constitutes an offense 
to the development and dissemination of knowl- 
edge. The following statement has been proposed 
to control the volume of publication: “. . . scien- 
tific publication [should] be considered a privilege 
consequent upon the finding of something which 
people may need to read, rather than as a duty 
consequent upon the spending of time and money. 
. . . Furthermore . . . no paper [should] be com- 
mitted more than once to the published literature 
without very special pleading” (Price 1964). 

Research on moral development. Thus far cer- 
tain theoretical and practical problems relating to 
ethics and social science research have been con- 
sidered. It should be noted now that social science 
research itself is a potential major source of un- 
derstanding of ethical conduct, of the origins and 
development of moral standards. Pioneer work 
was done by Hartshorne and May (1928). Piaget 
(1932) provided a theoretical matrix for illuminat- 
ing stages in the moral development of the child. 
Anthropologists and social psychologists (Whiting 
1963) have studied the influence of the family on 
character formation in different cultures. Russian 
pedagogical specialists are working explicitly to 
provide educational experiences to instill com- 
munist values in children (Bronfenbrenner 1962). 
In the United States, the establishment of the 
National Institute of Child Health and Human 
Development, to promote research on normal de- 
velopment, can be expected to encourage basic re- 
search on the problem. 

Social control of scientific inquiry. Various 
professional, trade, labor, and fraternal groups 
exert a major influence on the behavior of indi- 
viduals in contemporary society. Perhaps because 
of their very diversity they escape attention as in- 
struments of social control, yet it has been con- 
tended that they speak with more authority today 
than do organized religious groups and, further, 
that they influence day-by-day conduct even more 
than do local, state, and national governments. 

Many of these associations have formal codes 
of ethics. For the most part these codes have been 
found to have little effect on the behavior of mem- 
bers of the group (American Academy of Political 
and Social Science 1955). They are one of the 


appurtenances of associations and are designed 
with an eye to building public confidence. How- 
ever, the traditions, mores, and expectancies that 
are generated in professional groups do affect be- 
havior, often holding members to extraordinarily 
high standards of conduct. When codes of ethics 
are in harmony with long-established tradition (as 
in The Principles of Medical Ethics) or when they 
are backed up by effective machinery for enforce- 
ment, they can be powerful instruments of social 
control. 

The American Psychological Association has 
applied social science theory and methodology to 
the task of developing a code of ethics (Hobbs 
1948). The critical incidents technique was used 
to obtain the basic data for the construction of 
the code. Members of the association were asked 
to supply descriptions of situations in which a psy- 
chologist took some action that either upheld or 
violated ethical standards. From over a thousand 
such incidents a committee extracted the principles 
that appeared to be involved in the behavior re- 
ported. The result is two documents: a succinct 
code (American Psychological Association 1963) 
and a book-length statement (American Psycho- 
logical Association 1953) of ethical standards that 
includes principles, discussions of issues, and illus- 
trations drawn from the collection of critical inci- 
dents. Now underway is a new inquiry directed 
specifically at ethical issues in psychological re- 
search; the critical incident technique is again 
being used to develop basic data from which ethical 
principles will be derived. 

The psychologists’ statement of ethical stan- 
dards is being augmented by a collection of case 
studies drawn from the files of ethics committees 
responsible for the enforcement of the code. The 
assumption is made that the definition of ethical 
standards is an ongoing, never-finished process 
and that participation in the process by members 
of the association may be more important than 
the written code itself in nurturing high ethical 
standards in the profession. The Committee on 
Cooperation Among Scientists of the American 
Association for the Advancement of Science is col- 
lecting similar descriptions, not necessarily to 


` prepare a code of ethics but to illuminate the ethi- 


cal problems encountered by scientists in all fields. 

When scientists fail to regulate their own be- 
havior to the satisfaction of informed members of 
the community, one can confidently predict that 
controls will be imposed by legislation or by ad- 
ministrative regulations. In 1965-1966, two major 
federal agencies adopted procedures governing 
ethical issues in research supported by their grants. 
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One agency requires that tests, questionnaires, 
and other data-gathering devices be approved in 
Washington by a special review group composed 
of staff members, with the assistance of consult- 
ants. The other agency has established a require- 
ment that grant requests involving possible ethical 
issues must be reviewed by a recognized local com- 
mittee of peers of the investigator. The second 
solution appears to offer protection to research 
subjects on the basis of competent review without 
the danger of overcentralized control of scientific 
inquiry. However, there are responsible investi- 
gators who contend that a prescribed review by 
local peers is an invidious requirement implying 
incompetence and guilt when competence and 
rectitude should be assumed, with intervention in- 
dicated only when there is some evidence to the 
contrary. Here again a social process to define 
appropriate procedures is underway, with the 
proper resolution still unclear. 


It can be expected that society will develop, in 
time, a productive balance between its need for 
knowledge and the individual’s need for protection 
against intrusion, inconvenience, or discomfort. 
A dialectic tension involving values fundamental 
to a democracy must be resolved, both in terms 
of general principles and in terms of particular in- 
stances. For example, freedom of inquiry must be 
balanced against rights of privacy, both cherished 
values in our society. While the issues are complex, 
resolution is possible. The accommodation, both in 
substance and in process, will probably be com- 
parable in character to rules governing the right 
of eminent domain and the right of the individual 
to own property. 

The individual investigator is not without com- 
mon-sense guidelines. While the answers may not 
always be clear, some of the questions are: Is the 
knowledge to be gained worth the imposition in- 
volved in obtaining it? Would another design be 
equally productive but less intrusive? Has fullest 
advantage been taken of the subject’s informed 
willingness to cooperate? Has the proposed inquiry 
been designed to minimize effects on the subject 
population so that subsequent investigators will 
not be handicapped? To what extent are the pro- 
posed procedures consonant with emerging stan- 
dards, or a calculated departure from them? 

Nor is the investigator without criteria to assess 
and perhaps discover the adequacy of his answers 
to such questions: first, his own standards as an 
investigator, concerned quite as much with ethical 
as with statistical elegance of design; then the 
approbation of other competent scientists; and, 


finally, the appreciation of the larger community, 
or of significant sections of it, whose support is 
essential to the continued development of the social 
sciences. 

It is of greatest importance to keep ethical 
problems under continuing scrutiny and debate, in 
journals, in training programs, in public forums, 
with social scientists themselves taking the initia- 
tive in the process, in order to provide increasingly 
instructive principles for clarifying ethical issues 
in social science research. 

NICHOLAS HOBBS 


[See also MORAL DEVELOPMENT; PRIVACY; SCIENCE.] 
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ETHNIC GROUPS 


An ethnic group is a distinct category of the 
population in a larger society whose culture is 
usually different from its own. The members of 
such a group are, or feel themselves, or are thought 
to be, bound together by common ties of race or 
nationality or culture. The nature of an ethnic 
group's relationships with the society as a whole, 
and with other groups in it, constitutes one of the 
main problems in describing and analyzing such 
societies. As Ruth Benedict said of race conflict, it 
is not race that we need to understand, but con- 
flict; so, for an understanding of ethnic groups in a 
social system, it is not on racial or cultural differ- 
ences that we need to focus our attention, but on 
group relations. 


Historical outline 


The existence of distinct ethnic and cultural 
groups within societies is widespread and ancient 
and occurs at most levels of culture, ranging from 
the Bushmen of the Kalahari, who live within the 
framework of Tswana society, to modern Europe 
and America. Ethnic groups in the Near East were 
recorded by Herodotus almost 2,500 years ago and 
remained a persistent feature of the Byzantine, the 
Ottoman, and other Near Eastern empires. Similar 
situations also occurred in ancient India and in 
Chinese civilization at all stages of its expansion. 

Although scholars in the past have often noted 
the existence of multiracial and multicultural so- 
cieties, systematic examination of the sociological 
consequences of the phenomenon did not begin 
before the eighteenth century. And then it was 
principally in connection with the concepts of race 
and race relations as developed in the next century 
by writers such as Gobineau (1853-1855) and 
Chamberlain (1899). Linguistic scholars like Sir 
William Jones, the Grimm brothers, and Max 
Miiller not only examined the construction and de- 
velopment of Indo-European languages but also 
inadvertently encouraged the growth and elevation 
of the idea of race as an ideology and as the most 
significant index distinguishing culturally different 
groups from one another. 

Earlier historians, including the writers of the 
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Old Testament, had noted that ethnic groups might 
be found in a society as a result of the gradual 
migration of either whole populations or of seg- 
ments, such as religious refugees, traders, crafts- 
men, or manual laborers. They also observed that 
military conquest might bring in its train soldiers 
and civilians, who either settled permanently in the 
area or administered their conquests for a period 
of years before retiring and being replaced from 
the homeland. Or, again, ethnic groups might be 
incorporated into a society by altered political bound- 
aries. Sometimes a combination of processes was 
at work; but however a multiracial or multicultural 
system came into existence, the types of society in 
which ethnic groups could be found varied as 
widely as the processes that brought them into 
being. 

Most investigations of ethnic groups have been 
made in connection with studies of race relations 
and stratified societies such as are found in Africa 
(MacCrone 1937; Patterson 1953), the southern 
states of the United States (Dollard 1937), parts 
of the Caribbean (Smith 1955; 1956), Central and 
South America (Freyre 1933), and in the plural 
societies of former colonial areas of Asia (Furnivall 
1942). Ethnic groups that are not an integral part 
of a system of over-all social stratification are also 
found in countries like Switzerland and Nigeria, 
where they form units in the political system 
which, although perhaps internally stratified, are 
not ranked in relation to one another. Other types 
of multiracial and multicultural situations, as, for 
example, in northern Laos, Thailand, Burma, and 
India, have as yet hardly been examined. Fre- 
quently in these countries adjacent villages, or even 
sections of one village, may be linguistically and 
culturally different and yet be held together in a 
traditional system of social relations that is not 
part of the apparatus of a central government 
(Leach 1954). Similar conditions have been ob- 
served, although seldom analyzed, in the Indo- 
nesian archipelago, New Guinea, and parts of 
Africa. 

Definitions. At this point it would be wise, for 
the sake of clarity, to make the distinction between 
a social group and a social category. By a group, 
sociologists usually mean an aggregation of people 
recruited on clear principles, who are bound to one 
another by formal, institutionalized rules and char- 
acteristic, informal behavior. Unless a group is to 
be no more than a temporary aggregation, it must 
in addition be organized for cohesion and persist- 
ence; that is to say, the rights and duties of mem- 
bership must regulate internal order and relations 


with other groups. Members usually identify them- 
selves with a group and give it a name. In practice 
social groups vary in the degree to which they are 
corporate; and in certain situations one of the 
principal difficulties of analysis may be to decide 
whether a particular social entity is in fact a social 
group or a mere category of the population, such 
as red-haired people, selected by a criterion that in 
the context is socially neutral and that does not 
prescribe uniform behavior. For any study of group 
relations this distinction is essential. 

In east Africa, the African, the Arab, the Euro- 
pean, and the Indian elements of society are closer 
to being categories of the population than social 
groups. Although, for example, a fully institution- 
alized Indian group, recruited from the general 
category of Indians, is likely to act in the Indian 
sphere of life, there is no certainty that it will; 
relations between the ethnic categories may there- 
fore become blurred. The sections of an ethnically 
and culturally divided population may, according 
to circumstances, be institutionalized groups re- 
lated to one another in a system of stratification, 
or they may be groups living side by side and 
related in other ways. Ethnic divisions may simply 
be categories of the population, as are Welshmen 
and Scotsmen living in England, or Indians, Chi- 
nese, and Creoles in Mauritius, who are beginning 
to lose a sense of ethnic separateness. It is, there- 
fore, always important to be sure what is the exact 
sociological status of an ethnic or cultural division. 
Clarity in analysis depends upon it. 


Ethnic groups in stratified societies 


The division of society into broad strata, which 
form a hierarchy of prestige, wealth, and power, 
is a feature common to most societies and is one 
that has been used for classification. A few socie- 
ties, mostly primitive or small in size, may not be 
stratified; social positions may not be sufficiently 
numerous or diverse to be easily grouped into strata 
or aggregates of individuals sharing an equivalent 
status that would differentiate them from members 
of other similar aggregates. This is not to say, of 
course, that statuses in such small societies may 
not be ranked but is merely to point out that they 
do not constitute groups. Sociologists traditionally 
classify the types of stratification as caste, estate, 
or class systems. As ethnic and subcultural groups 
may form the basis of a system of stratification, 4 
closer examination of the matter is needed. 

In any system of social stratification the follow- 
ing apply: (1) Individuals belong to strata that are 
groups in the sense that everybody in them shares 


some obligatory ways of acting that are typically 
and intentionally different from those in other 
strata. (2) Strata must be exclusive, so that no- 
body may belong to more than one at the same 
time. (3) Strata must be exhaustive, so that every- 
body in the society belongs to one. (4) Strata must 
be ranked. In using this criterion, differential ac- 
cess to political and economic resources is taken 
to be the most significant aspect of the ranking. 
These criteria do not distinguish different types of 
stratification, any one of which may be exemplified 
in a society where ethnic groups are a component 
element of the system. 

Caste groups. One sociological definition of a 
caste system is that it is a hierarchy of endogamous 
groups in which status is rigidly ascribed by birth 
and in which mobility from one group to another 
is not possible. Correct relations between groups 
are maintained and validated by religious rules, 
especially the rule that improper contact between 
castes produces a state of impurity that entails 
ritual, legal, and other penalties. 

In this definition no careful distinction is usually 
made between the four-fold division of Indian so- 
ciety into castes (varna) and subcastes (jati). In 
early literature the varna (priests, soldiers, busi- 
nessmen, and laborers) are sometimes described in 
terms of what appear to be ethnic differences; but 
in historical times they have not constituted more 
than categories of value against which individuals 
and members of subcastes could measure their own 
and other people’s prestige. They were not groups, 
in the sense of imposing duties that were uniform 
throughout India. In short, the varna were ranked 
categories, not stratified groups. Not dissimilar ar- 
rangements are also found in some of the multi- 
racial societies of Africa and the Caribbean. 

Empirically, Indian society was made up of many 
small self-contained caste systems, each of which 
was a hierarchy of subcaste groups. Subcastes in 
turn were organized so that social labor was di- 
vided among them. Each subcaste traditionally held 
a monopoly of a particular service, so that all 
washermen, for example, although not bound to 
that occupation, could prevent others from prac- 
ticing it. The essence of this division of labor was 
that it was cooperative and complementary, not 
competitive. In these small, closed caste systems 
relationships between individuals tended to be mul- 
tiple in that two individuals could fill a number of 
roles in relation to each other. This “summation” 
or “involution” of roles is an attribute of small-scale 
and not large-scale social systems (Nadel 1957, 
pp. 64-72). The argument may be summarized 
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thus: (1) Caste groups must be recruited by birth, 
that is to say, they must be closed. (2) Relation- 
ships between groups must be cooperative, not 
competitive. 

In India subcastes are not usually separate 
ethnic or cultural groups, but an understanding of 
caste systems is essential in the analysis of society 
in Iran and parts of the Near East where a multi- 
plicity of ethnic and cultural groups appears to 
be organized in small-scale caste systems (Barth 
1960). It is possible, too, that society in certain 
parts of the southern states of the United States 
is similarly arranged (Dollard 1937). 

If we are content to say, as are many students, 
that South Africa and India both exhibit a caste 
system, then no further distinctions need be sought. 
But writers on India do not usually agree that 
“color-bar” societies are ipso facto caste systems. 
Although the population of India was always very 
large, society there was characteristically composed 
of separate small-scale involute caste systems. 
South Africa, although less populous, is typical of 
large-scale Western society where relationships be- 
tween roles are usually single-stranded and not the 
multiple ties of a small-scale society. 

In certain ways the system of closed groups in 
South Africa is nearer the model of an estate sys- 
tem than that of a caste or social-class system, but 
the sociological status of “color-bar” societies needs 
to be carefully re-examined. Studies of them have 
for the most part either used the concepts of strati- 
fication without careful consideration or have di- 
rected attention to economic functions (Boeke 
1953) or to attitudes and other psychological con- 
comitants of the existence of ethnic groups. [See 
CasTE.] 

Social class groups. In some places, such as 
parts of the West Indies, ethnic groups are regarded 
as, and may in fact be, social classes. Sociologists 
generally consider a class to be an aggregate of peo- 
ple occupying roughly the same status, which is dif- 
ferent from that of people in other classes and 
which, unlike status in a caste or estate system, 
allows movement from one stratum to another. It is 
never easy to decide to what degree a social class is 
an institutionalized group or exactly how itis related 
to economic and political status and prestige. When 
some of the qualifications for membership are also 
those for belonging to an ethnic category or group, 
the difficulties of analysis may become very great 
indeed. An aggregate of people is not a social class 
just because they think of themselves as one; it is 
a social class because some activities are obligatory 
to all or most members and act as a sign that the 
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people form a group and are eligible for access 
(appropriately graded according to their class) to 
resources that are valued by the society. When 
these activities are also qualifications for member- 
ship of ethnic or cultural groups, then ethnic and 
class groups coincide. 

The types of stratification that have been men- 
tioned are, of course, models; and a particular sys- 
tem, whether its constituent elements are ethnic 
groups or not, may not correspond with the model. 
Racial differences used as insignia or badges to 
mark off groups from one another are not different 
in kind from clothing, speech, manners, property, 
or other cultural emblems that may serve the same 
ends, But since physical differences are permanent 
and may be strikingly visible and may also carry 
much emotion, the understanding of societies such 
as are found in Mexico, Nigeria, or Kenya has been 
made difficult by treating them as if they were 
altogether different from those more familiar to 
sociologists. The fact that signs of a special kind 
are used for distinguishing groups in multiracial 
societies does not mean that such societies are 
radically different from others. 

In a study of early twentieth-century Burma and 
Netherlands Indonesia, Furnivall argued that coun- 
tries in which “there is a plural society, with differ- 
ent sections of the community living side by side, 
but separately, within the same political unit” 
([1948] 1956, p. 304) were “a distinctive form of 
society with a characteristic political and economic 
constitution” (1942, p. 195). In such a situation, 
he believed, members of society are unable to de- 
velop the common values and demands generated 
by sharing common institutions. Another writer, 
M. G. Smith, regards Furnivall’s concepts as essen- 
tial to comparative sociology, arguing that a plural 
society is composed of readily identifiable sections 
held together only by the fact that they are part of 
one central political system. Such sections, it 
should be noted, are not necessarily ethnic groups. 
Each is distinguished by having its own “core” of 
“basic” or “compulsory” institutions. Social systems 
may therefore be placed on a scale ranging from 
those that are fully plural with distinct sections ful- 
filling particular economic, political, religious, or 
other functions, to homogeneous systems in which 
one set of basic institutions is shared by all mem- 
bers (Smith 1960). Models of this kind have 
attracted anthropologists, historians, and econo- 
mists, especially those working in multiracial or 
multicultural areas; but most sociologists have 
found the concept of a basic core of differentiating 
institutions even harder to define and handle than 
the concepts with which they were more familiar, 


and they have preferred to rely on older and better 
tested theories of social differentiation. [See STRAT- 
IFICATION, SOCIAL.] 


Ethnic groups in nonstratified societies 


Not all societies having ethnic groups within 
their boundaries incorporate them into a unified 
system of social stratification, and relations among 
ethnic groups may (to use a political metaphor) be 
more of a “federal” nature than one of ranked 
access to social resources, 

Societies with a single administrative system. In 
Switzerland or Canada, for instance, cultural 
groupings are clearly differentiated and main- 
tained, and each may be separately stratified; but 
access to power in the wider society is not limited 
by either ethnic or cultural origins, nor is it con- 
ditioned by a ranked evaluation of ethnic groupings 
within the society. In Malaya, too, although there 
are distinct ethnic groups, they are not stratified in 
relation to one another. The population is divided 
between Malays and immigrant Chinese and In- 
dians. Constitutionally the machinery of govern- 
ment is in the hands of people selected by voting. 
The workings of electoral procedures and the staf- 
fing of the civil service and the armed forces have, 
in fact, placed most of the political machinery in 
the hands of Malays, leaving economic power 
largely with the Chinese. This distribution of the 
different kinds of power, with its open possibilities 
of real loss or gain, has tended to consolidate ethnic 
categories into political parties (Freedman 1960). 

In Mexico, on the other hand, ethnic criteria 
have, on the whole, been abandoned in the forma- 
tion of social groups because they no longer mark 
off differences considered significant. The people 
who control political, legal, and economic matters 
attain their positions without reference to race, 
although not without acquiring the dominant Span- 
ish culture. In Thailand, too, the balance struck be- 
tween demographic, political, economic, and cul- 
tural factors is such that the dominant Buddhist, 
Thai-speaking group is able to pursue a policy of 
assimilating ethnic minorities. The policy has had 
a measure of success with Chinese immigrants, 
but no attempt has been made to absorb the Mus- 
lim, Malay-speaking inhabitants of the southern 
provinces, 

Where ethnic or cultural differences coincide 
with groups that tenaciously hold different religious 
opinions, the relationships of one group to another 
and to the central government may become very 
complex indeed and lead to serious conflict. But 
the problem is not one peculiar to ethnically diverse 
societies. In the sixteenth century, the division of 


France into Roman Catholic and Protestant Chris- 
tians did not make it a plural society, even though 
some Protestants, especially in the south, were cul- 
turally and linguistically separate. Nor did it mean 
that groups that held these different opinions were 
different in kind from other groups that had been 
competing for political power before the religious 
differences arose. 

The point here is an essentially simple one. In 
any society the immediately effective determinants 
of most social action lie in the political, legal, and 
economic spheres; and whether or not the main 
component groups of the society are stratified in 
relation to one another, an examination of the 
social system must be primarily concerned not only 
with relations between groups but also more spe- 
cifically with those between rulers and ruled. The 
latter problem is an aspect of the former, but in a 
society with groups that differ ethnically, whose 
interests are in fact opposed to one another, the 
ensuing conflict may be phrased in racial terms 
and thus provoke more bitter hostility than in other 
struggles of the ruled against their rulers. 

Societies without a single administrative system. 
In the examples considered so far, ethnic or cul- 
tural groups have formed part of an organized ad- 
ministrative and political system. Understanding 
the significance and range of such groups and their 
economic, political, religious, and cultural impor- 
tance has revolved around the problem of ascertain- 
ing the exact position that they occupy in the sys- 
tem. In northern Thailand, Burma, and New 
Guinea, where small ethnic and cultural groups are 
dispersed and mingled over a wide area without 
traditionally being under the direct control of a 
single administrative system, it is also necessary to 
determine their exact relationship to one another. 

A number of such “tribal” groups have been 
studied, but because anthropological fieldwork 
tends toward village studies and because the lin- 
guistic difficulties of examining such a hetero- 
geneous system are very great, students have sel- 
dom made the study of the wider system the main 
focus of their attention. Groups of this kind appear 
to be linked to one another in a network of polit- 
ical, economic, ritual, and marriage alliances about 
which little information is available. But here, as 
in more politically unified systems, the balance 
struck in one place is seldom exactly the same as 
that in another, where the weighting of economic, 
political, cultural, and ideological forces may be 
different. The use of models of the kind that have 
been discussed in this article is essential in the 
analysis of any society; and where useful models 
do not exist, as is probable in the study of such 
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tribally mixed areas as those just noticed, then they 
must be constructed. But their usefulness must not 
be misunderstood. It is highly unlikely that any 
model will correspond in detail with the complexity 
and variety of real life, especially that of multiracial 
and multicultural societies. 


Specialization of ethnic groups 


This article is concerned mainly with the theo- 
retical problems of describing and analyzing the 
place of ethnic and cultural groups within social 
systems of different types. Most studies of societies 
of this kind have dealt less with theoretical prob- 
lems than with the consequences of economic, po- 
litical, or religious specialization of such groups 
within the wider society. Often these consequences 
are the result of the structural position of the 
group, but this in turn may be the result of the 
specialized tasks that it performs. 

When Europeans first began to govern east 
Africa the difficulties of setting up an administra- 
tion and of stimulating the trade needed to pro- 
duce a revenue gave an opening to Indian immi- 
grants, who were ethnically and culturally very 
different from the African and Arab inhabitants of 
the region. Even where common beliefs in Islam 
were held, this fact did not submerge ethnic or cul- 
tural differences or the hostility to, and suspicion 
and fear of, the immigrants, whose interests as 
middlemen and skilled workers brought them into 
conflict with all other ethnic categories of the popu- 
lation. Such a conflict tends to make a structural 
alignment even more rigid and to confirm and 
perpetuate associated attitudes. 

The point can be illustrated in many parts of 
the world. Studies of Jewish ethnic groups have 
long been concerned with the political and other 
social results of economic specialization and the 
ways in which specialized minority groups, once 
established, are modified and maintained. Similarly, 
in all parts of southeast Asia, economic and polit- 
ical developments have produced ethnic specializa- 
tion with a wide range of conflict. 

The political consequences of the specialization 
of ethnic groups by occupation, and therefore of 
the kinds of power that they hold in society, is a 
problem of which all historians of colonial empires, 
from that of the Romans and the imperial Chinese 
to the sixteenth-century Spanish and the modern 
Europeans, are well aware. But it is also a problem 
that needs even closer attention in postcolonial so- 
cieties, where, although the structural alignment of 
groups within them may have altered, the problems 
of cultural and ethnic diversity remain. 

With the growth of good communications and 
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the spread of travel, ethnically and culturally di- 
verse societies are likely, in the short term, to in- 
crease in number rather than diminish. As the 
sociological study of society ceases to be solely a 
Western discipline, the need to find appropriate 
conceptual tools for analyzing ethnic and cultural 
variation will undoubtedly become a major pre- 
occupation of the discipline. 

H. S. Morris 


[See also MINORITIES; PEASANTRY; RACE RELATIONS; 
SOCIAL STRUCTURE; TRIBAL SOCIETY. Other relevant 
material may be found in ASIAN society, article on 
SOUTH ASIA; CARIBBEAN SOCIETY; LINGUISTICS, ar- 
ticle on THE SPEECH COMMUNITY. | 
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ETHNOCENTRISM 


See CULTURE, article on CULTURAL RELATIVISM. 


ETHNOGRAPHY 


The data of cultural anthropology derive ulti- 
mately from the direct observation of customary 
behavior in particular societies, Making, reporting, 
and evaluating such observations are the tasks of 
ethnography. Although the successful carrying out 
of these tasks is intimately related to the validity 
of cultural and social anthropological interpreta- 
tions, ethnography itself has received little serious 
attention. However, as the social sciences have be- 
come more critical of their source materials, more 
concerned with how data are recorded, verified, and 
analyzed, interest has developed in ethnographic 
method and theory and in the more technical and 
personal aspects of conducting ethnographic re- 
search. 

While the scope and definition of ethnography 
have varied considerably and opinions differ on 
many details, contemporary usage does permit a 
few general terminological distinctions and impli- 
cations. An ethnographer is an anthropologist who 
attempts—at least in part of his professional work 
—to record and describe the culturally significant 
behaviors of a particular society. Ideally, this de- 
scription, an ethnography, requires a long period 
of intimate study and residence in a small, well- 
defined community, knowledge of the spoken lan- 
guage, and the employment of a wide range of 
observational techniques including prolonged face- 
to-face contacts with members of the local group, 
direct participation in some of that group’s activi- 
ties, and a greater emphasis on intensive work with 
informants than on the use of documentary Or 
survey data. Used nonspecifically, ethnography re- 
fers to the discipline concerned with producing 


such cultural descriptions. With a regional refer- 
ence (e.g., “Polynesian ethnography”), the term 
designates either the way in which ethnography is 
conceived and practiced in the area or the collec- 
tive or comparative treatment of the ethnographies 
written about the peoples living in the region. This 
last usage is frequently referred to as comparative 
ethnology, or simply ethnology [see ETHNOLOGY). 


History of ethnography 


Although the roots of ethnographic description 
are lost in antiquity and most observations and 
interpretations (or misinterpretations) of human 
societies have continued to be transmitted orally, 
some early written accounts have been preserved. 
Permanent documentation of such observations in- 
creased markedly with European voyages of dis- 
covery and exploration. Despite organizational and 
stylistic differences, it is possible to discern in the 
literature the transition from the curious recount- 
ings of strange, exotic, or bizarre practices to the 
present attempts at producing valid cultural de- 
scriptions. In comparing the successive steps in 
this transition, one should note changes not only 
in the content and purpose of ethnography but also 
in the preparation and background of the investi- 
gators and in the circumstances under which field 
work is conducted. (It is impossible to treat in de- 
tail many individual and institutional differences, 
such as the varying views of ethnographic work 
among American, British, and European anthro- 
pologists; see, for example, Eggan 1961; Firth 
1957; Gluckman 1961; Griaule 1957; Kroeber 
1957; Lowie 1953; Richards 1939.) 

Early ethnography. Beginning in the late fif- 
teenth century and continuing for several hundred 
years, descriptions of unfamiliar cultural practices 
were written largely as a result of explorations, 
missionizing, and the establishment of colonial gov- 
ernments and outposts (see Howell 1642; Rowe 
1964). Although there were some exceptional re- 
ports such as Pigafetta’s observations on Cebu, 
included in his chronicle of the Magellan voyage 
(1525), and the extensive Mexican texts recorded 
in the sixteenth century by Sahagún (see Sahagún, 
General History of the Things of New Spain), the 
dominant form of early ecclesiastic and govern- 
mental records was ethnographically unimpressive. 
In the nineteenth century, as territorial exploration 
intensified and the writings of natural historians, 
travelers, and museum collectors began to augment 
missionary and official documents, ethnographic 
inquiry became a somewhat more organized pro- 
cedure. Many questionnaires, lists, instructions, 
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and regional guides were written (see Lewis 1814; 
British Association 1852; Neumayer 1875). In 
Europe and the United States, professional anthro- 
pological societies were supported at first by 
travelers, government employees, and other ama- 
teurs and later encouraged by museums, Institu- 
tionalization stimulated the publication of mono- 
graph series devoted largely to cultural descriptions 
(notably, for example, the publications of the 
Bureau of American Ethnology, the American 
Ethnological Society, and the larger natural history 
museums). Landmarks include Morgan's account 
of Seneca culture (1851), such turn-of-the-century 
reports of field research as Rivers’ study of the 
Toda (1906), and a few refreshingly innovating 
works like Barton's study of Ifugao law (1919), in 
which the value of the case-method approach is 
demonstrated. Up to the end of World War 1 vast 
quantities of published ethnographic material had 
accumulated from many regions, but although 
some scholars (such as Boas) had begun to work 
in depth with informants on particular linguistic 
and other cultural problems (see Jakobson 1959; 
Smith 1959), most of this literature had been pro- 
duced by nonanthropologists (e.g., Morgan was a 
lawyer, Barton a schoolteacher and dentist), who 
for a variety of reasons had become attracted to 
the subject matter, and in the course of short visits, 
surveys, or by accidental association, had acquired 
sufficient field experience to write interesting ac- 
counts of their observations. Toward the end of 
this period, museums provided most of the support 
for field research. In general, ethnographic inquiry 
was correspondingly dominated by object-centered 
interests, a standardized topical format for observa- 
tion and recording, and extensive use of interpre- 
ters. [See Boas; Moran, Lewis HENRY; RIVERS.] 

Ethnography before World War m. By 1925 
ethnographic field work had become an established 
professional activity. There was a distinct shift 
away from mere acceptance of field work toward a 
more critical and craftsmanlike attention to its 
execution, a shift from a dominant concern with 
data accumulation to the deeper analysis of par- 
ticular cultural patterns. Many of these changes 
resulted directly from the influence of Malinowski’s 
reports (especially 1922; 1935) based on his 
lengthy and detailed observations in the Trobriand 
Islands. His insistence on using the local language, 
residing for a long time with the group being 
studied, and delineating functionally related cul- 
tural phenomena in specifiable contexts spurred 
serious rethinking of many aspects of ethnographic 
research [see MALINowsk!]. Increased interest in 
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cultural contexts led to a concern with the ethnog- 
rapher’s role in the field situation and a more care- 
ful assessment of the way in which field data are 
recorded (Mauss 1947; Mead 1947; Osgood 1940). 
Partly as a result of developments in linguistics, 
sociology, and psychology, ethnographers began to 
show greater interest in general theory and descrip- 
tive methods, as well as to take advantage of an 
expanded range of research techniques such as the 
recording of life histories, the administering of pro- 
jective tests, and the extensive use of film. Field 
inquiry was increasingly guided by an interest in 
general problems of cultural variability and of the 
nature of cultural universals. In the 1930s, 
attempts to provide needed ethnographic informa- 
tion for trait analysts and hypothesis testers led to 
various forms of standardization, such as the Out- 
line of Cultural Materials (Yale University 1938), 
to help organize the recording and cross-indexing 
of field observations. This development aided 
quantitative and comparative studies and expanded 
significantly the existing inventories of cultural 
detail. 

The detail, however, often lacked contextual 
specification, and these efforts thus drew attention 
to the inherent weakness of relying on prepared 
formats to guide functionally oriented field re- 
search, Similar criticism and experimentation with 
many field methods and techniques helped to meet 
the new demands of higher research standards. 
During this phase the influence of museum spon- 
sorship and amateur ethnography declined. Field 
workers were mostly trained as anthropologists in 
graduate university departments and were sup- 
ported by private foundation and government 
grants, 

Ethnography after 1950. Following World War 
u, ethnography began to attract more theoretical 
and methodological attention. Of particular note 
was the renewed and expanded interest in classi- 
fication, which is crucially important (Needham 
1963, pp. vii-ix), There was also an increased 
emphasis on communications systems and struc- 
tural models (e.g., Lévi-Strauss 1958); on the 
extension of principles developed in structural lin- 
guistics to ethnographic descriptions (e.g., Goode- 
nough 1951); on the detailed study of cultural sub- 
systems (e.g., Conklin 1957; Frake 1964b; Pospisil 
1958); on contrasts between qualitative and quanti- 
tative aspects of field observations (Leach 1961); 
and on the development of more effective means 
of accounting for both cultural and personal varia- 
bles in actual field situations (Condominas 1966). 
At the beginning of this phase there was a marked 
increase in the numbers of professional ethnog- 


raphers, in sources of support, and in opportunities 
for field work. 


Theory and method 


The cumulative efforts of ethnographers to go 
beyond the uncritical narrative and rambling pres- 
entation of assumed cultural detail have focused 
on determining what constitutes a valid cultural 
description, on developing a theory that permits 
evaluation of alternative descriptions, and on 
formulating methods that may be most effective in 
deriving general statements from recorded ob- 
servations. For example, it has been suggested that 
ideally an ethnography constitutes a cultural gram- 
mar, an abstract theory which provides the rules 
for producing, anticipating, and interpreting ap- 
propriate cultural behaviors in given settings 
(Conklin 1964; Frake 1964a; Goodenough 1957). 
Ethnographic theory, in this view, is concerned 
with evaluative criteria such as completeness (in 
both depth and breadth), conciseness, and ac- 
curacy. New approaches for providing reliable, 
valid, and more revealing field analyses have in- 
cluded (1) the formal treatment of cultural sub- 
systems in which the relevant cultural phenomena 
are discretely organized or relationally describable 
in terms of a small number of dimensional con- 
trasts or processes—componential definitions and 
formulaic reduction and rearrangement of terms 
by specified rules have been applied most fre: 
quently to kinship analysis (see Lounsbury 1964); 
and (2) the intracultural analysis of folk classi- 
fications, especially those of natural phenomena. 
The study of folk science (see Colby 1964) has led 
to a number of developments such as the more 
specific analysis of folk taxonomies (Berlin et al. 
1966; Conklin 1962). Together with other types of 
contrast-set, subsegregate, and network linkage 
analysis, these efforts have sometimes been re- 
ferred to as ethnoscience (Sturtevant 1964). 
Problems of alternative methodological procedures 
(Burling 1963) and of multiple contexts and 
code channels (Hymes 1962) are also being ex- 
amined. The principles guiding many recent efforts 
reflect influences from linguistics, logic, mathe- 
matics, and systematic biology. Early results of 
their application in ethnography are, in turn, stimu- 
lating developments in such fields as sociology 
(Cicourel 1964) and archeology (Chang 1967). 
Furthermore, it is generally agreed, even where 
opinions on the nature of valid evidence differ 
widely (e.g., Metzger 1965), that theory and meth- 
od as well as technique must be constantly tested 
in the field. [See COGNITIVE THEORY; COMMUNICA- 
TION; COMPONENTIAL ANALYSIS.] 


Techniques 


Instruments for gathering, storing, retrieving, 
rearranging, expressing, and using field data while 
still in the field have multiplied with technical de- 
velopments (Kano & Segawa 1945; Rowe 1953). 
Tape recording, cinematography, photogrammetry, 
aerial mapping, and the use of computers in text 
and demographic analysis are only a few of the 
frequently employed technical developments in the 
treatment of ethnographic data, Selecting from 
among these many devices and varied interactional 
techniques those most appropriate for keeping 
ethnographic records is a complex task. The eth- 
nographer tries not to rely upon published outlines 
and questionnaires; he shuns interviews with in- 
formants carried out in artificial settings; and he 
avoids premature quantification or overdifferenti- 
ated measurement. Initially, at least, flexibility, 
curiosity, patience, and experimentation with many 
alternative devices and procedures are desirable. 
In everyday conversations between field worker and 
informant, for example, attention to and use of 
such verbal techniques as the following have been 
used profitably, although not always with equal 
success: recording and using natural question and 
comment frames (i.e., the ways in which in- 
formation is normally solicited and transmitted in 
the local language); noting and using question- 
response sequences and implications; testing by 
intentional substitution of acceptable and incon- 
gruent references; testing by paraphrase; testing 
by reference to hypothetical situations; testing by 
experimental extensions of reference; and testing 
by switching styles, channels, code signals, mes- 
sage content, and roles (by reference or impersona- 
tion). Similarly, in the making of visual and non- 
verbal observations initial experimentation and 
flexibility help to determine focuses and bound- 
aries of scenes, scheduled events, key roles, etc. 
Graphic and plastic modeling media have pro- 
vided additional dimensions for the exploration of 
actual or hypothetical situations otherwise not 
easily investigated. Furthermore, ethnomodels, often 
ignored or treated only anecdotally, may clarify 
and facilitate field observation. When local systems 
have been qualitatively established, other pro- 
cedures such as scaling techniques may be applied 
to increase the range of observations and provide 
some basis for quantification by various kinds of 
discrete, direct, or indirect measures. As available 
technology makes more elaborate manipulation of 
field data possible, greater attention can be given 
to informant-ethnographer interaction, not only in 
terms of eliciting routines but also with reference 
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to critical and probabilistic changes in the micro- 
sociological environment. [See INTERVIEWING.] 
Personnel. Because ethnographers interact per- 
sonally and socially with informants, they find 
themselves. carrying on a unique type of natural 
history, in which the observer becomes a part of 
(and an active participant in) the observed uni- 
verse. The extent of this involvement and its im- 
portance for ethnographic recording depend on 
many situational considerations, including the per- 
sonalities of the ethnographer and his informants. 
In some types of field inquiry the ethnographer’s 
practical success or failure may depend as much 
on those impressions he makes locally (Goff- 
man 1956) as on the cultural events being ob- 
served. Informal recognition of these variables is 
frequently reflected in the nontechnical literature 
and in humorous anecdotes circulated among col- 
leagues. More systematic accounts of these per- 
sonal background factors and their consequences 
have appeared with increasing frequency (e.g., 
Berreman 1962; Casagrande 1960). Especially 
where long-term investigation of intimate personal 
relationships is concerned, most anthropologists 
would agree with Condominas (1965, p. 35) in 
stressing the “nécessité d’ethnographier les ethnog- 
raphes.” Methods of assessing such contextual in- 
formation are not yet well developed, but more 
careful and sensitive reporting of the kinds of 
transactions involved in ethnographic inquiry 
(Oliver 1958) and of the total spectrum of social 
involvements affecting these transactions (Junker 
1960; Mintz 1960) may lead to the desired aware- 
ness, and thus to appropriate adjustments in con- 
tinued research. The possibility of combining such 
sensitivity with technical mastery of ethnographic 
analysis has been dramatically illustrated in recent 
contributions by Paul Friedrich (see Tagari 1964) 
and Laura Bohannan (1966), [See OBSERVATION. ] 
Translation. The problems of ethnography are 
in the largest sense those of translation. Eventu- 
ally, all observations must be “translated” into the 
ethnographer’s descriptive code. Thus, linguistic 
theory, and translation theory in particular, has 
special relevance for ethnography (Gumperz & 
Hymes 1964; Nida 1964). And although ethnog- 
raphy and linguistics are not identical, they are to 
some extent mutually dependent (Hockett 1954). 
Furthermore, in spite of the fact that much eth- 
nographic research concerns nonverbal behaviors, 
observations of even the most inarticulate cultural 
process are often identified, tallied, and even quanti- 
fied by means of informant-expressed judgments. 
General linguistic and anthropological interest in 
semantic theory has been very responsive to dis- 
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cussions of ethnographic problems (e.g., Colby 
1966; Conklin 1962; Lamb 1966; Romney & 
D’Andrade 1964; cf. Malinowski 1935). In par- 
ticular, attention has been drawn to the diversity 
of semiotic relations, the multiplicity of contexts 
and related communication systems, and the im- 
portance of contrastive analysis of complete ter- 
minological sets [see SEMANTICS AND SEMIOTICS]. 


Since 1950 the critical re-evaluation of theory 
and practice has led to greater appreciation of the 
technical and human problems that are inherent 
in ethnographic research. Intellectual excitement 
and controversy have intensified efforts to refine 
methods for reducing apparent cultural complexity 
and indeterminacy to clear, systematic, and effec- 
tive statements, 

HAROLD C. CONKLIN 


[Directly related are the entries ANTHROPOLOGY; Com- 
PONENTIAL ANALYSIS; FIELD WORK; LANGUAGE, arti- 
cle on LANGUAGE AND CULTURE; LINGUISTICS; 
SEMANTICS AND SEMIOTICS.| 
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ETHNOLOGY 


Ethnology is generally regarded as one of the 
major subdivisions of cultural anthropology, the 
others being anthropological archeology and an- 
thropological linguistics. Anthropology is prefixed 
to the latter two terms because they refer to the 


archeology and linguistics largely of preliterate and 
preindustrial peoples. The archeology of classical 
Greece and the linguistics of contemporary France 
would rarely, if ever, be taught in an anthropology 
department. The claim that anthropology embraces 
all peoples past and present has been exaggerated 
by some anthropologists, although the present 
trend, especially in ethnology, is toward giving at- 
tention to a wider range of peoples. In the United 
States today, but less so in Europe, ethnology is 
joined to social anthropology. [See ANTHROPOLOGY, 
articles on CULTURAL ANTHROPOLOGY and SOCIAL 
ANTHROPOLOGY. ] 

Interesting insight into the scope of ethnology 
can be gained by looking at the names of early 
anthropological societies. In 1843 the Ethnological 
Society was founded in England, and in that same 
year it published the first edition of its guide to field 
work, an inventory of data to be obtained. This 
guide included some material on all fields of an- 
thropology but gave the most space to social 
anthropology. In 1863 the Anthropological Society 
was founded in England; this was a group of 
former members of the Ethnological Society who 
wanted to stress political issues, such as slavery, 
more heavily. In 1871 the two societies joined to 
form the Anthropological Institute of Great Britain 
and Ireland, to which name the word “Royal” was 
prefixed in 1907. In the United States, the Ameri- 
can Ethnological Society was founded in 1842, the 
Anthropological Society of Washington in 1879, 
and the American Anthropological Association in 
1902. In France, the Société Ethnologique de Paris 
was established in 1838 and the Société d’Anthro- 
pologie de Paris in 1858. In Germany, the Gesell- 
schaft fiir Anthropologie, Ethnologie, und Urge- 
schichte was founded in 1869. 

It is clear from these examples that “ethnology” 
was used as a blanket term to cover the entire 
range of the subject we now label “anthropology” 
and that the latter term came to be used in the 
wider sense at a later date. Through much of the 
nineteenth century the concepts of biological race, 
language, and culture were confused; one was in- 
ferred from the other, and reconstructions of hu- 
man development combined all three aspects. Eth- 
nology was historically oriented from the start and 
attempted to account for extant races, languages, 
and cultures in terms of migration, diffusion, and 
other historical processes [see DIFFUSION]. 

In the twentieth century, “ethnology” has come 
to mean the comparative study of documented and 
contemporary cultures and has largely excluded 
their bioanthropology, archeology, and linguistics. 
“Ethnography,” in contrast, is best used to describe 


the study of the culture of a single tribe or society; 
but because almost all ethnographies make com- 
parisons at least with neighboring peoples, the 
distinction between ethnography and ethnology is 
not sharp and may be compared to that between 
geography and geology. This article is limited al- 
most completely to comparative ethnology. 

Oscar Lewis (1956) gives an excellent idea of 
the contemporary scope of comparative ethnology. 
Comparisons may range from two ethnic units 
(societies) to hundreds, the largest sample so far 
being Murdock’s “Ethnographic Atlas” (Murdock 
et al. 1962-1966), which is approaching one thou- 
sand ethnic units, They may deal with a few ad- 
jacent peoples, or a larger number in a culture 
area, a continent, a hemisphere, or the entire 
world. The content to be compared may vary from 
a single culture element to a long list of elements 
and assemblages of them covering practically every 
aspect of culture. Verne Ray's list of 7,633 culture 
elements for the plateau area of North America 
(1942) is the longest enumerated list so far, and 
Murdock’s world-wide “Ethnographic Atlas” has 
reached nearly one thousand culture trait cate- 
gories. Data for comparison may be based on li- 
brary research, field research, or a combination of 
the two. The purpose of comparisons may be lim- 
ited to uncovering the range and kind of variations 
for the subject at hand and to locating them in 
space and time; or it may be aimed at establishing 
culture area groupings, cross-cultural regularities, 
evolutionary trends, or other hypotheses. The re- 
search design may range all the way from a few 
illustrations of loose-jointed generalizations to a 
rigid statistical method. 

The subject coverage of ethnology includes that 
of social anthropology and sociology, but it is much 
broader. For instance, ethnology also includes tech- 
nology and crafts, plastic and graphic arts, music, 
dancing, oral literature, dream analysis, religion, 
world view, ethics, and ethnomedicine. For a much 
longer list of the subject content of ethnology, see 
the Outline of Cultural Materials (Yale Univ. 1938). 

The dominant trend in nineteenth-century eth- 
nology—social anthropology was an evolutionary 
explanation of how things came to be as they are. 
In its most extreme form, unilinear evolution, it 
was assumed that culture change came about 
largely from causes operating within single soci- 
eties and that all peoples would, sooner or later, 
evolve through a half-dozen or more stages of 
development in the same sequence if their progress 
were not interrupted by some catastrophe, such as 
military invasion by an alien power [see EvoLU- 
TION, article on CULTURAL EVOLUTION]. 
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Historical ethnology 


Toward the end of the nineteenth century two 
schools of ethnology were founded, one by Boas in 
the United States and the other by Ratzel and 
Frobenius in Germany. Both schools emphasized 
the historical processes of diffusion and migration. 
Boas’ best demonstrations of the process of relay 
diffusion are to be found in his comparative studies 
of North American Indian folklore (1895; 1916). 
By tracing motifs and tale types among groups of 
contiguous tribes, he showed the overwhelming 
tendency of these phenomena to cluster into areal 
types which crosscut language-family boundaries, 
thus suggesting diffusion. If such material were 
independently invented over and over again, tribes 
separated by great distances would exhibit as many 
resemblances in folklore as neighboring tribes do; 
but because contiguous tribes shared much more 
folklore inventory than distant tribes did, diffusion 
was the obvious explanation. 

Boas’ diffusion emphasis was most fully devel- 
oped by A. L. Kroeber and by Clark Wissler, trained 
as a psychologist but long curator of anthropology 
in the American Museum of Natural History. Al- 
though the earliest American culture area scheme 
was that of Livingstone Farrand (1904), his work 
had less impact than that of Kroeber and Wissler. 
In 1904 Kroeber was the first to classify California 
Indian cultures, and in 1906 Wissler was the first 
to mention major North American areas. Wissler 
(1917) published the first map of culture areas 
for the hemisphere,.and Kroeber (1923) followed 
Wissler closely in a parallel scheme. In these and 
later works both authors postulated that the most 
significant aspects of culture in each area arose at 
the center and tended to diffuse outward toward 
the margins. On the assumption that all aspects of 
culture diffused at about the same rate, the age- 
area hypothesis, by which the age of a culture 
trait or complex was determined by the extent of 
its geographical distribution, was employed by both 
men. 

In addition to the scheme of 15 culture areas 
for the two American continents endorsed by both 
men, each also regarded the area from Mexico to 
Peru as the culture center of a vast Pan-American 
culture area. Kroeber (1923, fig. 35) presented a 
large histogram in which the supposedly oldest 
traits occurred at the bottom, as in archeological 
stratification, and the youngest at the top. Age was 
determined largely by the extent of geographical 
distribution, but typological complexity was also 
taken into account, as well as a little direct 
sequential evidence from archeology. Although 
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the horizontal dimension represented dispersal of 
the culture elements by diffusion and migration, the 
vertical dimension clearly showed an evolution 
from the simple to the complex. Thus, in Mexico 
the sequence included basketry, shamanism, and 
family groups in the earliest level; then patrilineal 
clans, simple weaving frame, domesticated plants, 
pottery, solstitial calendar, stone buildings, town 
life, cotton growing and loom weaving, matrilineal 
clans, textile clothing, priesthood, confederacy, 
sculpture, metallurgy, markets, human sacrifice, 
temples, empire, mathematics, astronomy, cycle 
calendar, writing, and books, in that order. Kroeber 
rejected nineteenth-century unilinear evolution and 
the theory of many independent origins of cultural 
resemblances, but he constructed a new kind of 
evolution with few independent origins and many 
diffusions and migrations from tribe to tribe and 
area to area, The age—area hypothesis demands a 
sequential arrangement of the material, and when 
this shows a temporal progression in complexity, 
it becomes evolutionary. 

A major weakness of culture area and age—area 
theory is that culture areas are of varying sizes. 
Local developments may originate in the “centers” 
of small areas and tend to spread toward the mar- 
gins, but at the same time new culture elements 
and assemblages may arise in the “centers” of 
larger culture areas and spread by diffusion or mi- 
gration to the smaller ones, thus mixing elements 
of internal and external origin; and still more in- 
vention may occur in “centers” of each hemisphere 
and spread widely throughout many culture areas 
of different sizes in the same hemisphere. The hope 
of unscrambling this sort of mixture without the 
help of archeology and historical linguistics is dim. 
[See ARcHEOLOGY; History, article on CULTURE 
HISTORY; LINGUISTICS, article on HISTORICAL LIN- 
GUISTICS.] 

Wissler’s derivation of historical inferences from 
geographical distributions was so bold an attempt 
that he was heavily criticized, especially by Dixon 
(1928). Kroeber, in contrast, modified his views 
as new evidence came to light, and in 1939 he 
published his well-received “Cultural and Natural 
Areas of Native North America.” This was funda- 
mentally an intuitive scheme of culture areas with- 
out detailed supporting geographical or historical 
evidence. Between the time this book was finished 
—1931—and the time it was published—1939— 
Kroeber supervised the University of California 
Culture Element Survey, which, with the help of 
13 field workers, collected responses to question- 
naires from old informants in 254 localities in 
western North America, from the continental di- 


vide to the Pacific and from the Mexican border 
to Alaska. This was aimed at collecting enough 
data to produce a definitive taxonomy of the nine- 
teenth-century cultures, which Kroeber hoped 
would lead to a fuller set of interpretations. After 
all this effort, interest in areal classification ebbed 
to the extent that no one has yet used this vast 
quantity of data to produce a much superior areal 
scheme based on a wealth of specific detail. The 
largest comparative study incorporating this cul- 
ture element material is still that of Driver (1941), 
who limited his subject coverage to girls’ puberty 
rites [see CULTURE AREA; KROEBER; WISSLER]. 

In Germany, Friedrich Ratzel (1887) intro- 
duced the “criterion of form,” which argued that 
all specific resemblances in the form of two or 
more museum objects, other than those determined 
by the material from which they were made or the 
use to which they were put, must be explained by 
a single origin and subsequent diffusion to the 
localities from which the museum specimens were 
obtained in the field, no matter how widely sep- 
arated these localities might be. This was a much 
more extreme diffusionist position than that of 
Boas, Boas’ pupils, or even Wissler, Frobenius 
(1898) was the first to use the term Kuliurkreis, 
best translated as “culture area” or “culture region”; 
he also introduced the criterion of quantity, which 
argued that the larger the number of arbitrary 
resemblances not due to the nature of the material 
or the use to which the object was put, the stronger 
the case for diffusion. 

Two early applications of the Kulturkreis rules 
for determining areal clusters, time sequences, and 
dispersal, those of Graebner (1905) and Anker- 
mann (1905), were fairly well received and not 
much less tenable than the works of Wissler. They, 
too, emphasized material culture and arranged their 
data in a series of temporal strata, or Schichten. 
Neither author gave any explicit technique for 
packaging culture elements into Kreise or Schichten, 
but Czekanowski (1911) showed clearly that the 
reality of Ankermann’s two African Kreise could be 
demonstrated with a correlation technique. Using 
Yule’s Q coefficient, Czekanowski intercorrelated 
Ankermann’s 17 traits of material culture among 
47 African tribes and arranged the correlations in 
a single matrix which clearly showed two distinct 
intertrait clusters. When mapped, these clusters 
yielded a twofold areal classification, which con- 
formed to Ankermann’s intuitive grouping. 

Although no one today subscribes to the idea of 
single origin and subsequent world-wide dispersal 
by migration and diffusion of any of the Kreise or 
Schichten of, say, Schmidt and Koppers (1924), 


some of the correlations and functional associa- 
tions of the Kreise and Schichten have been con- 
firmed or repostulated by later researchers of dif- 
ferent schools. For example, the correlation between 
moieties and matrilineal descent, challenged by 
Lowie (1937, p. 182), has been confirmed by 
Murdock (1949, p. 49). The functional complex 
of hoe farming by women, matrilocal residence, 
matrilineal descent, monogamy, and bride service, 
and the temporal sequence from division of labor 
to residence to descent has been confirmed or 
postulated by Murdock (1949), Driver (1956), 
and Aberle (1961). 

The world-wide inferred temporal strata of the 
Kulturkreis school produce an evolution which 
differs from that of the nineteenth century in 
calling for single origins or a very small number of 
independent origins and subsequent dispersal of 
the phenomena by migration and diffusion; but 
since such dispersals are multiple, they overlap 
each other geographically and produce a “layer 
cake” of temporal stages. Kroeber and Wissler 
confined their postulated dispersals to a culture 
area or a hemisphere for the most part, while 
Schmidt and Koppers more often included the en- 
tire world [see GRAEBNER; KOPPERS; RATZEL; 
ScHMIDT]. 

Kroeber’s interest in culture areas and diffusion 
stimulated Clements (Clements et al. 1926; Clem- 
ents 1928; 1931) and Driver (see Driver & Kroe- 
ber 1932) to determine areal groupings of ethnic 
units by intercorrelating their inventories of cul- 
ture traits. These papers were read by three young 
men in Europe who were familiar with Kulturkreis 
theory and the intertrait correlations of Czeka- 
nowski (1911). Almost simultaneously they pub- 
lished four papers which combined the intertrait 
correlations of Czekanowski with the intertribe 
correlations of Kroeber, Clements, and Driver 
(Fiirer-Haimendorf 1934; Klimek 1935; Milke 
1935; Klimek & Milke 1935). They computed the 
coefficient Q, for three sets of traits: intertribe, 
intertrait, and tribal cluster with trait cluster. All 
three assembled their coefficients in rectangular 
matrices and converted them to shades of gray (as 
Czekanowski had done in 1911) for quick compre- 
hension. They also mapped their clusters. Inter- 
pretation of the resemblances and the groupings 
was limited to historical factors, but because all of 
these studies were confined to small regions of 
culture area size, this wholesale historical explana- 
tion was probably not far from the truth although 
incomplete. A simplified explanation in English of 
their technique may be found in Driver (1961). 

Knowledge of statistical mechanics was so scant 
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among ethnologists of both schools that few at the 
time understood that these studies integrated the ap- 
proaches of the American culture area and the 
German Kulturkreis schools. They offered an ob- 
jective method for determining both intertribal 
(culture area) and intertrait (Kulturschicht-Kul- 
turkreis) groupings. If every writer in these schools 
had empirically demonstrated his intertribal and 
intertrait groupings in this manner, the differences 
between the schools would have been less marked 
and much useless polemic would not have been 
written. Differences in interpretation of the data 
grouped in these ways still exist, but the reality 
of the groupings themselves could have been estab- 
lished objectively. 

Driver, in his “Girls’ Puberty Rites in Western 
North America” (1941), used a multiple clustering 
technique parallel to that of the Europeans, but his 
interpretation of the results went beyond any of 
that group. His area was larger and posed more 
problems of interpretation. He distinguished sev- 
eral kinds of resemblances: universals, cultural 
heritages spread by migrations, relayed diffusions, 
and convergences. He pointed out (1) that ele- 
ments of universal or near universal occurrence 
should not be used to establish historical connec- 
tion between ethnic units in limited areas; (2) that 
elements closely associated with a language family 
might be regarded as a cultural heritage from the 
protoculture associated with the protolanguage of 
the group; (3) that continuously distributed re- 
semblances which crossed over language-family 
boundaries were best regarded as diffusions; and 
(4) that the group ceremony for pubescent girls 
among the Apacheans represented an independent 
origin and a convergence with that held in south- 
ern California. He also wrote a chapter on the psy- 
chological aspects of menstrual taboos, described 
the functional position and significance of the 
girls’ puberty rite in each of the subareas into 
which the entire area had been divided, and as- 
sessed the influence of geographical environment 
on the data. 


Functionalism 

In addition to the American and German histori- 
cal schools just described, there arose in the early 
part of this century the functional schools, which 
rebelled against not only the nineteenth-century 
unilinear evolutionists but also the culture histori- 
calists. Malinowski and Radcliffe-Brown are both 
identified as functionalists, although some signifi- 
cant divergences exist. Malinowski was generally 
both antihistorical and anticomparative, while 
Radcliffe-Brown was antihistorical (except in his 
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later years) but never anticomparative. For this 
reason the latter figures more prominently in 
comparative ethnology and will be singled out for 
brief appraisal here. 

Radcliffe-Brown discovered a number of gen- 
eralizations that would now be called correlations. 
One of the earliest (1913) was his discovery that 
preferred marriage to a first cross-cousin was as- 
sociated with kinship terminology of one type, 
while preferred marriage to a second cross-cousin 
was found with kinship terminology of another 
type. This and other perfect or high correlations 
are given in his important The Social Organization 
of Australian Tribes (1931), where he used the 
term “correlation” but did not compute any coeffi- 
cients. In 1935 he wrote: “. . . we can expect to 
find, in the majority of human societies, a fairly 
close correlation between the terminological classi- 
fication of kindred or relatives and the social classifi- 
cation,” as revealed “in the attitudes and behavior 
of relatives to one another” (Radcliffe-Brown 1935, 
p. 531). His greatest contribution lies in the em- 
phasis on this relationship. 

The principal opponent of Radcliffe-Brown was 
Kroeber, who said: 


Kin-term systems, . . . are subject to modification from 
within and without. There is always a sufficient num- 
ber of such “accidents” to disguise the basic patterns 
more or less. . . . the essential features of the pattern 
are . . . likely to be the ones which have the greatest 
historic depth. The search for them therefore implies 
a willingness and ability to view data historically. 
Without such willingness, it is as good as impossible 
to separate the significant from the trivial . . . and the 
work done becomes merely sociological, an affair of 
schemes, ... (Kroeber 1934, pp. 21-22) 


This position is echoed by E. W. Gifford, a col- 
league of Kroeber, who wrote: “. . . kinship sys- 
tems are first of all linguistic phenomena . . . and 
only secondarily social phenomena. As such they 

. constitute an archaic and highly refractory 
nucleus, which yields unevenly and only here and 
there to influences from . . . social structure” (Gif- 
ford 1940, pp. 193-194). Kroeber (1936) modified 
his view in the direction of that of Radcliffe-Brown 
in a conciliatory paper, and a year later (1937) 
made the first reconstruction of a protokinship 
terminology for a language family. 

No historian of ethnological thought has yet 
pointed out that the opposing views of Radcliffe- 
Brown and Kroeber stem directly from the areas 
with which each was most familiar. The kinship 
terminologies and social organization of native 
Australia are among the most highly integrated 
of any in the world. High correlations are the rule 
rather than the exception. California native cul- 


tures, in contrast, are among the least integrated 
in this respect, and correlations are low or non- 
existent, as Tax (1937), a pupil of Radcliffe-Brown, 
showed in an excellent and very thorough study, 
Australia remained one of the most isolated areas 
in the world, with little contact with the outside, 
while California, in contrast, has been exposed to 
contact on all sides except the Pacific, and the 
multiplicity of language families and phyla sug- 
gests much migration into and out of the area. 
Australian social organizations and languages had 
centuries and millennia to simmer down and be- 
come integrated, while those of California were 
constantly being disturbed by intrusions from with- 
out. Radcliffe-Brown and Kroeber both failed to 
understand the limitations of their samples, and it 
was not until Murdock’s Social Structure (1949) 
that an adequate sample and statistical technique 
were combined to produce more tenable generali- 
zations on this subject [see KINSHIP]. 


Cross-cultural studies 


In the United States cross-cultural studies were 
founded by G. P. Murdock and carried on by 
his pupils, including J. W. M. Whiting, Whiting’s 
pupils, and others. Murdock has concentrated on 
kinship and social organization, and his Social 
Structure (1949) is a monumental work in its 
field. He studied the association between rules of 
marriage, residence, and descent, as well as kinship 
terminology, in 250 societies in all the major areas 
of the world. He used no explicit sampling tech- 
nique, but his selection was large and widely dis- 
tributed, and his results have not been seriously 
challenged to date. Using functional theory, he ran 
off a list of hypotheses and then confirmed most 
of them with the Q coefficient of association and 
chi-square. His general conclusions were that the 
semantic categories of kinship terms are the result 
of social organization rather than the cause, and 
that they are determined principally by forms of 
marital residence and rules of descent. Marriage 
prescriptions showed practically no correlation with 
kinship terminology, while residence and descent 
yielded many significant correlations. 

Murdock further postulated three kinds of de- 
velopmental cycles, two of which began with the 
dominance of one sex in the economy, followed 
by corresponding forms of residence, descent, and 
kinship terminology. Thus a patri-dominated econ- 
omy would give rise to patrilocal residence, patri- 
lineal descent, and Iroquoian or Omaha kinship 
categories. In a similar fashion a matri-dominated 
economy would produce matrilocal residence, ma- 
trilineal descent, and Iroquoian or Crow kinship 
classification. A sexually balanced economy, in 


turn, would give rise to bilocal residence, bilateral 
descent, and Eskimo or Hawaiian kinship termi- 
nology. This cyclical theory was confirmed statisti- 
cally by Driver (1956), who found that the cor- 
relations based on about 250 North American 
peoples could be arranged in a matrix which could 
be explained in this way. However, Driver failed 
to measure the potency of genetic factors at this 
time, although he did so in 1966 in a more meth- 
odologically rigorous study (Driver 1966). 

Whiting and his followers have centered their 
interests in the socialization process, the ways and 
means by which a child acquires the culture in 
which he is born and reared. The basic work in 
this field is Whiting and Child (1953). Before 
their work, good field studies in this subject were 
so rare that the authors were able to assemble 
fewer than fifty societies for their comparisons. 
They used no sampling technique because the 
total number of tribes was so small, but since 
every continental area in the world, plus Oceania, 
was represented, their sample is a rough approxi- 
mation to a random one. Three “judges” carefully 
read the same field reports and coded the various 
societies independently on multistep rating scales. 
The published ratings are the scores of all three 
judges combined by summation. 

Whiting and Child tested a number of neo- 
Freudian hypotheses with this method. For in- 
stance, they divided severity of socialization into 
the following five aspects, each of which was rated 
separately by each of the “judges”: anal, oral, 
sexual, dependence, aggression. The ratings on 
these five aspects were intercorrelated, and the 
highest positive correlation turned out to be that 
between the oral and dependence aspects. These 
ratings were also correlated with other aspects of 
culture; for instance, amount of oral socialization 
anxiety correlated with presence of oral* explana- 
tions of illness showed a high degree of relation- 
ship. On the other hand, they found a zero corre- 
lation between anal socialization anxiety and anal 
explanations of illness. [See SOCIALIZATION.] 

A later and more transparent study by Whiting, 
Kluckhohn, and Anthony (1958) revealed a posi- 
tive correlation between male initiation ceremonies 
at puberty, patrilocal residence, exclusive mother— 
infant sleeping arrangements, and a long post 
partum sex taboo. These variables are also posi- 
tively correlated with a long lactation period and 
other kinds of long and close association of the 
infant with the mother. The authors’ explanation 
was that the resulting strong attachment to the 
mother had to be broken by an initiation rite which 
Separated the boy from his mother entirely and 
prepared him for an adult masculine role. Societies 


ETHNOLOGY 183 


which lacked the long and close association of 
mother and young son did not need an initiation 
because there was no strong attachment to sever. 
Although the results achieved so far in this difficult 
field of psychological ethnology are less impressive 
than those in the field of social organization, the 
difference is due to the inherent complexities of 
the problems and the scarcity of field material 
rather than to methodology. 

These cross-cultural studies of world-wide scope 
stem from the evolutionary interests of the nine- 
teenth century. When Tylor (1888) read his now 
famous paper in which he anticipated correlation 
methods, Francis Galton challenged the historical 
independence of Tylor’s 350 cases (societies). Ty- 
lor’s use of probabilities and his conclusion that, 
for instance, mother-in-law—son-in-law avoidance 
was caused by matrilocal residence implied that 
this form of residence occurred first in each society 
and that it gave rise to this form of avoidance 
independently over and over again in each society 
where the avoidance was found. This is a func- 
tional—causal—evolutionary explanation. When such 
explanations include the Oedipus complex or the 
incest taboo, they may be called psychofunctional— 
causal-evolutionary. The contrasting explanation 
has been called geographical-historical, historical, 
or genetic; it holds that once a custom becomes 
established, it may be relayed from society to so- 
ciety by means of intermarriage and other kinds 
of contact. It is not necessary to postulate any 
necessary antecedent because a behavior may 
spread like a fashion. Continuity of geographical 
distribution is generally regarded as evidence of 
such diffusion. Tylor failed to provide an answer 
to the question Galton raised, and it plagued cross- 
cultural research until the 1950s. 

Stephens in 1959 paired his 56 societies on the 
basis of membership in the same genetic language 
family and geographical proximity, and he con- 
cluded that geographical—historical factors deter- 
mined about as much association as did psycho- 
functional ones. Landauer and Whiting (1964) 
compared, in a similar way, associations found 
within culture areas with those found across cul- 
ture areas and concluded that the latter were rela- 
tively free of historical factors. Raoul Naroll (1961) 
and Naroll and D’Andrade (1963) have developed 
other specific techniques to show the effect of 
genetic versus psychofunctional—evolutionary fac- 
tors on correlations. Their general conclusion is 
that both kinds of explanations must be used to 
account for most correlations and that they are 
of about equal potency. Cultural behaviors with 
functional or causal relationships, such as unilat- 
eral descent, cross-cousin marriage, and corre- 
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sponding types of kinship terminology, tend to 
diffuse as a unit; or if part of such an assemblage 
is already present, the other members will diffuse 
more readily because they are compatible with it. 
Thus both internal and external factors determine 
the cultural inventory of societies. 

Driver (1966) employed still another method 
and found that genetic factors were a little more 
powerful than psychofunctional—evolutionary ones 
but that both were at work. He used a sample of 
277 peoples from native North America alone and 
postulated only four or five historically independ- 
ent origins of the kin avoidances, which formed 
the subject of his study. This was determined by 
a combination of areal clustering, culture-area 
membership, and language-family affiliation. If 
other areas show a similar number of origins, this 
would add up to no more than about twenty for 
the world. With only twenty cases for a test of the 
significance of correlations involving kin avoid- 
ances, it would require a rather high correlation 
to achieve significance. Many of the cross-cultural 
correlations so far computed, for which significance 
is claimed on the assumption of the historical 
independence of every positive instance, would by 
this criterion be judged to be not significant at all. 

The principal weakness of most cross-cultural 
studies so far is that their instances hop, skip, and 
jump across the map in such a manner that con- 
tinuity of geographical distribution and other clues 
to genetic explanations are missing. A world-wide 
study of a well-reported subject, such as kinship 
behavior, would require data from at least a thou- 
sand societies in order to insure sufficient geo- 
graphical continuity to permit valid inferences 
about the number of independent origins of the 
phenomena. This has not yet been achieved. 

Although significant positive correlations in 
cross-cultural research are relatively easy to find, 
causal relationships are more difficult to establish, 
and the direction of causation is still more elusive. 
Nevertheless, progress has been made in this re- 
spect. The direction of causation and sequence 
of stages in evolution have been determined by 
constructing a Guttman cumulative scale (Carneiro 
& Tobias 1963), and the direction of cycling by 
arranging correlations in a temporal matrix (Driver 
1956; Ascher & Ascher 1963; Blalock 1960). 

Naroll (1964) has drawn attention to the many 
problems surrounding the nature of the ethnic unit 
used in cross-cultural research. Because this is the 
unit counted in all intertrait cross-cultural correla- 
tions, its definition is crucial to such studies. Al- 
though there were more differences than agree- 
ments in the comments on this article, the major 


issues are now aboveboard, and refinements of 
definitions of ethnic units will surely follow. The 
nature of culture traits or variables also needs to 
be re-evaluated. 

Coult and Habenstein (1965) give over 500 
pages of cross tabulations of raw frequencies, phi- 
coefficients, and tests of significance for the 210 
culture categories and 565 ethnic units of Mur- 
dock’s 1957 sample. 

Textor (1966) offers a still more massive pack- 
age of similar measures computed largely from the 
data of Murdock’s 1962-1966 sample. These two 
compilations provide important sources of refer- 
ence to tens of thousands of relationships which 
can test many hypotheses, but they are so myopic 
that they may obscure some of the broader rela- 
tionships within the data. 

Sawyer and Levine (1966) have reduced Mur- 
dock’s 1957 sample to thirty variables, intercor- 
related and factor-analyzed these variables, and 
produced some compact generalizations about the 
whole sample which run only to article length. 
They have also run the same correlations sepa- 
rately for each of the six areas into which Murdock 
divided the world and have found rather marked 
areal differences. Some correlations are signifi- 
cantly positive in one area and significantly nega- 
tive in another, or zero in one area and signifi- 
cantly positive or negative in another. Such areal 
differences can only be explained by ecological and 
historical factors. They cast doubt on the im- 
portance of universal “laws” or regularities but do 
not demolish such concepts entirely. What is 
needed next is a series of correlation studies inter- 
mediate between the highly particularized com- 
puter print-outs and the grosser generalizations. 

It is thus apparent that recent studies have made 
considerable gains in understanding both genetic 
and evolutionary relationships and in the statistical 
rigor with which these relationships have been 
demonstrated. Although few ethnologists claim that 
their explanations of relationships among the cul- 
tures of nonliterate peoples are of timeless infalli- 
bility, validation of hypotheses has reached a 
respectable level which compares favorably with 
that of other behavioral sciences. 

HAROLD E. DRIVER 


[See also ANTHROPOLOGY, article on THE COMPARATIVE 
METHOD IN ANTHROPOLOGY. ] 


BIBLIOGRAPHY 
ABERLE, Davip F. 1961 Matrilineal Descent in Cross- 
cultural Perspective. Pages 655-727 in David M. 
Schneider and Kathleen Gough (editors), Matrilineal 
Kinship. Berkeley: Univ. of California Press. 


er 


see 


ANKERMANN, B. 1905 Kulturkreise und Kulturschichten 
in Afrika. Zeitschrift fiir Ethnologie 37:54-84. 

ASCHER, MARCIA; and ASCHER, Rospert 1963 Chrono- 
logical Ordering by Computer. American Anthropolo- 
gist New Series 65:1045-1052. 

BLaLocx, H. M. Jr. 1960 Correlation Analysis and 
Causal Inferences. American Anthropologist New 
Series 62:624-631. 

Boas, Franz 1895 Indianische Sagen von der nord- 
pacifischen Küste Amerikas. Berlin: Asher. 

Boas, Franz 1916 Tsimshian Mythology. Smithsonian 
Institution, Bureau of American Ethnology, Annual 
Report No. 31. Washington: Government Printing 
Office. 

CARNEIRO, ROBERT L. 1962 Scale Analysis as an Instru- 
ment for the Study of Cultural Evolution. Southwest- 
ern Journal of Anthropology 18, no. 2:149-169. 

CARNEIRO, ROBERT L.; and Tosias, STEPHEN F. 1963 The 
Application of Scale Analysis to the Study of Cultural 
Evolution. New York Academy of Sciences, Transac- 
tions Second Series 26:196-207. 

CLEMENTS, ForrEST E. 1928 Quantitative Method in 
Ethnography. American Anthropologist New Series 30: 
295-310. 

CLEMENTS, ForrEsT E. 1931 Plains Indian Tribal Cor- 
relations With Sun Dance Data. American Anthropolo- 
gist New Series 33:216-227. 

CLEMENTS, ForrEsT E.; SCHENCK, SARA M.; and BROWN, 
T. K. 1926 A New Objective Method for Showing 
Special Relationships. American Anthropologist New 
Series 28:585-604. 


* Count, Attan D.; and HABENSTEIN, RoperT W. 1965 


Cross Tabulations of Murdock’s “World Ethnographic 
Sample.” Columbia: Univ. of Missouri Press. > See 
Murdock 1957. 

CZEKANOWSKI, JAN 1911 Objektive Kriterien in der 
Ethnologie. Deutsche Gesellschaft fiir Anthropologie, 
Ethnologie, und Urgeschichte, Korrespondenz-Blatt 
42:1-5. 

Dixon, RoLanp B. 1928 The Building of Cultures. New 
York: Scribner. 

Driver, HARoLD E. 1941 Girls’ Puberty Rites in West- 
ern North America. California, University of, Anthro- 
pological Records 6:21-90. 

Driver, Harotp E. 1956 An Integration of Functional, 
Evolutionary, and Historical Theory by Means of Cor- 
relations. Indiana University, Publications in Anthro- 
pology and Linguistics Memoir No. 12. 

Driver, Harotp E. 1961 Introduction to Statistics for 
Comparative Research. Pages 303-331 in Frank W. 
Moore (editor), Readings in Cross-cultural Methodol- 
ogy. New Haven: Human Relations Area Files. 

Driver, Harotp E. 1962 The Contribution of A. L. 
Kroeber to Culture Area Theory and Practice. Indiana 
University, Publications in Anthropology and Linguis- 
tics Memoir No. 18. 

Driver, HaRotpE. 1966 Geographical—Historical Versus 
Psycho-Functional Explanations of Kin Avoidances. 
Current Anthropology 7:131-148. 

Driver, Harotp E.; and KROEBER, A. L. 1932 Quantita- 
tive Expression of Cultural Relationships. California, 
University of, Publications in American Archaeology 
and Ethnology 31:211-256. 

Driver, Harotp E.; and Massey, Witam C. 1957 
Comparative Studies of North American Indians. 
American Philosophical Society, Transactions 47:165- 
456. 


ETHNOLOGY 185 


FaRRAND, Livincstone (1904) 1964 The Basis of Amer- 
ican History. New York: Harper. 

Fropenius, Leo 1898 Der Ursprung der afrikanischen 
Kulturen. Berlin: Gebriider Borntraeger. 

FÜRER-HAIMENDORF, C. von 1934 Völker- und Kultur- 
gruppen im westlich Hinterindien, dargestellt mit 
Hilfe des statistischen Verfahrens. Anthropos 29:421- 
440. 

Grrrorp, E. W. 1940 A Problem in Kinship Terminol- 
ogy. American Anthropologist New Series 42: 190-194. 

GRAEBNER, Fritz 1905 Kulturkreise und Kulturschichten 
in Ozeanien. Zeitschrift für Ethnologie 37:28-53. 

GRAEBNER, Fritz 1911 Methode der Ethnologie. Heidel- 
berg (Germany): Winter. 

Hymes, DELL H. (editor) 1965 The Use of Computers 
in Anthropology. The Hague: Mouton. 

KLIMEK, STANISLAW 1935 The Structure of California 
Indian Culture. California, University of, Publications 
in American Archaeology and Ethnology 37: 1-70. 

KLIMEK, STANISLAW; and MILKE, WILHELM 1935 An 
Analysis of the Material Culture of the Tupi Peoples. 
American Anthropologist New Series 37:71-91. 

KLUCKHOHN, CLYDE (1953) 1961 Universal Categories 
of Culture. Pages 89-105 in Frank W. Moore (editor), 
Readings in Cross-cultural Methodology. New Haven: 
Human Relations Area Files Press. > First published 
in International Symposium on Anthropology, 1952, 
Anthropology Today. Edited by A. L. Kroeber. 

KROEBER, ALFRED L. 1904 Types of Indian Culture in 
California. California, University of, Publications in 
American Archaeology and Ethnology 2:81-103. 

KROEBER, ALFRED L. (1923) 1948 Anthropology: Race, 
Language, Culture, Psychology, Prehistory. New ed., 
rev. New York: Harcourt. > First published as An- 
thropology. 

KROEBER, ALFRED L. 1934 Yurok and Neighboring Kin 
Term Systems. California, University of, Publications 
in American Archaeology and Ethnology 35: 15-22. 

KROEBER, ALFRED L. 1936 Kinship and History. Ameri- 
can Anthropologist New Series 38:338-341. 

KROEBER, ALFRED L. 1937 Athabascan Kin Term Sys- 
tems. American Anthropologist New Series 39:602- 
608. 

KROEBER, ALFRED L. 1939 Cultural and Natural Areas 
of Native North America. California, University of, 
Publications in American Archaeology and Ethnology 
38: 1-242. 

KROEBER, ALFRED L. 1940 Stimulus Diffusion. American 
Anthropologist New Series 42:1-20. 

KROEBER, ALFRED L. 1962 A Roster of Civilizations and 
Culture. Viking Fund Publications in Anthropology, 
No. 33. New York: The Fund. 

LANDAUER, THomas K.; and WuitTinc, Jonn W. M. 1964 
Infantile Stimulation and Adult Stature of Human 
Males. American Anthropologist New Series 66:1007- 
1028. 

Lewis, Oscar 1956 Comparisons in Cultural Anthropol- 
ogy. Pages 259-292 in Yearbook of Anthropology, 
1955, Current Anthropology: A Supplement to Anthro- 
pology Today. Edited by William L. Thomas, Jr. Univ. 
of Chicago Press. 

Lowi, Ropert H. 1937 The History of Ethnological 
Theory. New York: Farrar & Rinehart. 

MILKE, WILHELM 1935  Siidostmelanesien: Eine ethno- 
statistische Analyse. Wiirzburg (Germany): Triltsch. 

Murpock, GEORGE P. 1945 The Common Denominator 
of Cultures. Pages 123-142 in Ralph Linton (editor), 


186 ETHNOMEDICINE 


The Science of Man in the World Crisis. New York: 
Columbia Univ. Press. 

Murpock, GEORGE P. 1949 Social Structure. New York: 
Macmillan. > A paperback edition was published in 
1965 by the Free Press. 

Mourpock, Grorcr P. 1957 World Ethnographic Sample. 
American Anthropologist New Series 59:664-687. 
Murpock, GEORGE P. et al. 1962-1966 Ethnographic 
Atlas. Ethnology 1-4: last article in every issue. 
NaRoLt, Raout 1961 Two Solutions to Galton’s Prob- 

lem. Philosophy of Science 28: 15-39. 

NARroLL, Raout 1964 On Ethnic Unit Classification. 
Current Anthropology 5:283-312. 

NARroLL, Raovt; and D’ANDRADE, Roy G. 1963 Two 
Further Solutions to Galton’s Problem. American An- 
thropologist New Series 65: 1053-1067. 

RavciiFFE-Brown, A. R. 1913 Three Tribes of Western 
Australia. Journal of the Royal Anthropological Insti- 
tute of Great Britain and Ireland 43:143-194. 

RapcLiFFE-Brown, A. R. (1931) 1948 The Social Or- 
ganization of Australian Tribes. Glencoe, Ill.: Free 
Press. > First published in Volume 1 of Oceania. 

RADCLIFFE-BRown, A. R. 1935 Kinship Terminologies 
in California. American Anthropologist New Series 37: 
530-535. 

RATZEL, FRIEDRICH 1887 Die geographische Verbreit- 
ung des Bogens und der Pfeile in Afrika. Akademie 
der Wissenschaften, Leipzig, Berichte über die Ver- 
handlungen Philologisch-historische Klasse 39:233— 
252. 

Ray, VERNE F. 1942 Culture Element Distributions: 
XXII Plateau. California, University of, Anthropologi- 
cal Records 8:99-258., 

Sawyer, Jack; and Levine, Ropert A. 1966 Cultural 
Dimensions: A Factor Analysis of the World Ethno- 
graphic Sample. American Anthropologist New Series 
68: 708-731. 

Scumipt, WILHELM (1937) 1939 The Culture Histori- 
cal Method of Ethnology: The Scientific Approach to 
the Racial Question. New York: Fortuny. > First 
published as Handbuch der Methode der kulturhi- 
storischen Ethnologie. 

SCHMIDT, WILHELM; and Korrers, WILHELM 1924 
Völker und Kulturen. Regensburg (Germany): Habbel. 

Tax, SoL (1937) 1955 Some Problems of Social Organi- 
zation. Pages 3-34 in Fred Eggan (editor), Social 
Anthropology of North American Tribes. 2d ed. Univ. 
of Chicago Press. 

TEXTOR, ROBERT B. 1966 A Cross-cultural Summary. 
Unpublished manuscript, Human Relations Area Files, 

TyLor, EDWARD B. (1888) 1961 On a Method of In- 
vestigating the Development of Institutions : Applied 
to Laws of Marriage and Descent. Pages 1-28 in Frank 
W. Moore (editor), Readings in Cross-cultural Method- 
ology. New Haven: Human Relations Area Files. > 
First published in Volume 18 of the Journal of the 
Royal Anthropological Institute of Great Britain and 
Treland. 

WHITING, Joun W. M.; and Cup, Irvin L. 1953 Child 
Training and Personality: A Cross-cultural Study. New 
Haven: Yale Univ. Press. > A paperback edition was 
published in 1962. 

WHITING, Joun W. M.; KLUCKHOHN, RICHARD; and AN- 
THONY, ALBERT 1958 The Function of Male Initia- 
tion Ceremonies at Puberty, Pages 359-371 in Society 
for the Psychological Study of Social Issues, Readings 
in Social Psychology. 3d ed. New York: Holt. 


Wisster, CLARK 1906 Ethnic Types and Isolation. Sci- 
ence 23:147-149, 

WISSLER, CLARK (1917) 1957 The American Indian: An 
Introduction to the Anthropology of the New World. 
3d ed. Gloucester, Mass.: Smith. 

WIssLER, CLARK (1923) 1938 Man and Culture. New 
York: Crowell. 

YALE UNIVERSITY, INsTITUTE oF HUMAN RELATIONS 
(1938) 1961 Outline of Cultural Materials. 4th ed., 
revised by George P. Murdock et al. New Haven: Hu- 
man Relations Area Files. 


ETHNOMEDICINE 
See under MEDICAL CARE. 


ETHNOMUSICOLOGY 
See under Music. 


ETHNOSCIENCE 
See ETHNOGRAPHY, 


ETHOLOGY 


Ethology has existed as a concept since 1762 
when it was defined in France as the study of ani- 
mal behavior. In this sense it carries the same 
meaning as the Greek word “ethos,” from which 
the modern term ethology is derived. However, a 
separate meaning of the word ethology, related to 
the term “ethics,” has been used in the Anglo-Saxon 
literature to define the “science of character.” 

The founder of modern ethology is Konrad Z. 
Lorenz, physician, zoologist, and comparative anat- 
omist. By systematic application of biological re- 
search methods to the analysis of animal behavior, 
he provided the initial impetus in the 1930s. The 
first modern ethology textbook, The Study of In- 
stinct, was written by Nikolaas Tinbergen in 1951, 
and E. H. Hess (1962) and Eibl-Eibesfeldt (1966) 
recently produced summaries of the modern con- 
cepts of behavior. The observations of a number 
of pioneers, including Spalding (1873), Darwin 
(1872), Whitman (1898), Altum (1868), Hein- 
roth (1911), and Craig (1918), awakened scien- 
tific interest in animal behavior, and ethology came 
to be considered an independent branch of zoology 
around 1910. As with every young science, ethology 
inevitably suffers, on the one hand, from the in- 
corporation of concepts whose meaning has oscil- 
lated or has already become too specialized (such 
as “instinct”) and, on the other hand, from the 
application of provisional concepts, which may 
alter in meaning with advances in knowledge, tO 
contemporary working hypotheses. ; 

The term “ethology” is now attached to the sci- 
entific investigation of the behavior of animals and 


of some aspects of human behavior. Pronounce- 
ments about inaccessible psychic phenomena are 
avoided; the term “animal psychology” is still occa- 
sionally used but on purely historical grounds. 
Ethology is concerned with the investigation of 
animals, whether these be single cells—either as 
individual protozoans or as parts of metazoans— 
or more complex animal structures, that is, indi- 
viduals, groups, or so-called animal colonies (e.g., 
ants, bees, and termites). 

The behavior of an animal is equated with 
changes brought about by effectors (e.g., move- 
ments, sounds, scent production, color changes). 
Such effector responses are temporal events. For 
this reason only effector responses which repeatedly 
and identifiably occur are open to scientific analy- 
sis; they are then termed “fixed action patterns.” 
It is important to note that these temporal events 
can be recorded by tape and film, except in the 
case of chemical or tactile signals. The locomotion 
of an animal can be subdivided into the movement 
of the extremities, of antagonistic muscle groups, 
of single muscles, and ultimately of the muscle 
fibers. The smallest identifiable effector compo- 
nents, occurring either singly or in combination 
with other components, are chosen as the units of 
ethological study. 

The aim of ethology is to explain both phylo- 
genetically and physiologically the functional rela- 
tionships of all factors involved in behavior. This 
is evident in the modern definition of instinct 
suggested by Tinbergen: “ . a hierarchically 
organized nervous mechanism which is susceptible 
to certain priming, releasing and directing impulses 
of internal as well as of external origin, and which 
responds to these impulses by coordinated move- 
ments that contribute to the maintenance of the 
individual and the species” (1951, p. 112). The 
touchstone for ethological hypotheses is the reli- 
able prediction of the behavior of a living system 
in any given situation. 

Ethologists are zoologists; they are thus inter- 
ested in the biology of a species, and their prime 
interest is behavior as it occurs under natural con- 
ditions. The ethologist always begins by compiling 
an “action catalogue,” or ethogram of the species in 
question, that is, as complete a description as pos- 
sible of the behavior throughout the animal's life 
cycle, This simply describes what the animal does, 
not why it does it. 

The various behavior patterns are then classified 
and compared with those of other species, espe- 
cially with closely related species. It is important 
that the animals should be observed in their natural 
habitats or in surroundings which closely resemble 
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them. Additional observations in captivity are often 
necessary. A very useful expedient, first known to 
have been practiced by Baron Ferdinand Adam von 
Pernau in 1702, is to rear the animals to be both 
tame and unconfined. 

Learning, maturation, and genetics. Although 
learning is considered to be very important in ani- 
mal behavior, the first concern of the ethologist is 
with behavior patterns typically performed by all 
animals of a species, because it is necessary for 
him to know the basic predetermined responses be- 
fore proceeding to study changes brought about by 
learning. This is important, since not every change 
of form or effectiveness of a given behavior pattern 
occurring during the life of an individual involves 
learning in the form of acquisition by experience. 
As early as 1760 a professor in Hamburg, Hermann 
Samuel Reimarus, discovered the phenomenon of 
maturation of instincts and pointed out the differ- 
ence between innate and acquired skills. The in- 
nate skills, for example, the collection of food or 
the performance and “understanding” of the dance 
language in bees, are present from the time of birth 
or of hatching from the egg or pupa. Without in- 
volving a definition of learning, the problem can be 
formulated as follows: the majority of behavior 
patterns in most animals are adapted ( adjusted) to 
special situations in their respective environments. 
Since this fact cannot be explained as a chance 
phenomenon and since it is not a self-evident phe- 
nomenon, an explanation must be provided. In 
order to behave adaptively, the animal must have 
at its disposal information about the environment. 
This information can be stored either in the chro- 
mosomes or in memory; that is, it can be either 
innate or acquired. In complex behavior patterns, 
there is often an interaction between innate and 
acquired elements. However, although we know of 
perceptual and motor skills in which learning plays 
no part, it is impossible to postulate a completely 
learned element of behavior that is not based on 
genetically determined and, therefore, delimited 
capabilities. Further, no one has so far been able 
to demonstrate the infinite modifiability of any 
arbitrarily chosen, innately determined element of 
behavior or the possibility that learning could be 
the function of a nonorganized aggregate of neural 
elements, In learning, the fact that the organism 
selects “good” and not “bad” behavioral responses 
or stimuli logically implies a built-in mechanism 
which is able to direct learning toward survival 
value. 

A particularly good method for distinguishing be- 
tween the learned and innate elements of behavior 
is contained in the deprivation experiment: the 
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animal, usually isolated from members of its own 
species, is deprived of certain experiences and 
later tested in the situation to which the behavior 
pattern in question is normally adapted. As a con- 
trol a normal animal must be tested in the same 
Situation. (This is one of several safety precau- 
tions which are necessary in the evaluation of the 
deprivation experiment.) The majority of behavior 
patterns do not follow the all-or-none law but can 
occur at varying intensities. The lowest intensities, 
where it is just possible to recognize which pattern 
has been activated, are referred to as intention 
movements. The intensity with which a behavior 
pattern is performed depends upon both internal 
and external factors. 

The appearance of a particular fixed action pat- 
tern in animals isolated from their own species is 
clear evidence of genetic fixity. It is a constant 
characteristic of the species concerned and is based 
upon a specific central nervous mechanism that 
is inherited just as are morphological and physio- 
logical characteristics. (This had already been 
stated by the English naturalist Spalding in 1873. ) 
A particularly good example is provided by many 
bird songs which develop into the species-specific 
pattern even in completely isolated animals. Re- 
search into the genetic basis of behavior patterns 
is developing as an important part of ethology. For 
example, crossing two duck species which differ in 
their courtship behavior can give rise to hybrids 
exhibiting courtship motor sequences not evident 
in any known species of duck or sometimes to 
hybrids possessing behavior patterns absent in both 
parent species but present in some presumed an- 
cestral type (Wall 1963). However, it is still not 
clear what changes in the complex physiological 
basis of such behavior patterns are responsible for 
these differences, Dilger investigated the carrying 
of nest material in F, hybrids of Agapornis roseicol- 
lis parrat (male) x A. personata fischeri (female). 
Both parent species cut strips of paper or leaves 
and carry them to the nest. Females of the first 
species carry the strips in their bills, while females 
of the second species tuck the strips under special 
feathers on the lower back, The hybrids attempted 
to perform this latter pattern but failed for various 
reasons, For example, some were unable to let go 
of the strip of paper and tried to carry the strips in 
the bill and under the feathers at the same time. 
Within two years the behavior of the hybrids im- 
proved through learning, but they continued to 
perform ineffective tucking movements (Dilger 
1962), 

The genetic fixity of elements of behavior and 
the fact that they are nearly always to be found in 


more than one species prove their taxonomic value, 
In fact they are often characteristic of genera, 
families, or even higher taxonomic categories. For 
this reason it is possible to employ behavior pat- 
terns in the investigation of the relationships be- 
tween animals. Indeed, Whitman (1898 ) and Hein- 
roth (1911) investigated the behavior of doves and 
ducks respectively in the hope of finding character- 
istics useful for a more systematic analysis of their 
interrelatedness. In some grasshopper and toad 
species, species-specific calls or songs are used for 
species recognition; thus they represent barriers to 
interspecific reproduction. On the other hand, it is 
possible to reconstruct the phylogeny of behavior 
patterns on the basis of variations in the form of 
the same basic pattern between closely related 
species, as was pointed out by Darwin in The Ex- 
pression of the Emotions in Man and Animals 
(1872). Exactly the same method is used in com- 
parative anatomy and morphology. Although no 
behavioral fossils exist, more transitional forms 
exist between different behavioral types than is the 
case with morphological characteristics; behavioral 
characteristics occur repeatedly and at different 
intensities, while a leg is formed only once. The 
individual elements of various behavior patterns 
are, however, more open to formation of novel 
combinations (e.g., in contrast to the bones of the 
skull in vertebrates). For this reason, the phylogeny 
of behavior patterns must be based on the simplest 
possible elements, 

We know that no behavioral characteristic is 
dependent upon only one gene, that each heredi- 
tary component affects several characteristics, and 
that there are not two separate sets of hereditary 
material governing body construction and behav- 
ioral features, The interaction of a behavior pattern 
with its effector organ is thus just as labile as the 
coadaptation of several functionally correlated 
organs. 

Evolution and selection. Reimarus pointed out 
that in many instances behavior patterns adapted 
to the use of certain organs are performed, at 
times, even before these organs are developed. 
Apart from differences in speed of development of 
behavior patterns and their effector organs, it is 
also possible that one survives when the other is 
lost; a cerambycid beetle will continue to preen its 
antennas after they have been removed by dissec- 
tion, mutant fruit flies (Drosophila) with no wings 
still perform the wing-preening movements typical 
of the species, stump-tailed monkeys, when they 
run along a branch, still show the balancing move- 
ments once effective in their long-tailed ancestors. 

Such historical carry-overs can also be observed 


in behavior patterns originally adapted to cer- 
tain environmental conditions which have since 
changed. Ground-breeding birds regularly use the 
beak to perform specific behavior patterns for roll- 
ing the eggs back into the nest if they are found 
outside. Tree-breeding birds do not show this, since 
the eggs that fall disappear. However, Poulsen 
(1953) was able to show that some birds which 
have recently evolved from ground-living stock to 
a tree-living habit still exhibit egg-rolling patterns, 
while some recently evolved ground-breeding birds 
lack this pattern. There are other examples which 
show that fixed action patterns can be extremely 
conservative in the evolution of a species. On the 
other hand, closely related species occupying the 
same area exhibit rapid phylogenetic changes in 
the sexual behavior patterns serving for sexual 
recognition. In fact, in some such species greater 
differences in species recognition signals are seen 
where two species occur together than where either 
species occurs in isolation. This phenomenon has 
been called character displacement. 

Intraspecific signals usually undergo selection 
for better recognition (to avoid “misunderstand- 
ings”) and tend to become more and more con- 
spicuous and outlandish, The behaviors involved 
are performed more conspicuously and are empha- 
sized by morphological characteristics of color, 
form, or odor. In addition such behaviors are often 
rhythmically repeated. This fixed patterning often 
ceases to show different degrees of intensity. The 
level of motivation is no longer expressed in the 
intensity of the behavior but in how often it is 
repeated at one and the same fixed intensity 
(Morris 1957), much as the urgency of a telephone 
call is indicated by how often the bell rings and 
not how loud it rings. Finally such a recently 
formed fixed action pattern may become motiva- 
tionally autonomous of the situation in which it 
was originally aroused by a process that Tinbergen 
has called emancipation. These and other changes 
in signal behavior patterns, leading to improved 
communication between signal sender and signal 
receiver, are referred to as ritualization. We still 
know very little regarding the physiological and 
neuroanatomical basis of both nonritualized and 
ritualized behavior patterns. [See COMMUNICATION, 
ANIMAL. ] 

It is commonplace to say that no animal per- 
forms the behavior patterns in its repertoire in 
random order, An animal responds to signals ac- 
cording to set principles. It is the task of the be- 
havioral physiologist to analyze this phenomenon. 
This task involves large-scale studies of sensory 
physiology, since the animal receives the stimuli 
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with its sense organs; of hormone physiology, since 
hormones can decide whether the sight of the fe- 
male elicits courtship by the male, and so on; and 
of the physiology of the central nervous system, 
which is responsible for the analysis of the stimuli 
and for the coordination of the requisite behavior 
patterns. 

Releasing mechanisms. The carnivorous water 
beetle Dysticus marginalis does not react to the 
sight of prey (e.g., a tadpole in a glass tube), al- 
though it has perfectly developed eyes, but it does 
react to the chemical stimuli emanating from the 
prey. If some prey-extract solution is added to the 
water, the beetle will clasp even inanimate objects 
immersed in the water. A male robin will attack a 
bundle of red feathers but not a perfect dummy of 
a male lacking the characteristic red breast. Such 
examples show that animals respond with quite 
specific reactions to quite specific stimuli among 
the many perceived from the environment. These 
relevant stimuli are called sign stimuli, or releasers. 

Sign stimuli act upon specific functional units of 
the central nervous system, the so-called releasing 
mechanisms. The specific properties of these units 
may likewise be either genetically determined, in 
which case they are termed innate releasing mech- 
anisms (IRM), or partially determined by learning. 
The releasing mechanism filters out the sign 
stimuli and thereby triggers off specific behavior 
patterns. Some behavior patterns ¢an be elicited 
by more than one stimulus (e.g., an odor or a 
vibration). The vigor of the reaction generally de- 
pends upon the strength of the stimulus, and 
heterogeneous stimuli may summate (the same 
intensity of a reaction may be shown toward a 
strong odor or a strong vibration or a weak odor to- 
gether with a weak vibration), Sometimes it is pos- 
sible to present the animal with an abnormally 
strong stimulus and obtain a response stronger 
than that released by the naturally occurring stimu- 
lus; Magnus (1958) has shown that the males of 
the silver-washed fritillary butterfly react with 
courtship toward the orange and black color pat- 
tern of the female’s fluttering wings. By placing 
orange and dark stripes on a cylinder and rotating 
it, he proved that more rapid color—dark alternation 
than the rate characteristic of the female was more 
effective in eliciting the male's reaction. The 
greater the speed of rotation of the cylinder, the 
greater were the courtship responses, right up to 
the physiologically demonstrated flicker-fusion fre- 
quency for the species concerned. [See SEXUAL 
BEHAVIOR, article on ANIMAL SEXUAL BEHAVIOR. ] 
This susceptibility of animals to supernormal re- 
leasers provides us with an insight into the reason 
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for the development of bizarre morphological sig- 
nal structures such as the feathers of the peacock. 
Some parasitic birds even capitalize on this phe- 
nomenon when their young are larger and more 
babyish than, and therefore preferred to, the host’s 
own young, 

Motivation and drives. It has been observed 
that one individual will sometimes respond to a 
weak stimulus with a strong response, while at 
other times respond only at the same intensity or 
not at all to a much stronger releasing stimulus. 
It is therefore necessary to measure independently 
the specific “readiness” of the animal to react, 
apart from the strength of the stimulus. The 
strength of a reaction often decreases sharply with 
repeated equivalent stimulation, as is the case with 
escape attempts in aquarium fish in response to 
tapping on the glass pane or with gaping in young 
birds when the nest is lightly shaken. The readi- 
ness of an animal to perform certain patterns ex- 
hibits extensive and independent variation; an 
animal which is not prepared to eat may never- 
theless exhibit readiness to flee. The readiness to 
perform a certain pattern is referred to as the 
motivation. Motivation (e.g., in hunger) often in- 
creases with the time interval from the last elicita- 
tion of the type of behavior concerned (“damming 
effect,” an effect which is related to the corre- 
sponding stimulus threshold). In the extreme case 
the action pattern can occur without any evident 
external elicitation—as vacuum activity. However, 
an animal with high motivation to perform specific 
behavior patterns (where the “drive” is under re- 
straint) usually performs certain behavior patterns 
suitable for attainment of a stimulus situation 
appropriate to the motivated patterns. In simple 
terms, the animal “searches.” Craig (1918) ob- 
served the occurrence of restlessness, varied move- 
ments, and searching behavior as symptoms of a 
physiological state of appetite for specific stim- 
uli and labeled such behavior appetitive behavior, 
as distinct from consummatory behavior, which 
lowers the degree of motivation when performed 
and leads to a state of satisfaction. It is important 
to note that the animal does not attempt to achieve 
the biological effect associated with the consum- 
matory act but merely the performance of the 
consummatory behavior itself. The state of satis- 
faction can also be achieved by abreaction in re- 
sponse to models. In the simplest case, appetitive 
behavior consists of undirected locomotion, but 
many animals (especially higher-developed forms ) 
learn from experience and modify the appetitive 
behavior, so that it more rapidly leads to success. 
They learn when, where, and how they can attain 


the releasing situation. Briefly, appetitive behavior 
is typically variable (plastic), whereas the con- 
summatory act is relatively fixed (stereotyped), 

Motivational analysis attempts to demonstrate 
how many behavior patterns are dependent upon 
the same motivational source and how many 
partially or completely independent motivational 
centers are present in a given animal species, It 
is taken as axiomatic that there are fewer inde- 
pendent sources of motivation than observably dis- 
tinct behavior patterns and that behavior patterns 
which regularly occur within short intervals from 
one another are thus commonly motivated—the 
motivational state of the animal oscillates more 
slowly than the alternation of behavior patterns, 
Further, it is known that behavior patterns exist 
which are characteristic for specific conflict situa- 
tions; in the conflict between attack and flight these 
are represented by threat behavior. Such patterns 
are certainly motivated from different sources, 
which may vary independently from one another. 
It is not known from the outset how many of the 
behavior patterns observed have mixed motivation, 
but for the purposes of analysis it is assumed that 
it is the minimum possible. Wiepkema, a Dutch 
ethologist, carried out the following model experi- 
ment (1961) with the European bitterling: First 
he recorded the occurrence of the behavior pat- 
terns which he had identified for this species 
(ramming the flank of a conspecific with the 
head, scouring of the substrate, tail-beating, swim- 
ming before the female, etc.) over a long period 
of time. In this process typical locomotory se- 
quences are found to occur regularly, while some 
behavior patterns are seen to be mutually exclu- 
sive within a given time interval. Wiepkema 
computed the minimum number of independent 
variables (i.e., motives) necessary to account for 
the observed distribution of action patterns. Mixed 
motivation was taken into account, but it was 
assumed that one given motive was predominant 
in each case. For the reproductive period of the 
bitterling it was found that three independent mo- 
tivational sources are necessary and that each 
source governs a group of motor patterns which 
are totally or predominantly dependent upon the 
source concerned. These groups are comprised of 
(1) behavior patterns directed at the rival, ob- 
jectively termed “fight,” (2) behavior patterns di- 
rected away from the rival, termed “flight,” and 
(3) the patterns carried out in combination with 
the female in association with spawning. Accord- 
ingly it is possible to refer to the predominant 
motivation in each case as (1) fight drive, (2) 
flight drive, and (3) sex drive. Some behavior 


patterns lie between these groups, however, and 
are thus more or less equally dependent upon more 
than one motive. For example, spreading of the 
fins combined with an undulating movement of 
the entire body, which we refer to as “threat,” is 
motivated by both fight and flight drives; a specific 
courtship pattern is motivated by both fight and 
sex drives, Using factor analysis it is even possible 
to rank the action patterns according to a scale 
of ratios between different drives. [See DRIVES; 
MOTIVATION. ] 

Various phenomena occur in conflict situations; 
the animal may combine two behavior patterns 
(e.g., warding off a rival and eating), it may 
oscillate between the intention movements of dif- 
ferent action patterns (e.g., oscillation between 
motions toward attack and flight without actually 
attacking or fleeing), it may exhibit abreaction of 
an inhibited behavior pattern by transferring the 
direction to a neutral object (e.g., gulls which do 
not dare to attack a stronger rival may tear and 
pluck at tufts of grass), or it may exhibit a be- 
havior pattern which does not belong to either of 
the motivational sources directly involved in the 
conflict. This last pattern is referred to as dis- 
placement activity. For example, domestic cocks 
will start to eat when they are involved in a 
conflict between attack and flight, while avocets 
will assume a sleeping posture. The physiological 
foundations of displacement activities have been 
investigated only in a few cases and appear to 
vary from case to case. Some behavior patterns 
may be dependent upon the same releasing stimuli 
as well as upon the same motivational sources. 
[See ConFLicr, article on PSYCHOLOGICAL ASPECTS. ] 

Sequential and hierarchical organization. The 
fact that some elements of behavior can give rise 
to conflict at corresponding integrational levels, 
while others are mutually exclusive, indicates that 
groups of elements are governed by superior sys- 
tems which can similarly show mutual interfer- 
ence, promotion, inhibition, or exclusion. In this 
way, we arrive at the concept of a hierarchical 
system of dominant and subordinate drives. The 
same concept emerges from the comparison of 
releasing situations and appetitive behavior. 

A hungry squirrel (1) climbs (2) trees (3) look- 
ing for cones; when motivated to build a nest, it 
(1) climbs (2) trees (3) looking for twigs. Thus 
different motivations may employ the same “lower” 
motor and orientation components. The latter are 
called taxes, a taxis being defined as orientation 
of the whole body or parts of it with respect to the 
source of stimulation, Further, the distinction be- 
tween appetitive behavior and consummatory be- 
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havior is a relative one. Normally, certain appeti- 
tive behavior leads to a stimulus situation which 
initiates another, more specific, appetitive be- 
havior. This fact has been carefully worked out 
by Baerends (1941) and Tinbergen (1951). The 
three-spined stickleback is brought into reproduc- 
tive motivation by the gradual increase in day 
length in spring and begins migration inland into 
shallow fresh-water habitats. This factor, together 
with the rise in water temperature and the visual 
stimulation of heavily vegetated sites, is a releasing 
mechanism for the establishment of a suitable 
territory by the males. A territory is necessary for 
the male to acquire its characteristic red belly. 
Only then does it begin to react to particular stimuli 
which previously had no effect. The male will build 
a nest with suitable material, fight against rival 
males (where the releasing stimulus is the red 
belly of the male intruding into his territory), and 
court passing females, which present their silvery, 
swollen, egg-filled bellies to the male in a charac- 
teristic manner. Thus, the stimuli emanating from 
a territory will activate the fighting, building, and 
mating drives, which must then be elicited by 
special releasers. Fighting itself consists of a num- 
ber of behavior patterns (chasing, threatening, 
tail-beating, biting), each dependent upon still 
further, highly specific stimuli emanating from 
the intruder’s behavior. The behavioral sequences 
of male and female form an alternating chain of 
reactions, each action of one partner releasing the 
following appropriate reaction of the other partner 
until the female spawns and the male fertilizes 
the eggs. The act of fertilization initiates brood 
care in the male; he now fans fresh water onto the 
eggs and continues to drive off rivals but does not 
exhibit further courtship until the young hatch. 
It is thus clear that there are chains of behavioral 
tendencies connected at higher and lower levels 
of integration and that these different levels are 
organized into a hierarchical system. The ad- 
vantage of a hierarchy, as opposed to a stereotyped 
series of single fixed actions, lies in its adapta- 
bility to unpredictable sequences of events. 
Neurophysiological aspects. It seems evident 
that some structural organization must exist within 
the central nervous system, paralleling the ob- 
served organization of behavioral responses—in 
particular the hierarchical organization. Neuro- 
physiological investigation of fixed action patterns 
has therefore become an important branch of etho- 
logical research. Extending the earlier experiments 
of W. R. Hess (1949), Holst and Saint Paul (1959) 
demonstrated the existence of structural hierarch- 
ical organization of the mechanisms underlying 
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behavior by electrically stimulating specific areas 
of the brain in chickens. Well-coordinated, com- 
plete sequences of movements identical with those 
observed in normal behavior were elicited. All these 
sequences of behavior were composed of single 
actions, each of which could be obtained in isola- 
tion by stimulation of specific brain areas. Holst 
and Saint Paul combined brain stimulation with 
the normal releasing stimuli, electrically changed 
the “mood” (motivational state) of the animal and 
studied artificial conflict between drives by pro- 
ducing interaction of different behavior patterns 
with simultaneous elicitation. 

Neurological research has substantiated the con- 
clusion, derived from ethological field studies, that 
the coordination of many locomotive patterns 
arises from impulses generated in the central nerv- 
ous system, Potent support has also been provided 
for Lorenz’ hypothesis postulating constant produc- 
tion of action-specific potentials in the central 
nervous system. 

Habituation and imprinting. Most of the origi- 
nal schemata postulated for the functional struc- 
ture of behavior have been shown to be simplifica- 
tions, though correctly describing special cases. In 
order to arrive at a generalized schema, it is still 
necessary to modify repeatedly such hypothetical 
schemata. Even the hierarchical system must be 
altered to a multidimensional network. The con- 
cept of habituation likewise increases in complex- 
ity with time. A behavior pattern repeatedly elicited 
by a particular sign-stimulus will cease to occur 
after a given time. The sense organ can neverthe- 
less be demonstrated to be fully capable of func- 
tioning, and it is not even necessary to presume 
that the motivation of the animal to perform this 
pattern is entirely extinguished; it is often possible 
to elicit the same behavior pattern immediately 
afterward with another sign-stimulus. In such 
cases, it is necessary to assume that central cut-off 
systems are involved. Such systems are capable of 
very complex functions; mechanisms of this type 
in turkeys extinguish flight behavior in response to 
all relatively slow-flying objects which occur fre- 
quently, and the adult animal flees only in response 
to uncommon flying objects. In fact, the most in- 
frequently occurring objects which (relative to 
their own size) are slow-flying are birds of prey. 
The adult turkey thus shows a well-adapted flight 
reaction in response to predatory birds—but also 
to advertising balloons, It is a question of definition 
whether or not one refers to this effect as learning; 
it takes place without marked exogenous reward or 
punishment. The same is true of imprinting, which 
was described by Spalding as early as 1873. In 


imprinting, a specific reaction of the animal 
(which need not be functional at the time of im- 
printing) becomes attached to an object which 
later functions as the releasing agent. This occurs 
within a limited sensitive period, usually at a young 
age. In contrast to learning by association, there is 
no reward (as in the previous example), and even 
punishing stimuli have a reinforcing effect. If 
ducks or doves are reared by other species they 
will later show a pairing preference toward a part- 
ner belonging to the foster species, even when a 
conspecific partner is available. [See IMPRINTING.] 
Phenomena similar to imprinting have been dis- 
covered in many fields. Some juvenile birds learn 
the song entirely from the “father” and will learn 
the song of a foster-father even in the midst of 
conspecifics. Since later offspring learn from these 
birds, the possibility of “speech dialects” arises. In 
different mammal species there have been cases of 
traditions which largely concern food preferences 
or forms of food acquisition, although traditions 
may also arise in the avoidance of enemies. 
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See CULTURE, article on CULTURAL RELATIVISM; 
VALUES, article on VALUE SYSTEMS; WORLD VIEW; 
and the biography of REDFIELD. 


EUGENICS 


Eugenics is an applied science that seeks to 
maintain or to improve the genetic potentialities of 
the human species. In practice, eugenics is con- 
cerned with any qualities that parents confer on 
their children, because genetic and cultural traits 
are often correlated or indistinguishable, and be- 
cause measures that improve one at the expense of 
the other are to be avoided. Genetics provides the 
core of eugenic theory, while any implementation 
must be broadly based on demography, medicine, 
psychology, and sociology. 

Founded by Sir Francis Galton at the end of 
the nineteenth century, before the rediscovery of 
Gregor Mendel’s laws of heredity, the early eugenics 
movement had an insecure scientific foundation 
and soon became contaminated with class and race 
prejudice, In the first half of the twentieth century, 
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eugenics was challenged by the growth of equali- 
tarian sentiments and suffered especially from the 
demonstration of perverted eugenics in Nazi Ger- 
many. Since mid-century, the movement has gained 
respectability by repudiating its early errors and by 
assimilating scientific advances. 

False concepts are still widely propounded in the 
name of eugenics, and there is disagreement even 
among geneticists as to the desirability and the 
urgency of eugenic measures. While some deteriora- 
tion of hereditary capacities is sure to result from 
preservation of physically or mentally handicapped 
persons and from increased irradiation of the popu- 
lation, the rate of this deterioration may be exceed- 
ingly slow. Furthermore, medical procedures that 
ameliorate genetic handicaps do not usually restore 
complete physical and social normality, so that 
natural selection still operates against the defects. 
Although intelligent parents have in the recent past 
tended to have fewer children than the less intelli- 
gent, early predictions of declining general intel- 
ligence have not been borne out. Analysis of present 
trends in reproduction indicates that the increased 
use and effectiveness of contraception may reverse 
this fertility difference (Osborn 1963a; 1963b). 

The case for eugenics. Three lines of reasoning 
support the case for eugenics. First, individual 
families can be spared suffering and disruption if 
severe hereditary defects are identified and the par- 
ents given medical advice or assistance in prevent- 
ing further births. 

Second, the high correlation between parents and 
children in mental characteristics has some of the 
same implications whether attributed mainly to 
heredity or mainly to cultural transmission. Men- 
tally handicapped persons, whose children stand 
the greatest risk of mental handicap, should at 
least be assisted if they wish to limit their offspring. 
More intelligent parents who can provide healthy 
home environments should be encouraged to bear 
as many children as they can support. 

Third, even a very small improvement in the 
intelligence of a large civilized population may be 
expected to increase available leadership signifi- 
cantly. The argument for eugenics does not depend 
on proving that present trends are downward but 
on evidence that in important respects the popula- 
tion would be better off with eugenic measures than 
without them (Shapiro 1959). The only assur- 
ance against genetic deterioration is demonstrable 
genetic improvement. 

Modern governments and institutions cannot 
avoid taking action in the sphere of eugenics. Every 
large-scale social or economic measure alters the 
distribution of births among segments of the popu- 
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lation, and this distribution determines the genetic 
potentials of the next generation. 


Present problems and applications 


Radiation effects. In the past two decades pub- 
lic attention has been focused on possible genetic 
effects of radiation, namely, gene mutations. Chil- 
dren born to survivors of the atomic bombs in 
Japan have not shown significant genetic effects, 
and this supports the belief that to date the dele- 
terious effects of atomic bursts on human heredity 
are small in comparison with natural mutations al- 
ready accumulated in concealed form. Yet in abso- 
lute numbers, the new induced mutations are prob- 
ably numerous and destined to take a proportionate 
toll in death and suffering spread over many 
generations, 

In medically advanced countries the population 
receives radiation from medical X rays that may 
greatly exceed the present dosage from fall-out of 
atomic tests. The debt incurred for future genera- 
tions by use of X rays or atomic testing must be 
weighed against the supposed immediate gains in 
health or national security. Some drugs also induce 
mutations, and these may need to be controlled in 
the future. Among the consequences of induced 
mutations, it is likely that physical defects would 
be less of a threat to man’s survival under civiliza- 
tion than innumerable small mental impairments. 

Race mixture. There is a broad consensus 
among geneticists that race mixture in man is no 
eugenic hazard. In plants and animals, the crossing 
of races may have unpredictable results. Especially 
if the parent populations were closely inbred, the 
new generation may show “hybrid vigor”; some 
other hybrids may show serious disharmonies, In 
the case of man there is little evidence for either 
hybrid vigor or hybrid disharmony, and if they 
occur they are overshadowed by social and cultural 
phenomena (Chung & Morton 1961). If some small 
hybrid populations appear to be genetically and 
culturally inferior to surrounding peoples, this may 
be due to isolation, inbreeding, or the quality of 
people who originate or who join such groups. Gene 
frequencies are not altered by race crossing but 
only by mutations and selection. 

Population policies. The immediate problem of 
curbing world population growth overshadows prob- 
lems of population quality. Nevertheless, popula- 
tion policies adopted now are likely to exert qualita- 
tive effects for a long time in the future, and these 
effects should be weighed. 

Birth rate differences between educational classes 
or between occupations appear in most civilized 
countries to be unfavorable to social and genetic 


progress. Eugenically sound population policies 
would therefore reduce or reverse these differen- 
tials at the same time that they accomplish their 
main purpose. A program to reduce births might, 
for example, facilitate voluntary family limitation 
and at the same time extend and advertise the eco- 
nomic benefits of such limitation to the more fer- 
tile and less secure economic classes. A program 
to increase births might best achieve this by assist- 
ing with higher education and other goals that 
deter reproduction by persons of high ability. 
Genetic counseling. Genetic factors undoubted- 
ly play some role in nearly all diseases and abnor- 
malities, but avoidance of reproduction is geneti- 
cally indicated only in conditions for which the risk 
is known to be high, Even in these instances the 
advisability of reproduction depends on the nature 
of the disease, on other genetic variables, on socio- 
economic circumstances, and above all on the par- 
ents’ willingness to sustain the risk. People who 
avoid childbearing for fear of hereditary defects 
sometimes have negligible genetic risks and are 
potentially superior parents. Yet those who seek 
genetic advice often fail to make use of it. Many 
universities now have centers for genetic counsel- 
ing, but hereditary diseases will remain unchecked 
until family planning becomes general and effec- 
tive. Whether it will ever be possible to transform 
harmful genes to the normal form by artificial 
means is a subject of speculation (Hotchkiss 1965). 


Theoretical aspects of eugenics 


Heredity and behavior. Galton believed that 
human behavior could be improved by genetic as 
well as by cultural progress (1883). Is there today 
any secure basis for such a belief? On theoretical 
grounds, the answer to this question is clearly yes. 
Over the past million years man’s forebears made 
rapid genetic progress with respect to intelligence 
and social behavior. The only known mechanism 
of genetic progress is natural or artificial selection 
of favorable variations, and selection produces 
permanent gains only if the selected variations are 
hereditary. Therefore, hereditary variation must 
have existed as long as mental capacities were 
evolving, and there is no theoretical reason to sup- 
pose that it has now disappeared. Such hereditary 
variation in behavior provides the necessary basis 
for further genetic improvement. 

Many psychologists and geneticists have adduced 
what they regard as evidence for hereditary psy- 
chological differences. The most important varia- 
tion seems to be that between individuals, whereas 
the observed differences between races and social 
classes are mainly cultural in origin. Even the con- 


cept of good and bad “family stock” is erroneous; 
both good and bad may occur in any family, and 
psychological traits shared with grandchildren or 
cousins must be attributed in large part to common 
environmental factors or to assortative (selective) 
mating. 

Natural selection. Natural selection is the only 
known guiding influence in organic evolution and 
is a necessary correlate of reproduction in a genet- 
ically variable organism. Individuals who contrib- 
ute more progeny than others to the next generation 
have, by definition, greater biological fitness. So far 
as the differences in fitness may be hereditary and 
the environment stable, the next generation will 
have a higher average ability to survive and repro- 
duce, At equilibrium the gain in each generation 
is offset by new harmful mutations. The first test 
of fitness is survival; but even among the survivors, 
natural selection operates through differences in 
fertility. 

Although it acts on whole organisms, natural 
selection produces evolutionary effects by changing 
the relative frequencies of single genes. In each 
generation, sexual reproduction recombines genes 
so that all are tested in new combinations. The 
ultimate frequency of a gene depends on its aver- 
age effect on survival and reproduction; if it is to 
become frequent, a new gene must be consonant 
with the major existing adaptations of the species. 
Natural selection is therefore conservative, tending 
usually to stabilize established norms. 

Natural selection is inefficient for at least three 
reasons: (1) Many deaths or reproductive failures 
are due to chance. (2) Individual differences in fit- 
ness are often not hereditary. (3) Most hereditary 
differences in fitness are small. Some of this inef- 
ficiency can be overcome under artificial selection, 
but at a price. When it entails breeding from a 
few selected individuals, artificial selection may 
sacrifice both genetic stability and reserve varia- 
bility. The artificial reproductive success conferred 
by the breeder on animals with commercially 
valuable traits may result in feeble or sterile strains. 
On the other hand, natural selection adapts a 
species only to its present environment or set of 
environments, and when the environment changes 
radically the species may become extinct. Man can 
foresee some changes in the environment and can, 
if he will, select for long-term fitness. In the last 
analysis this is probably a crucial advantage of 
scientific eugenics over natural selection. 

The genetic stability of populations. Evolution 
is slow not only because natural selection is inef- 
ficient but because natural populations have a great 
inherent genetic stability. This stability is con- 
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siderable even in small populations, for example, 
with a few hundred breeding individuals. Because 
of large numbers and small individual contribu- 
tions to the next generation (compared with most 
species), civilized human populations should be 
very resistant to change. 

Genetic stability resides primarily in the “gene 
pool” of a population. Each gene has a character- 
istic frequency, usually determined by a stable equi- 
librium between opposing forces of selection and 
mutation. Sudden changes in selective pressures 
may have more effect on complex traits than on the 
underlying gene frequencies, but even such traits 
are not likely to change fast. Thus, if lactation 
ceases to have survival value, its dependence on 
several physiological systems may make it vulnera- 
ble to mutations in any of a great many genes. 
Before any single genetic defect reached a fre- 
quency of 2.0 per cent, most women might have 
one or two rarer defects that prevented lactation. 
But an increase from 0.1 per cent to 2.0 per cent 
by mutation would take at least a hundred genera- 
tions. Defects of lactation that were positively se- 
lected because of effects on other traits might in- 
crease ten times this fast, but they would be few 
and represent coincidence only. 

Probably all human behavioral traits, except 
some grossly pathological ones, are controlled by 
multiple genetic factors and by the environment. 
Most of these traits are graded or continuous. 
Natural selection tends to eliminate both extremes 
in such a continuum and produces evolutionary 
change mainly from slight differences in fitness 
among the more numerous intermediate indi- 
viduals. Thus, the gene pool retains a large reserve 
of variability and accommodates any new variations 
that do not have extreme effects. 

Sources of variation. It has been suggested that 
ionizing radiation may speed human evolution by 
inducing more mutations. While mutations are the 
raw material of evolution, increased mutation rates 
now would probably do much more harm than 
good. First, nearly all mutations are harmful and 
must be eliminated by impaired fitness of the car- 
riers, often extending over many generations. Sec- 
ond, nearly all of the possibly beneficial mutations 
have occurred before and are already so frequent 
that rare additions will not help. Third, the few 
new useful mutations would increase so slowly 
under moderate selective pressures that they would 
remain very rare for centuries. 

Variations already present in human populations 
would suffice to carry human evolution forward a 
very long distance. New useful mutations that have 
arisen in the past few millennia are still in the early 
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stages of response to natural selection. The great 
majority of variable genes, kept at intermediate 
frequencies by conflicting or inefficient selection, 
should have an immense potential for improving 
the species. Some useful genes may be mainly re- 
stricted to small populations or to certain races; 
intermarriage between nationalities and between 
races will make these generally available. Finally, 
man’s constantly changing environment under 
civilization changes the survival value of genes, 
making some useful that were previously neutral 
or harmful. 

The role of an optimum environment. Early 
Darwinian enthusiasts supposed that natural selec- 
tion was inevitably cruel and that human evolutioh 
could proceed only in a harsh environment. Diverse 
environments are desirable since they develop di- 
verse potentials and, under natural selection, main- 
tain genetic variability. But present human environ- 
ments are often restricting and largely beyond 
control of the individual and even of the family. 

Man’s future environment will probably be closer 
to the best than to the worst of present living stand- 
ards. Eugenic planning for long-term adaptation 
should seek not only to equalize opportunity but to 
equalize it at a high economic and educational 
level. This would have no eugenic effect by itself 
but would enhance whatever selection was opera- 
tive, either negative or positive. 

Direction and choice in programs. A number of 
experts have warned that man may become dan- 
gerously dependent on medical technology. Such 
dependence, on a genetic basis, may develop gradu- 
ally over centuries or millennia, but it is evident that 
throughout human evolution innate physical ca- 
pacities have tended to deteriorate as compensatory 
mental abilities increased. In subhuman evolution, 
adaptability has repeatedly proved superior to spe- 
cialized adaptation, and intelligence in its many 
aspects opens the way to almost unlimited adapta- 
bility. In any eugenic program, therefore, intelli- 
gence, broadly defined, should take precedence 
over physical fitness. 

The above implied choice is an example of prob- 
lems that would continually arise in eugenics, It is 
sometimes contended that the planning of human 
evolution would require superhuman wisdom, In 
practice, eugenics need not imply detailed fore- 
sight or genetic planning but would select for 
adaptability and diversity among variations already 
discernible. If the problems were faced one by one 
and periodically reviewed, they should be no more 
grave or insoluble than present political, educa- 
tional, and social problems. 


Dysgenic and eugenic environments. The en- 
vironment is the instrument of natural selection 
since it sets the conditions for individual biological 
fitness, that is, the ability to survive and reproduce, 
For the continuation of our culture, and perhaps 
ultimately of the species, some other traits are as 
important as fertility. A society in which culturally 
important traits are positively correlated with fer- 
tility may be defined as a eugenic environment: the 
Opposite is a dysgenic environment. One society 
may be eugenic within some social strata and dys- 
genic in others. Any discussion of a eugenic en- 
vironment requires specification of the traits con- 


+ sidered culturally important. One might assume 


that the greatest need in our culture is for traits 
leading to superior achievement in one’s chosen 
occupation. A thorough treatment of the subject 
would give weight to more specific traits such as 
intelligence, social maturity, and parental re- 
sponsibility. 


Proposed systems of selection 


Promoting optimum expression of each person’s 
genetic capacities would not by itself constitute 
genetic progress. Eugenic selection, either natural 
or artificial, is also needed. Any deliberate program 
to promote eugenic selection would require much 
research before it could be instituted and careful 
monitoring of its progress. 

Breeding programs. Next to preoccupation with 
race, the advocacy of artificial breeding programs 
has probably done most to make eugenics unac- 
ceptable. Early proposals were incompatible with 
concepts of conventional marriage and families as 
social units. Since the introduction of artificial in- 
semination, the possibility of multiplying the prog- 
eny of selected men has become real. If artificial 
insemination from donors is practiced, all eugenists 
would agree that the donors should be carefully 
selected. But proposals for large-scale use of semen 
from a few great men (Muller 1960) may attach 
too much importance to fortunate combinations of 
genes that have little general value in the gene 
pool or in ordinary environments. 

Assistance to selected families. British eugen- 
ists, especially, have advocated financial assistance, 
resembling scholarships, to parents with good 
eugenic prognosis (Blacker 1952, p. 307). This 
would enable them to have as many children as 
they wanted. Economically dependent families, 
with generally poorer prognosis, would be assisted 
in limiting their progeny. 

With exceptions for grossly pathological heredity, 
eugenic prognosis in such a program ought to be 


based upon a couple's social and cultural attain- 
ments and on physical and mental health of their 
earliest progeny. Discrepancies between attain- 
ments and genetic potential might be compensated 
by social transmission of traits, so that this eugenic 
program would achieve limited cultural improve- 
ment if not also genetic improvement. If future 
environments achieve near equality of opportuni- 
ties, attainments will more closely reflect genetic 
capacities. 

Automatic selection. Some eugenists have been 
dissatisfied with the potential for error or misuse 
in any arbitrary system of selection. Others see 
eugenic processes as requiring both broad applica 
tion across all families and stability beyond that 


of most political systems. From these concerns have ` 


come suggestions for automatic selection: economic 
measures or social conditions under which natural 
selection will favor qualities of greatest value to 
society. 

If the newest methods of contraception become 
widely available, they may eliminate nearly all 
unwanted pregnancies. This in itself would bring 
family size more in line with parents’ capacities for 
education and achievement (Osborn 1963a). The 
rewarding of fertility with uniform family allow- 
ances is not eugenically effective, but some coun- 
tries offer other benefits. Highly effective automatic 
selection would require a more carefully planned 
eugenic environment (Osborn 1940). This might 
include high degrees of (1) social mobility, (2) in- 
dividual opportunity, and (3) voluntary assortative 
marriage, that is, between persons with similar 
abilities. It might also require special educational, 
economic, and social measures to make child rear- 
ing more acceptable to socially competent persons 
and less so to the socially inadequate. 

GORDON ALLEN 


[Other relevant material may be found in EVOLUTION; 
Genetics; RADIATION; and in the biographies of 
Darwin and GALTON.] 
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Ours is an age of social-action programs, where 
large organization and huge expenditures go into 
the attempted solution of every conceivable social 
problem. Such programs include both private and 
public ventures and small-scale and large-scale 
projects, ranging in scope from local to national 
and international efforts at social change. When- 
ever men spend time, money, and effort to help 
solve social problems, someone usually questions 
the effectiveness of their actions. Sponsors, critics, 
the public, even the actors themselves, seek signs 
that their program is successful. Much of the as- 
sessment of action programs is irregular and, often 
by necessity, based upon personal judgments of 
supporters or critics, impressions, anecdotes, testi- 
monials, and miscellaneous information available 
for the evaluation. In recent years, however, there 
has been a striking change in attitudes toward 
evaluation activities and the type and quality of 
evidence that is acceptable for determining the 
relative success or failure of social-action programs. 

Two trends stand out in the modern attitude 
toward evaluation. First, evaluation has come to be 
expected as a regular accompaniment to rational 
social-action programs. Second, there has been a 
movement toward demanding more systematic, 
rigorous, and objective evidence of success. The 
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application of social science techniques to the ap- 
praisal of social-action programs has come to be 
called evaluation research. 

Examples of the applications of evaluation re- 
search are available from a wide variety of fields. 
One of the earliest attempts at building evaluation 
research into an action program was in the field 
of community action to prevent juvenile delin- 
quency. The 1937 Cambridge-Somerville Youth 
Study provided for an experimental and a control 
group of boys, with the former to receive special 
attention and advice from counselors and other 
community agencies. The plan called for a ten- 
year period of work with the experimental group 
followed by an evaluation that would compare the 
record of their delinquent conduct during that 
decade with the record of the control group. The 
results of the evaluation (see Powers & Witmer 
1951) showed no significant differences in con- 
duct favorable to the program. A subsequent long- 
term evaluation of the same program failed to find 
new evidence of less criminal activity by persons 
in the experimental group but added a variety of 
new theoretical analyses to the evaluation (McCord 
et al. 1959), 

Several evaluations of programs in citizenship 
training for young persons have built upon one 
another, thus providing continuity in the field. 
Riecken (1952) conducted an evaluation of sum- 
mer work camps sponsored by the American 
Friends Service Committee to determine their im- 
pact on the values, attitudes, and opinions of the 
participants. His work was useful in specifying 
those areas in which the program was successful 
or unsuccessful as well as pointing up the impor- 
tance of measuring unsought by-products of action 
programs, Subsequently, Hyman, Wright, and Hop- 
kins carried out a series of evaluations of another 
youth program, the Encampment for Citizenship 
(1962). Their research design was complex, in- 
cluding a comparison of campers’ values, attitudes, 
opinions, and behavior before and after a six-week 
program of training; follow-up surveys six weeks 
and four years after the group left the program; 
three independent replications of the original study 
on new groups of campers in later years; and a 
sample survey of alumni of the program. These 
various studies demonstrated the effectiveness of 
the program in influencing campers’ social atti- 
tudes and conduct; they also examined the dy- 
namics of attitudinal change. 

Evaluations have been made in such varied 
fields as intergroup relations, induced technological 
change, mass communications, adult education, 
international exchange of persons for training or 


good will, mental health, and public health, Addi- 
tional examples of applications of evaluation re- 
search, along with discussions of evaluation tech- 
niques, are presented by Klineberg and others in a 
Special issue of the International Social Science 
Bulletin (1955) and in Hyman and Wright (1966). 


Defining characteristics 


A scientific approach to the assessment of a 
program's achievements is the hallmark of modern 
evaluation research. In this respect evaluation re- 
search resembles other kinds of social research in 
its concern for objectivity, reliability, and validity 
in the collection, analysis, and interpretation of 
data. But it can be distinguished as a special form 
of social research by its purpose and the conditions 
under which the research must be conducted. Both 
of these factors affect such components of the 
research process as study design and its transla- 
tion into practice, allocation of research time and 
other resources, and the value or worth to be put 
upon the empirical findings, 

The primary purpose of evaluation research is 
“to provide objective, systematic, and comprehen- 
sive evidence on the degree to which the program 
achieves its intended objectives plus the degree to 
which it produces other unanticipated conse- 
quences, which when recognized would also be 
regarded as relevant to the agency” (Hyman et al. 
1962, pp. 5-6), Evaluation research thus differs 
in its emphasis from such other major types of 
social research as exploratory studies, which seek 
to formulate new problems and hypotheses, or ex- 
planatory research, which places emphasis on the 
testing of theoretically significant hypotheses, or 
descriptive social research, which documents the 
existence of certain social conditions at a given 
moment or over time (Selltiz et al. 1959). Since 
the burden is on the evaluator to provide firm 
evidence on the effects of the program under study, 
he favors a study design that will tend toward 
maximizing such evidence and his confidence in 
conclusions drawn from it. Although good evalua- 
tion research often seeks explanations of a pro- 
gram’s success or failure, the first concern is to 
obtain basic evidence on effectiveness, and there- 
fore most research resources are allocated to this 
goal. 

The conditions under which evaluation research 
is conducted also give it a character distinct from 
other forms of social research. Evaluation research 
is applied social research, and it differs from other 
modes of scholarly research in bringing together 
an outside investigator to guarantee objectivity and 
a client in need of his services. From the initial 


formulation of the problem to the final interpreta- 
tion of findings, the evaluator is duty-bound to 
keep in mind the very practical problem of assess- 
ing the program under study. As a consequence 
he often has less freedom to select or reject certain 
independent, dependent, and intervening variables 
than he would have in studies designed to answer 
his own theoretically formulated questions, such 
as might be posed in basic social research. The 
concepts employed and their translation into meas- 
urable variables must be selected imaginatively 
but within the general framework set by the nature 
of the program being evaluated and its objectives 
(a point which will be discussed later). Another 
feature of evaluation research is that the investi- 
gator seldom has freedom to manipulate the pro- 
gram and its components, i.e., the independent 
variable, as he might in laboratory or field experi- 
ments. Usually he wants to evaluate an ongoing or 
proposed program of social action in its natural 
setting and is not at liberty, because of practical 
and theoretical considerations, to change it for 
research purposes. The nature of the program be- 
ing evaluated and the time at which his services 
are called upon also set conditions that affect, 
among other things, the feasibility of using an ex- 
perimental design involving before-and-after meas- 
urements, the possibility of obtaining control 
groups, the kinds of research instruments that can 
be used, and the need to provide for measures of 
long-term as well as immediate effects. 

The recent tendency to call upon social science 
for the evaluation of action programs that are 
local, national, and international in scope (a trend 
which probably will increase in future years) and 
the fact that the application of scientific research 
procedures to problems of evaluation is compli- 
cated by the purposes and conditions of evaluation 
research have stimulated an interest in methodo- 
logical aspects of evaluation among a variety of 
social scientists, especially sociologists and psy- 
chologists. Methodological and technical problems 
in evaluation research are discussed, to mention 
but a few examples, in the writings of Riecken 
(1952), Klineberg (1955), Hyman et al. (1962), 
and Hayes (1959). 

While it is apparent that the specific translation 
of social-science techniques into forms suitable for 
a particular evaluation study involves research de- 
cisions based upon the special nature of the pro- 
gram under examination, there are nonetheless 
certain broad methodological questions common to 
most evaluation research. Furthermore, certain 
principles of evaluation research can be extracted 
from the rapidly growing experience of social sci- 
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entists in applying their perspectives and methods 
to the evaluation of social-action programs. Such 
principles have obvious importance in highlighting 
and clarifying the methodological features of eval- 
uation research and in providing practical, if lim- 
ited, guidelines for conducting or appraising such 
research. The balance of this article will discuss 
certain, but by no means all, of these compelling 
methodological problems. 


Methodological steps and principles 


The process of evaluation has been codified into 
five major phases, each involving particular meth- 
odological problems and guiding principles (see 
Hyman et al. 1962). They are (1) the conceptuali- 
zation and measurement of the objectives of the 
program and other unanticipated relevant out- 
comes; (2) formulation of a research design and 
the criteria for proof of effectiveness of the pro- 
gram, including consideration of control groups 
or alternatives to them; (3) the development and 
application of research procedures, including pro- 
visions for the estimation or reduction of errors in 
measurement; (4) problems of index construction 
and the proper evaluation of effectiveness; and 
(5) procedures for understanding and explaining 
the findings on effectiveness or ineffectiveness. 
Such a division of the process of evaluation is 
artificial, of course, in the sense that in practice the 
phases overlap and it is necessary for the re- 
searcher to give more or less constant consideration 
to all five steps. Nevertheless it provides a useful 
framework for examining and understanding the 
essential components of evaluation research. 

Conceptualization. Each social-action program 
must be evaluated in terms of its particular goals. 
Therefore, evaluation research must begin with 
their identification and move toward their specifi- 
cation in terms of concepts that, in turn, can be 
translated into measurable indicators. All this may 
sound simple, perhaps routine, compared with the 
less structured situation facing social researchers 
engaged in formulating research problems for the- 
oretical, explanatory, descriptive, or other kinds 
of basic research. But the apparent simplicity is 
deceptive, and in practice this phase of evaluation 
research repeatedly has proven to be both critical 
and difficult for social researchers working in such 
varied areas as mental health (U.S. Dept. of 
Health, Education & Welfare 1955), juvenile delin- 
quency (Witmer & Tufts 1954), adult education 
(Evaluation Techniques 1955), and youth pro- 
grams for citizenship training (Riecken 1952; Hy- 
man et al. 1962), among others. As an example, 
Witmer and Tufts raise such questions about the 
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meaning of the concept “delinquency prevention” 
as: What is to be prevented? Who is to be de- 
terred? Are we talking only about “official” delin- 
quency? Does prevention mean stopping misbe- 
havior before it occurs? Does it mean reducing 
the frequency of misbehavior? Or does it mean 
reducing its severity? 

Basic concepts and goals are often elusive, 
vague, unequal in importance to the program, and 
sometimes difficult to translate into operational 
terms. What is meant, for example, by such a goal 
as preparing young persons for “responsible citizen- 
ship”? In addition, the evaluator needs to consider 
possible effects of the program which were un- 
anticipated by the action agency, finding clues 
from the records of past reactions to the program 
if it has been in operation prior to the evaluation, 
studies of similar programs, the social-science lit- 
erature, and other sources. As an example, Carlson 
(1952) found that a mass-information campaign 
against venereal disease failed to increase public 
knowledge about these diseases; nevertheless, the 
campaign had the unanticipated effect of improv- 
ing the morale of public health workers in the 
area, who in turn did a more effective job of com- 
bating the diseases. The anticipation of both 
planned and unplanned effects requires consider- 
able time, effort, and imagination by the researcher 
prior to collecting evidence for the evaluation itself. 

Research design. The formulation of a research 
design for evaluation usually involves an attempt 
to approximate the ideal conditions of a controlled 
experiment, which measures the changes produced 
by a program by making comparisons of the de- 
pendent variables before and after the program and 
evaluating them against similar measurements on 
a control group that is not involved in the program. 
If the control group is initially similar to the group 
exposed to the social-action program, a condition 
achieved through judicious selection, matching, 
and randomization, then the researcher can use 
the changes in the control group as a criterion 
against which to estimate the degree to which 
changes in the experimental group were probably 
caused by the program under study. To illustrate, 
Suppose that two equivalent groups of adults are 
selected for a study on the effects of a training 
film intended to impart certain information to the 
audience. The level of relevant information is 
measured in each group prior to the showing of 
the film; then one group sees the film while the 
other does not; finally, after some interval, informa- 
tion is again measured. Changes in the amount of 
information held by the experimental group cannot 
simply be attributed to the film: they may also 
reflect the influence of such factors in the situation 


as exposure to other sources of information in the 
interim period, unreliability of the measuring in- 
struments, maturation, and other factors extra- 
neous to the program itself. But the control group 
presumably also experienced such nonprogram- 
matic factors, and therefore the researcher can 
subtract the amount of change in information 
demonstrated by it from the changes shown by the 
experimental group, thereby determining how much 
of the gross change in the latter group is due to 
the exclusive influence of the program, 

So it is in the ideal case, such as might be 
achieved under laboratory conditions. In practice, 
however, evaluation research seldom permits such 
ideal conditions. A variety of practical problems 
requires alterations in the ideal design. As ex- 
amples, suitable control groups cannot always be 
found, especially for social-action programs involv- 
ing efforts at large-scale social change but also for 
smaller programs designed to influence volunteer 
participants; also ethical, administrative, or other 
considerations usually prevent the random assign- 
ment of certain persons to a control group that 
will be denied the treatment offered by the action 
programs. 

In the face of such obstacles, certain methodol- 
ogists have taken the position that a slavish in- 
sistence on the ideal control-group experimental 
research design is unwise and dysfunctional in 
evaluation research. Rather, they advocate the in- 
genious use of practical and reasonable alternatives 
to the classic design (see Hyman et al. 1962; and 
Campbell & Stanley 1963). Under certain condi- 
tions, for example, it is possible to estimate the 
amount of change that could have been caused by 
extraneous events, instability of measurements, and 
natural growth of participants in a program by 
examining the amount of change that occurred 
among participants in programs similar to the one 
being evaluated. Using such comparative studies 
as “quasi-control” groups permits an estimate of 
the relative effectiveness of the program under 
study, i.e., how much effect it has had over and 
above that achieved by another program and as- 
sorted extraneous factors, even though it is im- 
possible to isolate the specific amount of change 
caused by the extraneous factors. Another proce- 
dure for estimating the influence of nonprogram- 
matic factors is to study the amount of change 
which occurs among a sample of the population 
under study during a period of time prior to the 
introduction of the action program, using certain 
of the ultimate participants as a kind of control 
upon themselves, so to speak. Replications of the 
evaluation study, when possible, also provide safe- 
guards against attributing too much or too little 


effect to the program under study. Admittedly, all 
such practical alternatives to the controlled experi- 
mental design have serious limitations and must 
be used with judgment; the classic experimental 
design remains preferable whenever possible and 
serves as an ideal even when impractical. Never- 
theless, such expedients have proven useful to 
evaluators and have permitted relatively rigorous 
evaluations to be conducted under conditions less 
perfect than those found in the laboratory. 

Error control. Evaluation studies, like all social 
research, involve difficult problems in the selection 
of specific research procedures and the provision 
for estimating and reducing various sources of 
error, such as sampling bias, bias due to non- 
response, measurement errors arising in the ques- 
tions asked or in recording of answers, deliberate 
deception, and interviewer bias. The practices 
employed to control such errors in evaluation 
research are similar to those used in other forms 
of social research, and no major innovations have 
been introduced. 

Estimating effectiveness. To consider the fourth 
stage in evaluation, a distinction needs to be made 
between demonstrating the effects of an action 
program and estimating its effectiveness. Effective- 
ness refers to the extent to which the program 
achieves its goals, but the question of just how 
much effectiveness constitutes success and justifies 
the efforts of the program is unanswerable by 
scientific research. It remains a matter for judg- 
ment on the part of the program’s sponsors, ad- 
ministrators, critics, or others, and the benefits, of 
course, must somehow be balanced against the 
costs involved. The problem is complicated further 
by the fact that most action programs have multi- 
ple goals, each of which may be achieved with 
varying degrees of success over time and among 
different subgroups of participants in the program. 
To date there is no general calculus for appraising 
the over-all net worth of a program. 

Even if the evaluation limits itself to determin- 
ing the success of a program in terms of each 
specific goal, however, it is necessary to introduce 
some indexes of effectiveness which add together 
the discrete effects within each of the program's 
goal areas, Technical problems of index and scale 
construction have been given considerable atten- 
tion by methodologists concerned with various 
types of social research (see Lazarsfeld & Rosen- 
berg 1955). But as yet there is no theory of index 
construction specifically appropriate to evaluation 
research. Steps have been taken in this direction, 
however, and the utility of several types of indexes 
has been tentatively explored (see Hyman et al. 
1962). One type of difficulty, for example, arises 
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from the fact that the amount of change that an 
action program produces may vary from subgroup 
to subgroup and from topic to topic, depending 
upon how close to perfection each group was be- 
fore the program began. Thus, an information pro- 
gram can influence relatively fewer persons among 
a subgroup in which, say, 60 per cent of the people 
are already informed about the topic than among 
another target group in which only 30 per cent 
are initially informed. An “effectiveness index” has 
been successfully employed to help solve the prob- 
lem of weighting effectiveness in the light of such 
restricted ceilings for change (see Hovland et al. 
1949; and Hyman et al, 1962). This index, which 
expresses actual change as a proportion of the 
maximum change that is possible given the initial 
position of a group on the variable under study, 
has proven to be especially useful in evaluating the 
relative effectiveness of different programs and the 
relative effectiveness of any particular program 
for different subgroups or on different variables. 

Understanding effectiveness. In its final stage, 
evaluation research goes beyond the demonstration 
of a program’s effects to seek information that will 
help to account for its successes and failures. The 
reasons for such additional inquiry may be either 
practical or theoretical. 

Sponsors of successful programs may want to 
duplicate their action program at another time or 
under other circumstances, or the successful pro- 
gram may be considered as a model for action by 
others. Such emulation can be misguided and even 
dangerous without information about which as- 
pects of the program were most important in bring- 
ing about the results, for which participants in the 
program, and under what conditions. Often it is 
neither possible nor necessary, however, to detect 
and measure the impact of each component of a 
social-action program. In this respect, as in others 
noted above, evaluation research differs from ex- 
planatory survey research, where specific stimuli 
are isolated, and from experimental designs, where 
isolated stimuli are introduced into the situation 
being studied. In evaluation research the independ- 
ent variable, i.e., the program under study, is 
usually a complex set of activities no one of which 
can be separated from the others without changing 
the nature of the program itself. Hence, explana- 
tions of effectiveness are often given in terms of 
the contributions made by certain gross features 
of the program, for example, the total impact of 
didactic components versus social participation in 
a successful educational institution. 

Gross as such comparisons must be, they never- 
theless provide opportunities for testing specific 
hypotheses about social and individual change, 
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thereby contributing to the refinement and growth 
of social science theories. It is important to remem- 
ber, however, that such gains are of secondary 
concern to evaluation research, which has as its 
primary goal the objective measurement of the 
effectiveness of the program, 

Certain forms of research design promise to 
yield valuable results both for the primary task 
of evaluation and its complementary goal of en- 
larging social knowledge. Among the most promis- 
ing designs are those that allow for comparative 
evaluations of different social-action programs, rep- 
lication of evaluations of the same program, and 
longitudinal studies of the long-range impact of 
programs. Comparative studies not only demon- 
strate the differential effectiveness of various forms 
of programs having similar aims but also provide 
a continuity in research which permits testing the- 
ories of change under a variety of circumstances. 
Replicative evaluations add to the confidence in the 
findings from the initial study and give further 
Opportunity for exploring possible causes of change. 
Longitudinal evaluations permit the detection of 
effects that require a relatively long time to occur 
and allow an examination of the stability or loss 
of certain programmatic effects over time and 
under various natural conditions outside of the 
program's immediate control. 


Viewed in this larger perspective, then, evalua- 
tion research deserves full recognition as a social 
Science activity which will continue to expand. It 
provides excellent and ready-made opportunities to 
examine individuals, groups, and societies in the 
grip of major and minor forces for change. Its 
applications contribute not only to a science of 
social planning and a more rationally planned so- 
ciety but also to the perfection of social and psy- 
chological theories of change. 

CHARLES R. WRIGHT 


[See also EXPERIMENTAL DESIGN; SURVEY ANALYSIS. ] 
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I 
THE CONCEPT OF EVOLUTION 


There are few concepts that appear in the his- 
tory of ideas that are common to many realms of 
thought in social and natural science and to phi- 
losophy in general. Evolution is such a concept, 
and its origin as a doctrine was deeply embedded 
in the social and economic conditions of the in- 


EVOLUTION: The Concept of Evolution 


dustrial West. Like all such world views, it em- 
bodies many principles, not all of which are 
admitted in its various uses, so that even those 
concerned with organic evolution are unable to 
agree on the essence of the idea of evolution. 


Toward a definition 


There is a hierarchy of principles in the evo- 
lutionary world view: change, order, direction, 
progress, and perfectibility. Evolutionary theories 
are distinguished by how many of these are suc- 
cessively included as essential. Some evolutionists 
include only change and order, others add direc- 
tion, and some few, like Teilhard de Chardin, 
believe in perfectibility as well. 

Change. The idea of evolution, in its simplest 
form, is that the current state of a system is the 
result of a more or less continual change from its 
original state. The qualification that the change 
be continual, or at least frequent or regular, is an 
essential one and distinguishes evolutionary from 
static world views. For example, many opponents 
of organic evolution in the nineteenth century ac- 
cepted the authenticity of fossils but regarded them 
as antediluvian, as evidence of one or several 
floods that caused a total replacement of the 
world’s fauna. Diluvianism, like the closely related 
vulcanism, which postulated a series of inunda- 
tions of the earth by lava, was a theory of cata- 
strophic and irregular change. There is no funda- 
mental difference between a theory of special 
creation that populates the world once with un- 
changing beings and one that populates it several 
times. They are both theories of special interven- 
tion of unique forces in an otherwise normally 
static system. The distinction between such a world 
view and an evolutionary one is important for our 
understanding of the social and economic origins 
of evolutionism. 

Closely related to the idea that change is a 
characteristic of a system is the principle of 
uniformitarianism, the principle that the forces 
causing change are themselves unchangeable gen- 
eral laws that govern the system. Thus, geological 
evolution is seen as the result of processes of 
mountain-building, sedimentation, and erosion that 
have gone on throughout the history of the earth, 
at least since the time when liquid water was 
present in appreciable quantities. In like manner 
the processes of natural selection and mutation 
that can be seen occurring in the organic world 
today are assumed to have been the operative 
forces in all the past history of life. Moreover, since 
such forces are operating at present, it must be 
concluded that evolution is still going on. A com- 


203 


mitment to an evolutionary viewpoint represents 
a commitment to the instability of the present 
order as well as the past. In its simplest and irre- 
ducible form, evolutionism is the doctrine that 
change of state is an unvarying characteristic of 
natural systems and human institutions and that 
such change follows immutable laws. 

Order. While all evolutionary thought assumes 
continual change, there is some problem of dis- 
tinguishing “real” change from a stasis that has 
only the appearance of change. Let us imagine a 
deck of cards being shuffled over and over again. 
In one sense it is obvious that the state of the 
deck is undergoing continual change, since the 
cards are being rearranged. But is the deck evolv- 
ing? For Bergson and Whitehead it is not, because 
only alternate states of chaos are succeeding each 
other. Plus ça change, plus c'est la même chose. 
Out of this chaos some organization must appear 
to be true evolution, and the appearance of new 
organization, in the view of Whitehead and his 
followers, is the characteristic of an evolutionary 
process. In Science and the Modern World White- 
head says: “Evolution, on the materialistic theory, 
is reduced to the rôle of being another word for 
the description of changes of the external relations 
between portions of matter. There is nothing to 
evolve, because one set of external relations is as 
good as any other set of external relations. There 
can merely be change, purposeless and unprogres- 
sive” ({1925] 1960, p. 157). 

But how can we know order as against chaos, 
except that we have a preconception of order and 
purpose? To return to the analogy of the deck of 
cards, if after repeated shuffling, the deck turned 
out to be grouped by suits, we would certainly say 
that order had been created. An even higher de- 
gree of order would be ascribed to the deck if, in 
addition, the cards within the suits were arranged 
in ascending sequence. After all, in poker a royal 
flush wins and a mixed hand loses. Yet, on any 
objective criterion a royal flush has exactly the 
same probability as any given mixed hand, and a 
completely ordered deck is not less probable than 
any other given arrangement. The appearance of 
order is the correspondence between the arrange- 
ment of objects and a preconception. 

The demand that an evolutionary process create 
order, or at least that there be a change from one 
order to a different order, shows clearly that evo- 
lution, in this sense, is neither a fact nor a theory, 
but a way of organizing knowledge. 

In contrast to Whitehead, some modern evolu- 
tionists are willing to accept any rearrangement 
of the parts of a structure as evolution. Thus 
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Dobzhansky (1937) defines organic evolution as 
“a change in the genetic composition of popula- 
tions,” and he speaks for most students of organic 
evolution when he says that there are no differ- 
ences between organisms that cannot be accounted 
for in this way. But a change in the genetic com- 
position of populations is not different in essence 
from a reshuffling of cards in that it is for the 
most part only a change in the relative frequency 
of elements, all of which are already present. The 
question of order marks the separation between the 
completely positivistic evolutionism inherent in 
Dobzhansky’s definition and the creative evolution- 
ism of Bergson and Teilhard. For, once it is pro- 
posed that order is the natural outcome of an evo- 
lutionary process, ideas of direction, progress, and 
perfectibility follow swiftly. 

Direction. By direction in evolution we mean 
the concept that there is some natural linear order 
of states of the system and that an evolutionary 
process can be described as passing through suc- 
cessive states in this linear order. That is, evolu- 
tion can be described by a line on a two-dimensional 
graph with time on one axis and some description 
of the system on the other. Moreover, this line is 
supposed to be always ascending or descending. 
But this description limits evolution to those attri- 
butes for which the human mind can make a 
sensible linear order. It must be possible to describe 
an evolutionary process as one in which something 
or other “tends to increase.” Thus, it is insufficient 
to describe the evolution of human culture in terms 
of a change from hunting and gathering to agri- 
culture, from agriculture to industry. These modes 
of organization must somehow be placed on a 
graded scale as, for example, the degree of division 
of labor (Durkheim) or the degree of complexity 
(Spencer). 

The attempt to find the proper scale on which 
such a directionality can be measured has been a 
preoccupation of nonpositivist evolutionists and is 
the chief point at issue among them. Complexity 
is the scale most appealed to in organic and social 
evolution. For organic evolution it is supposed that 
modern organisms have a more complex structure 
than primitive ones, just as mammals are thought 
to be more complex than bacteria. Coupled with 
this idea of increase in structural complexity is the 
theory that the information content of modern 
organisms is greater than for past forms. Evolu- 
tion is, on this theory, a process of accumulation of 
information about the environment in the complex 
structure of organisms. Finally, the supposed ac- 
cumulation of information is thought to be a re- 
versal of the second law of thermodynamics, which 
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prescribes an increase of entropy with time and 
thus an increase in the randomness of the uni- 
verse. Evolutionists sometimes talk of the accumu- 
lation of “negentropy” in organic evolution, mark- 
ing off life from the inorganic cosmos. 

This view of organic evolution, which is sup- 
posed to apply not only to the structure of organ- 
isms but to the interrelations between organisms 
in the total biosphere, suffers from a number of 
serious difficulties. First, it would be difficult to 
show exactly in what sense mammals are more 
complex than bacteria. There is no doubt that there 
are many more kinds of tissues in mammals, but 
bacteria are capable of carrying out many syn- 
thetic reactions not possible for mammals. At the 
level of cell physiology and metabolism, bacteria— 
bringing a greater synthetic repertory—must be 
regarded as more complex. Moreover, even if we 
assume that modern organisms are more complex 
structurally than those of the Cambrian, no cri- 
terion of complexity can distinguish between mam- 
mals and bony fish, although there are 270 million 
years between the first appearance of each. Second, 
the relation between structural complexity and in- 
formation about the environment is not perfectly 
clear. No one knows exactly how to measure the 
information contained in any organism, It might 
be done, as in the Shannon—Wiener solution, by re- 
garding the genes as a code made up of three-letter 
words with a four-letter alphabet, corresponding to 
what is known about the molecular basis of he- 
redity. If this is done, however, many invertebrates 
turn out to have more information than many 
vertebrates, and among bacteria there is a very 
great variation. The real difficulty is that the equa- 
tion between complexity and information has been 
chiefly a metaphorical rather than an exact one. 

Finally, the equation of information and com- 
plexity with a thermodynamic measure of entropy 
is based on a misunderstanding of the kinetic the- 
ory of gases. The second law of thermodynamics, 
in the early nineteenth century, represented the 
beginning of modern evolutionary cosmology. The 
term “entropy,” used for a property of the universe 
that always increases, has the same meaning, 
etymologically, as “evolution.” Originally, the in- 
crease in entropy only signified that different parts 
of a physical system became more and more alike 
in their energy content, so that less and less useful 
work could be obtained from an interaction be- 
tween them, The kinetic theory of gases provided 
a picture of molecules moving and colliding, and 
thus explained the gross observation of heat and 
work. This, in turn, led to a new interpretation of 
the second law as guaranteeing that a collection of 
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molecules in any region of space would eventually 
have the same distribution of kinetic energies as 
in any other region. Two confusions have arisen 
about the kinetic theory of gases that have had an 
important effect on evolutionary thinking. First, it 
is supposed that individual molecules will all have 
the same kinetic energy rather than that assem- 
blages of molecules will have the same statistical 
distribution of energies. Second, there is a con- 
fusion of kinetic energy of molecules with general 
kinetic and potential energies, especially gravita- 
tional and electromagnetic potential. These two 
confusions give rise to an erroneously derived gen- 
eralized second law stating that all the molecules 
in the universe will eventually be equally spaced 
out from each other. Given such a formless and 
orderless end, the evolution of life does indeed seem 
to go in the opposite direction. 

The tendency to turn the second law of thermo- 
dynamics into a generalized evolutionary world 
view (with life as an exceptional countercurrent ) 
has been further encouraged by confusion of 
thermodynamics with yet another evolutionary 
cosmology, the “expanding universe.” According 
to this cosmogony, the material universe came into 
being on the order of 10 thousand million years 
ago in a small region of space, the matter exploded 
outward, and the material cosmos will continue to 
expand forever from this original point in space. 
A consequence of a fixed amount of matter occupy- 
ing a larger and larger volume is that matter is 
becoming more thinly spread globally but not 
necessarily locally. The theory of the expanding 
universe does not demand, for example, that the 
earth break up and its pieces spread apart. 

In its own way the theory of the expanding uni- 
verse is another example of the search for direc- 
tionality in evolutionary systems. More recently, 
Bondi, Gold, Hoyle, and others have given cur- 
rency to nondirectional theories of the cosmos. One 
is a steady-state theory that allows for expansion 
but holds density everywhere in dynamic equi- 
librium by continual creation of new matter. The 
other is an oscillation theory, which postulates a 
cyclic expansion and contraction of the material 
cosmos. 

Homeostasis, introduced by Cannon as a prin- 
ciple of physiology and evolution, is related to com- 
plexity. Homeostasis is the property of a system 
to hold constant certain of its elements despite 
external disturbing forces. What is held constant 
are those qualities of the system that are necessary 
for its maintenance, such as body temperature in 
a mammal or ionic strength of the blood. This 
property of homeostasis is then extended by evo- 
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lutionists to include communities of organisms 
occupying different but coordinated positions in 
the natural economy. Thus, the relation between 
numbers and efficiency of carnivores that prey on 
herbivores and herbivores that crop the grass is 
thought to be stabilized by the process of evolution, 
so that fluctuation in the abundance of any of 
these organisms is compensated by changes in the 
others. The result is a stable community structure. 
The notion of stability is appealing to modern evo- 
lutionists, who see evolution as self-fulfilling, as a 
stabilization of life in a capricious universe. For 
the nineteenth century it was quite another matter. 
Is Nature 


“So careful of the type?” But no. 
From scarped cliff and quarried stone 
She cries, “A thousand types are gone: 
I care for nothing, all shall go.” 
Tennyson, In Memoriam, Part 56, Stanza 1 


Perhaps the only evolutionist doctrine that con- 
tains no important element of direction is evolu- 
tionary geology, which is entirely a cyclic theory. 
Mountain-building revolutions are followed by 
erosion, the formation of featureless peneplains, 
the deposition of sediments in the seas, followed by 
new uplift and new mountain-building. Of course, 
it is supposed that the final cooling of the earth’s 
core will at last put an end to the cycle, but this 
cooling is the only vestige of directionality that is 
apparent. Geology remains, among the historical 
sciences, obdurately materialistic and positivist. 

Progress. It is not always easy to differentiate 
evolutionist doctrines of simple direction from those 
with an element of progress. I distinguish them 
by the moral or, better, moralistic tone of pro- 
gressivism, but moralism is sometimes well hidden. 
For example, the doctrine that homeostasis gives 
direction to evolution is sometimes arrived at be- 
cause man is assumed a priori to be the measure 
of evolution, and it is fairly easy to make the case 
that man, the rational mammal, is most homeo- 
static. “L'Homme, seul paramètre absolu de TÉvo- 
lution” is even more anthropocentric than it seems 
at first sight, for Teilhard de Chardin (1956) is 
referring not simply to the history of life but cosmic 
evolution! But this is simply Whitehead brought 
up to date, Whitehead who, in Modes of Thought, 
divides occurrences in nature into six types: “The 
first type is human existence, body and mind. The 
second type includes all sorts of animal life, insects, 
the vertebrates, and other genera... . The third 
type includes all vegetable life. . . . The sixth type 
is composed of the happenings on an infinitesimal 
scale, disclosed by the minute analysis of modern 
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physics” (1938, p. 214). Man leads all the rest. The 
shibboleths of progressivism are the superiority of 
man in the cosmos, industrial man in the world 
economy, and liberal democratic man in world 
society. 

Spencer makes extensive use of the term “prog- 
ress,” but in a way that seems not to have a norma- 
tive or moralistic overtone: “From the earliest 
traceable cosmical changes down to the latest 
results of civilization, we shall find that the trans- 
formation of the homogeneous into the heterogene- 
ous is that in which progress essentially consists” 
([1857] 1915, p. 10). In “Progress: Its Law and 
Cause,” Spencer shows that this transformation has 
occurred in music, poetry, society, government, 
manufacturing, commerce, language, and so on. He 
cautions against normative definition of progress. 
“Leaving out of sight concomitants and beneficial 
consequences, let us ask what progress is in itself” 
(ibid., p. 9). But this is a very curious question 
to ask, what progress is in itself, for does not prog- 
Tess, as opposed to simple change, imply a moral 
direction? What Spencer has done is to equate 
progress with change, to say that change, whatever 
its direction may turn out to be, is progressive by 
its very nature. We come again to that nineteenth- 
century belief that change is good, in Spencer’s 
words, “a beneficient necessity.” 

Most modern students of natural evolution, both 
organic and social, have taken a step toward mate- 
rialism in omitting the idea of progress from their 
systems. An exception is B. Rensch (1947), who 
distinguishes higher and lower forms of life and 
devotes a special category of evolutionary change, 
anagenesis, to evolution of higher from lower. While 
he includes under this rubric increases in stability, 
homeostasis, and complexity that are discussed 
here simply as directional rather than progressive, 
Rensch clearly regards man and especially human 
freedom as the highest and best product of evolution. 

Perfectibility. With the exception of the philos- 
opher Teilhard, modern evolutionism does not con- 
tain a utopian element. On the contrary, evolution 
is generally envisaged as an endless process with 
no particular perfect end or goal. There is some 
logical difficulty, however, in maintaining that evo- 
lution leads to greater homeostasis, greater cerebral- 
ization, greater adaptation, while ignoring the 
possibility of perfect homeostasis, complete cer- 
ebralization, or absolute adaptation. It is not at all 
obvious how homeostasis of individuals or com- 
munities can continue to increase forever. Never- 
theless, this issue is generally ignored or, as in the 
case of Spencer, deliberately set aside. The single 


important exception is in evolutionary economics, 
especially various utopian socialisms. Marxism, es- 
pecially as interpreted by Lenin and Trotsky, is a 
straightforward progressivist, perfectionist evolu- 
tionary theory. A stage of primitive capitalist accu- 
mulation through exploitation of the workers and 
colonies enriches the society. This is accompanied 
by the bourgeois revolution that leads to liberal 
bourgeois democracy. In turn comes the proletarian 
revolution, proletarian democracy, a breakdown of 
national interests in the face of class interests, and 
a final total leveling of the class structure. In this 
utopian scheme there will still be division of labor, 
but the “entropy” of the social order will be at a 
maximum. The parallel is with thermodynamic 
evolution, which is also a leveling theory and is in 
contrast with those views of organic evolution that 
depend upon an increase in differentiation, com- 
plexity, storage of information, and a decrease in 
entropy. 


Evolution and history 


There are close parallels between the methods 
and statements of evolutionist doctrines in the 
natural sciences and the methods and statements 
of historiography. Geology, cosmology, and organic 
evolution are historical sciences in that they are 
descriptions of, and attempts to explain, past events 
in the light of present occurrences. The problems 
of making laws or lawlike statements about the 
past and prediction of the future are the same 
whether the focus of interest is human history or 
the history of all organic life. Karl Popper, in 
The Poverty of Historicism (1957), asks the ques- 
tion, “Can there be a law of evolution?” and an- 
swers, “‘No,’ the search for the law of the ‘unvary- 
ing order’ in evolution cannot possibly fall within 
the scope of scientific method, whether in biology 
or in sociology. . . . The evolution of life on earth, 
or of human society, is a unique historical process 
(pp. 107, 108). : 

The chief difficulty of the historical sciences is 
that they fail to meet what is widely accepted as 
the norm for a science, Popper's criterion of falsifi- 
ability. For Popper, scientific laws are universal 
statements (“All swans are black, all planets move 
in ellipses,” etc.) and therefore are really prohibi- 
tions (“A white swan cannot be found, no planet 
moves in a circle,” etc. ). Such prohibitions provide 
a program for testability, for if one wishes to test 
the universal law about swans, he does not look for 
black ones but white ones. If a white swan is found, 
the law is disproved. A white swan is a potential 
falsifier of the law, and any statement that has no 
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potential falsifier (any existentially quantified state- 
ment falls in this category) is metaphysical and 
ought to be excluded from a science. 

The trouble with historical sciences like organic 
evolution is that they are almost entirely made up 
of existential rather than universal statements. We 
may take modern Darwinism as an example. It 
asserts that the organisms now living have evolved 
from ancestral organisms of a different nature and 
offers the fossil record as direct evidence. More- 
over, it asserts that the mechanism of this change 
is embodied in three principles: (1) different indi- 
viduals in a species have different morphologies, 
physiologies, behaviors, that is, there is variation; 
(2) there is a correlation between the form of the 
parents and the offspring, that is, the variation is 
heritable; and (3) different variants have different 
rates of survival and reproduction in different 
environments. 

Let us now examine these assertions in the light 
of Popper's criterion. The evidence that evolution 
has in fact occurred is contained in the succession 
of fossils found in different geological strata. 
From the fossil record we can state with confidence 
that there are many kinds of animals and plants 
that, having once existed, no longer exist. But that 
statement of itself, far from being a universal state- 
ment, is an existential one; in fact it is a historical 
statement, exactly corresponding to the assertion 
that Napoleon once lived or that Martin Luther died 
on February 18, 1546, at Eisleben. 

Can we push this observation further and say 
that all animals and plants in the fossil record are 
of a kind no longer represented? No, because that 
does not happen to be true. But can we at least 
make the much more interesting and important 
hypothesis that all kinds of animals and plants 
eventually will be supplanted by other forms? This 
also is not falsifiable and is really identical to the 
assertion that every man has his price. It says, in 
fact, that every species is mortal, that there exists 
a time in the future at which any given species will 
no longer exist. If statements about the univer- 
sality of evolution are historicist rather than scien- 
tific, it still might be that the principles underlying 
the mechanism of evolution are falsifiable in ex- 
plaining any particular case of evolution. But that 
is clearly not the case. The statement of natural 
selection is that there exists an environment—a 
combination of temperature, humidity, food, soil, 
competition of other forms—in which different var- 
iants will have different relative reproductive rates. 
But applied to the past or the future, such a state- 
ment has such vast explanatory and predictive 
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power that it is empirically empty. To say that the 
dinosaurs became extinct because some change in 
environment caused their rate of reproduction to 
be lowered below the replacement point, or to say 
that certain amphibia gave rise to reptiles because 
some environment existed which favored heritable 
variation in that direction, is, by Popper's criterion, 
to say nothing. 

An example of this difficulty is the argument of 
the selectionist in explaining the observed differ- 
ences among populations of present-day organisms. 
Why are the frequencies of blood types different 
in different human races? The selectionist says 
these have arisen by natural selection, and even if 
no differential survival can presently be discovered 
(as none can), things used to be different and, at 
one time, the different races lived in such different 
environments that they were differentially selected. 
Such an argument bears more than a little similar- 
ity to the claim of Hegel that one cannot act on 
principles deduced from history, because “each 
period is involved in such peculiar circumstances, 
exhibits a condition of things so strictly idiosyn- 
cratic, that its conduct must be regulated by con- 
siderations connected with itself, and itself alone.” 

Both historical explanation and evolutionary sci- 
ences can be concerned only with offering sufficient 
explanations for past events and with prescribing 
possible future events on the basis of the observa- 
tion of present processes. 


Evolutionism—social and economic matrix 


While there is no doubt that the publication of 
Darwin's On the Origin of Species (1859) led to 
an almost immediate explosion of evolutionary 
thought, it was only the percussion cap for a charge 
already set. Because a theory of organic evolution 
touched upon man’s origin and presented a materi- 
alistic challenge to his preordained primacy in the 
universe, it was bound to excite great interest. 
Nevertheless, the Origin of Species appeared in the 
middle of a period of rampant evolutionism and 
radical political and social change. It served as the 
issue over which the battle between stasis and 
change could be fought, a battle for the final su- 
premacy of a world view that had been making 
steady gains since the beginning of the eighteenth 
century. 

Darwin was the inheritor, not the creator, of the 
general preoccupation with evolutionism. This is 
made clear by Spencer in his Principles of Biology 
(1864-1867), when he argues that one of the chief 
evidences for organic evolution is that, after all, 
everything else evolves. 
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It is now universally admitted by philologists, that 
languages, instead of being artificially or supernaturally 
formed, have been developed. And the histories of reli- 
gion, of philosophy, of science, of the fine arts, of the 
industrial arts show that these have passed through 
Stages. . . . If, then, the recognition of evolution as the 
law of many diverse orders of phenomena, has been 
spreading, may we not say that there thence arises 
the probability that evolution will be recognized as the 
law of the phenomena we are considering? ([1857] 
1915, pp. 432-433) 


The theory of organic evolution will be in its 
proper historical perspective if it is remembered 
that evolutionary cosmology had been founded in 
Kant’s Metaphysical Foundations of Natural Sci- 
ence of 1786 and in Laplace’s nebular hypothesis 
of 1796. At about the same time, Hutton was form- 
ing modern geology by his rejection of the cata- 
strophic theories of the origin of geological forma- 
tions and his introduction of the principle of 
uniformitarianism. Although the term “entropy” 
was not introduced until 1865 by Clausius, the 
second law of thermodynamics was formulated by 
Sadi Carnot thirty years earlier. By the time of the 
publication of the Origin of Species, the physical 
sciences were already thoroughly evolutionist in 
outlook. Moreover, the fact of the evolution of living 
forms, although not a mechanism for that evolu- 
tion, was accepted widely in scientific and literary 
circles. Darwin’s grandfather, Erasmus Darwin, in 
the Temple of Nature (1803, p. 3) invokes the 
Muse to say “How rose from elemental strife, or- 
ganic forms, and kindled into life.” And his Muse 
reports that even “imperious man, who rules the 
bestial crowd, . . . arose from rudiments of form 
and sense” (p. 28), Less romantic students of nat- 
ural history like Buffon and especially Lamarck, 
had, by the beginning of the nineteenth century, 
fully developed theories of the transformation of 
Species. Even Diderot in 1769 in Le rêve de d'Alem- 
bert asks: “Qui scait les races d’animaux qui nous 
ont precedes? Qui scait les races d’animaux qui 
succederont aux notres? Tout change, tout passe, 
il n'y a que le tout qui reste” ([1830] 1951, p. 56, 
my italics). Seventy years later we hear the echo 
in Tennyson: “The old order changeth, yielding 
place to new” (“Morte d'Arthur,” l. 408). 

It is often thought obvious that scientific dis- 
covery influences the direction of social and eco- 
nomic change, or at least its rate. But what must 
be even more true is that social and economic 
world views must permeate science. No appeal to 
a Zeitgeist is implied by such a relationship, for 
the meaning of Zeitgeist is that science and other 
social activities respond equally to some spirit of 


the age whose source and power are unknown. To 
appeal to Zeitgeist is to reject any legitimate theory 
of historical causation. On the other hand, there is 
nothing mystical about the way in which notions 
of cause and effect, choice and chance, determinacy 
and freedom, spread from one science to another, 
Equally, it is entirely within the normal picture of 
historical causation that general social attitudes 
and economic relationships between social classes 
should have a profound effect upon the accept- 
ability and apparent reasonableness of scientific 
hypotheses. Science is, after all, a social activity. 

Prior to the eighteenth century, European social 
systems were characterized by a determinist world 
view. A man was born to his estate and occupied 
it by divine providence. Fixity and static stability 
were the mark of society, and radical changes in 
position could occur only as exceptional with- 
drawals or extensions of divine grace. Although 
Charles 1 was king of England Dei gratia, he could 
be deposed because, as Cromwell said, divine grace 
had been removed from him. The fact of his 
severed head was sufficient proof of that. Occa- 
sionally a man might rise from low estate to be 
the counselor of kings, but again only an extraor- 
dinary grace made this possible. Species were fixed 
as was the position of the earth in the universe. 
Galileo’s heresy was not that the earth was not at 
the center of the cosmos but that it moved. Men 
reason by analogy from the condition of their lives 
to the condition of the universe, and a static society 
could hardly believe in a dynamic cosmos. 

In the eighteenth century a change became felt 
in the condition of society as the influence of the 
industrial revolution spread. Social mobility be- 
came more common, and classes of parvenus 
acquired political and social power. 


“And it is a remarkable example of the confusion into 
which the present age has fallen”... . says Sir Leices- 
ter, . . . “that Mrs. Rouncewell’s son has been invited 
to go into Parliament.” 

Miss Volumnia utters a little sharp scream. . «+ 
“Good gracious, what is the man?” 

“He is called, I believe—an—Ironmaster.” 

Charles Dickens, Bleak House, 1853 


The phase of bourgeois revolution had begun and 
from it developed bourgeois revolutionary science. 
As change became the rule and characteristic of 
society, catastrophism lost ground in natural sci- 
ence, and a uniformitarian principle of change took 
its place. It is surely no coincidence that Josiah 
Wedgwood, who began as a potter’s apprentice and 
ended as one of the great eighteenth-century mag- 
nates, was Charles Darwin’s maternal grandfather. 
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Darwin's paternal grandfather, Erasmus, belonged 
to the circle of new Midland industralists: James 
Waitt, James Keir, Matthew Boulton, and, of course, 
Wedgwood. Although from the middle class, Eras- 
mus was a self-made man and his son, Robert 
(Charles's father), emulated him by accepting 40 
pounds of his father’s money and building it into a 
respectable fortune with no further aid. 

The bourgeois revolution not only established 
change as the characteristic element of the cosmos 
but added direction and progress as well. A world 
in which a man could rise from humble origins 
must have seemed, to him at least, a good world. 
Change per se was a moral quality. In this light, 
Spencer's assertion that change is progress is not 
surprising. Moreover, for those still rising or hope- 
ful of improvement, there is a vested interest in 
the perpetuity of change, in a uniformitarian prin- 
ciple of replacement of the old by the new. 

The bourgeois revolution reached its peak in 
England in the Reform Bill of 1832 based on 
Bentham’s principle of the “greatest happiness for 
the greatest number,” while on the Continent it 
took the more violent form of the revolutions of 
1848. By the time Darwin published the Origin of 
Species, the ascent of the middle classes was com- 
plete and the supremacy of change and progress 
established in all the natural sciences except biol- 
ogy. The furor against Darwinism was only the last 
hopeless struggle of an already fatally wounded 
adversary. It is, of course, true that the principle 
of natural selection, converted by Spencer to the 
“survival of the fittest,” was used in the last half 
of the nineteenth century as a justification for 
laissez-faire practices. But it was only the borrow- 
ing of a metaphor to further justify a system al- 
ready in full operation. It would be quite wrong 
to propose that Darwinism was an effective agent 
promoting unlimited economic competition. 

Like all revolutions the bourgeois revolution gave 
way slowly to a period of consolidation, a period 
in which we still find ourselves. Once the new 
classes had gained power, it was clearly to their 
advantage to prevent the revolution from going 
further, The static hereditary society could hardly 
be reconstituted, but in its place a system of dy- 
namic stability was erected. Change and social 
mobility are still accepted as characteristic of soci- 
ety, but it is a running-in-place rather than an 
overturn of the existing order. Liberal democracy 
of the twentieth century has a vested interest in 
maintaining the world social order but allowing in- 
dividuals, on the basis of relative competitive ability, 
to find their own place in the social structure. 

It is not remarkable, then, that evolutionary 
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theories of the twentieth century are marked by a 
concern for equilibrium condition and dynamic 
stability, a playing down of progressivist and per- 
fectionist elements, and a general reliance on the 
principle that plus ça change, plus c'est la même 
chose. In cosmogony there has been the rise of the 
steady-state theory of perpetual creation and also 
of the cyclic expansion-contraction model. Both 
are characterized by constant movement and 
change, but neither allows that the universe is 
going anywhere in particular. In thermodynamics 
and statistical mechanics there has been emphasis 
on the local rather than global nature of the law 
of increase of entropy. It is now admitted that 
entropy may decrease in other parts of the universe 
or at other times and that a global statement of 
the second law of thermodynamics may be too 
strong. In the realm of organic evolution, pro- 
gressivism has been entirely abandoned except for 
a few metaphysical writers. The direction in which 
evolution is supposed to lead, when a direction is 
admitted, is that of greater complexity and greater 
integration leading to greater stability. Modern 
students of evolution are preoccupied with dynamic 
stability and equilibrium in a global sense. The 
technical literature of evolutionary genetics is filled 
with reference to and studies of stable equilibria. 
This preoccupation would have seemed strange to 
the evolutionists of the nineteenth century, who 
saw, reflected in the process of organic evolution, 
the tendency toward a better world. The evolution- 
ist of the twentieth century presumably sees, in his 
view of evolution, “the best of all possible worlds.” 


R. C. LEWONTIN 
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n 
PRIMATE EVOLUTION 


When Darwin published the Origin of Species in 
1859, only a handful of fossil primates had been 
found and recognized as such. But during the last 
three decades of the nineteenth century a con- 
siderable series of fossil primates of Tertiary age 
were recovered in Europe and the Americas. The 
earliest scholarly attempts to analyze the course of 
primate evolution from the evidence of these fossils 
were made by Schlosser in Munich (1887), by 
Osborn in New York (1902), and by Wortman at 
Yale (1903-1904). Among the earliest monographic 
studies of early Tertiary primates were those of 
William K. Gregory of the American Museum of 
Natural History, who best summarized his views 
on primate evolution in a detailed review of the 
North American lemurlike primate Notharctus 
(1920). Only a little earlier Stehlin (1912-1916) 
at Basel had published a complementary review of 
European Eocene prosimians. 

Inadequacies of material and methods. There 
have been several factors holding back full analysis 
of the course of primate evolution. A major prob- 
lem results from their presumed early emergence 
in the tropical forests of the equatorial zones. 
Because Tertiary vertebrate fossils from these re- 
gions are poorly known, the early history of pri- 
mates remains relatively obscure. Primates of the 
past, like those of today, were apparently restricted 
to relatively warm climates, often to tropical for- 
ests, and prosimian remains dating after warm, 
early Tertiary times are not generally found in the 
northern continents. Exploration for sites in the 
tropics where Tertiary land vertebrate fossils do 
occur is being actively carried out today, and it is 
from these paleontologically little-known regions 


that most important future additions to knowledge 
of primate evolution may be expected. 

Most of the primate fossils from the early Ter- 
tiary (Paleocene and Eocene epochs) consequently 
are not found in the equatorial regions, where pre- 
sumably the mainstream of primate evolution has 
always been located. Therefore, reconstruction of 
primate history must, at present, be extrapolated 
in part from fossil evidence from marginal areas, 
This gap in knowledge of ancient tropical faunas 
is somewhat lessened because we know that many 
extinct primates were wide-ranging species and 
that the higher Old World primates, particularly 
apes and pre-men, never diversified into a host of 
separate lineages, as did some groups of mammals, 
such as bats and rodents. 

Thus, differentiation of the main groups and 
their interrelationships can be reasonably well 
understood, even from the occasional sampling 
typical of our imperfect paleontological finds. This 
is particularly true for the late Tertiary relatives 
and ancestors of man. However, two theories, origi- 
nating outside the realm of paleontological evi- 
dence, appear to have been responsible for creating 
a contrary impression. Although the outlook is sel- 
dom specifically articulated, some students seem 
to believe that in order to produce the brain capac- 
ities of modern man there must have been many 
competing early hominid species. With this goes the 
assumption that the mental capacities of apes and 
those presumed for pre-men would have made them 
near masters of their environment and, therefore, 
so successful that many species would have arisen. 
These views, together with an overeagerness tO 
create new species and genera, led some early an- 
thropologists and a few modern students to the 
false conclusion that a great diversity of fossil apes 
and prehuman species once existed. A confusing 
welter of ill-founded names exists in the literature 
on such extinct primates. Those applied to Tertiary 
apes and hominids have recently been revised by 
Simons and Pilbeam (1965). Although over a thou- 
sand individual specimens of extinct great apes 
(pongids ) and early hominids are now known from 
European, Asian, and African Miocene—Pleistocene 
deposits, only a few genera, Pliopithecus, Oreopith- 
ecus, Dryopithecus, Ramapithecus, and Giganto- 
pithecus, can be shown to be distinct and valid. 
Thus, the main types of close fossil relatives and 
forerunners of man (Hominoidea) that can con- 
vincingly be demonstrated to have existed in the 
past are about the same in number as the genera 
of this group that exist today—Hylobates (gib- 
bon), Symphalangus (siamang), Pan (chimpanzee); 
Pongo (orangutan), Gorilla (gorilla), Homo (man). 


As a consequence of the foregoing problems of 
approach, the study of primate evolution has been 
hampered. First, the early history of primates is 
obscured by inadequate knowledge of equatorial 
vertebrate faunas; and second, understanding of 
the later history of the particular group that in- 
cluded man’s ancestors has been confounded by 
the tendency to proliferate invalid genera and 


species. 


Origin of primates 

Undoubted primates first appear in the fossil 
record in North American deposits of early Paleo- 
cene age, which are more than sixty million years 
old. By middle Paleocene, the order had already 
diversified into three or four different main groups, 
or families. This would suggest that the initial 
separation of the order from primitive, insectivore- 
like stock was considerably earlier, perhaps in the 
late Cretaceous. Indeed, a species possibly primate, 
but based only on one tooth, has been reported 
from late Cretaceous beds in Montana. 

The best-known primates of Paleocene times 
appear to represent specialized side branches, 
which did not long survive the beginning of the 
Eocene epoch. Although not directly ancestral, 
these early species do indicate the starting point of 
the basic primate arboreal adaptation and give evi- 
dence of the gradual transition from nonprimate 
to primate. It may be that some of the less com- 
pletely known American Paleocene primates, such 
as Palaechthon and Plesiolestes, will eventually 
prove to be near the ancestry of one or more of the 
primate families of initial early Eocene appearance. 


Origin of major surviving groups 

Lemurs. At the beginning of the Eocene, pro- 
simians of lemurlike and tarsierlike aspect appear, 
nearly simultaneously, in the vertebrate assem- 
blages of Europe and North America but are as yet 
unknown elsewhere. Three major sorts of primates 
can be characterized. 

The most generalized of these early Eocene pro- 
simians are the loosely defined Adapidae, including 
such well-known forms as Pelycodus, and later 
Notharctus and Smilodectes in North America, 
and Adapis in Europe. Species of these genera, 
together with allied forms, apparently did not ad- 
vance during the Eocene beyond the “Jemuroid” 
condition represented today by the Malagasy 
lemurs, whom they resemble both in limb-bone 
structure and in the structure of the ear region of 
the skull. However, dental patterns in notharctines 
and adapines do not show trends evolving toward 
those of modern lemurs; this appears to indicate 
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that known Eocene species of Adapidae were not 
directly ancestral to the living varieties of lemurs. 
On the other hand, the family Adapidae—now 
rather broadly defined—could well represent the 
family from which modern lemurs and possibly the 
lorises differentiated (Simons 19622). 

How and when the true lemurs reached their 
present limited range of distribution in the island 
of Madagascar is a most intriguing and nearly in- 
soluble problem on the basis of present evidence. 
If their introduction to this island was by way of 
the African continent, it might be expected that 
some evidence of true lemurs would by now have 
been recovered from the Egyptian Oligocene or east 
African Miocene deposits, both of which contain 
warm-climate forest faunas, Although small, evi- 
dently arboreal primates have been found in some 
abundance in these deposits in Egypt and east 
Africa, none show any significant similarities to 
the Malagasy lemurs. On the other hand, similar- 
ities between lorises and two European late-Eocene 
primates, Pronycticebus and Anchomomys, at pres- 
ent classified as Adapinae, suggest that the intro- 
duction of this group into Africa could have been 
by way of Europe. 

Tarsiers. Enough is now known of the crani- 
ology and dentition of the so-called “tarsioid” pri- 
mates to demonstrate the falsehood of the extrava- 
gant opinion of Wood-Jones that Hominidae are to 
be derived directly from Tarsius-like forms through 
an ancestral line not shared by the Old World apes 
and monkeys. What is perhaps more significant is 
that the Eocene tarsioids so far discovered could 
not have given rise to the common stem of mon- 
keys, apes, or men either, for they possess non- 
catarrhine specializations, and all had apparently 
lost one pair of lower incisors. All surviving mem- 
bers of Anthropoidea have retained these teeth. 
Nevertheless, these early tarsioids are of great in- 
terest because they presumably represent the gen- 
eral level of organization the unknown forerunners 
of the living higher primates must have reached 
as far back as the beginning of the Eocene epoch. 

The first Anthropoidea. Another major group 
of primates appeared in the early Eocene, in Europe, 
Asia, and North America, and is, with one or two ex- 
ceptions, restricted in temporal range to this epoch. 
(Gazin 1958 has reviewed the distribution of mid- 
dle-Eocene members of this family, together with 
other contemporary North American primates. ) 
Collectively, members of Omomyidae show greater 
resemblances to Anthropoidea than do other early 
prosimians. They could well represent the taxon 
from which all true Anthropoidea differentiated, a 
probability that is also sound zoogeographically, in 
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view of their Holarctic distribution. Certainty on 
this point must wait, however, until better knowl- 
edge of cranial and postcranial anatomy is avail- 
able for members of this family. Partial evidence, 
derived mainly from remains of a species of the 
North American middle-Eocene omomyid genus 
Hemiacodon gracilis, shows that this species can- 
not be regarded as “tarsioid” postcranially (Simp- 
son 1940). This observation, in turn, implies that 
Anthropoidea in its earliest differentiation may not 
have passed through a definably tarsioid grade of 
organization. Rather, the anatomy of both Eocene 
Necrolemur and Holocene Tarsius clearly indicates 
that so-called “tarsioids” and the Anthropoidea 
probably originated in the same segment of early 
Prosimii. Although serious gaps in the geologic 
record of primates prevent any very definite attach- 
ment of early Tertiary species successions to those 
of the later Cenozoic, it is likely that some known 
Paleocene and Eocene types, at least at the generic 
level, do pertain to the ancestry of living forms. 
Because climates were much more equable through- 
out the early Tertiary, with regions as far north as 
Montana and England Supporting a subtropical 
flora, we can suppose that many of these Paleocene— 
Eocene genera contained species having a broad 
north-south range of distribution. Some of the spe- 
cies we now know from northern areas must have 
had very close allies in the unknown southern 
faunas. An example of this sort of distribution is 
provided by one of the large Eocene herbivores. 
Skulls and jaws of species of the Wasatch panto- 
dont genus Coryphodon found in the Big Bend 
region of Texas, near the Mexican border, cannot 
be distinguished from remains of this animal found 
in northern Wyoming, This represents a north— 
south separation of about one thousand miles; if 
paleontological information on Wasatch faunas 
were available from Canada to Guatemala, this 
distribution could probably be extended even far- 
ther. Nevertheless, as with most attempts to trace 
phyletic lineages in given groups of fossils, the 
need to discover a great many more connecting 
links remains, and the search in the more nearly 
equatorial regions for early Cenozoic faunas con- 
taining primates is one of our major objectives. 

For the Oligocene two sets of primate data from 
equatorial regions have become available through 
recent research projects. 

The first of these projects has provided new evi- 
dence on the question of differentiation of cerco- 
pithecoid monkeys and hominoids. Discoveries 
made in Oligocene deposits in Egypt between 1961 
and 1966 by Yale expeditions under the writer's 
direction have added new data on the nature of the 


initial appearance of these two superfamilies, A 
jaw fragment of a new small primate, Oligopithe- 
cus savagei (Simons 1962b), shows the typical low- 
er dental formula of Miocene—Holocene Old World 
Anthropoidea, in combination with what are the 
most primitively constituted lower molars known 
in this suborder. Premolar heteromorphy and 
slightly bilophodont molars, plus certain other 
characters, suggest relationships, on the one hand, 
to Eocene Omomyidae, and on the other, to Mio- 
cene—Holocene cercopithecoids. Further finds of 
this small mammal could Strengthen the possi- 
bility, which now rests almost entirely on the evi- 
dence of this single fragmentary jaw, that cerco- 
pithecoids arose from Old World omomyids, Miocene 
monkeys are known from deposits in Egypt and 
east Africa. The early history of Old World mon- 
keys remains very poorly understood, and much of 
the known earliest material has not been described. 


The emergence of Hominoidea 


To date no significant information whatever on 
upper tooth structure of earliest (Eocene-Oligo- 
cene) African or Eurasian Anthropoidea has been 
published. This has been a serious gap in our 
knowledge because upper premolars and molars, 
being somewhat more complex than lower-cheek 
teeth, allow for more accurate appraisal of taxo- 
nomic affinity. Restudy of the previously poorly 
known upper molars of Pondaungia cotteri from 
the late Eocene of Burma, together with discovery 
by the recent Yale expeditions of several partial 
upper dentitions of Apidium and of isolated upper 
teeth of Aegyptopithecus and Parapithecus, has 
greatly added to the data available for considera- 
tion of the prosimian sources of the higher primates 
of the Eastern Hemisphere. Briefly, the dental evi- 
dence of these three early species bears on the 
crucial point of emergence of Hominoidea from 
Prosimii: the upper molars of Apidium and Pon- 
daungia show the three primary cusps of the trigon 
unconnected by ridges. The hypocone is large, and 
a pitted lingual cingulum is usually present. In 
both these genera several small accessory cusps 
occur in the trigon between the three main cusps. 
In their over-all upper molar morphology, species 
of these two genera equally resemble prosimians 
and higher primates. Cheek teeth of Apidium, par- 
ticularly, also show similarities to those of Oreo- 
pithecus, This may be due to retention of a com- 
paratively primitive molar cusp pattern in the 
otherwise advanced latter hominoid, Upper molars 
of Propliopithecus, on the other hand, show the 
typical pattern characteristic of much later apes, 
near men, and men. However, the cingulum on the 


inner side of the upper molars is large, which 
strengthens the view that this is a primitive char- 
acter of ape dentitions. 

Pondaungia cotteri was described by Pilgrim 
(1927). This specimen was recovered from the 
Pondaung sandstone of Burma together with a 
small mammal fauna, which indicates a late 
Eocene age for this as well as for another primate, 
Amphipithecus. Comparisons based primarily on 
the anthracotheres of the Pondaung sandstone and 
those of the Fayum early Oligocene of Egypt sug- 
gest that the Burmese fossils may be earlier, but 
the question is by no means definitely settled. At 
present it seems best to regard Amphipithecus and 
Pondaungia as of late Eocene age, and since both 
appear to be hominoids, this indicates the upper 
limit of differentiation for this major group of pri- 
mates. As Colbert (1937) has already pointed out 
for Amphipithecus, both appear to show close ties 
with Pongidae. Tentative assignment of Amphi- 
pithecus and Pondaungia to this family does not 
seem questionable on grounds of their antiquity, 
inasmuch as a considerable number of presently 
existing mammalian families have now been traced 
back to the late Eocene. 

Pilgrim’s illustrations and comments on Pon- 
daungia left much to be desired, and in fact, on 
the basis of the information he provided, it was 
hardly possible to accept the species as belonging 
to the primates. As he mentioned, part of the 
source of his weak case lay in the fact that a web- 
like erosion of the enamel of the lower teeth in the 
type of Pondaungia had obscured their crown pat- 
terns. He supposed this to be also the case with 
the upper molars. Microscopic examination of this 
material indicates that, while the lower molars do 
appear to have suffered some erosive damage, the 
crenulations on the upper teeth represent the nat- 
ural surface except in one or two broken areas. 
This sort of crenulation is not unusual among pri- 
mates, being of frequent occurrence in Eocene 
Omomyidae and in Miocene—Holocene apes. In 
conclusion, these features of the upper molars, 
together with some distinguishable details of the 
lower teeth and mandibles, suggest an assignment 
to the Pongidae. Indeed Pondaungia may not be 
far from the direct ancestry of such forms as 
Propliopithecus and its less well-known allies from 
the Oligocene of Egypt. A less likely possibility is 
that Pondaungia is an advanced omomyid primate 
with teeth paralleling those of early pongids. 

All finds of primates from the Old World Oligo- 
cene are distributed throughout some six or seven 
hundred feet of mainly continental sediments of 
the Qatrani formation, Fayum province, Egypt— 


EVOLUTION: Primate Evolution 213 


generally regarded as being of early Oligocene age. 
The classic finds of Parapithecus, Propliopithecus, 
and Moeripithecus are from the lower part of this 
series, in the “Fossil Wood Zone,” while the type of 
Apidium came from about five hundred feet higher. 
The new species of Apidium and one of Proplio- 
pithecus have been found stratigraphically between 
the two earlier known levels. There are then three 
primate-yielding levels, of unknown age separa- 
tion. Enough time had elapsed between their suc- 
cessive depositions, however, to bring about evo- 
lutionary changes among the respective primate 
lineages represented. In December 1963 the Yale 
expedition recovered from the upper Fayum de- 
posits two new genera and species of primates. One 
of these, Aegyptopithecus, appears to be a good 
candidate for the ancestry of Dryopithecus, of 
Miocene-Pliocene age, and was possibly a fore- 
runner of all subsequent apes. The second new 
find, Aeolopithecus, resembles gibbons, living and 
fossil. 

Turning to a consideration of primates of the 
Miocene epoch, which began perhaps about 25 mil- 
lion years ago, there is more abundant information. 
From South American deposits, located mainly in 
Argentina and Colombia, a series of monkey species 
is known. These have most recently been discussed 
by Stirton (1951). At least partial skulls and jaws 
are known for species of three genera, Homun- 
culus, Dolicocebus, and Cebupithecia. These ma- 
terials show that by Miocene times South American 
monkeys were structured much as they are today, 
and therefore they do not provide much evidence 
as to the origins of this group. Old World monkeys 
of Miocene age are known only from sites in east 
Africa and Egypt. The east African Miocene mon- 
key has not yet been named, but that from Egypt 
was given the rather unfortunate generic name of 
Prohylobates by Fourtau (Egypt 1918). 

Broadly contemporary with these monkeys are 
species of the Dryopithecus group of apes (includ- 
ing subgenera Sivapithecus and Proconsul), which 
are apparently close to the ancestry of the living 
African apes. Also represented in Miocene faunas 
of Europe and Africa are species of Pliopithecus, 
an ancient relative of the present-day gibbons and 
siamangs of southeast Asia. 

Toward the end of the Miocene epoch in Eurasia 
and Africa, perhaps about fourteen million years 
ago, species of Dryopithecus are contemporary 
with the oldest undoubtedly manlike primate, 
Ramapithecus punjabicus. Originally discovered 
in the Siwalik Hills of north India in beds of 
Miocene-Pliocene age, this species was named by 
Pilgrim in 1910, but he did not recognize that it 
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belonged to a major new variety of primate and 
so considered it a species of Dryopithecus. Lewis 
(1934) defined the genus Ramapithecus, to which 
Pilgrim’s species has subsequently proved to be- 
long. Lewis initially pointed out that Ramapithecus 
has many manlike features in the upper tooth 
series. Thus, it can reasonably be placed in Homini- 
dae, the taxonomic family of man (Simons 1963: 
1964). To date, however, only parts of upper and 
lower jaws of this animal have been found. Ap- 
parently Ramapithecus was a successful and wide- 
ranging primate. Outside north India, teeth and 
jaws just like those of the Ramapithecus punjab- 
icus have been found at Fort Ternan in Kenya, 
east Africa; in Yunnan, China; and just possibly 
in Europe. 

Apart from the fact that facial displays and diet 
in Ramapithecus must have been more similar to 
that of true man than to that of the apes (in view 
of incisor, canine, and premolar reduction—rela- 
tive to cheek teeth) little can be inferred about this 
earliest hominid. It cannot be called man or hu- 
man, because there is no evidence that R. punjab- 
icus manufactured tools. Moreover, nothing is 
known of its brain size and limb or body skeleton. 
After this we know little of fossil primates until 
the earlier Pleistocene. 


The Pleistocene and hominid evolution 


The Pleistocene was a time of rapid hominid 
evolution, during which toolmaking first appeared 
among the ancestors of modern man. In 1925 Dart 
described an infant skull that he named Australo- 
pithecus africanus. In spite of the name, which 
means “African southern ape,” Dart recognized 
Australopithecus to be a hominid belonging to the 
same taxonomic family as modern man, Homo 
sapiens. Nevertheless, his views were not generally 
accepted until a large number of fossil men or near 
men of similar age and structural type were re- 
covered by Dart, Broom, and Robinson in south 
Africa, 

More recently, Australopithecus has apparently 
been identified in Java and in north and east 
African deposits. The proposed name Homo habilis, 
recently coined by Leakey and his associates 
(1964) on the basis of specimens from Olduvai 
Gorge in Tanganyika, covers mixed materials, some 
assignable to the prior south African species Aus- 
tralopithecus africanus and some to Telanthropus 
capensis and to Homo erectus. In the view of many 
students (Campbell 1964a), even the first two 
Species are not distinguishable, both belonging to 
the same small and gracile variety of early man. 
In addition to the species A. africanus, there is 


a large and more specialized early hominid form 
in east and south Africa, which should most cor- 
rectly be called Australopithecus robustus. Both of 
these species appear to have been habitual bipeds; 
the postcranial skeletons are similar to that of 
Homo sapiens and little like those of apes. Their 
jaws and teeth, although relatively large, are 
strongly reminiscent of teeth of later and better- 
known fossil men. However, in both species, known 
brain size was apparently little more than a third 
that of the average present-day Homo sapiens. The 
small brain size initially caused many students to 
place Australopithecus close to or with the apes. 
This early and erroneous placement of Australo- 
pithecus has continued to affect balanced under- 
standing of their relationship to living man. Sev- 
eral poorly known or juvenile Australopithecus 
specimens have been made the “types” of new taxa, 
said to be more advanced and more like Homo 
sapiens. Both Telanthropus capensis and Homo 
habilis fall in this category. As Campbell (1964b) 
demonstrates, they have not been shown to be dif- 
ferent enough from each other or from members 
of Australopithecus africanus to warrant species 
identification. 

Clarification of these taxonomic problems is 
crucial to discussion of the course of the main- 
stream of the early evolution of man and is one of 
the prime areas for future advance in this science. 
Definition and delineation of species populations 
is requisite to understanding of the ancestral line- 
age of modern man. Unfortunately, most anthro- 
pologists and anatomists who have been the namers 
of fossil men and near men were not adequately 
trained to define taxa in harmony with the con- 
cepts of modern systematics and population ge- 
netics. That these inadequacies are still leading to 
confusion in understanding of human evolution 
was confirmed in the technically incorrect diag- 
nosis of Homo habilis. Such neglect of the new 
taxonomy and systematics has seriously affected 
understanding of primate and human history. In all 
discussion of earliest men and pre-men, mam- 
malian taxonomy is an area that needs much more 
sober scientific attention than it has been given 
so far. 

ELWYN L. SIMONS 


BIBLIOGRAPHY q 
CAMPBELL, BERNARD G. 1964a Just Another “Man-ape”? 
Discovery 25, no. 6:37-38. 
CAMPBELL, BERNARD G, 1964b Science and Human Evo- 
lution. Nature 203:448-451, A 
CLARK, WILFRID E. LeGros (1955) 1964 The Fossil 
Evidence for Human Evolution. Rev. ed. Univ. of Chi- 
cago Press. 


CLARK, Witrrip E. LeGros 1960 The Antecedents of 
Man. Chicago: Quadrangle Books. 

Cotzert, Epwin H. 1937 A New Primate From the 
Upper Eocene Pondaung Formation of Burma. Amer- 
ican Museum of Natural History Novitates 951:1—18. 

Ecrrr, SURVEY DEPARTMENT (1918) 1920 Contribution 
à l'étude des vertébrés miocénes de l'Égypte, by René 
Fourtau. Cairo: Government Press. 

Gaz, C. Lewis 1958 A Review of the Middle and 
Upper Eocene Primates of North America. Smithso- 
nian Miscellaneous Collections, Vol. 136. Washington: 
The Institution. 

Grecory, WiıLLIAM K. 1920 On the Structure and Rela- 
tions of Notharctus, an American Eocene Primate. 
American Museum of Natural History, Memoirs New 
Series 3:49-243. 

Grecory, WILLIAM K. 1922 The Origin and Evolution 
of the Human Dentition. Baltimore: Williams & Wil- 
kins. 

Lraxey, L. S. B.; Tosras, P. V.; and Naprer, J. R. 1964 
A New Species of the Genus Homo From Olduvai 
Gorge. Nature 202:7-9. 

Lewis, G, Epwarp 1934 Preliminary Notice of New 
Man-like Apes From India. American Journal of Sci- 
ence Fifth Series 227:161-179. 

Osnorn, Henry F. 1902 American Eocene Primates 
and the Supposed Rodent Family Mixodectidae. Ameri- 
can Museum of Natural History, Bulletin 16: 169-214. 

Pircrm, Guy E. 1910 Notices of New Mammalian 
Genera and Species From the Tertiaries of India. 
India, Geological Survey, Records of the Geological 
Survey of India 40, no. 1:63-71. 

Prtcrrm, Guy E. 1927 A Sivapithecus Palate and Other 
Primate Fossils From India. India, Geological Survey, 
Palaeontologia indica New Series 14: 1-24. 

Scuiosser, Max 1887 Die Affen, Lemuren, Chiropteren 
usw. des europäischen Tertiars. Beiträge zur Paläon- 
tologie und Geologie Österreich-Ungarns 6:1-162. 

ScHLossER, Max 1911 Beiträge zur Kenntnis der Oligo- 
zänen Land-Säugetiere aus dem Fayum: Agypten. 
Beiträge zur Paläontologie und Geologie Österreich- 
Ungarns 24:51-167. 

Simons, ELwyN L. 1962a A New Eocene Primate Genus, 
Cantius, and a Revision of Some Allied European 
Lemuroids. British Museum of Natural History, Bul- 
letin 7, no. 1:1-36. 

Stmons, ELwyN L. 1962b Two New Primate Species 
From the African Oligocene. Postilla (Yale Peabody 
Museum) 64:1-12. 

Sımons, ELwyN L. 1963 Some Fallacies in the Study 
of Hominid Phylogeny. Science 141:879-889. 

Simons, ELwyn L. 1964 On the Mandible of Ramapith- 
ecus. National Academy of Sciences, Proceedings 51, 
no. 3:528-535. 

Smmons, ELwyn L.; and PrBEAM, Davip R. 1965 Pre- 
liminary Revision of the Dryopithecinae (Pongidae, 
Anthropoidea). Folia primatologica 3:81-152. 

Simpson, Grorce G. 1940 Studies on the Earliest Pri- 
mates, American Museum of Natural History, Bulle- 
tin 77:185-212. 

STEHLIN, H. G. 1912-1916 Die Säugetiere des schwei- 
zerischen Eocaens. Schweizerische Paléntologische 
Gesellschaft, Abhandlungen 38:1165-1298; 41:1299- 
1552. 

Sriton, R. A. 1951 Ceboid Monkeys From the Miocene 
of Colombia. California, Univ. of, Publications of the 
Department of Geological Sciences, Bulletin 28, no. 
11:315-355, 


EVOLUTION: Human Evolution 215 


Worrman, J. L. 1903-1904 Studies of Eocene Mam- 
malia in the Marsh Collection, Peabody Museum, 
Part 2: Primates. American Journal of Science Fourth 
Series 15:163-176, 399-414, 419-436; 16:345-368; 
17:23-33, 133-140, 203-214. 


m 
HUMAN EVOLUTION 


The modern or synthetic theory of evolution con- 
siders evolution to be the result of changes in the 
gene frequencies in populations. Gene frequencies 
are altered by means of mutation, selection, migra- 
tion, and certain chance factors such as drift. When 
mutations are considered relative to their useful- 
ness to the organism and to the population, muta- 
tions occur at random. They are not at random 
relative to the chemical structure of the organism, 
and mutations are commoner at some locations on 
the chromosomes than at others. Mutations are 
mostly disadvantageous, and their frequency is 
increased by radiation and by certain chemicals. 
Selection is the primary agent for bringing order 
to this process. Over the course of time, those 
genetic combinations that favor reproductive suc- 
cess become more common; and, conversely, those 
combinations that are less viable decline. It should 
be stressed that selection is for reproductively suc- 
cessful populations, and, from the viewpoint of evo- 
lution or of the species, the individual is only 
important insofar as he helps or hinders the suc- 
cess of the population. Variability itself is very 
important, as it permits the species to adapt to 
changing conditions; and selection is often for vari- 
ability rather than for homogeneity which demands 
a single, limited environment. Migration from one 
population to another may introduce new genes or 
change gene frequencies, and this means that genes 
that are favored by selection may spread through- 
out the species. Partial reproductive isolation per- 
mits adaptation to local conditions and the forma- 
tion of races, but in the long run it is the species 
that is the evolutionary unit. Especially when the 
number of individuals in the breeding population 
is small—as, in fact, was the case throughout most 
of human history—chance may cause the gene fre- 
quencies of one generation to differ from those of 
the preceding one. This has been called drift, and 
it can be a source of variability between small 
populations of a single species. If populations are 
founded by very few individuals, on the basis of 
statistical probabilities their gene frequencies are 
unlikely to be the same as those of the parent 
population. In summary, evolution is the result of 
changing gene frequencies. These changes origi- 
nate by chance and are ordered by selection. They 
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are modified by migration and certain chance fac- 
tors. The process of natural selection has both 
directly and indirectly favored highly variable pop- 
ulations, and there has been no tendency for evolu- 
tion to produce genetically uniform types. 


Biological history 


The synthetic theory of evolutioh has many im- 
portant implications for social science, but it must 
be remembered that the concepts most frequently 
borrowed by the social scientists long antedate mod- 
ern evolutionary theory. The greatest misunder- 
standing comes from the notion that evolution is 
directed toward a particular goal, frequently man 
himself, and that its course is determined by trends 
(orthogenesis). The reproductive success of popu- 
lations is determined by the conditions of the 
moment, not by ultimate desirability or by the re- 
mote future. Reptiles succeeded the amphibians 
because of structural, physiological, and behavioral 
advantages that led to numerical and adaptive suc- 
cess, not because reptiles were to give rise to mam- 
mals, one of which was to become man. The trends 
seen in evolution are descriptions of what has actu- 
ally taken place, and if a trend continues it is 
because selection in each generation has favored a 
particular course of events. When there are only 
a few fossils, the course of evolution may appear 
simple; but when the record is rich, biological his- 
tory appears as an incredibly complex web of adapt- 
ing organisms. 

These points are made particularly incisively by 
Simpson (1964). As he emphasizes, evolution is 
biological history. The mechanism is revealed in 
the geneticist’s laboratory, and the record is dis- 
covered in the rocks. But there are no laws in the 
usual scientific sense determining the course of evo- 
lution. In biological history there are no ultimate 
goals, inevitable trends, or vitalistic explanations. 

The contrast between thinking about evolution 
in terms of inevitable trends and as the result of 
selection may be illustrated by consideration of the 
length of the toes of man and his ancestors. Since 
human toes have become much shorter compared 
to those of any possible ancestral primate, it is 
often asked whether this trend will continue; and 
men of the future are pictured as having still 
smaller toes. But the length of human toes has not 
been determined by a trend which is continuing 
into the future, regardless of the circumstances, 
When our ancestors became terrestrial animals to a 
greater extent, selection favored more efficient bi- 
pedal locomotion. Selection for shorter toes was a 
part of this process, but the fact that selection was 
for different proportions in the foot does not mean 


that selection continues to be for even shorter toes. 
Probably an equilibrium was reached more than a 
million years ago, and selection for shorter toes has 
not continued. There is no trend determining the 
foot proportions of the future: the notion of an 
inevitable man of the future with a huge brain, 
tiny face, small limbs, and so forth finds no support 
in modern evolutionary theory. Future gene fre- 
quencies will be determined by selection (i.e., by 
reproductive success), and man will become more 
intelligent only if those individuals with the com- 
binations of genes favoring the development of 
high intelligence leave more offspring. Evolutionary 
trends are statements of what has actually taken 
place in the history of forms of life, but there is no 
biological momentum that carries past trends into 
the future. 

The study of human evolution is particularly 
beset with the notions of goal and trend and of evo- 
lution as a magic process. It might clarify thinking 
on the subject to omit the word “evolution” for the 
moment and to consider the history of the forms of 
life as revealed by the fossil record, the mechanism 
of change as shown in the laboratory, and the inter- 
pretations of these two kinds of data. The study 
of man’s biological history means precisely the 
same thing as the study of human evolution, and 
it does not carry the suggestion of inevitable trend 
or progress toward some desired goal. The state- 
ment that we may study human biological history, 
human cultural history, and the interrelation of the 
two should mean precisely the same thing as that 
we may study human biological and cultural evo- 
lution. Understanding comes from the study of the 
data, and no information is added by using the 
word “evolution” in preference to the word “history.” 

The importance of using modern concepts is 
well illustrated by the change in the meaning of 
the word “origin.” In human evolution “origin” has 
usually meant a relatively restricted time and place 
in which a new type arose. Thus one theory held 
that Neanderthal man arose in Europe, another 
that he evolved in central Asia. But if the popula- 
tions of the genus Homo were not reproductively 
isolated during the latter half of the Pleistocene 
period, mutations and gene combinations favored by 
selection might have spread throughout the whole 
species. The origin of the Neanderthal populations 
that inhabited Europe during the last interglacial 
period was not limited to Europe but depended on 
the extent to which European populations were in 
contact with populations in other parts of the Old 
World. In the traditional typological, local sense 
there really is no “origin” of Neanderthal but many 
origins at different times and places which became 


incorporated in the European gene pool some 50 to 
150 thousand years ago. The origins of Neanderthal 
were going on steadily over large areas for long 
periods of time, and the populations named Nean- 
derthal were in part the result of events (mutations, 
drift, selection, migration) taking place in other 
races of the species. 


Stages in human evolution 


The characteristics of the main stages of human 
evolution are based on the fossils and on the struc- 
ture and behavior of the primates that exist today. 
Both kinds of evidence are necessary to reconstruct 
the course of human evolution, and it is important 
to keep in mind the nature of the understanding 
that each kind of evidence can give. The fossils are 
the only direct clue to what the ancestors were like, 
the only evidence on many forms that have no close 
living relatives, and the only evidence as to the 
actual time of appearance of the various groups of 
primates. But the fossils are limited to those hard 
parts that do not easily deteriorate, largely jaws and 
teeth. Study of the contemporary primate forms 
supplements this record in two ways. The changes 
that are seen in the fossils may be interpreted more 
meaningfully if knowledge of the living animals is 
used. For example, changes in the form of the 
ethmoid bone may be directly related to changes in 
the sense of smell; or a particular form of limb 
bone may be interpreted as showing a special loco- 
motor pattern. In a quite different way the contem- 
porary forms may indirectly suggest characteristics 
that probably existed in extinct forms. For example, 
primitive mammals and contemporary prosimians 
have tactile hairs, scent glands, and a well-devel- 
oped sense of smell. It is highly probable that these 
conditions were general for all the primates of the 
first half of the Age of the Mammals, but there is 
direct evidence only on the sense of smell. It is 
probable that this whole complex was reduced at 
the end of the Eocene or the beginning of the 
Oligocene, but this may have happened much 
earlier or later, and different parts of the complex 
may have changed at different times. If one is in- 
terested in general statements about what hap- 
pened, the evidence is very good. But the more 
detailed questions one asks, the more the answers 
are limited to those that may be derived from teeth 
and bones. 

For the first half of the Age of the Mammals the 
ancestors of man were small, long-snouted pro- 
simians, not distinguished from the other primates 
of their time in any remarkable way. The prosim- 
ians of the Eocene were successful and highly 
diversified, and the group underwent a major adap- 
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tive radiation resulting in the formation of many 
distinct families. Many of the most successful 
groups evolved elongated incisor teeth; and if one 
examined only the Eocene primates, it might be 
concluded that these were to be the ultimately suc- 
cessful forms. But the majority of these Eocene 
primates became extinct after the true rodents ap- 
peared, and it may be that competition with the 
rodents was a major factor in their extinction. Un- 
fortunately, the fossil record is particularly scant 
at the end of the Eocene and at the beginning of 
the Oligocene, so it is impossible to determine just 
which of the families of the early primates are 
ancestral to the later ones. But the main changes, 
as outlined below, would be similar regardless of 
just which lineage is eventually proved to be the 
correct one. While maintaining hands and feet 
adapted for climbing by grasping and a primitive 
quadrupedal posture, some primates in both the 
New World and the Old World independently 
evolved a new organization of the special senses 
and the brain. The primitive sense of smell was 
reduced, and binocular, stereoscopic color vision 
evolved. The reduction of the sense of smell is 
directly reflected in the fossil bone. The changes in 
vision are inferred from the conditions found in 
contemporary primates and are mirrored, at least 
to some extent, in the structure of the bony orbit. 
The brain increased in size at least four or five 
times and changed in organization from a primitive 
dependence on the sense of smell to an organiza- 
tion based on vision as the dominant sense. The 
change from prosimian to monkey is primarily in 
the brain and special senses, but in addition the 
face became shorter and deeper. The special senses 
and the brain determine what aspects of the ex- 
ternal world can be perceived and appreciated and 
how sensations are organized. In a very real sense 
the world that we think of as normal (a stereo- 
scopic world of color, in which activity is diurnal) 
began with the monkeys of the end of the Eocene 
or the beginning of the Oligocene. Judging from 
contemporary prosimians, the increase in the size 
of the brain made a great difference in intelligence. 
But it must be remembered that this reorganization 
of the structural basis of experience was an adapta- 
tion to a particular kind of arboreal life which took 
place independently in both the New World and 
the Old World. Although this organization forms 
the basis for human experience, it initially evolved 
because it was useful to monkeys, not to prepare 
the way for man. 

Monkeys (Cercopithecidae ) have remained quad- 
rupedal, but the apes (Pongidae) evolved a differ- 
ent locomotor pattern. Apes climb by reaching far 
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up above the head and may hang by one arm, swing 
below a branch, or reach to the side. This manner 
of locomotion in the trees probably evolved first as 
a way of feeding out near the ends of small 
branches. The structures that make this mode of 
living possible are complex and include a shallow, 
wide chest, a long clavicle, a special shoulder joint 
that involves modification of all bones near the 
joint, and changes in the elbow, wrist joint, and 
hand. The fossil record is exceedingly scanty, but 
this whole complex probably evolved during the 
Miocene, long after the Separation between mon- 
keys and apes. At least many monkeylike features 
persist in the arm bones of such forms of early 
Miocene ape as Proconsul and Pliopithecus. Man is 
similar to the apes in this structure of the trunk 
and arms (including the form and numbers of ver- 
tebrae, the disposition of the viscera, the form of 
joints and muscles, and the proportions of the trunk 
and limbs). Again it must be remembered that this 
complex evolved for a special life in the trees, and 
it is shared by all the contemporary apes. Man has 
these features because his ancestors were arboreally 
adapted apes, not because there was a trend toward 
man. 

By the beginning of the Pleistocene, possibly two 
million years ago, the family of which man is the 
sole living representative (Hominidae) was repre- 
sented by a genus of bipedal, small-brained crea- 
tures (Australopithecus). The direct fossil evidence 
shows that the Hominidae must be at least two mil- 
lion years old; but Australopithecus already had 
small canine teeth of human form and a pelvis 
closely approximating that seen in man (Homo), 
and the foot differed from that of Homo only in 
details. Prior to the Pleistocene, the Hominidae 
must have been separated from the Pongidae for 
some substantial period of time during which these 
characters evolved. Fragments of jaws dating from 
the end of the Miocene in India (Ramapithecus ) 
and Africa (Kenyapithecus) suggest that the line 
leading to the Hominidae may have been distinct 
at that time; but since the rate of evolution may be 
very different for different functional complexes, 
there is no way to tell if these forms were begin- 
ning to be bipedal. It has been repeatedly shown 
that conclusions drawn from such fragmentary 
remains are likely to be wrong. [See EVOLUTION, 
article on PRIMATE EVOLUTION. ] 

Stone tools and animal bones have been found 
with the remains of Australopithecus; and it is 
probable that these creatures made the tools and 
hunted small animals, just as from their anatomy 
it appears that the behavior of Australopithecus was 
far more human than apelike. There were at least 


two species, A. africanus and A. robustus, a small 
one and a large one. Judging from the evidence of 
the teeth and from the associated archeological 
remains, both probably made tools. The small spe- 
cies may have been directly ancestral to the genus 
Homo of the Middle Pleistocene, The large form 
lived at the same time as the small one and con- 
tinued on to be a contemporary of Homo before 
becoming extinct. As man is approached, there is 
a tendency to label each specimen as distinct and 
to emphasize differences instead of similarities, The 
result is a multiplication of species and genera the 
more the forms resemble man. Actually, the reverse 
should be the case. The more a primate is bipedal, 
tool-using, and hunting, the less likely the form is 
to speciate and the more likely it is to occupy wide 
areas with only racial differences. 

By the Middle Pleistocene men of the genus 
Homo were fully bipedal, and there is every indica- 
tion that their locomotor system had evolved to 
virtually its present form. Brains were approxi- 
mately twice the size of those of Australopithecus. 
Tools were made according to complex traditions 
of manufacture, which were widely distributed geo- 
graphically. Large animals were hunted, and fire 
was used. This fully human way of life appears to 
have been established by half a million years ago, 
but it changed very slowly. Approximately fifty 
thousand years ago men of modern form appeared. 
Undoubtedly evolution continued, and the popula- 
tions adapted to local conditions by biological as 
well as cultural means. But the main events of 
human evolution had taken place before that time, 
and culture became increasingly more important 
in human adaptation. With agriculture and particu- 
larly with modern science the whole pattern shifted, 
numbers increased from a few millions to billions, 
and recently it is man who has altered the world. 
Mutation, selection, and migration are changed by 
the human way of life and at least in part may be 
brought under human control. i 

Since the fossil record is so fragmentary, it is 
important to note that the latest biochemical and 
cytological evidence supports the classification of 
the primates and the general stages of human evo- 
lution noted above. Immunochemical studies (espe 
cially Williams 1964 and Goodman 1963) show 
that man’s closest living relatives are the great apes, 
especially the chimpanzee and the gorilla. The small 
apes, the gibbons, are much less similar both in 
their immune reactions and in chromosome num- 
ber and types. The Old World monkeys are still 
further removed, and the prosimians are both very 
different and highly diversified. For the first half 
of the Age of the Mammals man’s ancestors were 
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prosimians whose primary adaptation was climbing 
by grasping. At the end of the Eocene a monkey- 
ape group evolved, characterized by changes in the 
brain and special senses. This group diversified in 
the Oligocene, and the special ape locomotor and 
feeding patterns evolved in the Miocene. A human 
bipedal group separated later and was fully evolved 
in locomotor and dental characters by the begin- 
ning of the Pleistocene, some two million years ago. 

It should be stressed that the Eocene prosimians 
evolved into many different forms, most of which 
became extinct. Subsequently, monkeys and apes 
evolved into dozens of different genera. No general 
trend dominates the evolution of the primates, and 
most of primate evolution has no relation to that 
of man. 


Evolution of behavior 


The direct evidence for the course of human evo- 
lution outlined above is fragmentary and limited 
primarily to teeth and jaws. Changes in locomotor 
patterns are directly reflected in the bones, and in 
a general way the stages outlined above are prob- 
ably correct; but so few limb bones are preserved 
that a very wide variety of interpretations is pos- 
sible. Turning to the evolution of other behaviors, 
the evidence is even more indirect and is based pri- 
marily on the contemporary forms. The general 
logic in the use of information from the contem- 
porary primates is that if a structure or behavior is 
common in a group of living primates it was prob- 
ably present in closely allied fossil forms. Both 
parallel evolution and convergence may render such 
conclusions invalid; and since it is highly adapt- 
able, behavior is particularly liable to these sources 
of error. An example may make the situation 
clearer. Primitive mammals probably had a litter 
of several young. In the primates the number of 
young is usually reduced to one at a time, and that 
infant is carried by the mother. The general kind 
of change is clear, but the details cannot be deter- 
mined from the record. Marmosets do have twins 
and galagos more than one young. It is probable 
that the reduction took place several times, and it 
is possible that the ancestral apes had more than 
one young. But the most probable interpretation is 
that in arboreal primates a single infant carried by 
the mother was more likely to survive, and that this 
behavioral adaptation had evolved in the prosimians 
prior to the evolution of the later primate forms. 

The human infant is remarkable in being un- 
able to hold onto its mother, and this not only alters 
the pattern of human mother-child relations but 
introduces a division of labor and many social 
problems that are unique to man. George B. Schaller 
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(1963) observed that the female gorilla also must 
help her infant for the first six weeks. Thus 
man is a little less different than had been thought, 
and perhaps the particular group of apes from 
which we are descended were more like gorillas in 
this regard than like all the other apes and mon- 
keys. The matter cannot be proved one way or the 
other, but it is most likely that the main difference 
between man and the other primates came with the 
evolution of large brains in the Pleistocene, long 
after the human lineage had separated from the 
other apes. 

A problem exists in the examples that follow 
which is similar to that encountered in our con- 
sideration of the number of young in a litter and 
the degree of dependency of the newborn: that is, 
comparison of the living forms highlights a situa- 
tion of importance in the behavior of man, but the 
time of origin of the behavior can be suggested in 
only the most general way. However, many impor- 
tant aspects of human evolution can be appreciated 
only in this way. For example, length of life has 
increased greatly in man. In small primitive mam- 
mals maturity is a matter of months and old age 
of two or three years. Comparable figures for mon- 
keys are maturity in three or four years and a life 
span of well over twenty years. In chimpanzees 
full maturity is in approximately eight to ten years, 
and the life span is more than forty years. Com- 
parable figures for man are nearly twice as long. 
Although there is much variation, the time during 
which the human young enjoy a protected and priv- 
ileged position has increased, and a human of eight 
or ten is still learning and playing at an age when 
most primates would be fully adult. Clearly, selec- 
tion has favored the long period of development and 
learning, even at a great biological cost. It might 
appear that a process based on reproductive success 
would favor several young and rapid growth, but 
in the human way of life selection has been for a 
single infant, growing slowly. Prolonged depend- 
ency and the presence of experienced adults have 
been major factors in the evolution of human 
society. 

In monkeys and apes there is a menstrual cycle 
of approximately a month in duration. Sexual ac- 
tivity is concentrated in a period of estrus close to 
the time of ovulation. Females do not come into 
estrus for a period of some months during the later 
part of pregnancy and the first part of lactation. 
These physiological facts have the effect of con- 
centrating sexual activity when conception is most 
probable and of spacing the infants. The spacing 
may be further reinforced by a breeding season, or 
at least a much greater frequency of births in one 


220 EVOLUTION: Human Evolution 


part of the year. In the human female the basic 
cycle continues, but estrus behavior has been lost. 

Human females experience a period of lowered 
fertility during early lactation which is probably 
comparable to that in many monkeys. However, this 
period is not long enough for the requirements of 
human infant spacing; the human infant cannot 
feed itself when it is six months or even a year old. 
In man the spacing of infants must be extended by 
taboos and customs which supplement the physio- 
logical mechanisms of birth spacing in the non- 
human primate. The loss of the estrus cycle in the 
human female may be related to the economic di- 
vision of labor and the interdependency of the 
sexes, which is unique to man, This loss may be 
due to the need of the human female to keep the 
interest of the food-sharing male; but there may 
well have also been selective pressure against estrus 
behavior, which would have been disruptive to the 
stable, interdependent relationships of a family 
unit. 

The monkeys and apes are almost entirely vege- 
tarian, although most will eat birds’ eggs and in- 
sects. Hunting of small mammals has been ob- 
served rarely in chimpanzees and baboons. The 
range in which monkeys and apes forage is small, 
varying from much less than a square mile in gib- 
bons to some fifteen square miles in baboons and 
gorillas. From a human point of view it is remark- 
able that animals with keen eyesight, who are capa- 
ble of climbing into trees and surveying the scene 
and who are well adapted for locomotion on the 
ground, so restrict their normal activities. It is prob- 
able that hunting is the behavioral adaptation that 
caused the change in man’s relation to his physical 
environment. Intensive hunting would drive game 
from a small range, and the location of game and 
the pursuit of wounded animals would lead to the 
establishment of large territories. In most non- 
human primates ranges are not defended, and the 
areas occupied by groups of monkeys or apes 
usually overlap. Human defense of territory may 
also be the result of hunting. 

Tool use is one of man’s most distinctive attri- 
butes, and the skillful use of objects is unique to 
man. Manipulative skills depend on the brain as 
well as on the hand, and the large area in the 
human brain devoted to the hand is probably 
the result of the new selection pressures that came 
into being with the beginnings of tool using. Both 
the evidence of the teeth and of the associated tools 
and animal bones suggest that Australopithecus 
was a tool user and had been so for a long time. If 
this interpretation is correct, then the increase in 
size of the human brain came long after the use of 


tools and probably in response to the new ways 
of life that tools made possible. 

Intelligence and learning are not general but are 
related to specific abilities. For example, human 
children play with objects as well as with other 
children. They enjoy practicing using and throw- 
ing, and many games are built around objects. In 
marked contrast, monkeys and apes are tool-dumb, 
so to speak. Tasks that the human child easily 
masters are very difficult or impossible for non- 
human primates. Only the chimpanzee uses some 
minor tools (Goodall 1964). The same principle 
can be seen in language. In spite of major efforts, 
it has proved impossible to teach monkeys or apes 
to speak. This is because the nonhuman primates 
lack the neural mechanisms necessary for speech. 
The sounds of the nonhuman primates convey 
emotion (such as fear) and the location of the call- 
ing animal, and they are important especially when 
combined with gestures in social interaction. The 
sound systems of primates are not more complex 
than those of many other mammals; and, like other 
mammals, primates can be trained to respond to 
human sounds. The distinctive character of human 
speech is the naming of objects, and this requires 
the linking of visual and auditory parts of the cor- 
tex of the brain (Geschwind 1964). The necessary 
connections are not present in the brains of the 
nonhuman primates. Once naming of objects had 
commenced in even the most minor way, the suc- 
cess of this revolution changed selection pressures 
so that the course of evolution of the brain changed, 
and structures evolved that ultimately made pos- 
sible language as we now know it. It is likely that 
the situation which led to object naming was tool 
use, and it is the uniqueness of this combination 
of tools and language to man that accounts for 
why the other primates did not develop even the 
simplest languages. If language began at the time 
the brain was doubling in size between Australo- 
pithecus and Homo, this would give a minimum 
of two million years of stone toolmaking by small- 
brained bipeds as a time in which the first naming 
of objects might have occurred. It is tempting to 
attribute this great increase in brain size to lan- 
guage and all the ways of life that language made 
possible. But it must be remembered that it is not 
just increase in size that separates the brain of 
modern man from that of the contemporary apes. 
There have been changes of organization in the 
brain too, and without these changes skills in tool 
use, language, and social planning would be im- 
possible. Human intelligence is built on specific 
abilities which are the products of the evolutionary 
process. 


But the world in which man evolved was a very 
different one from that in which we are living to- 
day. Our bodies had evolved to practically their 
present form some fifty thousand years ago, and 
since then human adaptations to the environment 
have been increasingly by technology and custom. 
This does not mean that evolution has stopped, 
but it does mean that the direct interrelation that 
selection had forged between man and his ways of 
life is no longer functional. As stone tools improved 
over vast intervals of time, the biology of the users 
had time to evolve along with their way of life. In 
the human head we see the product of the inter- 
relations of biology and a succession of ways of life 
in which selection was for smaller faces and bigger 
brains. But since the agricultural and scientific 
revolutions customs have changed so rapidly that 
there has been no time for corresponding biological 
evolution to fit the human actors for the modern 
world. Human biology evolved to be adaptive in a 
world of small society, great hazards, and personal 
skill. The human actor in modern society is too 
aggressive, too dominance seeking, too acquisitive. 
The kind of planning necessary in the modern 
world is difficult for an organism built along the 
lines of Homo sapiens. Many acts which now 
would be judged undesirable (acts of selfishness, 
cruelty, and war) are easily learned because they 
are in accord with basic human biology (Hamburg 
1963). 


Natural selection can bring about adaptation 
between biology and a way of life only in very long 
periods of time, and there is no orthogenesis carry- 
ing trends of the past into the future. The fit be- 
tween organism and society must now be deter- 
mined by science, and for the first time in all of 
biological evolution both biology and social life can 
be planned. But planning can be more efficient if 
planners remember that the actors in modern tech- 
nical society are products of the past, of times and 
ways of life long gone. 


S. L. WASHBURN AND JANE B. LANCASTER 
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Iv 
CULTURAL EVOLUTION 


The grand movement of origin, transformation, 
and differentiation of our universe, our earth, and 
life itself is called evolution. Within this total proc- 
ess we are concerned with the transformations that 
occurred when the biological, or organic, phase 
arose out of the inorganic and when the later, cul- 
tural phase arose from the organic. Despite their 
interconnections, however, each of these stages has 
its own characteristic mode and tempo of evolution. 
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The cultural phase transcended the organic and 
inorganic when populations of men created new 
ways of adapting to each other and to the environ- 
ment. These adaptations occurred after certain 
gradual changes in the size and complexity of the 
hominid forebrain made symbolic thought and 
communication possible, The capacity for, and use 
of, symbolic manipulation brought forth unprece- 
dented kinds of social behavior. These new ways of 
behaving were suprabiological in the sense that 
such natural primate characteristics as jealousy, 
fear, sex and food appetites, and so on, were so 
often channeled, sublimated, or otherwise altered 
by means of social rules. In a few striking re- 
spects, in fact, the new modes of social behavior 
were contrabiological inasmuch as they actually 
repressed such powerful urges as the sexual, for 
example, and required sharing rather than com- 
peting for scarce food. 

The sum total of the social and political rules, 
technological inventions and economic institutions, 
the arts, shared beliefs and practices—that is, the 
culture—tends to persist through time because any 
particular society maintains these parts integrated 
with each other and with its environment. Evolu- 
tionary changes, therefore, do not correspond to a 
single world-wide pattern, and each society main- 
tains a certain distinctiveness in the course of 
change. Thus the culture of mankind generally is 
an evolutionary stage in the universal process, 
while particular societies differentiate into cultural 
genera and species, creating heterogeneity. Some- 
times this adaptive process brings forth striking 
advances that permit greater dominance, all-round 
adaptability, and growth. 

All anthropologists would agree that the earliest 
human societies must have been small and simple 
in social organization, poor in technological equip- 
ment, without formal legal or governmental insti- 
tutions, and with an ideology based more on the 
Supernatural than on science. Since these char- 
acteristics contrast greatly with modern industrial 
States, we think of evolution as directional: gen- 
erally from small to large societies, from simple to 
complex organizations, from informal to formal 
political institutions, and so on. The idea of direc- 
tionality is important because it provides the cri- 
teria for classifying separate societies into general 
Stages of the evolution of culture as a totality. 

A second characteristic of evolution is the re- 
latedness of the sequence of forms, An important 
aspect of the interpretation of any particular unit 
is the investigation of the ancestral forms from 
which it “unfolded”—its phylogeny. 

Not many social scientists are evolutionists, and 


even the anthropologists who agree that there has 
been general evolutionary growth disagree about 
whether there has been enough orderliness in the 
process for the theory to be useful in classifying 
cultures or in interpreting a purely historical suc- 
cession of discrete events. But then, of course, even 
if the orderliness is agreed upon, it is natural to 
want to know what causes it, and here further dis- 
agreement arises. Most of the different conceptions 
of the nature and causes of cultural evolution 
arose in the eighteenth and nineteenth centuries, 
and a brief historical sketch is useful in describing 
them. 


History of the concept 


As primitive societies in various parts of the 
world became known to Europeans during the age 
of discovery, two different explanations of their 
primitiveness were offered. The most widespread 
belief was theological: they had “degenerated” 
further from an original state of grace than had 
civilized peoples. The other, the rationalist explana- 
tion that became usual among intellectuals in the 
seventeenth and eighteenth centuries, was that 
civilization had evolved from earlier primitive types 
that must have been similar to the culture of con- 
temporary savages and barbarians. There had been, 
to be sure, evolutionary notions held by philoso- 
phers of the classical traditions—by the Greek 
Epicurus and the Roman Lucretius, for example— 
but modern ideas about cultural evolution were 
propounded most influentially by Turgot, Montes- 
quieu, Rousseau, Condorcet, Helvétius, Diderot, 
and others in France; Kant and Herder in Ger- 
many; Vico in Italy; and Hume, Hobbes, and Fer- 
guson in Britain. 

As opposed to degenerationism, all of these 
writers held to a theory of progress. Although 
modern anthropologists have commonly thought 
that this theory was merely a happy giddiness in- 
duced by the great economic and political advances 
of the period, most European scholars were actu- 
ally pessimistic about progress in the future. While 
all agreed that progress had taken place in the past, 
and that discernible, orderly stages of its evolution 
could be demarcated, only Kant, Turgot, and Con- 
dorcet thought that progress was inevitable. There 
were variations in the names and numbers of 
stages that were proposed, but most accepted either 
Turgot’s stages of hunting, pastoralism, and farm- 
ing, or Montesquieu’s similar typology of savagery, 
barbarism, and civilization. 

Although the modern holistic concept of culture 
was lacking, the elements that were seen to be 
evolving were the human institutions of which cul- 


ture is composed. There was one emphasis in the 
evolutionary theory that was peculiar to the times 
—rationalism, Human institutions were viewed as 
products of the human mind (as indeed in some 
sense they must be), a mind that was mistaken or 
irrational in the past, but increasingly less super- 
stitious and more reasonable as time went on. Since 
civilized man had literally thought himself out of a 
“state of nature,” the degree of orderliness that lay 
in human history was due to the progressive 
improvement of mentality. 

In the nineteenth century, evolutionary thought 
became less philosophical and more influenced by 
empirical aspirations. Tylor in England and Mor- 
gan in the United States were prominent in the 
creation of ethnological evolutionism; Spencer in 
England, Saint-Simon, Comte, and Durkheim in 
France were pioneers in sociology; the reworking of 
Hegel by Marx and Engels was a creative contribu- 
tion to political-revolutionary theory. They differed 
from their predecessors and from one another 
mainly with respect to the following problems: 
(1) What is it that evolves—culture in general or 
only the institutions of specific societies? (2) How 
does it evolve—by orthogenetic, inevitable progress, 
by rational thought and intention, survival of the 
fittest, or an unconscious dialectical struggle? (3) 
Where is the locus of the evolutionary impulse— 
in the improvements of technology and the ma- 
terial aspect alone, in the division of labor, the 
political or ideological aspect, or as a force in the 
cosmos? 

The nature of culture. The ethnologists of the 
late 1800s, convinced of orderliness in evolution, 
directed themselves to the delineation of stages of 
culture, which E. B. Tylor defined in the anthro- 
pological sense in 1871. L. H. Morgan (1877) did 
not use that word, but in his usage, “society” and 
“ethnical periods” (for stages) referred essentially 
to the subject matter of Tylor’s concept. Tylor and 
Morgan, and most ethnologists of their time as well, 
were concerned with the world-wide manifestation 
of cultural stages, not with the culture of a par- 
ticular society. A specific primitive tribe was of 
interest only as an illustration of aspects of the 
culture of an entire stage and of a large geo- 
graphical area. 

This concern with culture in its most general 
sense rather than with particular societies caused a 
misunderstanding of the ethnological evolutionists 
by later commentators. Modern anthropologists fre- 
quently criticize the nineteenth-century evolution- 
ists as “unilinear,” meaning that the latter believed 
that all societies inevitably progress through the 
same stages. This would be a powerful blow at 
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nineteenth-century evolutionary theory, if it were 
true, but it seems doubtful that any of the evolu- 
tionists believed such manifest nonsense. In the 
statement that seems best to serve modern critics, 
Morgan said: 


Since mankind were one in origin, their career has 
been essentially one, running in different but uniform 
channels upon all continents, and very similarly in all 
the tribes and nations of mankind down to the same 
status of advancement. It follows that the history and 
experience of the American Indian tribes represent, 
more or less nearly, the history and experience of our 
own remote ancestors when in corresponding con- 
ditions. ([1877] 1964, pp. 6-7) 


This sounds “unilinear” to a modern ethnologist, 
whose concern has been restricted to the structure 
and functioning of unit systems, but inasmuch as 
ethnologists were not making such studies in the 
nineteenth century, it seems apparent that Morgan 
must have meant nothing more than that wherever 
barbarism (defined by the traits of horticulture or 
pastoralism ) was found, a general stage of hunting— 
gathering society (savagery) had preceded it and 
that stages of both had preceded civilization on 
continents that had achieved civilization. Such a 
judgment is attested by archeology now as well as 
by common sense and should evoke no comment, 
but in Morgan’s day it was worth stating because 
theories of degeneration and catastrophe were still 
commonly opposed to evolutionism. 

The sociologists tended toward the organismic 
model for society. A society was thought of as a 
contained unit made up of interdependent parts, 
each subserving the others. Evolution was seen as 
the development of more parts and greater differ- 
entiation of them. The “parts” are individuals, 
groups, and specialized persons and groups. Re- 
ligions, morals, and political, social, and economic 
institutions function largely to bolster, integrate, 
and smooth the relations between the social parts. 
This early model, refined as nonevolutionary struc- 
tural functionalism, became characteristic of mod- 
ern American sociology and British social anthro- 
pology but had its roots in the eighteenth-century 
concern with progress. 

The third group, the Marxists, was closer to 
ethnology than to sociology, at least in its begin- 
ning phases under Marx and Engels. In fact, Engels’ 
Origin of the Family, Private Property and the 
State (1884) was inspired by Morgan’s Ancient 
Society (1877) and borrowed heavily from it. The 
theory of general developmental stages was the 
same, and the conception of the evolving unit did 
not have the organismic particularism of the soci- 
ologists. As in Morgan’s case, the concept of culture 
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was absent but would have been appropriate, for 
institutions (especially technological and economic) 
were not merely subserviently integrative in func- 
tion but had more of an initiating “prime mover” 
status than the sociologists believed. 

How does culture evolve? The eighteenth-cen- 
tury evolutionists thought of the progressive im- 
provement of the human condition as a mentalistic 
evolution and thus took an idealist view of the evo- 
lutionary process, Some of the language of nine- 
teenth-century ethnological evolutionism reflects 
this, so that we find such expressions as Morgan’s 
“growth of the idea of government” and Tylor’s 
frequent use of “mind” and “mental life” as near 
synonyms of many aspects of culture. But a very 
important change occurred in the nineteenth- 
century view of evolution. As it became more scien- 
tifically oriented it posited causal and functional 
connections between different aspects of culture. 
Tylor, in his greatest book, Primitive Culture 
(1871), devoted much of the introductory chapter 
(“The Science of Culture”) to describing not only 
cause-effect relations in culture but also the de- 
termining of these cultural relations and the 
thought and will of the individual. The ethnological 
school of evolution thus made significant moves 
toward determinism and against assumptions of 
free will in human affairs. 

The “how,” the mechanics of the evolutionary 
process, was not explicitly described, beyond the 
suggestion that it was unconscious. We are told 
that new elements tend to replace older ones if they 
are better, sometimes, but beyond that one has the 
impression that evolution was taken as a “given,” 
that orthogenetic forces had moved mankind ever 
upward, however fitfully. 

There is a recurrent note reminiscent of the 
eighteenth-century rationalist ancestry: “Now that 
we understand evolution we can more consciously 
control it.” Tylor called anthropology a “reformer’s 
science,” and Morgan said, “The time will come . . . 
when human intelligence will rise to the mastery 
over property... .” This was the deterministic para- 
dox: We can scientifically analyze the evolution 
of culture because it is orderly (because it is de- 
termined ); knowing this we can somehow influence 
the future as we pass, as Tylor put it, “from the age 
of unconscious to that of conscious progress.” 

The sociological wing of nineteenth-century evo- 
lutionism pursued the implications of the biological 
analogy. The rationalist optimism of Condorcet was 
outdated by the obvious attendant evils of indus- 
trialization, especially in England, as illustrated by 
Malthus’ pessimistic Essay on the Principle of Popu- 
lation. After the great intellectual success of Dar- 


win’s theory of selection by survival (itself sug- 
gested by Malthus’ essay), a theory of “social 
Darwinism” arose, which, whatever its demerits, at 
least provided a “how” for the evolutionary process: 
as a result of conflict between societies superior 
ones replace the inferior. Some added an “internal 
conflict” aspect: out of the struggle between classes, 
groups, and even individuals within the society 
comes the improvement of the society. Walter 
Bagehot, Auguste Comte, Herbert Spencer, and 
Ludwig Gumplowicz were the leaders in this mode 
of thought. 

Marx and Engels were even more consistently 
deterministic and materialistic than the ethnolo- 
gists and had a much more definite theory of the 
mechanics of evolution. This theory was ortho- 
genetic in that the impetus for change came from 
within the society, from the “dialectic” of the class 
struggle, the resolution of contradictions in terms 
of either failure or a higher unity. It should be 
noted that this internal-conflict theory is not like 
social Darwinism: the ruling or propertied class is 
not superior. Marx and Engels, like the ethnolo- 
gists, were insistent on the lawful, determined 
nature of evolution, and they also believed that evo- 
lution could be oriented by the conscious action of 
man once he understood its processes. Then, by 
abolishing the capitalist form of production, “Man, 
at last the master of his own form of social organ- 
ization, becomes at the same time the lord over 
nature, his own master—free” (Engels [1882] 
1935, p. 75). 

Most of the sociologists became less interested 
in evolution itself than in the more immediate 
problems of the organismic nature of a society, 
particularly that of integration. What holds a so- 
ciety together? Following the social contract theo- 
ries of the eighteenth century, there were psycho- 
logical theories, mental interaction theories, and 
imitation theories, all of which took society to be 
an organismic entity somehow mentally consti- 
tuted. This was the time when sociology became 
nonevolutionary, as it mostly remains to this day. 

The locus of the evolutionary impulse. Morgan 
thought that cultural evolution consisted of two 
distinct aspects—“inventions and discoveries” (the 
technical order), which evolve in connected, pro- 
gressive, cumulative relations to one another, and 
“institutions” (the forms of the family, of govern- 
ment, religion, architecture, property ), which stand 
in “unfolding relations.” By this expression he 
meant that social institutions originate in a few 
“primary germs of thought” and thereafter inde- 
pendently change form as well as replace previous 
forms. Tylor also, although not so explicitly, 


thought that technology, science, and other aspects 
of material culture undergo evolution rather inde- 
pendent of religion and “intellectual and moral” 
progress. Nowhere is it demonstrated, however, 
that one of these aspects is the “prime mover” and 
the other a dependent variable or superstructure. 
But again it should be remembered that Morgan 
and Tylor were not talking about the process of 
systemic change in any particular society, hence 
the matter of functional priority of one part over 
another simply did not concern them. 

The sociologists, preoccupied with social integra- 
tion, psychology, and mentalism, did not see the 
initiating locus of evolutionary change in any 
aspect of culture at all. Spencer and his followers 
saw evolution as a grand cosmic force that gen- 
erated complexity out of simplicity and heteroge- 
neity out of homogeneity, aided somewhat by Dar- 
winian “conflict and survival.” A few Frenchmen, 
most notably Emile Durkheim (1893), posited that 
the division of labor in society, like the functional 
specialization of organs in biological entities 
(again, the organismic model), is related to popu- 
lation increases, greater social density, and larger, 
stronger societies. But it is not clear what the 
causes of these developments are, and Durkheim 
explicitly denied that the division of labor is in- 
creased for utilitarian reasons like “the greatest 
happiness for the greatest number” or by any other 
kind of intention or plan. 

The Marxians, on the other hand, were firm in 
the conviction that the locus of evolutionary change 
lies in the technoeconomic (or material) sector, 
which then affects the nature of the social classes 
and their interrelations, and that ideology is mere 
superstructure. As such, it is the last part of culture 
to change. This form of the old materialist versus 
idealist philosophical argument persists strongly to 
this day. 


Twentieth-century evolutionism 


Most American ethnologists in the first half of 
the twentieth century repudiated an evolutionism 
that they misunderstood in favor of a raw ethno- 
graphic, “natural history” approach to the study of 
primitive culture. In Britain and France, and in 
sociology nearly everywhere, evolutionism suc- 
cumbed intellectually to a structural functionalism 
that had greater utility for the practical solution 
of social problems through political administration 
in the colonies and at home. 

A. G. Keller, an American sociologist, Leslie A. 
White and Julian H. Steward, American ethnolo- 
gists, and V. Gordon Childe, a British archeologist, 
were virtually alone in opposing the antievolution- 
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ary temper of the times. It was not until after 
mid-century that there was any larger shift of opin- 
ion toward an evolutionary outlook again, but this 
took place only in America, only in anthropology, 
and there only in part. 

Twentieth-century evolutionism differs from pre- 
vious theories in two major respects. The first 
concerns the cultural adaptations through which 
evolutionary changes are believed to occur. The 
concept of cultural adaptation has supplanted the 
orthogenesis of earlier evolutionists, who had found 
the only generative impulse in the internal class- 
struggle dialectic proposed by Marx and Engels and 
in the social Darwinism of some sociologists. Second, 
a new theoretical synthesis has been made by the 
reworking and integration of some of the earlier 
perspectives that had been thought to be con- 
tradictory. 

The significance of the adaptation of culture to 
the natural environment as an important aspect of 
the evolutionary process was presented as “cultural 
ecology” by Julian H. Steward. Others have pro- 
posed further that in the process of adaptation, the 
environment includes not only the natural environ- 
ment but adjustments to other social systems as 
well. According to this view, inventions and dis- 
coveries, borrowings, unconscious historical “acci- 
dents,” changes from whatever source, are the raw 
materials for evolutionary change in culture. Some 
of these “fit” as improvements in the internal func- 
tional arrangements, while others solve external 
environmental problems with respect to nature or 
competition: thus they are selected simply because 
they are superior instruments. The advance of this 
perspective over eighteenth-century and nineteenth- 
century ideas of the “inevitability of progress” and 
orthogenesis is manifest: the evolutionary perspec- 
tive is not mystical and can accommodate and 
make more intelligible the variety of historical data 
we now possess. 

One of the historical facts of life that has plagued 
all orthogenetic schemes is that different societies 
manifest great variation in rates of evolution, from 
drastic revolution to the other extreme of non- 
evolution—i.e., stabilization. And in so many in- 
stances a society makes a very rapid rise only to 
reach a long-term plateau. The theory that evolu- 
tion proceeds by adaptation, however, allows for 
all of these eventualities, taking stabilization as 
much for granted as progress. Stabilization, after all, 
merely bespeaks the success of the adaptive proc- 
ess: when the culture is successfully adapted, it 
tends to reject subsequent possible changes. This 
can render explicable what might seem paradox- 
ical: that a culture “high” in one stage might fail 
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to advance to further heights in the next simply 
because of its earlier success. And, of course, the 
more specialized its form of adaptation, the more 
deeply entrenched and committed to its present 
environment it becomes. 

The perspective of cultural adaptation and selec- 
tion is particularly useful in reconciling opposing 
viewpoints derived from the nineteenth century. 

What is it that evolves? Is it culture in general, 
through grand stages, or only particular social 
systems? The reconciliation of these two views is 
easy: both are correct. The evolution of the totality 
is the product of the evolution of particular soci- 
eties. To be sure, there is but a single evolutionary 
process, the selection of traits and their functional 
adjustment via adaptation in particular systems. 
This is the way societies become differentiated 
one from the other, but it is also the way some 
become superior to others in measurable ways. 
Thus, two different theoretical perspectives are pos- 
sible with respect to the same data. These are what 
Sahlins (Sahlins & Service 1960) calls the specific 
evolutionary perspective as compared with the 
general evolutionary perspective. The former refers 
merely to the creation of diversity by adaptive mod- 
ification of related particular societies. The latter 
is the measurement of progress; some specific evo- 
lutionary changes are significant breakthroughs 
that can be measured by such objective directional 
criteria of progress as greater all-round adaptability, 
greater dominance, or greater complexity of or- 
ganization. In short, specific evolution refers to our 
concern with descent-with-modification or adaptive 
variation; general evolution refers to the progressive 
emergence of superior forms, stage by stage, which 
can be related to the directional evolution of the 
total culture of the human species. 

How does culture evolve? Is it in some measure 
intentionally planned, or is it an unconscious and 
nonrational process, determined by events outside 
human awareness? Surely an improvement in ideas 
has something to do with it, and sometimes ideas 
must be conscious and rational; this is most obvi- 
ous in science and engineering but also holds true 
in the institutional realm. Many political institu- 
tions, for example, result from attempts to solve 
social or economic problems purposefully. But, of 
course, there are often latent and unintended con- 
sequences of even the most manifest political 
expediency. 

Again, it would seem that a reconciliation of the 
opposed views can be made by means of the adap- 
tation—selection perspective. New culture traits or 
modifications can have any number of sources: 
inventions, purposeful borrowings, accidents, un- 


conscious functional shifts, and so on. The selection 
or rejection of any of these could also involve 
conscious intentionality, even if but rarely. The 
selective process in cultural evolution is only roughly 
analogous to natural selection in biology; certainly 
the capacity of a person to analyze his own be- 
havior, predict future events, and rearrange his 
affairs on that basis is a distinctively human trait. 
It is more difficult to plan and arrange things on a 
social or political basis, and the greater the demo- 
graphic scope the more difficult it is; but it does 
happen. The adaptation-selection perspective has 
the great virtue of not prescribing either conscious 
intention or unconsciousness; it can accommodate 
either and still lead to greater comprehension of 
cultural change. And further, determinism in hu- 
man affairs is not equated with unawareness, in- 
determinism with awareness. Determinism is a 
perspective that the analyst takes, not a property 
of the subject matter under investigation. 

Where is the locus of the evolutionary impulse? 
Does it lie in the mode of production, in technology, 
in the relations of production, in the class struggle, 
in the division of labor, in man’s view of destiny; 
or is it a mystical force in the cosmos? It would 
seem that those who posited mode of production, 
class struggle, technology, or division of labor were 
much influenced by the industrial revolution, which 
has been, of course, a most striking evolutionary 
prime mover for the past century and a half and 
promises even more wondrous cultural transforma- 
tions almost immediately. But has the material, 
technoeconomic aspect always been the prime 
mover? The change from primitive chiefdoms to 
early states and then to empires in Mesoamerica, 
Peru, and probably elsewhere seems to have been 
first in the political sector; even the important in- 
ventions of writing and mathematics could have 
originated in the occult mumbo jumbo of priests. 
The modern evolutionist accordingly wants to know 
more about particular instances of change and 
finds no need to insist that the initial loci must be 
always in the same sector of culture. 

Evolutionists of the nineteenth century were 
more empirical than their predecessors, but their 
use of ethnographic data was often mere illustra- 
tion rather than proof of hypotheses, The “com- 
parative method” of some, such as Sir James Frazer, 
was simply an uncritical, but energetic, “clip and 
paste.” E. B. Tylor (1888) was a notable exception: 
he originated a method of statistical correlations in 
his comparative study of marriage and descent 
rules. Hobhouse, Wheeler, and Ginsberg’s work 
(1915) was another important application of statis- 
tics to problems of cultural evolution. Otherwise, 


both evolutionary theory and the comparative meth- 
od nearly perished from inattention until mid- 
twentieth century. 

The use of the comparative method began to 
revive in America during the 1950s, particularly 
stimulated by George P. Murdock’s creation of the 
Human Relations Area Files and later the “Ethno- 
graphic Atlas.” Formal graphical means of showing 
correlations and sequences of culture traits in the 
course of evolutionary changes have attracted at- 
tention recently. (See Naroll 1956; Freeman & 
Winch 1957; Gouldner & Peterson 1962; and Car- 
neiro & Tobias 1963.) The results of these efforts 
are meager so far, and several unsolved difficulties 
attend them. The problems of how to define the 
significant cultural units to be counted and how to 
select a random sample of them in the absence of 
an adequate number of ethnographies of unaccul- 
turated societies are serious. 

The gravest difficulty of all is caused by the tend- 
ency of a culture to become specialized as it adapts 
to its environment, for to the extent that it is 
special it is incommensurable. Walter Goldschmidt 
has aptly called this “the Malinowskian dilemma”: 
Malinowski argued (and successfully demon- 
strated) that every cultural institution must be 
understood as a unique product of the cultural 
whole within which it developed. It would seem to 
follow, therefore, that the comparative method is 
wrong, comparing incomparables. Yet, paradox- 
ically, Malinowski often extrapolated from his in- 
sights into Trobriand culture to the primitive world 
in general, Goldschmidt argues that the solution is 
to compare functions, not institutions: “What is 
consistent from culture to culture is not the insti- 
tution; what is consistent are the social problems. 
What is recurrent from society to society is solu- 
tions to these problems” (1966, p. 31). 

Julian H. Steward’s studies (1955) of “multi- 
linear evolution” show an awareness of these diffi- 
culties. He recommends comparative studies of 
specific holocultures in evolution, rather than com- 
parisons of isolated traits, in order to find the 
“regularities” of evolution. 

There is a different test of theory that is bound 
to be used more frequently—the test of fruitful- 
ness. Since one of the main purposes of evolution- 
ary theory is to provide intelligibility to historical 
data, then the better it fulfills this function, the 
greater must be its empirical as well as logico- 
didactic virtues. Guy Swanson’s study of religion 
(1960) is a good example: light is cast on the 
development of religion, and at the same time 
evolutionary theory proves to be useful. 

Such empirical applications of evolutionary the- 
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ory can result in its refinement only to the extent 
that evolutionists maintain an empirical orienta- 
tion, willing to change the theory in the service of 
its intellectual functions. Some of the older evolu- 
tionary philosophies, particularly those of Marx 
and Spencer, were too grand in scope and too 
schematic to be useful. They also became stultified 
dogmas as they were used by political parties and 
academic “schools of thought.” A better fate may 
be expected of recent evolutionism, judging from 
the evidence of new empirical attitudes, particu- 
larly if its proponents remain guarded against un- 
necessary and untested preconceptions that can so 
easily impede a true evolutionary science of 
culture. 

ELMAN R. SERVICE 


[See also ANTHROPOLOGY, articles on THE FIELD and 
THE COMPARATIVE METHOD IN ANTHROPOLOGY; 
ARCHEOLOGY, article on THE FIELD; CULTURE; ECOL- 
ocy. Also related are the entries INTEGRATION; 
SoctaL DARWINISM; SocIoLocy, article on THE DE- 
VELOPMENT OF SOCIOLOGICAL THOUGHT; and the 
biographies of CHILDE; MORGAN, LEWIS HENRY; 
SPENCER; TYLOR.] 
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v 
SOCIAL EVOLUTION 


Evolutionary theory dominated sociological 
thought in the nineteenth and early twentieth cen- 
turies, but since about 1920 interest in it has, on 
the whole, given way to preoccupation with sys- 
tematic analysis of social systems, analysis of 
broad social and demographic trends, and investi- 
gation of the social determinants of behavior 
(Ginsberg 1932). The recent tentative revival of 
interest in an evolutionary perspective is closely 
related to growing interest in historical and com- 
parative studies. It does not, however, denote a 
mere return to the assumptions of the classical 
evolutionists, but rather it implies revision and 
reappraisal of evolutionary theory in the light of 
recent advances in sociological theory and research. 

From the point of view of sociological analysis, 
the older evolutionary models broke down mainly 
over two stumbling blocks. The first was the as- 
sumption that the development of human societies 
is relatively cumulative and unilinear and that the 
major “stages” of development are universal—even 
if there are many differences in detail and even if 
not all societies reach every stage of evolution. The 
second stumbling block was the failure to specify 
fully the systemic characteristics of evolving soci- 
eties or institutions or the mechanisms and proc- 
esses of change through which the transitions from 
one “stage” to another were effected. Most of the 
classical evolutionary schools tended, rather, to 
point out general causes of change (economic, 
technological, spiritual) or some general trends 
(for example, the trend to complexity) inherent 
in the development of societies. Very often they 
confused such general tendencies with the causes 
of change or assumed that these general tendencies 
explained concrete instances of change (Bock 
1963). 

Attempts to reappraise evolutionary perspectives, 
therefore, must address themselves to several basic 
problems inherent in the new analytical develop- 
ments in sociological theory, on the one hand, and 
in the general setting of the evolutionary problem, 
on the other (Wolf 1964; Parsons 1964; Eisen- 
stadt 1963a). The first crucial problem concerns 
the extent to which change from one type of soci- 


ety to another is not accidental or random but, 
rather, evinces over-all evolutionary or develop- 
mental trends. Second is the question of the extent 
to which such changes are cumulative within any 
given society and within any given institutional 
sphere in different societies (Wolf 1964). Third 
is the problem of the extent to which such changes 
do indeed enhance the adaptive potential of a soci- 
ety in relation to its cultural and natural environ- 
ment—however such adaptation and environment 
are defined (White 1959; Sahlins & Service 1960). 

Furthermore, even if some such common char- 
acteristics or trends can be found within different 
and disparate societies, the question remains as to 
the validity of talking about the evolution of human 
society or culture as a whole. Here three sub- 
problems exist: the first is the extent to which other 
societies constitute the “environment” of any soci- 
ety—that is, the environment to which any given 
single society has to adapt and which can enhance 
the general reservoir of its techniques of adapta- 
tion. The second is a question of the extent to 
which institutions and forms of organization that 
have adaptive value can be borrowed and trans- 
planted from one society to another, thus enhanc- 
ing their adaptive potential. And, finally, we must 
ask to what extent human society is a “system of 
points” with some common adaptive and integra- 
tive mechanisms. As distinct from the general 
theory of cultural evolution, which in a way as- 
sumes the unity of mankind and hence also the 
internal transferability of institutions or tech- 
niques, the focus of sociological analysis is on the 
relations between the systemic characteristics of 
societies in interaction with their natural, social, 
and cultural environments, on the one hand, and 
some possible broader trends of changes and trans- 
formations in their “transbiological” or superor- 
ganic abilities and traditions, on the other (Mead 
1964). 

The starting point of all these discussions, espe- 
cially from the point of view of the relation between 
the transformative capacities of any single society 
and any possible general trends in the develop- 
ment of human societies and human society in 
general, is the problem of the extent to which such 
changes may be envisaged as crystallizing into 
developmental “stages”—a key concept in classical 
evolutionary thought (Ginsberg 1932). In the old- 
er evolutionary school such stages were construed 
mostly in terms of “specialization” and “com- 
plexity,” whereas in recent works these concepts 
have been to a large extent replaced by that of 
“differentiation.” 
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Differentiation and institutional growth 


Differentiation, like complexity or specialization, 
is first of all a classificatory concept. It describes 
the ways through which the main social functions 
or the major institutional spheres of society be- 
come dissociated from one another, attached to 
specialized collectivities and roles, and organized 
into relatively specific and autonomous symbolic 
and organizational frameworks within the confines 
of the same institutional system. In broad evolu- 
tionary terms, such continuous differentiation has 
usually been conceived as a continuous develop- 
ment starting from the “ideal” type of the primitive 
society or band, in which all the major roles are 
allocated on an ascriptive basis and the division of 
labor is based primarily on family and kinship 
units. Development then proceeds through various 
stages of specialization and differentiation. 

Specialization is first manifest when each of the 
major institutional spheres, through the activities 
of people placed in strategic roles within it, devel- 
ops its own organizational units and complexes 
and its specific criteria of action. The latter tend 
to be congruent with the basic orientations of a 
given sphere, facilitating the development of its 
potentialities: technological innovation, cultural 
and religious creativity, expansion of political 
power or participation, or development of complex 
personality structure. 

Second, different levels or stages of differentia- 
tion denote the degree to which major social and 
cultural activities as well as certain basic resources 
—such as manpower and economic resources— 
have been disembedded or freed from kinship, ter- 
ritorial, and other ascriptive units (Parsons 1964; 
Bellah 1964; Eisenstadt 1963a). Although these 
“free-floating” resources pose new problems of in- 
tegration, they may also become the basis for a 
more differentiated social order that is, at least 
potentially, better adapted to deal with a more 
variegated environment. Thus, a new set of prob- 
lems—those of integration—emerges as the very 
crux of the way in which such resources can be 
utilized for the crystallization of some general 
transformative potentials within a society. 

Problems of integration. As the more differen- 
tiated and specialized institutional spheres become 
more interdependent and potentially complemen- 
tary in their functioning within the same over-all 
institutionalized system, this very complementarity 
creates more difficult and complex problems of 
integration. The growing autonomy of each sphere 
of social activity, and the concomitant growth 
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of interdependence and mutual interpenetration 
among them, pose for each sphere ever more diffi- 
cult problems in crystallizing its own tendencies 
and potentialities and in regulating its normative 
and organizational relations with other spheres. 
And at each more “advanced” level or stage of dif- 
ferentiation, the increased autonomy of each sphere 
creates increasingly more complex problems of in- 
tegrating these specialized activities into one sys- 
temic framework. 

The growing autonomy of the different institu- 
tional spheres and the extension of their organi- 
zational scope not only increase the range and 
depth of social and human problems but also open 
up new possibilities for technological development, 
expansion of political power or rights, and cultural 
creativity. Growing differentiation also enhances 
systemic sensitivity to a much wider physical— 
technical environment and to more comprehensive 
intersocietal relations. But the growth of systemic 
sensitivity to new problems and exigencies does 
not necessarily imply a concomitant development 
of the ability to deal with these problems, nor does 
it indicate the ways in which these problems may 
be solved. At any given level of differentiation an 


institutional sphere may or may not achieve an’ 


adequate degree of integration, and the potential- 
ities unfolded through the process of differentiation 
may be “wasted”—that is, they may fail to become 
crystallized into an institutional structure. 

Recognition of the integrative problems that are 
attendant on new levels of differentiation consti- 
tutes the main theoretical implication of the con- 
cept of differentiation, and it is in the light of the 
analytical problems raised by this implication that 
the various questions pertinent to a reappraisal of 
the evolutionary perspective in social science have 
to be examined. We are as yet far from any defini- 
tive answers to these questions, but at least we can 
point out some of the most important problems of 
research in this direction. 


Responses to differentiation 


The passage of a given society from one stage 
of differentiation to another is contingent on the 
development within it of certain processes of 
change which create a degree of differentiation 
that cannot be contained within the pre-existing 
system. Growing differentiation and the consequent 
Structural breakthroughs may take place through 
a secular trend of differentiation, or through the 
impact of one or a series of abrupt changes, or 
both. These tendencies may be activated by the 
Occupants of strategic roles within the major insti- 
tutional spheres as they attempt to broaden the 


scope and develop the potentialities of their spheres. 
The extent to which these changes are institution- 
alized and the concrete form they take in any given 
society necessarily depend on the basic institutional 
contours and premises of the pre-existing system, 
on its initial level of differentiation, and on the 
major conflicts and propensities for change within 
it (Eisenstadt 1964b). 

We need not assume that all changes in all soci- 
eties necessarily increase differentiation. On the 
contrary, the available evidence shows that many 
social changes do not give rise to over-all changes 
in the scope of differentiation but result, instead, 
mainly in changes in the relative strength and com- 
position of different collectivities or in the integra- 
tive criteria of a particular institutional sphere. 
Largely because the problem has not yet been fully 
studied, we do not know exactly what conditions 
facilitate or precipitate these different types of 
change in different societies and what makes for 
variations in innovative or transformative capaci- 
ties among different societies (Eggan 1963: Sahlins 
1964). 

Even when social change increases differentia- 
tion, the successful, orderly institutionalization of 
a new, more differentiated social system is not 
always a necessary outcome. Moreover, at any 
level of development, response to the problems cre- 
ated by the process of differentiation may take one 
of several different forms (Weber 1922a; Eisen- 
stadt 1963a). The most extreme outcome is failure 
to develop any adequate institutional solution to 
the new problems arising from growing differenti- 
ation, Aside from biological extinction, the conse- 
quences may be total or partial disintegration of 
the system, a semiparasitic existence at the margin 
of another society, or total submersion within 
another society. 

A less extreme type of response tends to lead to 
“regression,” that is, to the institutionalization of 
less differentiated systems within the more differ- 
entiated system that has broken down, Examples 
include the establishment of small patrimonial or 
semifeudal chiefdoms on the ruins of the Achae- 
menid Empire, the development of dispersed 
tribal—feudal systems at the downfall of the Roman 
Empire, and similar developments on the ruins of 
Greek city-states. Many such regressive develop- 
ments are only partial, in the sense that within 
some parts of the new institutional structure some 
nuclei of more differentiated and creative orienta- 
tions may survive or even develop. Sometimes, but 
certainly not always, these nuclei “store” entre- 
preneurial ability for possible—but not inevitable 
—future developments. 


Another possibility, which perhaps overlaps with 
the last one but is not always identical with it, is 
the development of a social system in which the 
processes of differentiation and change go on rel- 
atively continuously in one part or sphere of a 
society without becoming fully integrated into a 
stable, wider framework. In such situations a con- 
tinuous process of unbalanced change may develop, 
resulting either in a breakdown of the existing 
institutional framework or in stabilization at a 
relatively low level of integration. Perhaps the best 
examples of such developments can be found in 
various “dual conquest” societies (for example, 
conquest of the sedentary population by nomads 
in the Mongol Empire) and especially in the pre- 
independence stages of modern colonial societies. 

A fourth, and perhaps the most variegated, type 
of response to growing differentiation consists of 
some structural solution that is on the whole con- 
gruent with the relevant problems. Within this 
broad type a wide variety of concrete institutional 
arrangements is possible. Such different solutions 
usually have different structural results and reper- 
cussions. Each denotes a different structure crys- 
tallized according to different integrative criteria 
and is interpenetrated in a different way by the 
other major social spheres. 

Thus, drawing on examples from the great cen- 
tralized empires of history, we see that although 
the initial stages of socioeconomic differentiation 
were relatively similar in Byzantium, in the later 
(Abbasside) caliphate, and in post-Han China, 
each of these societies developed different over-all 
institutional structures (Eisenstadt 1963a). The 
Byzantine Empire became a highly militarized and 
politically oriented system, whereas the caliphate 
developed a theocratic structure, which was based 
on continuous attempts to institutionalize a new 
type of universalistic politicoreligious community. 
China developed a centralized system based on the 
power of the emperor and the bureaucracy and, at 
the local level, on the relative predominance of 
the gentry; the selective channels of the examina- 
tion system and the elite formed by the literati were 
the major mechanisms for integrating the local 
and central levels. 

One very interesting structural solution is the 
development of a relatively stable system in 
which the major institutional spheres vary in de- 
gree of differentiation. One of the most important 
examples of such variation occurs in feudal sys- 
tems, which are characterized by a relatively high 
degree of differentiation in some of the central 
cultural roles as against a much smaller degree of 
differentiation in the economic and political roles. 


EVOLUTION: Social Evolution 231 


In cases of such uneven differentiation the more 
differentiated units of such related societies (for 
example, the church in feudal or patrimonial sys- 
tems) often tend to develop a sort of international 
system of their own, apart from that of their 
“parent” societies. 

The variety of integrative criteria and institu- 
tional contours at any level of differentiation is, 
of course, not limitless. The very notion of inter- 
dependence among major institutional spheres 
negates the assumption that any number of levels 
of differentiation in different institutional spheres 
can coalesce into a relatively stable institutional 
system. The level of differentiation in any one 
sphere necessarily constitutes, within broad limits, 
a precondition for the effective institutionalization 
of certain levels of differentiation in other social 
spheres. But within these broad limits of mutual 
preconditioning a great deal of structural’ variety 
is possible. 

The intersocietal environment. The processes 
of change and of differentiation, on the one hand, 
and the development of different integrative re- 
sponses to them, on the other, do not take place 
within single, closed societies. They are closely 
related to the international system that constitutes 
the broader environment of any society. Each soci- 
ety is related to many others geopolitically, ecologi- 
cally, and socioculturally. These relations constitute 
the environment to which each society has to adapt 
and which may also influence its ability to evolve 
institutional responses to the processes of change. 

Such international geopolitical factors, in the 
broadest sense, not only provide the general setting 
for any given society but also give rise to many of 
the concrete pressures upon it, such as external 
pressures of population, problems of military se- 
curity, or adjustment to international trade. Such 
an intersocietal environment need not always con- 
sist of societies of the same type or level of differ- 
entiation; it may, indeed, contain many different 
types of societies. In general, it can be assumed 
that the more differentiated a society is, the greater 
is its systemic sensitivity—although not necessarily 
its ability to cope with these problems—to a wider 
and more variegated international setting. 

Thus, at any given level of differentiation, the 
crystallization of different institutional orders is 
shaped by the interaction between the broader 
structural features of the major institutional 
spheres, on the one hand, and, on the other, the 
development of elites or entrepreneurs in some of 
the institutional spheres of that society, in some 
of its enclaves, or even in other societies with 
which it is in some way connected. 
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The variability in the concrete components of 
such interaction helps to explain the great (but 
not limitless) variety of structural and integrative 
forms that may be institutionalized at any given 
level of differentiation. Although different societies 
may arrive at broadly similar stages of evolution in 
terms of the differentiation of their major institu- 
tional and symbolic spheres, yet the concrete insti- 
tutional contours developed at each such step, as 
well as the possible outcomes of such institutional- 
ization in terms of further development, break- 
down, regression, or stagnation, may differ greatly 
among them. 


Reappraising evolutionary theory 


The preceding analysis of processes of change 
and differentiation and of concomitant institution- 
alization of new structures indicates some of the 
problems that are posed by any attempt to re- 
appraise the evolutionary perspective in sociological 
theory. 

First is the exploration of the different mech- 
anisms of social change and the distinction be- 
tween those conditions and processes of change 
that create potentialities for transformation and 
those that do not. It is obvious, as indicated above, 
that not all processes of social change necessarily 
give rise to changes in over-all institutional sys- 
tems. Although the potentialities for such systemic 
changes (as distinct from changes in patterns of 
behavior, in the composition of subgroups, or in 
the contents of the major integrative criteria of 
different spheres) exist in all societies, the very 
actualization of these potentialities, as well as the 
tempo and direction of such changes, varies greatly 
among different societies (Eggan 1963; Eisenstadt 
19634). 

Second, and closely connected with the first, is 
the problem of how cumulative the development 
is of different types of institutional organization. 
Here it seems that in some institutions there may 
indeed be a “scale” or “semiscale” order of develop- 
ment, although the application of scale analysis to 
this type of phenomenon has so far failed to detect 
any perfect regularities (Carneiro 1962; Good- 
enough 1963). Such scale order is probably to be 
found least in the sphere of kinship or family in- 
stitutions, whereas it is more pronounced in those 
institutional spheres, such as economics, politics, 
and law, that are most closely connected with 
technology or with organizational problems (Wolf 
1964; Schwartz & Miller 1964), 

However, even the existence of such scale order 
does not necessarily imply that developments in 


any institutional sphere are necessarily cumulative 
in the sense that they can be transferred easily 
from one society to another at a similar general 
level of differentiation; neither does it imply that 
their development within any single society or their 
transfer from one to another must necessarily pro- 
ceed in a certain order or that “jumps” are not 
possible. Studies of the modernization of tradi- 
tional societies are especially relevant from the 
point of view of the possibilities of such jumps, 
although similar cases can probably be found in 
other types of societies also (Eisenstadt 1963b). 
At most, studies of such scale order indicate that 
a certain trait or organizational type may be a 
necessary, but certainly not a sufficient, condition 
for the emergence of another; even here, the find- 
ings of “quasi scales” indicate the possibility of 
many functional equivalents of any such trait in 
a sequential series—especially in cases of rapid 
social change (Schwartz & Miller 1964). 

Third is the question of the extent to which the 
problems arising from growing differentiation and 
the institutional solutions to these problems are 
indeed the same in different societies, thus creating 
some common trend of development. This problem 
is very close to that of the relation, to use Sahlins’ 
and Service's nomenclature (1960), between “spe- 
cific” and “general” evolution or that of the feasi- 
bility of the assumption, as put by Eggan (1963, 
p. 355), “that these particular developments neces- 
sarily add up to ‘the succession of culture through 
stages of overall progress,’ which is general evolu- 
tion.” But there appears to be no reason why all 
societies should reach certain stages of differenti- 
ation or why they should necessarily develop the 
same types of institutional contours once they 
attain such stages. The most that can be claimed 
at present is that the processes of differentiation 
in different societies exhibit similar formal and 
structural characteristics and that these create 
somewhat similar integrative problems. 

It is in these common characteristics and prob- 
lems that the fact of the common humanity of all 
human societies, as well as the possibilities of some 
common understanding and of intersocietal bor- 
rowing and transfer of institutions, is rooted; more- 
over, these characteristics indicate the existence 
of some “evolutionary universals” in the develop- 
ment of different human societies (Parsons 1964). 
However, the variety of possible “functional equiv- 
alents” of institutionalized solutions to such prob- 
lems, as well as the possibilities of “regression, 
stress the fact that the paths of development of 
different societies are neither necessarily common 


nor given. In other words, there is no reason to 
assume that there is a necessary relation and con- 
gruity between the mechanisms of genetic (here 
cultural or social) “transmission and change and 
the route of development of this or that organism 
or species” (Gellner 1965, p. 17). 

This discussion is closely related to the fourth, 
very crucial problem involved in the reappraisal 
of evolutionary perspectives: the explanation of 
the variability of institutionalized solutions to the 
problems arising from the development of a given 
level of structural differentiation. Here it should 
be recognized that the conditions giving rise to 
structural differentiation and to “structural sensi- 
tivity” to a greater range of problems do not neces- 
sarily create the capacity to solve these problems. 

Creative entrepreneurial elites. The crucial fac- 
tor is the presence or absence, in one or several 
institutional spheres, of an active group of special 
“entrepreneurs”—that is, an elite that is able to 
offer solutions to the new range of problems. Among 
modern sociologists Weber came closest to recog- 
nizing this factor when he stressed that the crea- 
tion of new institutional structures depends heavily 
on the “push” given by various “charismatic” groups 
or personalities and that the routinization of cha- 
risma is critical for the crystallization and con- 
tinuation of new institutional structures (Weber 
[1922b] 1963, chapters 4, 10, and 11). The devel- 
opment of such “charismatic” personalities or 
groups constitutes perhaps the closest social anal- 
ogy to genetic mutation. It is the possibility of 
such mutation that explains why, at any level of 
differentiation, a given social sphere contains not 
one but several, often competing, possible orienta- 
tions and potentialities for development. 

As yet, we know little about the specific condi- 
tions (as distinct from the more general trend 
toward structural differentiation ) that facilitate the 
rise of new elites—that is, the conditions which 
influence the nature of their basic orientations as 
well as their relations with broader groups, strata, 
and trends of development, and their ability to 
forge out and maintain a viable institutional order. 
There are indications, however, that factors beyond 
the general trend toward differentiation are im- 
portant. For example, various special enclaves, 
such as sects, monasteries, and sectarian intellec- 
tual groups or scientific communities, play an 
important role in the formation of such elites. 
Furthermore, a number of recent studies (see, for 
instance, McClelland 1961; Hagen 1962) have in- 
dicated the importance of certain familial, ideologi- 
cal, and educational orientations and institutions. 
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Within this context, it is necessary to re-examine 
the whole problem of the extent to which institu- 
tional patterns are crystallized through diffusion 
from other societies rather than through independ- 
ent invention within a society. Cases of diffusion 
might be partially due to successful importation, 
by entrepreneurial groups on the margins of a 
given society, of acceptable solutions to latent 
problems or needs within that society. 

Intersocietal borrowing. The problems of the 
interaction between processes of change and 
“mutative” elites are closely related to a set of 
problems bearing on the intersocietal nature of 
evolution. We have seen that the international 
setting not only constitutes the environment to 
which any single society has to adapt itself but 
also provides a reservoir of responses that may be 
available to it; for instance, the setting may pro- 
vide enclaves from which new elites or adaptive 
techniques and organizations can be borrowed. 

The existence of such interrelationships—in 
terms of both common problems and the possibility 
of “borrowing” solutions—necessarily underlies the 
basic mutual resemblance of human societies, and 
it is in turn closely related to the problem of the 
extent to which it is possible to talk about general 
“social” evolution or the evolution of human society 
as a total entity. However, the existence of such 
mutual resemblance certainly does not ensure that 
the development of “human society” as a unified 
system with common adaptive mechanisms neces- 
sarily increases its ability to deal with over-all 
problems of adaptation. Paradoxically, the very 
interrelatedness of societies may create problems 
with which they may not be able to deal. What 
may seem to be a positive accumulation of avail- 
able mechanisms and a repertoire of adaptations 
from the point of view of human society as a whole 
may yet, because of the lack of intersocietal inte- 
grative and adaptive mechanisms, constitute a very 
grave problem. 


Limitations of evolutionary theory 


The considerations presented above constitute 
the background for a reappraisal of the evolution- 
ary perspective within the framework of recent 
sociological theory. An evolutionary perspective, 
from the point of view of human societies, makes 
sense only so far as at least some of the processes 
of change that are inherent in any social system 
create the potentialities for the institutionalization 
of more differentiated social and symbolic systems. 
From the point of view of human society or cul- 
ture as a whole, such a perspective makes sense 
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only insofar as there exist some mechanisms for 
the transmission of various institutional and adap- 
tive techniques and for creating some common, 
intersocietal, adaptive and integrative capabilities 
and frameworks. 

With regard to all these areas, several problems 
for which there exist as yet no adequate solutions 
have been pointed out above. They have, in a way, 
all focused on the interaction between processes of 
social differentiation, on the one hand, and the 
formation and activities of different elites, on the 
other. It is this interaction that makes possible 
the institutionalization of different integrative prin- 
ciples and concrete structures at a given level of 
societal differentiation. Any search for solutions to 
these problems must concentrate on the various 
aspects of these processes. In this endeavor broad 
evolutionary considerations indicate ranges of pos- 
sibilities and types of potential breakthroughs but 
do not in themselves provide answers. 


SHMUEL N. EISENSTADT 


[Directly related are the entries on EMPIRES; FEUDAL- 
ISM; SOCIAL INSTITUTIONS. Other relevant material 
may be found in ANTHROPOLOGY; DIFFUSION.] 
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Vir 
EVOLUTION AND BEHAVIOR 


The fundamental postulate of the modern bio- 
logical theory of evolution is that the guiding 
agency of evolutionary changes is adaptation to 
the environments that a species inhabits. This is 
equally true of changes in structural features, 
physiology, and behavior of organisms. Evolution- 
ary developments maintain the adaptedness of the 
species when the environments change, or they 
improve the adaptedness if the environments are 
more or less stationary. The environment does not, 
however, impose changes on the organism, as was 
believed by some early evolutionists, particularly by 
the adherents of the now almost completely aban- 
doned Lamarckian hypothesis, It is more accurate 
to say that the environment presents challenges, to 
which a living species may respond by adaptive 
modification of its genetic endowment (genotype). 
If a response is elicited, the adaptedness is pre- 
served or improved; if the species fails to respond, 
its fitness declines and it may become extinct. The 
genetic raw materials from which evolutionary 
changes may be constructed are mutations, that is, 
alterations in the gene or chromosome structures. 
The effects that mutations produce vary in magni- 
tude all the way from alterations so drastic that 
the mutant is inviable (lethal, as a fatal hereditary 
disease causing death before sexual maturity) to 
changes so slight that the change in fitness, if any, 
that they produce can be detected only by means 
of refined statistical study. Mutant genes may be 


favorable only in heterozygous carriers (hybrid 
vigor, or heterosis) but unfavorable in double dose 
(in homozygous condition). Or a genetic change 
may be favorable in some environments and un- 
favorable in others. Or, finally, a mutant gene may 
be favorable in combinations with some genes but 
unfavorable in combination with others. 

The great evolutionary importance of sexual re- 
production lies in that it constantly combines and 
recombines the various genes present in the species 
population, enabling the favorable gene combina- 
tions to arise and to be tested by natural selection. 
In sexually reproducing and outbreeding species, 
such as man, no two individuals (except identical 
twins) have the same genotype. We inherit our 
genes from our parents and pass them to our chil- 
dren, but the gene constellation, the genotype, of 
every individual is unique. By and large, the greater 
the change a mutation produces, the greater the 
chance it will be harmful to the organism. 

The adaptive evolutionary changes are com- 
pounded almost exclusively of the slight mutational 
changes (sometimes termed polygenic changes). 
Changes of greater magnitudes are important 
rather as the source of the genetic pathology, in- 
capacitation, or weakness. Such diseases as phenyl- 
ketonuria, with its associated mental defects, and 
schizophrenia, to name only two, are examples of 
deleterious mutants affecting behavior. Apart from 
their negative importance for public health, muta- 
tions that produce strikingly visible alterations are 
also important as materials for genetic studies. 
Most of the pioneering work in genetics was done 
with genetic variants that must be classed as more 
or less pathological deviants and that play only 
negative roles in evolution, contributing to the 
“genetic load” the population carries. 

Natural selection. Mutational changes, those 
affecting behavior as well as those responsible for 
structural and physiological alterations, are mostly 
harmful to the organism. This is the reason why 
any increase of the frequency of mutations in 
human populations (for example, through expo- 
sure to X rays, other mutagenic radiations, or chem- 
ical mutagens) can result only in a reduction of 
the average fitness of the populations concerned. 
How, then, can mutations serve as building blocks 
for adaptive evolutionary changes? The answer is 
that mutational changes and their combinations 
are sorted out by natural selection. The majority 
of mutations that decrease the fitness of their car- 
riers in all environments and in all combinations 
are cast out of the populations by natural selection; 
those that are useful in at least some environments 
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and in some combinations are preserved and 
multiplied. 

The action of natural selection is sometimes 
compared to that of a sieve, separating the useful 
genetic variants from the harmful ones. This 
analogy is misleading if it is taken too literally. 
Human populations, and those of most sexually 
reproducing and outbreeding species, always carry 
great stores of genetic variants which arose by 
mutation in the immediate or remote past. The 
genetic reassortments, combined with natural se- 
lection in changing environments, become a cyber- 
netic process in which the genetic developments 
that occur at a given time depend upon the changes 
that have taken place earlier, and, in turn, they 
condition the developments that may take place in 
the future. 

Mutation, the process that supplies the raw ma- 
terials from which evolutionary changes can be 
constructed, is repeatable and reversible; it is a 
physiological and, in the last analysis, a mechanical 
process, But the evolution controlled by natural 
selection becomes a creative process that is un- 
likely to be reversed or to be repeated. Evolutionary 
transformations, such as the transformations that 
have led to the emergence of the human species, 
are chains of unique, nonrecurrent events. 

Importance of environment. Another consider- 
ation, particularly relevant in relation to the evo- 
lution of behavior, is that heredity determines not 
fixed “characters” or “traits” but reactions of the 
developing organism to the environment. The trait 
“behavior” obviously cannot be transmitted in in- 
heritance, because it is not present in the sex cells, 
which are the only material bridge connecting the 
parents with their progeny. But neither is the skin 
color inherited in this sense, because no skin pig- 
ment is present in the sex cells, What the sex cells 
do carry are genes, and the genes determine the 
pattern, or path, that the development of an indi- 
vidual will follow in a given sequence of environ- 
ments. Another individual with similar genes, an 
identical twin, may develop differently if his en- 
vironments are different. A carrier of a different 
set of genes might also develop differently in 
similar environments. 

We observe that human beings vary with respect 
to a great many traits—skin color, height, weight, 
head shape, intelligence, temperament, and special 
abilities, among countless others. As a broad gen- 
eralization, it is fair to say that whenever the varia- 
tion in any trait has been adequately studied geneti- 
cally, it has been found to be influenced by both 
genetic and environmental factors, In the past, 
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investigators have often tried to determine which 
traits are genetic or hereditary and which are envi- 
ronmental; such a dichotomy is now recognized as 
naive and misleading. All traits, or at any rate a 
great majority of them, are both genetic and en- 
vironmental. If everybody had the same genes, as 
identical twins do, people would look and behave 
more nearly alike than they actually do. Likewise, 
if the environments in which people grow and de- 
velop were made uniform, this would also result 
in a reduction of the observed structural and 
behavioral diversity. 

Fixity and conditioned plasticity. No living 
Species inhabits an absolutely uniform and con- 
stant environment. Organisms have to face many 
environments, variable both in space and in time. 
For example, the inhabitants of the temperate and 
cold climates must survive in both summer and 
winter environments. Man achieves more and more 
effective control over his physical environments, 
but he must face a great and growing diversity of 
Sociocultural environments. There are two ways to 
become adapted to a diversity of environments, and 
both have actually been used in the evolutionary 
process, including human evolution. One is genetic 
fixity and genetic specialization: the other is genet- 
ically conditioned developmental plasticity. In gen- 
eral, genetic fixity is characteristic of traits whose 
presence and precise form are indispensable for 
survival and reproduction. The developmental proc- 
esses giving rise to such traits are said to be 
homeostatically buffered, so that they can occur in 
all environments that the species normally encoun- 
ters in its habitats. Thus, with very few exceptions, 
all infants are born with two eyes, a four-chambered 
heart, physiological systems which digest food and 
maintain a constant body temperature, ability to 
learn a symbolic human language, etc. Genetic 
specialization makes the species polymorphic (con- 
sisting of two or more genetically distinct forms 
living and interbreeding in the same territory) or 
polytypic (consisting of races that inhabit differ- 
ent territories and are genetically adapted to the 
environments of their respective territories). For 
example, the darker and the lighter skin pigmenta- 
tion of some human races is plausibly supposed to 
fit them to the climatic conditions of the lands in 
which they originally lived. 

Genetically conditioned developmental plasticity 
is advantageous when the organism profits by hav- 
ing some traits shaped differently in the different 
environments that it encounters. The tanning of 
human skin on exposure to sunlight is an example 
of such a plasticity. It is important to realize that 
both fixity and plasticity are genetically deter- 


mined. Natural selection favored the spread and 
establishment of mutant genes which in some 
populations make the skin permanently darkly pig- 
mented and in other races make the pigmentation 
contingent on sun exposure. 

Culture. The most significant product, and the 
paramount determining factor, of human evolution 
is culture. The relationships between the biological 
evolution and culture are frequently misunder- 
stood, and it is important to make them clear. Cul- 
ture is not transmitted biologically through some 
special genes; it is acquired anew in every genera- 
tion by learning and instruction, in large part 
through the medium of the symbolic language. 
However, the capacity to learn and to instruct and, 
most essential of all, the capacity to use the sym- 
bolic language, is biologically and genetically 
vouchsafed to every nonpathological human being. 
An individual whose genes deprive him of these 
capacities is an obvious misfit, and his genes are 
likely to be eliminated by natural selection. Con- 
versely, it is safe to assume that the genetic equip- 
ment that made the human species capable of 
developing and maintaining culture has been com- 
pounded by natural selection in the course of the 
prehuman, subhuman, and human evolution. 

Of the many existing forms of human culture, 
the particular one an individual acquires is deter- 
mined by the society in which this individual is 
brought up, rather than by his genes. And yet, not 
only the ability to acquire any culture at all but 
also the capacity and inclination to choose this or 
that occupation, role, or trade within a culture may 
well be genetically conditioned, facilitated, or hin- 
dered. The ability to speak is genetically determined, 
but what a person will actually say is largely 
independent of genetics. 

Human acquired, extrabiological culture is man’s 
most potent adaptive instrument; it is chiefly by 
brain, not by brawn, that man controls his environ- 
ments. Since, however, human environments are 
preponderantly created by culture, the possession 
of a genetic endowment that enables members of 
human populations to adapt themselves to these 
cultural environments becomes overwhelmingly 
important. The evolution of the biological basis of 
human behavior has been controlled by this fact. 
In all cultures, primitive as well as advanced, the 
Vital ability is, and always was, for every individual 
to be able to learn whatever is necessary to become 
a competent member of the culture of which the 
individual is a part. For this reason, natural selec- 
tion has favored in human evolution a remarkable 
plasticity of the behavioral development; man’s 
cardinal adaptive trait is his educability, that is, his 


capacity to adjust his behavior to circumstances in 
the light of experience. 

Most individuals can be trained, with a greater 
or lesser facility, for many or most of the occupa- 
tions and roles that a given culture requires to be 
filled. Almost everybody could become, if properly 
brought up, a fairly competent farmer, crafts- 
man, soldier, sailor, teacher, or priest. This is 
the valid premise of the tabula rasa theory, from 
which this theory draws an erroneous conclusion. 
First clearly stated by John Locke in 1690, this 
theory is still popular in many circles. It asserts 
that a human being at birth is a clean slate on 
which the environment will inscribe a collection of 
attributes and qualities. The genetically secured 
developmental plasticity of human behavior, how- 
ever, is not at all incompatible with genetic di- 
versity. It is eminently probable that an infant at 
birth is not a clean slate and that some individuals 
are, because of their genetic endowments, more 
easily trainable for some occupations than for 
others, This is certain for some specialized profes- 
sions; by no means does everybody have the genetic 
wherewithal to become a fine singer or a first-class 
composer or performer of music, or to achieve peak 
performance in sports or in art. 

Biologically, this makes sense; the development 
of cultures and civilizations has not caused the 
diversity of vocations to become smaller; on the 
contrary, this diversity increases by leaps and 
bounds. The biologically adaptive response to this 
situation is obviously a combination of an educa- 
bility or trainability, with an underlying genetic 
diversity to facilitate the division of labor. 

The fallacy of racism. While the variants of 
the tabula rasa theory would make us believe that 
all the observed differences, especially all the dif- 
ferences in behavior, between people are the prod- 
ucts of upbringing and education, the even more 
pernicious fallacy of racism would claim that these 
differences are genetically fixed and largely inde- 
pendent of the environment. One superficially 
plausible argument often given in favor of racist 
views is worth discussing here, because it will en- 
able us to bring out clearly an important feature 
of the evolution of human behavior. It is claimed 
that the variation in psychic or behavioral traits 
among human individuals and races must be 
genetically fixed to about the same extent as it is 
among breeds of domestic animals. Different breeds 
of dogs, horses, or cattle are indeed clearly differ- 
ent in behavior, temperament, disposition, intelli- 
gence, trainability, etc. These differences are very 
largely genetically fixed, although by careful train- 
ing and discipline one can modify them to some 
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extent. Why then, it is argued, should the differ- 
ences between humans be supposed to be anything 
but genetic? 

This argument overlooks a profound dissimi- 
larity between the evolutionary histories of the 
human species and of the animals that man has 
domesticated. The behavior of a breed of a do- 
mestic animal is an essential part of the complex 
of characteristics that are selected to fit this breed 
for its intended use. A work horse should not be- 
have like a race horse, because this would make it 
dangerous and inefficient, and a race horse should 
not behave like a work horse if it is to win any 
races. The laboratory mouse and the laboratory rat 
are sluggish, unaggressive, and apparently dim- 
witted compared to the wild mouse and the wild 
rat, They would hardly survive under the condi- 
tions in which their wild ancestors thrive; it has 
been claimed that these species have degenerated 
when man has furnished their food and shelter. It 
is, however, obvious that wild mice and wild rats 
are inconvenient as laboratory animals, and that is 
why they are seldom used in laboratories. 

What would be rated as degenerate in the wild 
state is a desirable trait in an animal living in a 
laboratory cage. Man has seen to it that the genes 
for fixing and stabilizing desirable behavior are es- 
tablished and that the genes for undesirable behav- 
ior are bred out of the animals he has domesticated. 
Although some writers have seen fit to call man a 
“self-domesticated” animal (a designation accepted 
by few biologists), his evolutionary pattern is in 
many ways just the reverse. As previously stated, 
natural selection may favor mutant genes that con- 
fer a developmental fixity on some traits and de- 
velopmental plasticity on others; the latter is the 
case with human behavior. A person who is able 
to learn whatever modes of behavior fit various pro- 
fessions and vocations available in a human so- 
ciety, and to adjust himself to the ways of life that 
go with these professions and vocations, is likely to 
have both a social and a biological advantage. An 
individual set in his ways, always aggressive or 
always yielding, unable and unwilling to learn or 
to be trained, is likely to be discriminated against 
by natural selection. The great developmental plas- 
ticity of psychic traits in man is, thus, no biological 
accident but, on the contrary, a fundamental evo- 
lutionary adaptation that distinguishes man from 
nonhuman animals. 

Ethics. A considerable amount of speculation 
has been devoted to the problem of whether human 
ethics could have arisen through the action of 
natural selection in the evolutionary process. Natu- 
ral selection is obviously not a benevolent spirit 
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guiding the evolution but a blind and opportunistic 
process. It is opportunistic in the sense that it pro- 
motes the establishment of genes which confer an 
advantage for survival or reproduction when and 
where the selection acts, regardless of whether 
these same genes may be disadvantageous later on. 
The extinction of countless species of organisms 
has been due to such a narrow, overspecialized 
adaptation to the environments that did not endure. 
Now, it is conceivable that natural selection might 
encourage genes for altruistic behavior in a species 
broken up into numerous small colonies or sub- 
populations. “Altruistic” is in this case to be defined 
as a behavior benefiting the group (family, clan, 
tribe) to which the individual so behaving belongs 
and detrimental to that individual himself. A small 
population in which genes for such behavior occur 
may prosper and multiply, despite some of the car- 
riers of these genes sacrificing themselves for the 
sake of their fellows and thus not transmitting their 
genes to their own progeny. Conversely, in a large, 
undivided population, genes for “egotistic” or 
“criminal” behavior may secure an advantage, if 
their carriers survive and leave progeny at the 
expense of other members of the population. 

Another possibility is that altruism and egotism 
are not products of some kind of special genes but, 
rather, products of cultural developments transmit- 
ted not by genes but by learning. C. H. Waddington 
(1960) has argued that natural selection acts to 
make man an “ethicizing being” and an “authority 
acceptor.” Particularly in childhood but also dur- 
ing his entire life, a person is able and even eager 
to acquire, from his parents or from other persons, 
ideas about what is good and what is evil and to 
accept instruction or counsel concerning the desir- 
able ways of living in a society with other human 
beings. According to this view, man is not born 
virtuous or vicious but with a capacity for both 
virtue and vice. Biological evolution does not make 
man ethically better or worse, but it does promote 
intellectual alacrity and perhaps a sensitivity to 
ethical issues, 

Future developments. Even more speculative 
and uncertain are the attempts to prognosticate the 
future evolutionary developments of human be- 
havioral traits and capacities. This is evidently a 
part of a more general problem of the evolutionary 
perspectives of the human species. An opinion 
often expressed is that the biological evolution of 
man has virtually completed its course, and from 
now on any further development will be in the cul- 
tural realm. This is true to the extent that cultural 
changes are more rapid than the genetic ones, and 
this is, in fact, the reason why the development of 


the capacity for culture has conferred upon the 
human species an unprecedentedly high biological 
fitness. It should, however, be kept in mind that the 
maintenance, not to speak of further expansion, of 
cultural capacities is possible only on the basis of 
sound human genetic endowments. Improvement, 
maintenance, and prevention of deterioration of 
these genetic endowments is the task of the applied 
science of eugenics. Many eugenists have been ex- 
tremely pessimistic about the genetic future of 
mankind, believing that the genetic processes 
which go on in human populations trend inexorably 
toward biological disaster. Others have urged vari- 
ous remedial schemes, such as sterilization of the 
unfit, or the artificial insemination of women by 
semen collected from biologically superior donors 
and preserved in frozen condition for extensive use 
over the years. During the first third of the twen- 
tieth century, eugenics was often used as a support 
of ultraconservative social philosophies and racist 
doctrines. All this has made many social scientists, 
and the public at large, properly skeptical and sus- 
Picious of eugenic schemes. Yet eugenics un- 
doubtedly has a sound core; sooner or later man 
will be forced to take the management of his evo- 
lution in his own hands. 

THEODOSIUS DoBZHANSKY 


[Other relevant material may be found in EUGENICS; 
GENETICS; PsycHoLocy, article on COMPARATIVE 
PSYCHOLOGY; and in the biography of DARWIN.] 
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See EXCHANGE AND DISPLAY; SPECIALIZATION 
AND EXCHANGE. 


EXCHANGE, SOCIAL 
See under INTERACTION. 


EXCHANGE AND DISPLAY 


Exchange refers to the transaction of labor, re- 
sources, products, and services within a society. 
Exchange is not limited to the market economies 
of industrial societies. Market economies appeared 
late in world history and until recently were few 
in number. Even today most of the world’s societies 
do not have fully developed market economies; the 
range extends from those with no market institu- 
tions whatsoever, through those with peripheral 
markets, to those with important but by no means 
fully developed market institutions. For such econ- 
omies as these, it is necessary to consider modes 
of exchange other than market exchange. 

Karl Polanyi (1944; Polanyi et al. 1957) identi- 
fied and defined three modes of exchange: recipro- 
cal, redistributive, and market. The three modes of 
exchange are found singly or in combination in 
the economic organizations of the diverse societies 
of the world. They seem to be capable of clear and 
evident definition whatever the complexities re- 
quired in the analysis of their operation in an eco- 
nomic system. In brief, reciprocity is obligatory gift 
exchange; redistribution is obligatory payment to 
an allocative center; and market exchange is pur- 
chase and sale with reference to a price system. 
The three modes subsume all the types of exchange 
recognized in any society. Transfers or transac- 
tions that cannot be included in these categories 
are considered by a society to be illegitimate or 
wrong for that very reason. 

Most important of all, however, the three modes 
are the functioning aspects of the integrative struc- 
tures of various types of economies. A society can 
be structurally integrated by its social, political, or 
economic organization, or by some combination of 
these organizations. When one of these structures 
predominates or, more rarely, is the sole structure 
present in the society, a clear model type is dis- 
cernible. There are three such clear model types. 
For example, a social economy is one in which the 
social organization integrates economic life; here 
reciprocity is the prevailing mode of exchange. In a 
political economy, the political organization inte- 
grates economic life; redistributive exchange pre- 
vails. In a market-integrated economy, market 
exchange prevails. It follows that there will be 
mixed types of economy ranging through those in 
which two of the three possible structures of the 
society seem to be of about equivalent importance, 
but predominate over the third, to one in which all 
three have a roughly equivalent importance. 

In the comparative study of exchange, the sub- 
ject of gift giving has a historical and continuing 
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place. It was Marcel Mauss who first investigated 
the socioeconomic nature of gift giving, and his 
ideas form part of the present discussion of recipro- 
cal exchange. 

The subject of display, which has excited analyt- 
ical interest since Thorstein Veblen’s The Theory 
of the Leisure Class (1899), will be discussed in 
relation to the three modes of exchange. It will be 
shown that there are three types of display, each of 
which is specific to one.of the three modes of ex- 
change and its type of economy and social context. 


Reciprocity and social economy 


Marcel Mauss pointed out the completely obliga- 
tory character of gift exchange. He described it as 
one of the many forms of obligatory reciprocity 
that expressed and maintained the articulated in- 
dividual and group relationships of primitive social 
systems (1925). There existed no “natural,” or 
“pure,” market economy in primitive societies. 
Goods had social or moral value but not some sep- 
arate economic value. They were exchanged in the 
same social way as were courtesies and respects 
or as “entertainments, ritual, military assistance, 
women, children, dances, and feasts; and fairs in 
which the market is but one element and the cir- 
culation of wealth but one part of a wide and 
enduring contract.” Gift exchange, therefore, was 
part of a system of “total prestations,” as Mauss 
termed it, in which individuals in a society had both 
to give and receive in all social exchanges under 
the “sanction of private or open warfare.” If gift 
giving (or any other sort of social exchange) is 
often thought of as free and voluntary, it is because 
the members of a social group far more willingly 
receive and give than they act in any way that 
would threaten their membership in the group. The 
ideas of Mauss continue to be useful but require 
reworking. 

Reciprocity, the receiving and giving of goods 
and services, is built into the human life cycle and 
the social order. Without it the nonproducing young 
could not live and mature to provide the next gen- 
eration with its livelihood and the social order its 
continuity. There would be no cushioning of mis- 
fortune or infirmity, and the world would be with- 
out festivity, hospitality, and benefice. Reciprocity 
as a mode of exchange is a different matter. It is 
not universally an important mode of exchange ex- 
cept in those societies that are called primitive by 
the anthropologist. In nonprimitive folk, peasant, 
or state-organized societies, reciprocities involve 
one or both of the other two modes of exchange. 
For example, for reciprocal gift giving to take place 
in industrial society, it is necessary for the donors 
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to go into the market for the gifts they will ex- 
change; even the crudest Kindergarten Handarbeit 
will have required for its manufacture some tool 
or material obtained in the market. 

The distinctive character of primitive societies is 
that they are organized by their social structures. 
Primitive societies are without independent eco- 
nomic or political structuring. The economic order 
is “embedded” in the social order as is the political 
order (Polanyi 1944, p. 57). The rigorous study of 
such societies is the study of kinship and its exten- 
sions and affiliations and of the interrelations of 
the roles and social groupings formed on a kinship 
base. Economic aspects can be analyzed out of this 
social matrix, but such analysis will prove only that 
in so socialized a situation economic facts are social 
facts as well, and their ordering follows or is the 
social organization of the group concerned. It is 
proposed here that the economic systems of all such 
groups be called social economies. The use of this 
term sets out the distinguishing features of such 
systems and prevents the confusions that arise 
when the economy of such a group is spoken of as 
if it were a version, albeit a primitive version, of 
the separable and independent economic systems 
referred to by the economist. 

Reciprocity is the prevailing and characteristic 
mode of exchange in the world’s social economies, 
which is to say a majority of the world’s societies 
up to the industrial revolution. Nothing other than 
a social economy existed until the development in 
the Neolithic of some state organizations or politi- 
cal economies. It is only in the past two centuries 
that the intrusions of Western market economies 
or Western political power exerted along economic 
lines have brought about the extinction of almost 
all pure social economies save, perhaps, those in 
Amazonas and the interior of New Guinea. Today, 
therefore, there are few going social economies in 
which reciprocity is the sole mode of exchange; but 
the number in which it is still the dominant mode 
of exchange is not insignificant, containing as it 
does a large number of African tribal societies and 
some Oceanic and American Indian societies. 

A full illustration of how reciprocal exchanges 
follow the lines and groupings of the social struc- 
ture in a social economy cannot be presented here. 
The reader is referred to such classic studies as 
Malinowski’s study of the Trobriand Islanders 
(1922) and Firth’s study of the Tikopia (1939), 
Certain famous institutions of primitive cultures, 
such as bridewealth, kula, and the potlatch have 
excited interest precisely because each demonstrates 
the essentially social nature of reciprocal exchange 
involving valued goods. Anthropologists and other 
social scientists have examined and re-examined 


these three institutions as somehow exemplary 
statements of some key part of the problem of the 
relation between the economic and the noneconomic 
in human society. 

Bridewealth comprises goods, and sometimes 
labor services, transferred from the groom’s social 
group to that of the wife. The institution of bride- 
wealth is widespread in Africa, where it is almost 
always a vitally important institution and quite 
often highly elaborated. The goods used as bride- 
wealth range from cattle to such items as hoes 
and pots. Elaborations occur in the amount of the 
bridewealth; in the number of installments in which 
it is transferred; in the presence, amount, and num- 
ber of reciprocal goods (dowry ) the wife may bring 
to the marriage; or in “cattle linking,” in which a 
given marriage and transfer of bridewealth in cattle 
is determined by past marriages and predetermines 
future ones. The earlier term used for the institu- 
tion was “brideprice,” but the substitution of the 
term “bridewealth,” suggested by Evans-Pritchard 
(1931, pp. 36-39), is now widely accepted. “Price” 
is a term that cannot be lifted out of the context 
of market economy free of the connotations of pur- 
chase and sale. The problem is therefore set: if 
there is an exchange in which goods of value and 
a woman are transferred, why is this not a matter 
of purchase and sale? The anthropologists’ answer, 
although some are not in agreement on this point 
(Gray 1960), is that this transfer occurs in the 
context of a rearrangement of the social structure. 
The marriage creates new reciprocal roles, with 
new obligations and expectations; the giving of 
bridewealth is more important symbolically than it 
is economically. It is thus part of a vast series of 
social reciprocities and reciprocal exchanges of, as 
Mauss would say, courtesies, respects, women, 
dances, and so on, as well as of goods of value. The 
difference lies precisely in the fact that reciprocal 
exchange is part of continuing social process and 
behavior, while market exchange is an isolated, 
one-time economic act. 

Kula is an institution of reciprocal exchange in 
which permanent partners give and receive recog- 
nized treasure items. There are two types of such 
treasure, which circulate in opposite directions 
around a kula ring made up of a number of island 
societies of the western Pacific, The institution is 
best known from the description of the Trobriand 
Islands by Malinowski (1922). The treasure items 
are shell armlets and spondylus shell necklaces, 
which are surrounded by lore and myth and which 
cannot be exchanged for anything besides one an- 
other in the kula ring. Kula partners engage in 
reciprocal exchanges of second-grade items (Bo- 
hannan 1963, p. 236) as an earnest of the generos- 


ity with which they will reciprocate gifts of the 
treasure items of the greatest possible value in the 
lore and myth that attaches to them. The whole 
affair is fraught with the real and magical hazards 
of long sea voyages by canoe, with elaborate ritual 
and ceremonial attesting to its importance and 
assuring its proper conduct and success, and with 
thrill and wonder about the treasures that have 
been or might be received. There is no question 
about the noneconomic nature of the kula. It is a 
reciprocal exchange devoid of economic motive, 
basis, or gain. This noneconomic interpretation of 
kula itself has been generally accepted, even by 
those who have claimed that kula was a pretext for 
carrying on a higgling pedestrian trade in very 
ordinary goods on the side—never with kula part- 
ners—during kula voyages. The problem kula sets 
is the same whatever emphasis is given to the 
non-kula trading on the side. Why is the institution 
so elaborated, and why are its opportunities so en- 
thusiastically pursued? The only possible answer 
is that the reciprocal exchange of goods without 
economic value, no less than economic goods, ac- 
knowledges social worth by giving content to the 
playing of a social role. In kula, as is sometimes 
the case in bridewealth, there is feedback between 
the recognition of an individual’s social prestige 
within his own society and outside it with his kula 
partners of other societies. 

The potlatch of the Indian societies of the north- 
west coast of North America is an institution of 
reciprocal exchange in which the frequency, size, 
and system of exchanges became so great in some 
cases as to seem not only unique but also bizarre 
and incredible. It was among the Kwakiutl, in the 
first quarter of the twentieth century, that the pot- 
latch reached its most extreme development 
(Codere 1950; 1961). At a potlatch, goods were 
ceremoniously displayed and then distributed by 
the potlatcher to a number of individuals of higher 
and lower social rank. The recipients returned the 
gift with an increment when they potlatched at 
some later date. Although the individuals concerned 
received their potlatch positions through inheri- 
tance, it was only by potlatching that the social 
rank and worth of these positions was maintained 
or enhanced. Enormous inventories of goods were 
involved in the over-all system of potlatching and 
even on the occasion of a single potlatch. The 
Kwakiutl earned Canadian dollars, particularly in 
the commercial fishing industry. These earnings 
were converted into vast quantities of mass-pro- 
duced Western industrial goods, such as cheap 
blankets, clothing, housewares, and so on, all of 
which circulated in the potlatch system in amounts 
far above consumption needs and possibilities, 
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along with such traditional non-European goods as 
wooden canoes. The great season for potlatching 
was the wintertime, which was largely a time of 
vacation from economic effort. However, the only 
vacation from potlatching was just after giving one. 
The cycle of accumulating property by hard work 
and by making loans at interest began again as did 
the accumulation of new indebtedness through re- 
ceiving goods as new or as returned gifts from 
other potlatchers. As in the case of kula, an im- 
mense lore captured the interest of the people. 
Twenty-five years after the effective end of pot- 
latching, former participants were still recounting 
the details of past potlatches and the titles, rhetoric, 
and historical and mythical background associated 
with them. 

The problem set by the potlatch is by now a 
familiar one with a familiar answer. In a social 
economy, the ends of all efforts are social; and they 
are organized along the lines of the social system 
itself, according to the reciprocal form of all social 
exchanges. 


Redistribution and political economy 


Redistribution is the form of exchange that oc- 
curs when taxes or other exactions are collected 
and reallocated by an administrative center. The 
term is not of the same descriptive accuracy as 
Polanyi’s other two terms, since it does not make 
clear the nature of the exchange that takes place. 
Perhaps a better term would be “politically enforced 
exchange.” It is political power that is behind the 
exaction of goods and services in exchange for 
membership in good standing in the polity along 
with whatever reallocations might be received in 
return. Such reallocations may be in the form of 
benefit from internal and territorial policing, judi- 
cial institutions, and public works or distributions. 

Some redistributive exchange exists in many 
social economies. A rudimentary form of it would 
be the collection of foodstuffs by a headman for 
the giving of a feast in which all share. Redistribu- 
tion also occurs in the relatively few predominantly 
market economies of the mid-nineteenth-century 
industrial nations, since some taxation was present 
in all of them. However, redistribution is the dom- 
inant mode of exchange in the political economies 
that include all those centralized state organiza- 
tions, archaic and recent, from ancient Egypt, medi- 
eval Europe, and pre-Columbian Peru, to precolonial 
African and Asiatic states. In the political economy, 
the market is either relatively undeveloped or sec- 
ondary and subservient to the polity, which, in large 
part, controls distribution of goods. 

The precontact kingdom of Rwanda in east cen- 
tral Africa corresponds to the model of a political 
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economy in which redistribution is the dominant 
mode of exchange. In Rwanda social life as well as 
economic life was politicized, although it is the 
latter that is of concern here (Codere 1962). The 
masses of the people were peasants who were 
forced to contribute goods and services to the sup- 
port of a vast and complex political administration 
headed by the king, or mwami as he was called. 
Although they are said to have gloried in their 
poverty and subjugation, which is a matter in doubt, 
they received little beyond the minimum realloca- 
tions in return for almost the entirety of their pro- 
duction over and above what was needed for their 
own bare subsistence. From their feudal overlords, 
who were members of the governing caste and often 
simultaneously their local civil or military admin- 
istrators, they received the custody and usufruct 
of a cow. This was probably more important as a 
symbol of being in good standing with the ruling 
caste than it was as an economic asset. From their 
administrators they received military protection, 
courts, and whatever benefits were inherent in the 
idea that the entire fertility and welfare of the land 
and the people depended upon the going order of 
things with the mwami at the head of the central- 
ized state and its administration. The chief redis- 
tribution in Rwanda’s economy was, in fact, the 
contribution made by the masses to the support of 
the few. The ruling caste comprised 15 per cent 
of the total population and was supported at a level 
of living that was always higher, and often astro- 
nomically higher, than that of the masses. The use 
or threat of force perpetuated the system. 

The use of Rwanda as an example is not to sug- 
gest that all political economies have been similarly 
despotic or similarly one-sided in exacting much 
and reallocating very little in exchange. Rwanda, 
however, presents a type case of a political econ- 
omy in which redistribution is the predominant 
mode of exchange, Market exchange was virtually 
nonexistent; and reciprocal exchange was cut back 
to a minimum, since the masses had neither the 
kinship organization development nor the economic 
wherewithal for it. In order to achieve such political 
ends as improving and maintaining their own 
power positions within the political hierarchy, the 
ruling caste engaged in a small number of recipro- 
cal exchanges and in some exchanges of a redis- 
tributive character. 


Market exchange and market economies 


In the comparison of modes of exchange in 
diverse societies and economies, the salient features 
of market exchange are purchase and sale at a 
money price determined by the impersonal forces 


of supply and demand. It is important to emphasize 
the abstract and impersonal nature of market ex- 
change as compared to the other two modes, 
Features that are the essence of reciprocal and 
redistributive exchange can be, and usually are, 
eliminated as irrelevancies in market exchange. 
These include, for example, the social and political 
roles of the exchangers, the obligatory sociability 
or ceremoniousness connected with the act of ex- 
change, and social and political restrictions and 
specifications concerning which goods are exchange- 
able. To be sure, mutual courtesies and recognition 
of social worth or political position may accompany 
any instance of purchase and sale; but they are not 
necessary. Where they are present, it is either as 
an irrepressible human element or as part of the 
amount on the price tag, adjusted to cover such 
costs as time-consuming courtesy and the elegance 
of the place of sale. 

Although economists have treated most indus- 
trial national economies as though they were 
analogs of the pure market economy model, it is 
those of the nineteenth century that fit the model 
best, since they entailed less regulation for any 
social or political end. Of them, Polanyi wrote 
(1944, p. 57): “Instead of economy being em- 
bedded in social relations, social relations are 
embedded in the economic system”; and he also 
stated that “a market economy is an economic sys- 
tem controlled, regulated and directed by markets 
alone” (p. 68). In a pure market economy, social 
and political relations are economized. It is not that 
social and political relations cease to exist; the 
former are necessary for the continuation of society 
and the latter for the policing of the market and 
enforcement of the principle of contract, by means 
of which it works. Where market exchange prevails, 
social and political organization give way to eco- 
nomic organization. It was in the dehumanizing 
and desocializing consequences of market auton- 
omy that such critics of the system as Marx, 
Engels, Veblen, and Polanyi found their target, just 
as critics of totalitarian regimes have centered on 
the human costs of the politicizing of society. 


Mixed modes of exchange 


Perhaps in no empirical economy in the contem- 
porary world is reciprocity, redistribution, or mať- 
ket exchange the sole mode of exchange, although 
typically one of the modes may predominate. In 
primitive societies, the usual situation today is a 
mixture of the three modes. This has sometimes 
come about through internal development but more 
often through the political and economic intrusions 
of the expanding industrial nations of the West. 


In primitive societies where developments, espe- 
cially of marketlike exchange, have been indigenous 
and contact influences slight, the three modes of 
exchange have frequently coexisted as separate 
spheres that have remained relatively compart- 
mentalized. These have been termed “multicentric” 
economies by Bohannan and Dalton (1962, p. 3). 
An example would be Trobriand society (Malinow- 
ski 1922; Bohannan 1963, pp. 233-240), in which 
reciprocity is found in kula and kinship relations, 
redistribution in the exchanges of chiefs and others, 
and marketlike exchange, or an imperfect and 
moneyless market exchange, is represented by the 
ordinary trade accompanying the kula and in inter- 
nal island trade. Another example is provided by 
Kwakiutl society (Codere 1950; 1961), as it existed 
from the turn of the century to the depression of 
the 1930s. Reciprocity characterized many ex- 
changes of subsistence goods between kin and vil- 
lagers, as well as exchange between individuals in 
the potlatch system. Market exchange in the Cana- 
dian market economy was the source of most of 
the goods used in potlatching, and some redistribu- 
tion occurred in the special type of potlatching 
involved in the purchase of “coppers.” 

When the three modes of exchange are linked 
and cross-linked in contemporary economies, they 
rarely appear as distinct spheres but are relatively 
integrated by market exchange and the use of 
money. Thus, many economies today, industrial or 
nonindustrial, seem to be genuinely mixed econ- 
omies. Africa furnishes many examples of social 
or social—political economies that have been per- 
meated by market exchange. Western industrial 
nations exemplify the process in reverse as their 
nineteenth-century pure market exchange econ- 
omies have been transformed into mixed ones. In 
Africa the European imposition of a money head 
tax on tribesmen forced the African into market 
exchange. By the time a man worked for the Euro- 
pean to earn money to pay his own tax and that of 
his kinsmen, and returned with gifts purchased 
in the market and with money with which to buy 
a cow or cows for bridewealth, the three modes of 
exchange were linked and cross-linked. In the 
Western industrial nations, it was perhaps inevi- 
table that where political power became more 
widely distributed among the people they would use 
it to regulate and use the market economy for 
social ends. For example, the social reciprocity of 
supporting aged kinfolk is in part taken care of by 
the payment of taxes and the redistribution to them 
of money they use in market exchange for pur- 
chases to meet their own needs, for gifts to give 


EXCHANGE AND DISPLAY 243 


reciprocally, and for redistributive payments such 
as property or sumptuary taxes. 


Display 

Veblen’s The Theory of the Leisure Class (1899) 
is the point of departure for all inquiries into the 
meaning and uses of display in human society. His 
terms “conspicuous consumption,” “conspicuous 
waste,” and “conspicuous leisure” are still telling, 
and useful in discussions patterned after his on the 
universal association of display with social position 
and prestige. Contemporary social scientists, how- 
ever, find universals of interest only if they can be 
fruitfully used in explanations of social behavior. 
They are not content to conclude, as Veblen did, 
that display is irrational from an economic point 
of view, or to leave the discussion of display, irra- 
tionality, prestige, and society on a generalized 
level. 

By centering attention on display as it is associ- 
ated with exchange, it seems possible to discern its 
essential role in society. This enables us to explain 
why it takes the forms it does and why it is associ- 
ated with social prestige, political power, or eco- 
nomic importance. 

The most frequent, most important, and most 
elaborate laying out of goods before the eyes of 
others will be connected with exchange and specific 
to the particular mode of exchange. To return to 
the model systems, in social economies, where re- 
ciprocal exchange predominates, the great displays 
are associated with those exchanges. Similar gen- 
eralizations can be made for the political and mar- 
ket economies. The great displays are also great 
events that give visible proof of the productive 
effort and exchange relations that can be mobilized. 
They reaffirm and celebrate the system, even while 
they are part of it. 

Displays connected with reciprocal exchange in 
social economies are frequently modest ones. There 
is not the production capacity in many cases to 
achieve anything much more impressive than suf- 
ficient food and drink for feasting and sufficient 
time to prepare for the festivities and participate 
in them. For example, the Papago Indians of south- 
western North America had a saguaro, or cactus- 
fruit, wine feast, for which everyone produced and 
set out as many pots of wine as possible and then 
got publicly, communally. and “beautifully” drunk. 
In their desert environment and with their specific 
technology, this was probably the greatest abun- 
dance they could have produced; it was witnessed 
and enjoyed, and it was a great yearly event. 

At the other extreme, in the rich natural envi- 
ronment of the northwest coast of North America 
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the displays connected with potlatching were prob- 
ably the greatest of any ever associated with recip- 
rocal exchange in any primitive society. Wooden 
Statues were often set up a hundred feet or so 
apart, marking the boundaries of the beachfront 
display of the goods that had been given away ina 
potlatch. Of course, these statues then measured 
the impressiveness of subsequent potlatch displays. 
The eye of the witness was delighted as well as 
impressed by various devices of presentation. Gold 
and silver bracelets, suspended from small stakes 
planted on the beach, stirred in the breeze and 
caught the sunlight. Display was part of the actual 
distribution of goods; every item was publicly and 
often glowingly mentioned as it was given away, 
and every lot, of blankets, for example, was publicly 
counted out. Feasting and entertainment, special 
carved wooden feast dishes, and special costumes 
were all part of the show. 

Whether modest or pretentious, such displays 
showed the social worth of the individuals or 
groups who were responsible for them. Their ability 
and desire to support social relationships for social 
as well as private ends were on view. 

Display connected with redistributive exchange 
consists of processions, panoply, public works, and 
state festivals and entertainments: it is inspiring 
or intimidating depending upon the nature of the 
political power concerned and the onlooker’s or par- 
ticipant’s relation to it. 

In the African kingdom of Rwanda, the greatest 
displays were redistributive. The king’s retinue, 
court, and capital; the review of the royal cattle; 
the performances of the royal dancers; the magnifi- 
cence of the royal progress whenever the mwami 
traveled about; and the conspicuous bounty of 
goods, foodstuffs, milk pots, beer pots, and every- 
thing that poured into his household were all part 
of the pattern. It was repeated on a smaller scale 
for every lesser political figure in the system. 

Other examples of redistributive displays include 
the Panathenaic festival and the Parthenon; the 
Roman triumph; and the Coliseum and its circuses. 
In these societies, which were political economies, 
however different their polities, the displays con- 
nected with redistributive exchange were the most 
important of any displays that were made in the 
society. These displays were part of the redistribu- 
tive exchange itself. The thousands of citizens or 
subjects whose accumulated tax or tribute pay- 
ments made such displays possible may have re- 
ceived little in addition to spectacle. The function 
of such display was the demonstration of political 
power and its ability to organize all aspects of a 
society, including the economic aspect. 


Display connected with market exchange is so 
familiar it requires little discussion. Its means are 
the market place, the fair, the store, the showcase, 
and so on, not forgetting the ubiquitous extensions 
of the market place via printing and electronics, 
While the general purpose of such display is to en- 
tice and facilitate purchase, each instance of dis- 
play has the particular purpose of enabling the 
particular goods concerned to compete, on as favor- 
able terms as possible, with all the other goods of 
the market. It is for this reason that its arts and 
artifices are so highly developed, for they must 
make up the difference needed for successful sales 
competition. Display in market economies can be 
institutional in nature and aim, like some of the 
great trade fairs. It can also proclaim the ramifica- 
tions of the system and the general wealth of goods 
it has made available. However, this is a small part 
of the total amount of display connected with 
market exchange; the typical appeal is not “Buy”, 
but “Buy such-and-such!” It can therefore be said 
of the display connected with market exchange that 
its functions and purposes in society are practically 
the same, that is, to maximize and to integrate the 
market. 


Research on exchange along comparative and 
behavioral lines will be increasingly developed by 
anthropologists, if not by economists, since it prom- 
ises a more profound understanding of the eco- 
nomic aspects of society. There is a need for more 
detailed and quantified field study of exchange in 
its full social and cultural context. When such a 
field as that of kinship and social organization has 
been developed to the point of rigorous and quasi- 
mathematical treatment, there seems no good 
reason why the field of exchange, with its clear 
factual basis (something being exchanged for some- 
thing with someone), cannot be reduced to similar 
clarity, detail, and order. 

HELEN CODERE 


[See also ECONOMIC ANTHROPOLOGY; TRADE AND MAR- 
KETS. Other relevant material may be found in the 
biographies of MALiNowsk1; Mauss; PoLANY1; VEB- 
LEN.] 
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1. THE DESIGN OF EXPERIMENTS 
ul, RESPONSE SURFACES 
II. QUASI-EXPERIMENTAL DESIGN 


I 
THE DESIGN OF EXPERIMENTS 


In scientific research, the word “experiment” 
often denotes the type of study in which the in- 
vestigator deliberately introduces certain changes 
into a process and makes observations or measure- 
ments in order to evaluate and compare the effects 
of different changes. These changes are called the 
treatments. Common examples of treatments are 
different kinds of stimuli presented to human sub- 
jects or animals or different kinds of situations 
with which the investigator faces them, in order 
to see how they respond. In exploratory work, the 
objective may be simply to discover whether the 
stimuli produce any measurable responses, while 
at a later stage in research the purpose may be to 
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verify or disprove certain hypotheses that have 
been put forward about the directions and sizes 
of the responses to treatments. In applied work, 
measurement of the size of the response is often 
important, since this may determine whether a 
new treatment is practically useful. 

A distinction is often made between a controlled 
experiment and an uncontrolled observational 
study. In the latter, the investigator does not inter- 
fere in the process, except in deciding which phe- 
nomena to observe or measure. Suppose that it is 
desired to assess the effectiveness of a new teach- 
ing machine that has been much discussed. An 
observational study might consist in comparing the 
achievement of students in those schools that have 
adopted the new technique with the achievement 
of students in schools that have not. If the schools 
that adopt the new technique show higher achieve- 
ment, the objection may be raised that this in- 
crease is not necessarily caused by the machine, 
as the schools that have tried a new method are 
likely to be more enterprising and successful and 
may have students who are more competent and 
better prepared. Examination of previous records 
of the schools may support these criticisms. In a 
proper experiment on the same question, the in- 
vestigator decides which students are to be taught 
by the new machine and which by the standard 
technique. It is his responsibility to ensure that 
the two techniques are compared on students of 
equal ability and degree of preparation, so that 
these criticisms no longer have validity. 

The advantage of the proper experiment over 
the observational study lies in this increased ability 
to elucidate cause-and-effect relationships. Both 
types of study can establish associations between 
a stimulus and a response; but when the investi- 
gator is limited to observations, it is hard to find 
a situation in which there is only one explanation 
of the association. If the investigator can show by 
repeated experiments that the same stimulus is al- 
ways followed by the same response and if he has 
designed the experiments so that other factors that 
might produce this response are absent, he is in 
a much stronger position to claim that the stimulus 
causes the response, (However, there are many 
social science fields where true experimentation is 
not possible and careful observational investiga- 
tions are the only source of information.) [See, for 
example, EXPERIMENTAL DESIGN, article on QUASI- 
EXPERIMENTAL DESIGN; OBSERVATION; SURVEY 
ANALYSIS.] 

Briefly, the principal steps in the planning of 
a controlled experiment are as follows. The treat- 


ments must be selected and defined and must be 
relevant to the questions originally posed. The ex- 
perimental units to which the treatments are to 
be applied must be chosen. In the social sciences, 
the experimental unit is frequently a single animal 
or human subject. The unit may, however, be a 
group of subjects, for instance, a class in compari- 
sons of teaching methods. An important point is 
that the choice of subjects and of the environ- 
mental conditions of the experiment determine the 
range of validity of the results. 

The next step is to determine the size of the 
sample—the number of subjects or of classes. In 
general, the precision of the experiment increases 
as the sample size increases, but usually a balance 
must be struck between the precision desired and 
the costs involved. The method for allocating treat- 
ments to subjects must be specified, as must the 
detailed conduct of the experiment. Other factors 
that might influence the outcome must be con- 
trolled (by blocking or randomization, as discussed 
later) so that they favor each treatment equally. 
Finally, the responses or criteria by which the 
treatments will be rated must be defined. These 
may be simple classifications or measurements on 
a continuous scale. Like the treatments, the re- 
sponses must be relevant to the questions origi- 
nally posed. 

History. The early history of ideas on the plan- 
ning of experiments appears to have been but little 
studied (Boring 1954). Modern concepts of ex- 
perimental design are due primarily to R. A. Fisher, 
who developed them from 1919 to 1930 in the plan- 
ning of agricultural field experiments at the 
Rothamsted Experimental Station in England. The 
main features of Fisher's approach are as follows 
(randomization, blocking, and factorial experimen- 
tation will be discussed later): 

(1) The requirement that an experiment itself 
furnish a meaningful estimate of the underlying 
variability to which the measurements of the re- 
sponses to treatments are subject. 

(2) The use of randomization to provide these 
estimates of variability. 

(3) The use of blocking in order to balance out 
known extraneous sources of variation. y 

(4) The principle that the statistical analysis 
of the results is determined by the way in which 
the experiment was conducted. 

(5) The concept of factorial experimentation, 
which stresses the advantages of investigating the 
effects of different factors or variables in a single 
complex experiment, instead of devoting a separate 
experiment to each factor. 
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These ideas were stated very concisely by Fisher 
in 1925 and 1926 but more completely in 1935. 


Experimental error 


Some sources of experimental error. A major 
problem in experimentation is that the responses 
of the experimental units are influenced by many 
sources of variation other than the treatments. For 
example, subjects differ in their ability to perform 
a task under standard conditions: a treatment that 
is allotted to an unusually capable group of sub- 
jects will appear to do well; the instruments by 
which the responses are measured may be liable 
to errors of measurement; both the applied treat- 
ment and the environment may lack uniformity 
from one occasion to another. 

In some experiments, the effects of subject- 
to-subject variation are avoided by giving every 
treatment to each subject in succession, so that 
comparisons are made within subjects. Even then, 
however, learning, fatigue, or delayed consequences 
of previously applied treatments may influence the 
response actually measured after a particular treat- 
ment. 

The primary consequence of extraneous sources 
of variation, called experimental errors, is a mask- 
ing of the effects of the treatments. The observed 
difference between the effects of two treatments is 
the sum of the true difference and a contribution 
due to these errors. If the errors are large, the 
experimenter obtains a poor estimate of the true 
difference; then the experiment is said to be of 
low precision. 

Bias. It is useful to distinguish between ran- 
dom error and error due to bias. A bias, or system- 
atic error, affects alike all subjects who receive 
a specific treatment. Random error varies from 
subject to subject. In a child growth study in 
which children were weighed in their clothes, a 
bias would arise if the final weights of all children 
receiving one treatment were taken on a cold day, 
on which heavy clothing was worn, while the chil- 
dren receiving a second treatment were weighed 
on a mild day, on which lighter clothing was worn. 
In general, bias cannot be detected in the analysis 
of the results, so that the conclusions drawn by 
statistical methods about the true effects of the 
treatments are misleading. 

It follows that constant vigilance against bias 
is one of the requisites of good experimentation. 
The devices of randomization and blocking, if used 
intelligently, do much to guard against bias. Ad- 
ditional precautions are necessary in certain types 
of experiments. If the measurements are subjective 


evaluations or clinical judgments, the expecta- 
tions and prejudices of the judges and subjects 
may influence the results if it is known which 
treatment any of the subjects received. Conse- 
quently, it is important to ensure, whenever it is 
feasible, that neither the subject nor the person 
taking the measurement knows which treatment 
the subject is receiving; this is called a “double 
blind” experiment. For example, in experiments 
that compare different drugs taken as pills all the 
pills should look alike and be administered in the 
same way. If there is a no-drug treatment, it is 
common practice to administer an inert pill, called 
a placebo, in order to achieve this concealment. 
Methods for reducing experimental error. Sev- 
eral devices are used to remove or decrease bias 
and random errors due to extraneous sources of 
variation that are thought to be substantial. One 
group of devices may be called refinements of 
technique. If the response is the skill of the sub- 
ject in performing an unfamiliar task, a major 
source of error may be that subjects learn this 
task at different rates. An obvious precaution is 
to give each subject enough practice to reach his 
plateau of skill before starting the experiment. The 
explanation of the task to the subjects must be 
clear; otherwise, some subjects may be uncertain 
what they are supposed to do. Removal from an 
environment that is noisy and subject to distrac- 
tions may produce more uniform performance. The 
tasks assigned to the subjects may be too easy or 
too hard so that all perform well or poorly under 
any treatment, making discrimination between the 
treatments impossible. The reduction of errors in 
measurement of the response often requires pro- 
longed research. In psychometrics, much of the 
work on scaling is directed toward finding supe- 
rior instruments of measurement [see SCALING]. 
Blocking. In many experiments involving com- 
parisons between subjects, the investigator knows 
that the response will vary widely from subject to 
subject, even under the same treatment. Often it 
is possible to obtain beforehand a measurement 
that is a good predictor of the response of the 
subject. A child’s average score on previous tests 
in arithmetic may predict well how he will per- 
form on an arithmetic test given at the end of a 
teaching experiment. Such initial data can be used 
to increase the precision of the experiment by 
forming blocks consisting of children of approxi- 
mately equal ability. If there are three teaching 
methods, the first block contains the three children 
with the best initial scores. Each child in this block 
is assigned to a different teaching method. The 
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second block contains the three next best children, 
and so on. The purpose of the blocking is to guar- 
antee that each teaching method is tried on an 
equal number of good, moderate, and poor per- 
formers in arithmetic. The resulting gain in pre- 
cision may be striking. 

The term “block” comes from agricultural ex- 
perimentation in which the block is a compact 
piece of land. With human subjects, an arrange- 
ment of this kind is sometimes called a matched 
pairs design (with two treatments) or a matched 
groups design (with more than two treatments). 

A single blocking can help to balance out the 
effects of several different sources of variation. In 
a two-treatment experiment on rats, a block com- 
prising littermates of the same sex equalizes the 
two treatments for age and sex and to some extent 
for genetic inheritance and weight also. If the 
conditions of the experiment are subject to uncon- 
trolled time trends, the two rats in a block can be 
tested at approximately the same time. 

Adjustments in the statistical analysis. Given 
an initial predictor, x, of the final response, y, an 
alternative to blocking is to make adjustments in 
the statistical analysis in the hope of removing the 
influence of variations in x. If x and y represent 
initial and final scores in a test of some type of 
skill, the simplest adjustment is to replace y by 
y — x, the improvement in score, as the measure 
of response. This change does not always increase 
precision. The error variance of y — x for a subject 
may be written o% + o? — 2po,o,, where p is the 
correlation between y and x. This is less than o; 
only if p exceeds o,/2o,. 

A more accurate method of adjustment is given 
by the analysis of covariance. In this approach, the 
measure of response is y—bx. The quantity b, 
computed from the results of the experiment, is an 
estimate of the average change in y per unit in- 
crease in x. The adjustment accords with common 
sense. If the average x value is three units higher 
for treatment A than for treatment B, and if b is 
found to be 4, the adjustment reduces the differ- 
ence between the average y values by two units. 

If the relation between y and x is linear, the use 
of a predictor, x, to form blocks gives about the 
same increase in precision as its use in a covari- 
ance analysis. For a more detailed comparison in 
small experiments, see Cox (1957). Blocking by 
means of x may be superior if the relation between 
y and x is not linear. Thus, a covariance adjust- 
ment on x is helpful mainly when blocking has 
been used to balance out some other variable or 
when blocking by means of x is, for some reason, 


not feasible. One disadvantage of the covariance 
adjustment is that it requires considerable extra 
computation. A simpler adjustment such as y — x 
is sometimes preferred even at some loss of pre- 
cision, 

Randomization. Randomization requires the 
use of a table of random numbers, or an equiva- 
lent device to decide some step in the experiment, 
most frequently the allotment of treatments to 
subjects [see RANDOM NUMBERS]. 

Suppose that three treatments—A, B, C—are to 
be assigned to 90 subjects without blocking. The 
subjects are numbered from 1 to 90. In a two-digit 
column of random numbers, the numbers 01 to 
09 represent subjects 1 to 9, respectively; the num- 
bers 10 to 19 represent subjects 10 to 19, respec- 
tively, and so on. The numbers from 91 to 99 and 
the number 00 are ignored. The 30 subjects whose 
numbers are drawn first from the table are assigned 
to treatment A, the next 30 to B, and the remain- 
ing 30 to C. 

In the simplest kind of blocking, the subjects or 
experimental units are arranged in 30 blocks of 
three subjects each. One in each block is to receive 
A, one B, and one C. This decision is made by 
randomization, numbering the subjects in any 
block from 1 to 3 and using a single column of 
random digits for the draw. 

Unlike blocking, which attempts to eliminate 
the effects of an extraneous source of variation, 
randomization merely ensures that each treatment 
has an equal chance of being favored or handi- 
capped by the extraneous source. In the blocked 
experiment above, randomization might assign the 
best subject in every block to treatment A. The 
probability that this happens is, however, only 1 
in 3°. Whenever possible, blocking should be used 
for all major sources of variation, randomization 
being confined to the minor sources. The use of 
randomization is not limited to the allotment of 
treatments to subjects. For example, if time trends 
are suspected at some stage in the experiment, 
the order in which the subjects within a block are 
processed may be randomized. Of course, if time 
trends are likely to be large, blocking should be 
used for them as well as randomization, as illus- 
trated later in this article by the crossover design. 

In his Design of Experiments, Fisher illustrated 
how the act of randomization often allows the in- 
vestigator to carry out valid tests for the treatment 
means without assuming the form of the frequency 
distribution of the data (1935). The calculations, 
although tedious in large experiments, enable the 
experimenter to free himself from the assumptions 
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required in the standard analysis of variance. In- 
deed, one method of justifying the standard meth- 
ods for the statistical analysis of experimental re- 
sults is to show that these methods usually give 
serviceable approximations to the results of ran- 
domization theory [see NONPARAMETRIC STATISTICS; 
see also Kempthorne 1952]. 

Size of experiment. An important practical de- 
cision is that affecting the number of subjects or 
experimental units to be included in an experi- 
ment. For comparing a pair of treatments there 
are two common approaches to this problem. One 
approach is to specify that the observed difference 
between the treatment means be correct to within 
some amount +d chosen by the investigator. The 
other approach is to specify the power of the test 
of significance of this difference. 

Consider first the case in which the response is 
measured on a continuous scale. If ø is the stand- 
ard deviation per unit of the experimental errors 
and if each treatment is allotted to n units, the 
standard error of the observed difference between 
two treatment means is VŽ o/ vn for the simpler 
types of experimental design. Assuming that this 
difference is approximately normally distributed, 
the probability that the difference is in error by 
more than d = 1.96 V2 o/ vn is about 0.05 (from 
the normal tables). The probability becomes 0.01 
if d is increased to 2.58 V2 o/ vn. Thus, although 
there is no finite n such that the error is certain 
to be less than d, nevertheless, from the normal 
tables, a value of n can be computed to reduce the 
probability that the error exceeds d to some small 
quantity æ such as 0.05, Taking a=0.05 gives 
n= 7.702/d? = 802/d2. The value of ø is usually 
estimated from previous experiments or prelimi- 
nary work on this experiment. 

If the criterion is the proportion of units that 
fall into some class (for instance, the proportion 
of subjects who complete a task successfully), the 
corresponding formula for n, with a = 0.05, is 


n=4[p,(1 — pi) + p.(1 — p:)1/ P, 


where p,, p. are the true proportions of success 
for the two treatments and d is the maximum tol- 
erable error in the observed difference in propor- 
tions. Use of this formula requires advance esti- 
mates of p, and p.. Fortunately, if these lie between 
0.3 and 0.7 the quantity p(1—p) varies only 
between 0.21 and 0.25. 

The choice of the value of d should, of course, 
depend on the use to be made of the results, but an 
element of judgment often enters into the decision. 

The second approach (specifying the power) is 


appropriate, for instance, when a new treatment is 
being compared with a standard treatment and 
when the investigator intends to discard the new 
treatment unless the test of significance shows that 
it is superior to the standard. He does not mind 
discarding the new treatment if its true superiority 
is slight. But if the true difference (new — stand- 
ard) exceeds some amount, A, he wants the prob- 
ability of finding a significant difference to have 
some high value, 8 (perhaps 0.95, 0.9, or 0.8). 

With continuous data, the required value of n 
is approximately 


n = 20° ( Ea + &,-8)*/4*, 
where 


&, = normal deviate corresponding to the signif- 
icance level, a, used in the test of sig- 
nificance, 


and 


¿p = normal deviate for a one-tailed probability 
1-£. 
For instance, if the test of significance is a one- 
tailed test at the 5% level and £ is 0.9, so that 
&, = 1.64 and &,-s = 1.28, then n = 170°/A*. The 
values of A, a, and £ are chosen by the investigator. 
With proportions, an approximate formula is 


n = (ëa + &:-8)*PG/(P2 — Pi)”, 


where p = (p, + p2)/2 and g = 1 — p and p: — p: is 
the size of difference to be detected. One lesson 
that this formula teaches is that large samples are 
needed to detect small or moderate differences be- 
tween two proportions. For instance, with p, = 0.3, 
p: = 0.4, a = 0.05 (two-tailed), and 8 = 0.8, the for- 
mula gives n = 357 in each sample, or a total of 
714 subjects. 

More accurate tables for n, with proportions and 
continuous data, are given in Cochran and Cox 
(1950) and a fuller discussion of the sample size 
problem in Cox (1958). 

If the investigator is uncertain about the best 
values to choose for A, it is instructive to compute 
the value of A that will be detected, say with prob- 
ability 80% or 90%, for an experiment of the size 
that is feasible. Some experiments, especially with 
proportions, are almost doomed to failure, in the 
sense that they have little chance of detecting a 
true difference of the size that a new treatment is 
likely to produce. It is well to know this before 
doing the experiment. 

Controls. Some experiments require a control, 
or comparison, treatment. For a discussion of the 
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different meanings of the word “control” and an 
account of the history of this device, see Boring 
(1954). In a group of families having a prepaid 
medical care plan, it is proposed to examine the 
effects of providing, over a period of time, addi- 
tional free psychiatric consultation. An intensive 
initial study is made of the mental health and 
social adjustment of the families who are to re- 
ceive this extra service, followed by a similar in- 
ventory at the end. In order to appraise whether 
the differences (final — initial) can be attributed 
to the psychiatric guidance, it is necessary to in- 
clude a control group of families, measured at the 
beginning and at the end, who do not receive this 
service. An argument might also be made for a 
second control group that does not receive the 
service and is measured only at the end. The rea- 
son is that the initial psychiatric appraisal may 
cause some families in the first control group to 
seek psychiatric guidance on their own, thus dilut- 
ing the treatment effect that is to be studied. 
Whether such disturbances are important enough 
to warrant a second control is usually a matter of 
judgment. 

The families in the control groups, like those in 
the treated group, must be selected by randomiza- 
tion from the total set of families available for 
the experiment. This type of evaluatory study pre- 
sents other problems. It is difficult to conceal the 
treatment group to which a family belongs from 
the research workers who make the final measure- 
ments, so that any preconceptions of these workers 
may vitiate the results. Second, the exact nature 
of the extra psychiatric guidance can only be dis- 
covered as the experiment proceeds. It is impor- 
tant to keep detailed records of the services ren- 
dered and of the persons to whom they were given. 


Factorial experimentation 


In many programs of research, the investigator 
intends to examine the effects of several different 
types of variables on some response (for example, 
in an experiment on the accuracy of tracking, the 
effect of speed of the object, the type of motion of 
the object, and the type of handle used by the 
human tracker). In factorial designs, these vari- 
ables are investigated simultaneously in the same 
experiment. The advantages of this approach are 
that it makes economical use of resources and 
provides convenient data for studying the interre- 
lationships of the effects of different variables, 

These points may be illustrated by an experi- 
ment with three factors or variables, A, B, and (6% 
each at two levels (that is, two speeds of the ob- 
ject, etc.), Denote the two levels of A by a, and 


a., and similarly for B and C. The treatments con- 
sist of all possible combinations of the levels of 
the factors. There are eight combinations: 


(1) a,b,c, (3) abc, (5) a,b,c, 
(2) a.bic, (4) a.b,c, (6) a.b,c, 


(7) abc, 
(8) a.b.c, 


Suppose that one observation is taken on each 
of the eight combinations. What information do 
these give on factor A? The comparison (2) — (1), 
that is, the difference between the observations for 
combinations (2) and (1), is clearly an estimate 
of the difference in response, ad. —a,, since the 
factors B and C are held fixed at their lower levels. 
Similarly, (4) — (3) gives an estimate of a.—a,, 
with B held at its higher level and C at its lower 
level. The differences (6) — (5) and (8) -— (7) 
supply two further estimates of a. — a,. The aver- 
age of these four differences provides a comparison 
of a. with a, based on two samples of size four and 
is called the main effect of A. 

Turning to B, it may be verified that (3) — (1), 
(4) — (2), (7) -— (5), and (8)—(6) are four 
comparisons of b, with b,. Their average is the 
main effect of B. Similarly, (5) — (1), (6) — (2), 
(7) — (3), and (8) — (4) provide four compari- 
sons of c, with c,. 

Thus the testing of eight treatment combina- 
tions in the factorial experiment gives estimates of 
the effects of each of the factors A, B, and C based 
on samples of size four. If a separate experiment 
were devoted to each factor, as in the “one vari- 
able at a time” approach, 24 combinations would 
have to be tested (eight in each experiment) in 
order to furnish estimates based on samples of 
size four. The economy in the factorial approach 
is achieved because every observation contributes 
information on all factors. 

In many areas of research, it is important to 
study the relations between the effects of different 
factors. Consider the following question: Is the 
difference in response between a, and a, affected 
by the level of B? The comparison 


(a,b, — a,b,) — (a.b, — a,b,), 


where each quantity has been averaged over the 
two levels of C, measures the difference between 
the response to A when B is at its higher level and 
the response to A when B is at its lower level. This 
quantity might be called the effect of B on the 
response to A. The same expression rearranged as 
follows, 


(a,b, — a,b,) — (a,b, — a,b), 


also measures the effect of A on the response to B. 
It is called the AB two-factor interaction. (Some 
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writers introduce a multiplier, }, for conventional 
reasons.) The AC and BC interactions are com- 
puted similarly. 

The analysis can be carried further. The AB 
interaction can be estimated separately for the two 
levels of C. The difference between these quanti- 
ties is the effect of C on the AB interaction. The 
same expression is found to measure the effect of 
A on the BC interaction and the effect of B on the 
AC interaction. It is called the ABC three-factor 
interaction. 

The extent to which different factors exhibit 
interactions depends mostly on the way in which 
nature behaves. Absence of interaction implies 
that the effects of the different factors are mutu- 
ally additive. In some fields of application, main 
effects are usually large relative to two-factor inter- 
actions, and two-factor interactions are large rela- 
tive to three-factor interactions, which are often 
negligible. Sometimes a transformation of the scale 
in which the data are analyzed removes most of 
the interactions [see STATISTICAL ANALYSIS, SPE- 
CIAL PROBLEMS OF, article on TRANSFORMATIONS 
OF DATA]. There are, however, many experiments 
in which the nature and the sizes of the interac- 
tions are of primary interest. 

The factorial experiment is a powerful weapon 
for investigating responses affected by many stim- 
uli, The number of levels of a factor is not re- 
stricted to two and is often three or four. The chief 
limitation is that the experiment may become too 
large and unwieldy to be conducted successfully. 
Fortunately, the supply of rats and university stu- 
dents is large enough so that factorial experiments 
are widely used in research on learning, motiva- 
tion, personality, and human engineering [see, for 
example, TRAITS]. 

Several developments mitigate this problem of 
expanding size. If most interactions may safely be 
assumed to be negligible, good estimates of the 
main effects and of the interactions considered 
likely to be important can be obtained from an 
experiment in which a wisely chosen fraction (say 
3 or 4) of the totality of treatment combinations 
is tested. The device of confounding (see Cochran 
& Cox 1950, chapter 6, esp. pp. 183-186; Cox 
1958, sec. 12.3) enables the investigator to use a 
relatively small sized block in order to increase 
precision, at the expense of a sacrifice of informa- 
tion on certain interactions that are expected to 
be negligible. If all the factors represent continu- 
ous variables (x,,x.,:::) and the objective is to 
map the response surface that expresses the re- 
sponse, y, as a function of x,, x.,:--, then one of 
the designs specially adapted for this purpose may 


be used. [For discussion of these topics, see EXPERI- 
MENTAL DESIGN, article on RESPONSE SURFACES; 
see also Cox 1958; Davies 1954.) 

In the remainder of this article, some of the 
commonest types of experimental design are out- 
lined. 

Randomized groups. The randomized group 
arrangement, also called the one-way layout, the 
simple randomized design, and the completely ran- 
domized design, is the simplest type of plan. Treat- 
ments are allotted to experimental units at random, 
as described in the discussion of “Randomization,” 
above. No blocking is used at any stage of the 
experiment; and, since any number of treatments 
and any number of units per treatment may be 
employed, the design has great flexibility. If mis- 
haps cause certain of the responses to be missing, 
the statistical analysis is only slightly complicated. 
Since, however, the design takes no advantage of 
blocking, it is used primarily when no criteria for 
blocking are available, when criteria previously 
used for blocking have proved ineffective, or when 
the response is not highly variable from unit to 
unit. 

Randomized blocks. If there are v treatments 
and the units can be grouped into blocks of size v, 
such that units in the same block are expected to 
give about the same final response under uniform 
treatment, then a randomized blocks design is 
appropriate. Each treatment is allotted at random 
to one of the units in any block. This design is, in 
general, more precise than randomized groups and 
is very extensively used. 

Sometimes the blocks are formed by assessing 
or scoring the subjects on an initial variable related 
to the final response. It may be of interest to exam- 
ine whether the comparative effects of the treat- 
ments are the same for subjects with high scores 
as for those with low scores. This can be done by 
an extension of the analysis of variance appro- 
priate to the randomized blocks design. For exam- 
ple, with four treatments, sixty subjects, and fif 
teen blocks, the blocks might be classified into 
three levels, high, medium, or low, there being five 
blocks in each class. A useful partition of the de- 
grees of freedom (df) in the analysis of variance 
of this “treatments x levels” design is as follows: 


df 

Between levels 2 
Between blocks at the same level 12 
Treatments 3 
Treatments x levels interactions 6 
Treatments X blocks within levels 36 
Total 59 
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The mean square for interaction is tested, against 
the mean square for treatments x blocks within 
levels, by the usual F-test. Methods for construct- 
ing the levels and the problem of testing the over- 
all effects of treatments in different experimental 
situations are discussed in Lindquist (1953). [See 
LINEAR HYPOTHESES, article on ANALYSIS OF VARI- 
ANCE. | 

The crossover design. The crossover design is 
suitable for within-subject comparisons in which 
each subject receives all the treatments in succes- 
sion. With three treatments, for example, a plan 
in which every subject receives the treatments in 
the order ABC is liable to bias if there happen to 
be systematic differences between the first, second, 
and third positions, due to time trends, learning, 
or fatigue. One design that mitigates this difficulty 
is the following: a third of the subjects, selected 
at random, get the treatments in the order ABC, 
a third get BCA, and the remaining third get CAB. 
The analysis of variance resembles that for ran- 
domized blocks except that the sum of squares 
representing the differences between the over-all 
means for the three positions is subtracted from 
the error sum of squares. 

The Latin square. A square array of letters 
(treatments) such that each letter appears once 
in every row and column is called a Latin square. 
The following are two 4 x 4 squares. 


a) (2) 
CTAR D AHR ECRD 
ACM Be DUR Be CPEDUTA 
BADAICHTA DEA RYG 
DANCER ATSR C DAB 


This layout permits simultaneous blocking in two 
directions. The rows and columns often represent 
extraneous sources of variation to be balanced out. 
In an experiment that compared the effects of 
five types of music programs on the output of fac- 
tory workers doing a monotonous job, a 5x5 
‘Latin square was used. The columns denoted days 
of the week and the rows denoted weeks. When 
there are numerous subjects, the design used is 
frequently a group of Latin squares. 

For within-subject comparisons, the possibility 
of a residual or carry-over effect from one period 
to the next may be suspected. If such effects are 
present (and if one conventionally lets columns 
in the above squares correspond to subjects and 
rows correspond to order of treatment) then square 
(1) is bad, since each treatment is always pre- 
ceded by the same treatment (A by C, etc.). By 
the use of square (2), in which every treatment is 


preceded once by each of the other treatments, the 
residual effects can be estimated and unbiased 
estimates obtained of the direct effects (see Coch- 
ran & Cox 1950, sec. 4.6a; Edwards 1950, pp. 274- 
275). If there is strong interest in the residual 
effects, a more suitable design is the extra-period 
Latin square. This is a design like square (2), in 
which the treatments C, D, A, B in the fourth 
period are given again in a fifth period. 

Balanced incomplete blocks. When the number 
of treatments, v, exceeds the size of block, k, that 
appears suitable, a balanced incomplete blocks 
design is often appropriate. In examining the taste 
preferences of adults for seven flavors of ice cream 
in a within-subject test, it is likely that a subject 
can make an accurate comparison among only 
three flavors before his discrimination becomes 
insensitive. Thus v = 7, k = 3. In a comparison of 
three methods of teaching high school students, 
the class may be the experimental unit and the 
school a suitable block. In a school district, it may 
be possible to find twelve high schools each having 
two classes at the appropriate level. Thus v = 3, 
Rix 2s 

Balanced incomplete blocks (BIB) are an exten- 
sion of randomized blocks that enable differences 
among blocks to be eliminated from the experi- 
mental errors by simple adjustments performed in 
the statistical analysis. Examples for v = 7, k = 3 
and for v = 3, k = 2 are as follows (columns are 
blocks) : 


The basic property of the design is that each 
pair of treatments occurs together (in the same 
block) equally often. 

In both plans shown, it happens that each row 
contains every treatment. This is not generally true 
of BIB designs, but this extra property can some- 
times be used to advantage. With v = 7, for in- 
stance, if the row specifies the order in which the 
types of ice cream are tasted, the experiment is 
also balanced against any consistent order effect. 
This extension of the BIB is known as an incom- 
plete Latin square or a Youden square. In the high 
schools experiment, the plan for v= 3 would be 
repeated four times, since there are twelve schools. 

Comparisons between and within subjects. 
Certain factorial experiments are conducted so 
that some comparisons are made within subjects 
and others are made between subjects. Suppose 
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that the criterion is the performance of the sub- 
jects on an easy task, T;, and a difficult task, T:, 
each subject attempting both tasks. This part of 
the experiment is a standard crossover design. 
Suppose further that these tasks are explained to 
half the subjects in a discouraging manner, S,, 
and to the other half in a supportive manner, S.. 
It is of interest to discover whether these prelimi- 
nary suggestions, S, have an effect on performance 
and whether this effect differs for easy and hard 
tasks, The basic plan, requiring four subjects, is 
shown in the first three lines of Table 1, where O 
denotes the order in which the tasks are performed. 

The comparison T, — T,, which gives the main 
effect of T, is shown under the treatments line. 
This is clearly a within-subject comparison since 
each subject carries a + and a —. The main effect 
of suggestion, S. — S,, is a between-subject com- 
parison: subjects 3 and 4 carry + signs while 
subjects 1 and 2 carry — signs. The TS interaction, 
measured by T.S, — T,S. — T-S, + T;S,, is seen to 
be a within-subject comparison. 

Since within-subject comparisons are usually 
more precise than between-subject comparisons, 
an important property of this design is that it gives 
relatively high precision on the T and TS effects 
at the expense of lower precision on S. The design 
is particularly effective for studying interactions. 
Sometimes the between-subject factors involve a 
classification of the subjects. For instance, the sub- 
jects might be classified into three levels of anxiety, 
A, by a preliminary rating, with equal numbers of 
males and females of each degree included. In this 
situation, the factorial effects A, S (for sex), and 
AS are between-subject comparisons. Their interac- 
tions with T are within-subject comparisons. 

The example may present another complication. 
Subjects who tackle the hard task after doing the 
easy task may perform better than those who 
tackle the hard task first. This effect is measured 
by a TO interaction, shown in the last line in 
Table 1. Note that the TO interaction turns out to 
be a between-subject comparison. The same is true 
of the TSO three-factor interaction. 

In designs of this type, known in agriculture as 


Table 1 
Subject 1 2 3 4 


Order Oy a F (ohn. (es O O O O 
Treatment TiS, TaSi TaSi TiSi TiS2 T2S2 TaS2 TiS 


an : : A po Se Sa 
ea RNA S A AS 
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Table 2 

Source df 
Between subjects 

S 1 

TO 1 

TSO 1 

Error b A(n — 1) 
Within subjects 

o 1 

T 1 

TS 1 

sO 1 

Error w A(n — 1) 


split-plot designs, separate estimates of error are 
calculated for between-subject and within-subject 
comparisons. With 4n subjects, the partition of 
degrees of freedom in the example is shown in 
Table 2 (if it is also desired to examine the TO 
and TSO interactions). 

Plans and computing instructions for all the 
common types of design are given in Cochran and 
Cox (1950); and Lindquist (1953), Edwards 
(1950), and Winer (1962) are good texts on ex- 
perimentation in psychology and education. 


WILLIAM G. COCHRAN 


[Directly related are the articles under LINEAR HY- 
POTHESES.] 
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I 
RESPONSE SURFACES 


Response surface methodology is a statistical 
technique for the design and analysis of experi- 
ments; it seeks to relate an average response to the 
values of quantitative variables that affect response. 
For example, response in a chemical investigation 
might be yield of sulfuric acid, and the quantitative 
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a ca 2 weeks delay 
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variables affecting yield might be pressure and 
temperature of the reaction. 

In a psychological experiment, an investigator 
might want to find out how a test score achieved 
by certain subjects depended upon duration of the 
period during which they studied the relevant mate- 
rial and the delay between study and test. In math- 
ematical language, the psychologist is interested in 
the presumed functional relationship y = f(é,, &) 
that expresses the response score, y, as a function 
of the two variables duration, £,, and delay, é,. 
If repeated experiments were made at any fixed 
set of experimental conditions, the measured re- 
sponse would nevertheless vary because of meas- 
urement errors, observational errors, and variability 


(B) 


TEST SCORE (%), 


(C) Test score contours 


STUDY PERIOD é, 


(D) 


DELAY é 


STUDY PERIOD é, 


Figure 1 — Geometrical representations of a relationship connecting test scores with study period, delay, and 


sleep deprivation 
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in the experimental material. We regard 7 there- 
fore as the mean response at particular conditions; 
y, the response actually observed in a particular 
experiment, differs from y because of an (all- 
inclusive) error e. Thus y =y +e, and a mathe- 
matical model relating the observed response to the 
levels of k variables can be written in the form 


(1) y=f(é:, r&i) +e. 


The appropriate investigational strategy depends 
heavily on the state of ignorance concerning the 
functional form, f. At one extreme the investigator 
may not know even which variables, ¢, to include 
and must make a preliminary screening investiga- 
tion. At the other extreme the true functional form 
may actually be known or can be deduced from a 
mechanistic theory. 

Response surface methods are appropriate in the 
intermediate situation; the important variables are 
known, but the true functional form is neither 
known nor easily deducible. The general procedure 
is to approximate f locally by a suitable function, 
such as a polynomial, which acts as a “mathe- 
matical French curve.” 

Geometric representation of response relation- 
ships. The three curves of Figure 1A, showing a 
hypothetical relationship associating test score with 
study period for three different periods of delay, 
are shown in Figure 1B as sections of a response 
surface, This surface is represented by its response 
contours in Figure 1C. Figure 1D shows how a third 
variable may be accommodated by the use of three- 
dimensional contour surfaces. 

Local graduation. It is usually most convenient 
to work with coded variables like x, = (é, — &?)/S,, 
x, = (é, — €2)/S, in which é°, é are the coordinates 
of the center of a region of current interest and 
S, and S, are convenient scale factors. 

Let ĝ represent the calculated value of the re- 
sponse obtained by fitting an approximating func- 
tion by the method of least squares [see LINEAR 
HYPOTHESES, article on REGRESSION]. In a region 
like R, in Figure 1C an adequate approximation can 
be obtained by fitting the first-degree polynomial 


(2) 9 = by + dix, + dix . 


The response contours of such a fitted plane are, of 
course, equally spaced parallel straight lines. In 
a region like R, a fair approximation might be 
achieved by fitting a second-degree polynomial 


(3) ĝ =b, + dix, + b,x, + b,x + b,x + bX. 


: Flexibility of functions like those in (2) and (3) 
is greatly increased if the possibility is allowed 
that y, x,, and x, are suitable transformed values 
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of the response and of the variable. For example, it 
might be appropriate to analyze log score rather 
than score itself. [Ways of choosing suitable trans- 
formations are described in STATISTICAL ANALYSIS, 
SPECIAL PROBLEMS OF, article on TRANSFORMA- 
TIONS OF DATA; and in Box & Cox 1964 and 
Box & Tidwell 1962.] 


Uses of response surface methodology 

A special pattern of points at which observations 
are to be made is called an experimental design. In 
Figure 1C are shown a first-order design in R,, 
suitable for fitting and checking a first-degree 
polynomial, and a second-order design in R,, suit- 
able for fitting and checking a second-degree 
polynomial. Response surface methodology has 
been applied (a) to provide a description of how 
the response is affected by a number of varia- 
bles over some already chosen region of interest 
and (b) to study and exploit multiple response re- 
lationships and constrained extrema. In drug 
therapy, for example, the true situation might be 
as depicted in Figure 2. First-degree approximating 
functions fitted to each of the three responses— 
mı, therapeutic effect, n., nausea, and ns, toxicity 
—could approximately locate the point P where 
maximum therapeutic effect is obtained with 
nausea and toxicity maintained at the acceptable 
limits n: =5, 7; =30. Response surface method- 
ology has also been applied (c) to locate and ex- 
plore the neighborhood of maximal or minimal re- 
sponse. Because problems in (c) often subsume 
those in (a) and (b), only this application will be 
considered in more detail. 
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Figure 2 — Dependence of three responses on combined 
dosages of drugs 
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Location and exploration of a maximal region. 
Various tactics have been proposed to deal with the 
problem of finding where the response surface has 
its maximum or minimum and of describing its 
shape nearby. Because the appropriateness of a 
particular tactic usually depends upon factors that 
are initially unknown, an adaptive strategy of mul- 
tiple iteration must be employed, that is, the in- 
vestigator must put himself in a position to learn 
more about each of a number of uncertainties as 
he proceeds and to modify tactics accordingly. It is 
doubtful whether an adaptive strategy could be 
found that is appropriate to every conceivable re- 
sponse function. One such procedure, which has 
worked well in chemical applications and which 
ought to be applicable in some other areas, is as 
follows: When the initially known experimental 
conditions are remote from the maximum (a paral- 
lel strategy applies in the location of a minimum ) 
rapid progress is often possible by locally fitting a 
sloping plane and moving in the indicated direc- 
tion of greatest slope to a region of higher response. 
This tactic may be repeated until, when the experi- 
mental sequence has moved to conditions near the 
maximizing ones, additional observations are taken 
and a quadratic (second-order) fit or analysis is 
made to indicate the approximate shape of the re- 
sponse surface in the region of the maximum. 

An example. In this example iteration occurs 
in (A) the amount of replication (to achieve suf- 
ficient accuracy), (B) the location of the region of 
interest, (C) the scaling of the variables, (D) the 
transformation in which the variables are con- 
sidered, and (E) the necessary degree of com- 
plexity of approximating functions and of the cor- 
responding design. The letters A, B, C, etc., are 
used parenthetically to indicate the particular type 
of iteration that is being furthered at any stage. 
Suppose that, unknown to the experimenter, the 
true dependence of percentage yield on tempera- 
ture and concentration is as shown in Figure 3A 
and the experimental error standard deviation is 
1.2 per cent. 

A first-degree approximation. Suppose that five 
initial duplicate runs made in random order at 
points labeled 1, 2, 3, 4, and 5 in Figure 3B yield 
the results y, = 24, y’ = 27, Y, = 38, y; = 40, y, = 
42, y; = 42, y, = 42, y= 41, Y; = 50, y} = 53. The 
average yields at the five points are then 9, = 25.5, 
92 = 39, Ja = 42, ğı = 41.5, Js = 51.5. At this stage 
it is convenient to work with the coded variables 
x, = (temp. — 70)/10 and x. = (conc. — 42.5)/2.5. 
Using standard least squares theory the coefficients 
by, b, , b, of equation (2) are then easily estimated 


(for example, b, = 3 { — 9, + J: — ğı + 95} =5.9) 
and the locally best-fitting plane is 


(4) ĝ = 39.9 + 5.9 x, + 7.1 x. 


The differences in the duplicate runs provide an 
estimate s = 1.5, with five degrees of freedom, of 
o, the underlying standard deviation. The standard 
errors of b,, b,, and b, are then estimated as 0.5, 
and no further replication (A) appears necessary 
to obtain adequate estimation of y. 

Checking the fit. To check the appropriateness 
of the first-degree equation it would be sensible to 
look at the size of second-order effects. For reason 
of experimental economy a first-order design usually 
contains points at too few distinct levels to allow 
separate estimation of all second-order terms. The 
design may be chosen, however, so as to allow 
estimates of “specimen” second-order coefficients 
or combinations thereof. In the present case esti- 
mates can be made of b,. = 3(9, — 9. — Jı + J) = 
— 0.9 + 0.5 and (bi, + be.) = 39, + J+ Ji +J) — 
Ys = — 2.6 + 1.2. Some inadequacy of the first- 
degree equation is indicated, therefore, but this is 
tentatively ignored because of the dominant magni- 
tude of b, and b.. 

Steepest ascent. It is now logical to explore (B) 
higher temperatures and concentrations. The points 
6, 7, and 8 are along a steepest ascent path ob- 
tained by changes proportional to b, x S, = 5.9 X 
10° = 59° in temperature and b, x S, = 7.1 x 2.5% 
= 17.75% in concentration. Suppose that y, = 59, 
Y: = 63, and ys = 50. Graphical interpolation indi- 
cates that the highest yield on this path is between 
runs 6 and 7, and this is chosen (B) as the center 
of the new region to be explored. 

The path calculated as above is at right angles 
to contours of the fitted plane when 10-degree units 
of temperature and 2.5 per cent units of concen- 
tration are represented by the same distances, That 
the experimenter currently regards these units as 
appropriate is implied by his choice of levels in 
the design. 

Scaling correction. To correct unsuitable scal- 
ing (C) the investigator can adopt the rule that if 
a variable produces an effect that is small com- 
pared with that produced by the other variables, the 
center level for that variable is moved away from 
the calculated path and a larger change is made 
for this variable in the next set of runs. No change 
of relative scaling is indicated here, but progress 
up the surface would normally be accompanied by 
reduction in the sizes of b, and b,. Also, the checks 
have already indicated that second-order effects 
can scarcely be estimated with adequate accuracy 
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(A) True response surface 
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(B) First-order design with steepest ascent path 
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(C) Second-order design with fitted contours 
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(D) Final design in canonical variables X, and X2 
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Figure 3 — A true response surface and three successive design patterns 


in the present scaling. Thus, wider ranges in both 
variables should be employed in a second design. 

A second-degree approximation. A widened 
first-order design at the new location might give 
Yo = 50, Yiv = 66, yu = 66, Yı: = 63, and -yı = 52, 
as in Figure 3C. 

Then 9 = 59.4 + 1.3 x, — 0.3 x. is the best-fitting 
plane, in which x, is given by (temp. — 90)/15, 


x, is given by (conc. — 18.75)/3.75, and the esti- 
mated standard error of the coefficients is about 0.8. 
In the new scaling the check quantities are now 
by. = —6.75 + 0.8 and b,, + b» = 8.25 + 1.7. It is 
clear, without this time duplicating the design, 
that first-order terms no longer dominate, and no 
worthwhile further progress can be made by ascent 
methods. To make possible the fitting and checking 
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of a second-degree polynomial (E), five additional 
observations might be taken, say yı, = 54, yıs = 54, 
Yis = 57, Yı = 65, ys = 55. The last ten observa- 
tions now form a second-order design. A second- 
degree equation fitted to these observations gives 
(5) 9 = 65.50 + 1.16 x, + 0.05 x, 
— 4.31 x2 — 4.81 x2 — 6.75 x,x,. 

The design allows a check on the adequacy (E) 

of the second-degree equation by providing esti- 


mates of certain “specimen” combinations of third- 
order terms 


bin — Diss = 4 {Yo — Yro + Yio — Yis —V24:5 Er V 2410} 


= —0.19 + 1.03 

and 

bee — bna = + {Yo + Yio — Yr2 — Yis —VŽyu ai V 2y18} 
= —0.60 + 1.03. 


The estimated standard errors of the linear coeffi- 
cients b, and b., of the quadratic coefficients b,, 
and b.., and of the interaction coefficient b,. are, 
respectively, 0.52, 0.62, and 0.73. 

Before an attempt is made to interpret equation 
(5) there must be some assurance (A) that the 
change in response it predicts is large compared 
with the standard error of that prediction. For a 
design requiring N observations and an approx- 
imating equation containing p constants, the aver- 
age variance of the N calculated responses ĝ is 
(p/N )s? = (6/12) x 2.1 =1.1 for this example. 
The square root (1.0 for this example) gives an 
“average” standard error for §. This may be com- 
pared with the range of the predicted 9's, which is 
17.08, the highest predicted value being 9,, = fı: = 
65.50 and the lowest #, = 48.42. 

A more precise indication of adequacy may be 
obtained by an application of the analysis of vari- 
ance, but a discussion of this is outside the scope of 
the present account [see LINEAR HYPOTHESES, ar- 
ticle on ANALYSIS OF VARIANCE]. It is to be noted, 
however, that bare statistical significance of the re- 
gression would not ensure that the response sur- 
face was estimated with sufficient accuracy for the 
interpretation discussed below. 

Interpretation. Once adequate fit and precision 
have been obtained, a contour plot of the equation 
over the region of the experimental design is help- 
ful in interpretation. Especially where there are 
more than two variables, interpretation is further 
facilitated by writing the second-degree equation 
in canonical form (D). In most cases, this means 
that the center of the quadratic system is chosen 
as a new origin, and a rotation of axes is per- 
formed to eliminate cross-product terms. 

In a final group of experiments the new canon- 


ical axes and scales could be used to position the 
design. In Figure 3D the design points are chosen 
so that they roughly follow a contour and make a 
rather precise final fitting possible. 

It might be asked, Why not simply use the 
twenty or so experimental points to cover the re- 
gion shown in Figure 3A with some suitable grid 
in the first place? The answer is that it is not 
known initially that the region of interest will be 
in the area covered by that diagram. The “content” 
of the space to be explored goes up rapidly as the 
number of dimensions is increased. 

Suitable designs. From the foregoing discus- 
sion it will be clear that the arrangements of ex- 
perimental points suitable for response surface 
study should satisfy a number of requirements. 
Ideally, a response surface design should (1) allow 
§(x) to be estimated throughout the region of in- 
terest, R; (2) ensure that 9(x) is as “close” as pos- 
sible to n(x); (3) give good detectability of lack 
of fit; (4) allow transformations to be fitted; 
(5) allow experiments to be performed in blocks; 
(6) allow designs of increasing order to be built up 
sequentially; (7) provide an internal estimate of 
error; (8) be insensitive to wild observations; 
(9) require a minimum number of experimental 
points; (10) provide patterning of data allowing 
ready visual appreciation; (11) ensure simplicity 
of calculation; and (12) behave well when errors 
occur in settings of the x’s. 

A variety of designs have been developed, many 
of which have remarkably good over-all behavior 
with respect to these requirements. When maxi- 
mum economy in experimentation is essential, de- 
signs that fail to meet certain of these criteria may 
have to be used at some increased risk of being 
misled. 

G. E. P. Box 
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ut 
QUASI-EXPERIMENTAL DESIGN 


The phrase “quasi-experimental design” refers to 
the application of an experimental mode of analysis 
and interpretation to bodies of data not meeting 
the full requirements of experimental control. The 
circumstances in which it is appropriate are those 
of experimentation in social settings—including 
planned interventions such as specific communica- 
tions, persuasive efforts, changes in conditions and 
policies, efforts at social remediation, etc.—where 
complete experimental control may not be possible. 
Unplanned conditions and events may also be 
analyzed in this way where an exogenous variable 
has such discreteness and abruptness as to make 
appropriate its consideration as an experimental 
treatment applied at a specific point in time to a 
specific population. When properly done, when at- 
tention is given to the specific implications of the 
specific weaknesses of the design in question, quasi- 
experimental analysis can provide a valuable ex- 
tension of the experimental method. 

History of quasi-experimental design. While 
efforts to interpret field data as if they were actu- 
ally experiments go back much further, the first 
prominent methodology of this kind in the social 
sciences was Chapin’s ex post facto experiment 
(Chapin & Queen 1937; Chapin 1947; Greenwood 
1945), although it should be noted that because of 
the failure to control regression artifacts, this mode 
of analysis is no longer regarded as acceptable. 
The American Soldier volumes (Stouffer et al. 
1949) provide prominent analyses of the effects of 
specific military experiences, where it is implausible 
that differences in selection explain the results. 
Thorndike’s efforts to demonstrate the effects of 
specific coursework upon other intellectual achieve- 
ments provide an excellent early model (for exam- 
ple, Thorndike & Woodworth 1901; Thorndike & 
Ruger 1923), Extensive analysis and review of this 
literature are provided elsewhere (Campbell 1957; 
1963; Campbell & Stanley 1963) and serve as the 
basis for the present abbreviated presentation. 


True experimentation. The core requirement 
of a true experiment lies in the experimenter's 
ability to apply experimental treatments in com- 
plete independence of the prior states of the mate- 
rials (persons, etc. ) under study. This independence 
makes resulting differences interpretable as effects 
of the differences in treatment. In the social sci- 
ences the independence of experimental treatment 
from prior status is assured by randomization in 
assignments to treatments. Experiments meeting 
these requirements, and thus representing true ex- 
periments, are much more possible in the social 
sciences than is generally realized. Wherever, for 
example, the treatments can be applied to individ- 
uals or small units, such as precincts or classrooms, 
without the respondents being aware of experimen- 
tation or that other units are getting different treat- 
ments, very elegant experimental control can be 
achieved. An increased acceptance by adminis- 
trators of randomization as the democratic method 
of allocating scarce resources (be these new hous- 
ing, therapy, or fellowships) will make possible 
field experimentation in many settings. Where in- 
novations are to be introduced throughout a social 
system and where the introduction cannot, jn any 
event, be simultaneous, a use of randomization in 
the staging can provide an experimental compari- 
son of the new and the old, using the groups receiv- 
ing the delayed introduction as controls. 

Validity of quasi-experimental analyses. Noth- 
ing in this article should be interpreted as minimiz- 
ing the importance of increasing the use of true 
experimentation. However, where true experimen- 
tal design with random assignment of persons to 
treatments is not possible, because of ethical con- 
siderations or lack of power, or infeasibility, appli- 
cation of quasi-experimental analysis has much to 
offer. 

The social sciences must do the best they can 
with the possibilities open to them. Inferences must 
frequently be made from data obtained under cir- 
cumstances that do not permit complete control. 
Too often a scientist trained in experimental meth- 
od rejects any research in which complete control 
is lacking. Yet in practice no experiment is per- 
fectly executed, and the practicing scientist over- 
looks those imperfections that seem to him to offer 
no plausible rival explanation of the results, In the 
light of modern philosophies of science, no experi- 
ment ever proves a theory, it merely probes it. 
Seeming proof results from that condition in which 
there is no available plausible rival hypothesis to 
explain the data. The general program of quasi- 
experimental analysis is to specify and examine 
those plausible rival explanations of the results that 
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are provided by the uncontrolled variables. A fail- 
ure to control that does not in fact lend plau- 
sibility to a rival interpretation is not regarded as 
invalidating. 

It is well to remember that we do make assured 
causal inferences in many settings not involving 
randomization: the earthquake caused the brick 
building to crumble; the automobile crashing into 
the telephone pole caused it to break; the language 
patterns of the older models and mentors caused 
this child to speak English rather than Kwakiutl; 
and so forth. While these are all potentially errone- 
ous inferences, they are of the same type as experi- 
mental inferences. We are confident that were we 
to intrude experimentally, we could confirm the 
causal laws involved. Yet they have been made 
assuredly by a nonexperimenting observer. This 
assurance is due to the effective absence of other 
plausible causes. Consider the inference about the 
crashing auto and the telephone pole: we rule out 
combinations of termites and wind because the 
other implications of these theories do not occur 
(there are no termite tunnels and debris in the 
wood, and nearby weather stations have no records 
of heavy wind ). Spontaneous splintering of the pole 
by happenstance coincident with the auto’s onset 
does not impress us as a rival, nor would it explain 
the damage to the car, etc. Analogously in quasi- 
experimental analysis, tentative causal interpreta- 
tion of data may be made where the interpretation 
in question is consistent with the data and where 
other rival interpretations have been rendered im- 
plausible. 

Dimensions of experimental validity. A set of 
twelve dimensions, representing frequent threats 
to validity, have been developed for the evaluation 
of data as quasi-experiments. These may be re- 
garded as the important classes of frequently plau- 
sible rival hypotheses that good research design 
seeks to rule out. Each will be presented briefly 
even though not all are employed in the evaluation 
of the designs used illustratively here. 

Fundamental to this listing is a distinction be- 
tween internal validity and external validity. Inter- 
nal validity is the basic minimum without which 
any experiment is uninterpretable: Did in fact the 
experimental treatments make a difference in this 
specific experimental instance? External validity 
asks the question of generalizability: To what pop- 
ulations, settings, treatment variables, and meas- 
urement variables can this effect be generalized? 
Both types of criteria are obviously important, even 
though they are frequently at odds in that features 
increasing one may jeopardize the other. While 


internal validity is the sine qua non, and while the 
question of external validity, like the question of 
inductive inference, is never completely answer- 
able, the selection of designs strong in both types 
of validity is obviously our ideal. 

Threats to internal validity. Relevant to inter- 
nal validity are eight different classes of extraneous 
variables that if not controlled in the experimental 
design might produce effects mistaken for the effect 
of the experimental treatment. These are the fol- 
lowing. (1) History: other specific events in addi- 
tion to the experimental variable occurring between 
a first and second measurement. (2) Maturation: 
processes within the respondents that are a func- 
tion of the passage of time per se (not specific to 
the particular events), including growing older, 
growing hungrier, growing tireder, and the like. 
(3) Testing: the effects of taking a test a first time 
upon subjects’ scores in subsequent testing. (4) In- 
strumentation: the effects of changes in the cal- 
ibration of a measuring instrument or changes in 
the observers or scorers upon changes in the ob- 
tained measurements. (5) Statistical regression: 
operating where groups of subjects have been 
selected on the basis of their extreme scores. (6) 
Selection: biases resulting in differential recruit- 
ment of respondents for the comparison groups. 
(7) Experimental mortality: the differential loss of 
respondents from the comparison groups. (8) 
Selection—maturation interaction: in certain of the 
multiple-group quasi-experimental designs, such as 
the nonequivalent control group design, an inter- 
action of maturation and differential selection is 
confounded with, that is, might be mistaken for, 
the effect of the experimental variable [see LINEAR 
HYPOTHESES, article on REGRESSION; SAMPLE SUR- 
VEYS]. 

Threats to external validity. Factors jeopardiz- 
ing external validity or representativeness are: 
(1) The reactive or interaction effect of testing, in 
which a pretest might increase or decrease the 
respondent's sensitivity or responsiveness to the 
experimental variable and thus make the results 
obtained for a pretested population unrepresenta- 
tive of the effects of the experimental variable for 
the unpretested universe from which the experi- 
mental respondents were selected. (2) Interaction 
effects between selection bias and the experimental 
variable. (3) Reactive effects of experimental ar- 
rangements, which would preclude generalization 
about the effect of the experimental variable for 
persons being exposed to it in nonexperimental 
settings. (4) Multiple-treatment interference, a 
problem wherever multiple treatments are applied 
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to the same respondents, and a particular problem 
for one-group designs involving equivalent time 
samples or equivalent materials samples. 

Types of quasi-experimental design. Some com- 
mon types of quasi-experimental design will be 
outlined here. 

One-group pretest—posttest design. Perhaps the 
simplest quasi-experimental design is the one-group 
pretest—posttest design, O, X O., (O represents meas- 
urement or observation, X the experimental treat- 
ment). This common design patently leaves uncon- 
trolled the threats to internal validity of history, 
maturation, testing, instrumentation, and, if sub- 
jects were selected on the basis of extreme scores 
on O, , regression. There may be situations in which 
the investigator could decide that none of these 
represented plausible rival hypotheses in his set- 
ting: A log of other possible change-agents might 
provide no plausible ones; the measurement in 
question might be nonreactive (Campbell 1957), 
the time span too short for maturation, too spaced 
for fatigue, etc. However, the sources of invalidity 
are so numerous that a more powerful quasi- 
experimental design would be preferred. Several of 
these can be constructed by adding features to this 
simple one. 

Interrupted time-series design. The interrupted 
time-series experiment utilizes a series of measure- 
ments providing multiple pretests and posttests, for 
example: 


O, O. O, O, X O; O: O; Os. 


If in this series, O,—0O; shows a rise greater than 
found elsewhere, then maturation, testing, and re- 
gression are no longer plausible, in that they would 
predict equal or greater rises for O,—O:, etc. Instru- 
mentation may well be controlled too, although in 
institutional settings a change of administration 
policy is often accompanied by a change in record- 
keeping standards. Observers and participants may 
be focused on the occurrence of X and may spe- 
ciously change rating standards, etc. History re- 
mains the major threat, although in many settings 
it would not offer a plausible rival interpretation. 

Multiple time-series design. If one had avail- 
able a parallel time series from a group not receiv- 
ing the experimental treatment, but exposed to the 
same extraneous sources of influence, and if this 
control time series failed to show the exceptional 
jump from O, to O., then the plausibility of history 
as a rival interpretation would be greatly reduced. 
We may call this the multiple time-series design. 

Nonequivalent control group. Another way of 
improving the one-group pretest—posttest design is 


to add a “nonequivalent control group.” (Were the 
control group to be randomly assigned from the 
same population as the experimental group, we 
would, of course, have a true experimental design 
not a quasi-experimental design.) Depending on 
the similarities of setting and attributes, if the non- 
equivalent control group fails to show the gain 
manifest in the experimental group, then history, 
maturation, testing, and instrumentation are con- 
trolled. In this popular design, the frequent effort 
to “correct” for the lack of perfect equivalence by 
matching on pretest scores is absolutely wrong 
(e.g., Thorndike 1942; Hovland et al. 1949; Camp- 
bell & Clayton 1961), because it introduces a re- 
gression artifact. Instead, one should accept any 
initial pretest differences, using analysis of covari- 
ance, gain scores, or graphic presentation. (This, 
of course, is not to reject blocking on pretest scores 
in true experiments where groups have been as- 
signed to treatments at random.) Remaining un- 
controlled is the selection—maturation interaction, 
that is, the possibility that the experimental group 
differed from the control group not only in initial 
level but also in its autonomous maturation rate. 
In experiments on psychotherapy and on the effects 
of specific coursework this is a very serious rival. 
Note that it can be rendered implausible by use of 
a time series of pretest for both groups thus moving 
again to the multiple time-series design. 

Other quasi-experimental designs. There is not 
space here to present adequately even these four 
quasi-experimental designs, but perhaps the strat- 
egy of adding specific observations and analyses to 
check on specific threats to validity has been illus- 
trated. This is carried to an extreme in the re- 
current institutional cycle design (Campbell & 
McCormack 1957; Campbell & Stanley 1963), in 
which longitudinal and cross-sectional measure- 
ments are combined with still other analyses to 
assess the impact of indoctrination procedures, etc. 
through exploiting the fact that essentially similar 
treatments are being given to new entrants year 
after year or cycle after cycle. Other quasi-ex- 
perimental: designs are covered in Campbell and 
Stanley (1963), Campbell and Clayton (1961), 
Campbell (1963), and Pelz and Andrews (1964). 

Correlational analyses. Related to the program 
of quasi-experimental analysis are those efforts to 
achieve causal inference from correlational data. 
Note that while correlation does not prove causa- 
tion, most causal hypotheses imply specific cor- 
relations, and examination of these thus probes, 
tests, or edits the causal hypothesis. Furthermore, 
as Blalock (1964) and Simon (1947-1956) have 
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emphasized, certain causal models specify uneven 
patterns of correlation. Thus the A>B>C model 
implies that r,. be smaller than T4» or Tne. However, 
their use of partial correlations or the use of 
Wright's path analysis (1920) are rejected as tests 
of the model because of the requirement that the 
“cause” be totally represented in the “effect.” In 
the social sciences it will never be plausible that the 
cause has been measured without unique error and 
that it also totally lacks unique systematic variance 
not shared with the effect. More appropriate would 
be Lawley’s (1940) test of the hypothesis of single 
factoredness. Only if single factoredness can be 
rejected would the causal model, as represented by 
its predicted uneven correlation pattern, be the pre- 
ferred interpretation [see MULTIVARIATE ANALYSIS, 
articles on CORRELATION]. 

Tests of significance. A word needs to be said 
about tests of significance for quasi-experimental 
designs. It has been argued by several competent 
social scientists that since randomization has not 
been used tests of significance assuming random- 
ization are not relevant. On the whole, the writer 
disagrees. However, some aspects of the protest are 
endorsed: Good experimental design is needed for 
any comparison inferring change, whether or not 
tests of significance are used, even if only photo- 
graphs, graphs, or essays are being compared. In 
this sense, experimental design is independent of 
tests of significance. More importantly, tests of sig- 
nificance have mistakenly come to be taken as 
thoroughgoing proof. In vulgar social science usage, 
finding a “significant difference” is apt to be taken 
as proving the author’s basis for predicting the dif- 
ference, forgetting the many other plausible rival 
hypotheses explaining a significant difference that 
quasi-experimental designs leave uncontrolled. Cer- 
tainly the valuation of tests of significance in some 
quarters needs demoting. Further, the use of tests 
of significance designed for the evaluation of a 
single comparison becomes much too lenient when 
dozens, hundreds, or thousands of comparisons 
have been sifted. And in a similar manner, an ex- 
perimenter’s decision as to which of his studies is 
publishable and the editor’s decision as to which 
of the manuscripts are acceptable further bias the 
sampling basis. In all of these ways, reform is 
needed. 

However, when a quasi-experimenter has, for 
example, compared the results from two intact 
classrooms employed in a sampling of convenience, 
a chance difference is certainly one, even if only 
one, of the many plausible rival hypotheses that 
must be considered. If each class had but 5 stu- 
dents, one would interpret the fact that 20 per cent 
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more in the experimental class showed increases 
with less interest than if each class had 100 stu- 
dents. In this case there is available an elaborate 
formal theory for the plausible rival hypothesis of 
chance fluctuation. This theory involves the as- 
sumption of randomness, which is quite appro- 
priate when the null model of random association 
is rejected in favor of a hypothesis of systematic 
difference between the two groups. If a “significant 
difference” is found, the test of significance will 
not, of course, reveal whether the two classes dif- 
fered’ because one saw the experimental movie or 
for some selection reason associated with class 
topic, time of day, etc., that might have interacted 
with rate of autonomous change, pretest instigated 
changes, reactions to commonly experienced events, 
etc. But such a test of significance will help rule 
out what can be considered as a ninth threat to 
internal validity; that is, that there is no difference 
here at all that could not be accounted for as a 
vagary of sampling in terms of a model of purely 
chance assignment. Note that the statement of 
probability level is in this light a statement of the 
plausibility of this one rival hypothesis, which al- 
ways has some plausibility, however faint. 


DonaLp T. CAMPBELL 


[Other relevant material may be found in HYPOTHESIS 
TESTING; PERSONALITY MEASUREMENT, article on 
SITUATIONAL TESTS; PsYCHOMETRICS; REASONING 
AND LOGIC; SURVEY ANALYSIS.] 
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EXPERIMENTAL PSYCHOLOGY 

The subject matter of the field that is tradition- 
ally called experimental psychology is included 
under the entries DRIVES; EXPERIMENTAL DESIGN; 
FORGETTING; LEARNING; MOTIVATION; PERCEP- 
TION; PsycHOLoGyY, article on PHYSIOLOGICAL PSY- 
CHOLOGY, 


EXPLANATION 
See CAUSATION and SCIENTIFIC EXPLANATION. 


EXPLORATION OF DATA 
See STATISTICS and SURVEY ANALYSIS, article 
On METHODS OF SURVEY ANALYSIS. 


EXPLORATORY DRIVES 
See STIMULATION DRIVES. 


EXPORTS 
See INTERNATIONAL TRADE. 


EXPRESSIVE BEHAVIOR 


The term “expressive behavior” refers to those 
aspects of behavior which manifest motivational 
States. “Motivational state” is used here to cover 
emotional attitudes and moods, cognitive attitudes 
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(attention, concentration ), activation states (arous- 
al, fatigue), and more-or-less permanent attitudes 
that are personality attributes. 

The study of expressive behavior has a dual 
origin in psychodiagnostics (falsely attributed to 
Aristotle) and in rhetoric (Quintilian). Interest in 
art and, particularly, in the stage (e.g. Engel 
1785) stimulated detailed descriptions of expres- 
sive movement. More recently, expressive behavior 
has been discussed in connection with the philo- 
sophical problem of knowing other minds (Bain 
1859; Lipps 1905; Scheler 1913). At present, the 
main impetus for the investigation of expressive 
behavior stems from the study of social perception, 
emotion, and personality. 

“Expressive behavior” is a somewhat misleading 
term. Many behaviors which are not to be classified 
as “expressions” still have expressive aspects, €.g., 
the “deliberate” or “determined” manner of per- 
forming actions; the “hesitant” or “emphatic” in- 
tonational patterns in speech. The term is mis- 
leading, again, since it might suggest an expressive 
intention or purpose on the part of the subject 
that in fact is not presupposed. Labeling some be- 
havior as “expressive” does not imply anything 
about its function or purpose. Expression is not a 
specific category of behavior, but expressiveness is 
the result of a perspective on all behavior. Every 
behavior is expressive when viewed with respect to 
the motivational state suggested by it. The term 
“expressive behavior” is misleading, finally, in that 
it suggests the presence of something, an inner 
experience, which is expressed but exists distinct 
from its expression. This, again, is not necessarily 
the case. Some behavior is expressive without the 
subject’s experiencing the emotions suggested by 
his behavior, as in a theatrical performance or in 
an act of deceit. Actually, the investigation of the 
relationship of expressive behavior to inner states 
or other dispositions is one of the main tasks in 
this area. 

Phenomena that are functionally quite different 
are usually classed as expressive behavior; they 
vary in the manner in which they manifest motiva- 
tional states: 

(a) In both expressive movements (movements 
of limbs, head, facial features, and body) and the 
manner of performing purposive actions, the move- 
ment pattern itself is expressive, that is, it mani- 
fests the motivational state. Expressive aspects of 
vocal behavior and visceral manifestations of emo- 
tion can be classed along with expressive movement. 

(b) In expressive actions, the way in which the 
environment is treated, rather than the movement 
pattern itself, is expressive. 
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(c) Verbal-symbolic behavior is expressive to 
the extent that it evidences some motivational state 
of the speaker. 

(d) The products of creative behavior, in con- 
trast to the behavior pattern itself, are considered 
expressive. Professional dancing and singing can, 
of course, be classified under both (a) and (d). 

Expressive movement—descriptive analysis. The 
study of expressive movement implies (1) care- 
fully describing behavior patterns and (2) ascer- 
taining their meaning, in terms of the subject's 
introspective report, his earlier or subsequent be- 
havior, or the stimulus situation; or in terms of 
independently assessed personality traits. 

The traditional source for descriptive data on 
expressive movement is the theater. While posed 
expressions are still used as stimuli in experi- 
ments on recognition of expression, they present 
obvious and unnecessary difficulties for the analysis 
of expressive behavior as such. Spontaneous ex- 
pressions can be collected systematically by expos- 
ing the subject to specific stimuli (sudden noises, 
Sweet music, pain); by time sampling during 
standardized or experimentally varied conditions 
(stress, task failure); or by electrical brain stimula- 
tion (Hess 1962). For recording these movements, 
refined techniques are sometimes utilized, such as 
high-speed cinematography (Landis & Hunt 1939) 
or electromyography (Sainsbury 1955). Sometimes 
it has been feasible to obtain introspective reports 
as well (Frijda 1953). Some response patterns 
have been described in detail, such as the startle 
pattern (Landis & Hunt 1939) or animal rage pat- 
terns (Bard 1950). Correlational studies have been 
used to identify patterns of individual differences 
(e.g., Allport & Vernon 1933; for visceral response 
patterns, see Lacey et al. 1963). Many studies 
have established the existence of stable individual 
differences in single expressive traits, such as fre- 
quency of nervous movements (Sainsbury 1955) 
or muscular tension during stress (see Duffy 1962, 
chapter 11), or size and speed variables, such as 
size of writing, length, and speed of stride, and 
rate of speech, as used in the Allport and Vernon 
study (1933). 

A somewhat different approach has been fol- 
lowed in a series of German studies that originated 
in the work of Piderit (1867). These studies try 
to describe the variables of expressive movement 
for various body areas (facial expression, Lersch 
1932; general body movement, Strehle 1954; gait, 
Kietz 1956; gesture, Kiener 1962). They present a 
large number of hypotheses on personality attri- 
butes presumably revealed by the various expres- 
Sive traits; interpretations are based upon impres- 


sions gained during psychodiagnostic sessions, 
There are no validation studies, however. In gen- 
eral, very little has been established on the rela- 
tionship between expressive movement and per- 
sonality, except for some group-comparison studies. 
For example, patients classified as neurotic or as 
anxious tend to manifest a higher habitual degree 
of muscular tension and more frequent nervous 
movements than normal or nonanxious controls 
(see Duffy 1962, chapter 11; Sainsbury 1955). 

Specificity and meaning. Traditionally it is 
held that expressive movement expresses emotion; 
also, that for every linguistically distinguishable 
emotion there is a corresponding distinct expressive 
movement pattern. The existence of expressions of 
other aspects of personality presents difficulties for 
the first hypothesis. Experimental investigation 
disproves the second. Landis (1924) elicited spon- 
taneous emotional reactions from a group of sub- 
jects. Comparison of these reactions with the 
subjects’ introspective reports revealed that no emo- 
tion invariably leads to the same expression in 
every person and at every moment. Recognition 
experiments, in which observers are presented with 
photographs or films showing facial or other ex- 
pressions, indicate that interpretations of expres- 
sion are quite often incorrect. Observers attach 
widely different labels to one and the same ex- 
pression (see Woodworth & Schlosberg 1955, chap- 
ter 5). 

The variety of interpretations for a given ex- 
pression does not mean, however, that expression 
of emotion is highly ambiguous. Woodworth dem- 
onstrated (1938) that there was a high degree of 
agreement among observers when expressions were 
judged in terms of emotion groups rather than 
separate emotions; Schlosberg (1954) showed the 
same for judgment in terms of three dimensions— 
pleasantness—unpleasantness, attention—rejection, 
and level of activation—and demonstrated at the 
same time the equivalence of these dimension 
judgments to judgment in terms of emotion groups. 
These studies suggest that expression does not rep- 
resent emotions as distinct and discontinuous 
States; it represents, rather, a set of continuous 
emotional dimensions. This conclusion is supported 
by high correlations between dimensional values 
and measurements of facial features (Frijda & 
Philipszoon 1963). Every emotion occupies some 
place in this multidimensional expression space. 
However, several different motivational states may 
occupy the same place; hence the confusion in 
recognition experiments. As yet there is still some 
uncertainty about the dimensions involved. One of 
Schlosberg’s dimensions, attention—rejection, seems 


superfluous in view of multidimensional scalings 
based upon similarity judgments (Abelson & Ser- 
mat 1962; Shepard 1962). On the other hand, the 
importance of additional dimensions is suggested 
by other studies (Frijda & Philipszoon 1963). 

Expressive movement, then, appears to repre- 
sent the person’s state as defined by these dimen- 
sions, whose psychological nature can be summa- 
rized as the person's attitude, or his readiness to 
relate to his environment, and his state of activa- 
tion. Expressive movement, consequently, cannot 
properly be said to represent emotion. It may mani- 
fest some aspect of emotion, emotion being de- 
fined by the type of situational determinant in 
addition to the attitude and the activation state 
(Frijda 1958; Schachter & Singer 1962). However, 
the attitude and activation state may instead be 
part of a cognitive, nonemotional attitude. They 
may also represent some habitual attitude or activa- 
tion modus, in which case they constitute a per- 
sonality trait. 

Origin. In order to understand the origin and 
function of expressive movement, a distinction has 
to be made. Expressive movements which arise 
spontaneously out of the actual motivational state 
are called primary; expressions which are made 
more or less intentionally are called secondary. 
Secondary expressive behavior overlays and modi- 
fies the primary. People frequently exaggerate their 
spontaneous reactions for reasons of communica- 
tion or social participation. Sometimes they pro- 
duce expressive movements when no corresponding 
emotional attitude is experienced, as to give a social 
signal, to be polite, or to deceive. They also produce 
symbolic expressive movements—gestures of doubt, 
disbelief, denial, or approval—which serve linguis- 
tic functions, underlining or replacing verbal be- 
havior. 

Primary expressions seem to be unlearned. The 
evidence comes from different sources: develop- 
mental observations; data on generality over spe- 
cies (which in humans means ethnological evi- 
dence); and neuropsychological research [see 
INSTINCT]. 

Within the first few weeks of life a limited but 
still differentiated repertory of reactions can be 
observed: crying, generalized excitement, startle, 
orienting responses, quiet relaxation, and facial 
contortions typical of disgust (Malrieu 1960). 
Somewhat later, smiling appears. These expres- 
sions can be observed under conditions that make 
social learning unlikely: absence of social rein- 
forcement (Dennis 1938); blindness; and even a 
combination of blindness and deafness (Thompson 
1941). The blind manifest these expressions in 
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situations that would call forth the same reactions 
in normal children. 

Systematic cross-cultural research on expressive 
behavior has been rare since 1867, when Darwin 
sent a questionnaire to informants around the 
world. Yet, from scattered reports the generaliza- 
tion seems warranted that the same primary ex- 
pressive patterns are present in every culture, 
where they indicate the same motivational states; 
or, rather, that everywhere they appear at least 
with this common meaning. Laughter is used 
everywhere to express joy, even if occasionally it 
may express other things as well and even if joy 
occasionally is expressed differently. 

Some expressive reactions—those considered in- 
dicative of rage, fear, or attention—have been 
evoked in animals by midbrain stimulation (Hess 
1962); laughter, smiling, and weeping occur in 
human subjects who have cerebral lesions or who 
are subjected to electrical stimulation in the 
thalamic region (Hassler & Riechert 1961). Still, 
these data may be interpreted (e.g., by Hess) as 
the evocation of emotional impulses as such, and 
these impulses, of course, might also utilize learned 
behaviors. 

Social determinants. The inborn forms of ex- 
pressive behavior are supplemented by and modi- 
fied by social learning. To what extent this is the 
case has been demonstrated by Efron’s (1941) com- 
parison of gestural behavior of Jews and Italians 
in New York City. Jews and Italians who have 
been minimally acculturated to the general Amer- 
ican way of life possess quite different gestural 
habits, although both gesticulate considerably more 
than the average American. Among more accul- 
turated groups these differences largely disappear. 
The findings concerning cultural differences, how- 
ever, in no way conflict with the notion of universal 
human expressive patterns with fixed meanings. 

The influence of culture on expression can be 
summarized as follows. 

(a) Culture determines to what extent emo- 
tional impulses are shown and, consequently, how 
frequent and pronounced expressive behavior is. 
The extent to which expressive behavior is encour- 
aged is culturally determined. 

(b) Culture may determine under what condi- 
tions certain expressions are allowed or prescribed. 
Weeping by men is, in Western culture, permitted 
only under special circumstances; in prerevolution- 
ary China the amount of weeping when a relative 
died was carefully graded by a codex, according to 
the closeness of the relationship (Granet 1922). 

(c) There are cultural differences with regard 
to the events that elicit given emotional reactions. 
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In these cases there is not so much difference in 
expressive behavior as in emotional behavior. 

(d) Certain more-or-less general motor habits— 
manner of walking or sitting, gestural accompani- 
ment of speech, intonement of speech—may be 
culturally determined. Such habits can be quite 
typical for people of a given culture (Efron 1941; 
LaBarre 1947), They may reflect some culturally 
determined mental attitude or they may be the ex- 
pression of nothing but a cultural-movement norm. 

(e) Secondary expressive movements, particu- 
larly those with linguistic meanings, are based upon 
cultural conventions: the gestures for “yes” and 
“no” are well-known examples. 

Nature of expressive movement. Why should 
there be expressive movement, and why does it 
take a particular form? Several explanatory prin- 
ciples have been adduced, which are all supported 
by some empirical evidence. 

(a) Expressive behavior is either adaptive be- 
havior or conditioned but originally adaptive behav- 
ior. The addition of “originally adaptive” is obvi- 
ously needed, since people make “disgusted” faces, 
possibly useful for getting rid of bad-tasting sub- 
stances, when they hear morally disgusting stories. 
It seems certain that much in expressive movement 
is adaptive not only with regard to our forebears, 
as Darwin would have it, but here and now. Facial 
expressions of fear often form part, or the begin- 
ning, of general protective responses. Expressive 
movement, if it is adaptive, is so in a rather special 
way: not by modifying the environment, but by 
modifying the organism’s relationship to his en- 
vironment, through approach and withdrawal tend- 
encies and through increase or decrease of sensory 
readiness of different kinds. To be sure, expression 
may sometimes be adaptive in only a subjective 
way—for instance, when a child tries to hide from 
people’s glances by holding its hands before its 
face. To the extent that expressive movement is 
adaptive, it is connected to the motivational states 
in a quite intimate and intrinsic manner, since 
motives are by definition tendencies for establish- 
ing or destroying certain kinds of relationships. 
Primary expressive movement, according to this 
view, is the execution of behavior tendencies that 
define or partly define emotions and other motiva- 
tional states. 

(b) Expressive behavior is the direct manifesta- 
tion of emotional (or other) activation, Drowsi- 
ness, energy, joy, and alertness refer to hardly 
more than quantitative variations of activation; the 
corresponding expressive movements are indeed 
mainly different degrees of generalized muscular 


activity, as correlation studies indicate (Frijda & 
Philipszoon 1963). In addition, expressive behavior 
embodies the degree of activity control. When emo- 
tions disrupt orientation and planning, behavior 
becomes disorganized and is, consequently, “ex- 
pressive” of the disrupted state. In this reformula- 
tion of Darwin’s “irradiation principle,” the ex- 
pressive components again are connected to the 
emotional impulses in no fortuitous manner. The 
expressive components are the consequences of 
the motivational impulses, 

(c) Expressive movement might be communica- 
tive behavior either instinctively, by means of he- 
reditary patterns, or intentionally, by means of 
conventional symbols. There is ethological evidence 
for instinctiveness in animal mating ceremonies or 
warning calls but little evidence to support or sug- 
gest instinctiveness in human expression, except, 
perhaps, the crying of the infant. Voluntary com- 
municative expression is, of course, evident in 
verbal behavior and in secondary expressive move- 
ment. Primary expressions are rapidly utilized for 
communication, as is notable in the development of 
infant crying. 

(d) Expressive movement might be a release 
phenomenon, serving to discharge emotional ten- 
sions. Introspectively, expressing emotions often 
gives relief. Freeman and Pathman (1942) found 
that experimentally induced emotional tension dis- 
sipated more rapidly in subjects manifesting much 
restless behavior than in those manifesting little. 
The fact that there is tension discharge does not, 
of course, imply that this is the raison d'être for 
the discharging behavior. The explanation may 
hold, however, for those nervous movements which 
arise during emotional stress or conflict (“autistic 
gestures,” Krout 1935), such as scratching, rub- 
bing the nose, nail-biting, etc. 

The foregoing principles seem to account for 
most expressive movement; they seem to apply 
equally well to extralinguistic vocal expression. 
There are two important exceptions, however. 
Laughter and weeping do not seem to allow of any 
interpretation in terms of past or present adaptive 
movement, although such an interpretation was 
tried by Darwin; they cannot be smoothly inter- 
preted as activation modi or mere consequences of 
loss of activity control; and their release function 
does not explain their form either. As yet, laughter 
and weeping are riddles of human behavior. 

Expressive actions. Every human activity indi- 
cates some attitude and is thereby expressive, even 
if only of unemotional matter-of-factness. This 
holds particularly for choice behavior—choice of 


interests, of mates, of possessions, of clothing, etc. 
Activities which demand special mention, however, 
are those that are clearly objectively useless and 
clearly motivated by some sort of emotional ex- 
citement—a child’s hiding under its mother’s skirt, 
banging on the table, tearing its clothes, etc. In 
part these are what Lewin (1927) called field 
actions, which try to change the relationship with 
the environment. In part they have the distinct 
function of releasing emotional tensions which can- 
not find their outlet in a more proper, more ade- 
quate way. Little systematic study has been made 
of these phenomena notwithstanding their social 
importance, which is shown by the behavior of 
youthful audiences. 

Verbal-symbolic behavior. Verbal-symbolic be- 
havior is expressive, of course, primarily in that 
the subject desires to communicate his feelings, 
attitudes, or ideas. There is little relationship be- 
tween a definition of expressive behavior that 
covers these phenomena and one that is focused 
upon expressive movement. 

Expressive creative behavior. Creative behavior 
can justifiably be called expressive: its products 
stem from some drive to structure and shape. It is 
primarily this drive, which may get its energy 
from various sources, which art expresses. Artistic 
production may also, occasionally, serve the pur- 
pose of communication of feelings or ideas. Artistic 
products are frequently expressive, moreover, in 
the sense in which this term is primarily used here. 
Paintings, music, and poetry appear invested with 
emotional meanings for the spectator; they evi- 
dence motivational states. Still, there is an im- 
portant distinction which separates artistic produc- 
tion from the execution of expressive movements. 
Functionally, both belong to entirely different 
classes of phenomena. Expressive movement is a 
direct manifestation of tendencies inherent in the 
emotional impulse, Expression in art is, as Langer 
(1942) emphasized, a representation of some mo- 
tivational state in the language of the particular 
artistic mode—pictorial, musical, etc. It is a sym- 
bol of a motivational state that need not even be 
actually experienced at the time of creation. In the 
fact of its being a representation resides an effect, 
and possibly a function, of artistic production: in 
the creative process emotional impulses or experi- 
ences become structured, and this may facilitate 
coming to grips with them. Sometimes artistic 
products may be pervaded by traces of primary 
expressive movement: this is clearest in dancing 
and singing, but it is also true of painting and of 
some aspects of writing style. Yet the intention to 
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create shape or structure in a conventional symbol 
system—eyen if the convention is peculiar to the 
individual artist—makes this class of phenomena 
irreducible to other kinds of expressive behavior. 


Nico H. FRIJDA 


[Directly related is the entry EMOTION. Other relevant 
material may be found in ATTENTION; CREATIVITY; 
SYMPATHY AND EMPATHY; TRAITS.] 
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EXTERNAL ECONOMIES 
AND DISECONOMIES 


The concepts of external economies and disecon- 
omies (“externalities”) treat the subject of how the 
costs and benefits that constrain and motivate a 
decision maker in a particular activity may deviate 
from the costs or benefits that activity creates for 
a larger organization. Most of the economic liter- 
ature on externalities has focused on the operation 
of an entire economic system, with particular ref- 
erence to the effectiveness of prices, markets, com- 
petition, and profit motivation as regulators of 
production and consumption, 

Economic theory suggests that a system charac- 
terized by private ownership of resources and suffi- 
cient competition will maximize total income and 
economic welfare. The system will establish an 
equilibrium in which product prices equal their 
Costs on their respective margins of production. 
Costs include an Opportunity rate of return on in- 
vested capital, which is an element of business 
accounting profit, and the rewards, or “rent,” that 
especially endowed resources may command. Pro- 
duction costs also reflect technological constraints, 
and producers employ the least costly method of 
producing any given output. A further character- 
istic of the equilibrium is that similar resources, 
including capital, obtain equal earnings or returns 
in all activities. If earnings were unequal, resources 
would enter more profitable activities and leave 
less lucrative ones until earnings equality comes 
about. The resulting allocation of resources is also 
consistent with consumers’ preferences. Finally, 
consumers’ demands, through their influence on 
market prices and hence profits, determine the 
allocation of resources, 

The system works in such a way that the wide 
diffusion of decision making which is necessary if 
complex systems are to operate at all is permitted. 
Each decision maker only needs to have knowledge 
about the things he consumes, or produces, or his 
occupation. That individuals can so narrow their 
focus permits a division of labor and, in turn, the 
resulting gains of specialization. The vital mech- 
anism (and social institution) that facilitates such 
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specialization is the price system, or market organ- 
ization. The price system is an “information sys- 
tem” that provides producers and consumers with 
the signals that guide their behavior. Hence, the 
economic system is highly interdependent: the 
combined behavior of individual decision makers 
spontaneously determines relative prices and quan- 
tities of items produced and consumed, while rel- 
ative prices are the signals, constraints, and 
opportunities to which individual decision makers 
respond and adapt. 

Such a general equilibrium system has two spe- 
cific qualities: (1) Production costs of each item, 
on its respective margin of production, when viewed 
in a social cost sense, equal the price of each 
item. (2) The price of each end product accurately 
reflects the incremental satisfaction that consumers 
attach to it. These two qualities constitute a “social 
optimum” in that national income and economic 
welfare are maximized [see, however, WELFARE 
Economics]. Note that it is only optimal if the 
marginal social costs of each activity equal the 
social benefits they create. If the social cost of an 
activity exceeds the costs relevant to the decision 
makers in the activity, there is an external dis- 
economy. If the benefits of an activity exceed its 
marginal cost, there is an external economy. 

Due to the extreme interdependence within an 
economy, the behavior of a given industry can 
increase the cost of other industries in ways which 
need not be socially undesirable. Some of these 
phenomena, too, have been associated with the 
subject of external economies and diseconomies. 
One of the difficulties in the evaluation of external- 
ities is the problem of determining which are so- 
cially desirable or undesirable and should be pro- 
moted or counteracted by public policy measures 
and which do not warrant government interference 
with the private sector. 

The subject of external economies and disecon- 
omies thus treats possible mechanical shortcomings 
of an economy that cause individual decision makers 
to operate in a fashion that thwarts the full attain- 
ment of broad social objectives. To some students 
the possible wide extent of externalities is sufficient 
basis to justify extensive government intervention 
in the private sector of the economy. To other 
students this point is debatable. The resolution of 
these differences has been, and remains, a major 
unsettled issue in economics. 


External diseconomies 


Technical external diseconomies. Technical ex- 
ternal diseconomies, sometimes called “nuisance 
effects,” were extensively discussed by A. C. Pigou 
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({1920] 1960, part 2, chapter 9). They arise from 
undesirable by-products of a production process. 
An example used by Pigou is the case of steam 
locomotives emitting sparks that cause fires. A 
farmer's livestock that eats his neighbor’s crops is 
another example. Extensive lists of unwanted by- 
products may be drawn up in modern societies— 
from air and water pollution to traffic congestion 
associated with the automobile. 

Such unwanted by-products are a natural conse- 
quence of many production processes. They impose 
a cost upon society similar to the cost of productive 
resources necessary to produce the desired product. 
They create a social problem insofar as their cost 
may not be properly allocated between different 
segments of the economy. 

Let us consider further Pigou’s example of spark- 
emitting steam locomotives. Assume that the mar- 
ginal cost of employing resources to produce a 
ton-mile of railroad freight service may be 3 cents. 
The railroad system, however, “causes” 0.5 cents 
worth of damage per ton-mile because of fires 
inflicted on farmers’ crops adjoining the right-of- 
way. Whether such behavior creates an unwar- 
ranted social cost, and what the appropriate social 
policy should be to deal with it, pose some subtle 
and complex issues. 

The main force of Pigou’s treatment of the sub- 
ject is that the “social cost” of producing a ton-mile 
of freight is 3.5 cents (3 cents for the railroad’s 
own costs, plus 0.5 cents for the destroyed crops). 
The policy prescription is that the railroad should 
be made to pay farmers for their destroyed crops 
or that railroads should be taxed or restrained in 
other ways that will prevent damage. 

Coase (1960) has demonstrated, however, that 
this traditional approach to “nuisance effects” is 
wrong. The relationship is reciprocal. Crop damage 
is “caused” just as much by the farmer’s growing 
crops along the railroad’s right-of-way as by loco- 
motives emitting sparks (indeed, the doctrine of 
“causation” is spurious). Moreover, to restrain rail- 
roads in arbitrary ways may impose a greater social 
loss (due to higher-cost railroad services) than the 
loss of some crops. The proper solution is to design 
a system that maximizes the economy's total 
product. 

Such a solution might be found by recognizing 
that in producing the crops associated with the 0.5 
cents per ton-mile of damage, farmers must pur- 
chase resources worth, say, 0.4 cents. Under these 
conditions, the railroad could pay farmers 0.1 cent 
not to grow crops along its right-of-way. Farmers 
would be just as well off as if there were no rail- 
road; freight costs would be 3.1 cents per ton-mile, 
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instead of 3.5 cents if farmers were arbitrarily 
awarded “full” damages; and 0.4 cents worth of 
resources would be freed to produce other prod- 
ucts. Indeed, this kind of solution is often worked 
out spontaneously by bargaining between the con- 
cerned parties or is brought about through legal 
adjudication. 

The social problems associated with external 
“nuisance effects” arise when certain scarce re- 
sources are treated as if they were “free goods”— 
because of faulty specification of property rights, 
or because it is difficult to identify in some quan- 
titative way who causes the nuisance or who suf- 
fers from it (or both), or because the administrative 
costs of “solving” the problem may be more costly 
than the nuisance itself. Many students suggest 
that activities imposing unregistered social costs 
upon society be subjected to special excise taxes; 
however, the precise design of excise taxes that 
would not themselves distort resource allocation is 
difficult. Other students urge extensive state regu- 
lation, However, a distressing number of nuisance 
effects are due to activities that are already either 
regulated or managed by the state, for example, 
highway systems and government-owned public 
utilities, suggesting that effective solutions may 
not be easily attained from that quarter. 

Sweeping solutions to the nuisance effect prob- 
lem do not appear readily available. Thus far in 
virtually all social systems they have been coped 
with on an ad hoc basis. Perhaps one of the best 
ways to achieve better social guidelines for treat- 
ment of these problems is for economists to give 
more attention to the precise content of property 
rights, in terms of their economic effects, and for 
lawyers to employ economic analysis to evaluate the 
social utility of legal principles applied to torts. 

Pecuniary external diseconomies. Consider in- 
dustry A (for illustrative simplicity we assume that 
it only requires labor as a resource ) which expands 
its output (see Table 1). 


Table 1 
Average 


and mar- "Marginal 
ginal cost cost” per 


perunit unit to 
Output Workers Wage Total cost to firm industry 
100,000 100,000 $10.00 $1 ,000,000 $10.00 - 
101,000 101,000 $10.50 $1,060,500 $10.50 $60.50 


This example illustrates an “increasing cost” 
industry. The wage increase is necessary to induce 
more workers to enter the industry. The operators 
responsible for hiring the additional workers, by 
forcing up the wage, may be said to impose an addi- 


tional cost upon their colleagues. This kind of cost 
increase has been termed an “external diseconomy,” 

It was with reference to cases like this that some 
students suggested (e.g., Pigou [1920] 1960, pp. 
223-225) that the free operation of private busi- 
ness firms resulted in too much output by “increas- 
ing cost industries” and that they should be subject 
to taxation to restrict output. The implication, as 
illustrated by the arithmetical example, is that the 
$60.50 is the “social cost” per unit of the additional 
output; whereas the price that actually emerges in 
the market is $10.50. 

The implication that the increasing cost was a 
social diseconomy raised some fundamental ques- 
tions about the nature of costs. The clarification 
was achieved by D. H. Robertson ([1924] 1952, 
pp. 143-159) and especially F. H. Knight ([1924] 
1952, pp. 160-179) and reiterated by Ellis and 
Fellner ([1943] 1952, pp. 242-263). The essential 
points are as follows: 

Under competition, with many producers in an 
industry, each producer will view the cost (or 
wage) of hiring an additional worker as his mar- 
ginal cost. In our example, it is $10.50. The $10.50 
is also the average cost of production. In a private, 
competitive equilibrium, the average cost, the mar- 
ginal cost to each producer, and the market price 
will be equal. If we take a collective view of the 
industry (which might be the case if it were oper- 
ated by a socialist trust or a private monopoly) the 
“marginal cost” of $60.50 appears, which results 
from reckoning the possible impact of industry 
expansion upon the wage level. The question is: 
Which marginal cost concept is valid from an over- 
all social welfare viewpoint? 

The $10.50, rather than the $60.50, is the rele- 
vant measure of the marginal social cost. It repre- 
sents what workers on the margin of production in 
the given industry could earn producing alterna- 
tive products in other industries, Increasing the 
output of industry A requires enticing workers from 
industry B. If $10.50 is necessary to attract a 
worker into A, it is because he can earn $10.50 in 
B. Consumers give up $10.50 worth of B to get an 
increment of A. The marginal social cost of A is 
therefore $10.50. It is also the marginal cost upon 
which private decision makers focus and the signal 
that the price system generates. 

That the shift in consumer demand and the con- 
sequential raising of the wage level in industry A 
Operate to create an apparent marginal cost of 
$60.50 is simply a result of the fact that product 
prices and wages (and other resource earnings 
rates) are mutually determined by the operation of 
the price system. For a central authority to try to 
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prevent the wage increase in industry A would 
(apart from thwarting consumer preferences) cre- 
ate many problems. First, other means to get addi- 
tional workers into the industry would have to be 
found. If, even after getting additional workers into 
the industry, authorities sought to maintain the 
$10.00 wage and product price, they would create 
a rationing problem insofar as consumers would 
demand more of the commodity at $10.00 than 
they would at $10.50. Finally, the higher wage of 
$10.50 for all workers would induce the proper use 
of the specialized resources: at a $10.00 wage, pro- 
duction managers would not use the workers as 
efficiently as they would at a $10.50 wage. [For a 
further discussion of these points, see CostT.] 


External economies 


Technical external economies. Consider the 
situation where a group of farmers dam a stream 
in order to obtain a supply of irrigation water. The 
resulting pond may stimulate the fish population 
and thus enhance fish output and the earnings of 
fishermen. Increased irrigated acreage devoted to 
apples will increase the supply of nectar, improve 
the productivity of bees, and increase the output of 
honey; conversely, a larger bee population can in- 
crease pollenization and raise apple yields. 

Such external economies are creatures of multi- 
product activities and are similar to the technical 
external diseconomies cited above. They are prob- 
ably not as frequent because private producers are 
adroit at forming arrangements whereby they can 
capture their by-products and derive a profit from 
them. For example, in our irrigation pond—fish ex- 
ample, neither the value of irrigation water nor the 
extra fish may—separately—justify the expense of 
the dam that makes both possible. But a merger 
between a fisherman and a farmer would create 
the necessary arrangement by which the full gain 
could be captured privately and thus accrue to 
society. 

Technical external economies may nevertheless 
exist in important areas—where property arrange- 
ments are inadequate to capture privately the full 
benefits of an activity—and are sufficient to justify 
state intervention. For example, public education 
and health are areas where, while it may be of 
some economic worth to an individual to educate 
himself (or his child) and to keep his family 
healthy, greater social benefits are obtained if the 
state pushes the activity beyond the margins that 
individual incentive would achieve. Multiple-pur- 
pose river development projects are further cases 
in point. Such projects can provide electric power, 
flood control, navigable waterways, irrigation, and 


recreational facilities. In principle, a private group 
with a charter granting it proprietary rights over 
an entire river basin could construct and operate 
an appropriate multipurpose river development 
project. But such a monopoly would have to be 
carefully controlled by the state to prevent the pri- 
vate group from exploiting such a powerful owner- 
ship right in unsocial ways. Hence, it is simpler for 
the state to design, create, and operate such multi- 
purpose projects. 

It should be recognized, however, that this type 
of argument is used to justify or rationalize many 
kinds of subtle and not-too-subtle forms of state in- 
tervention in support of various industries or forms 
of consumption. In prewar Germany agriculture 
enjoyed state subsidies on the ground that peasant 
boys made good soldiers; in the United States at 
present the maritime industry enjoys subsidies and 
the mineral industries enjoy preferential tax treat- 
ment in the cause of national defense. Whether 
the relationship between the costs and the benefits 
of each of the many government interventions war- 
rants the government action is moot: very few 
such programs are actually subjected to rigorous 
cost-benefit analysis based upon modern quantita- 
tive and analytical techniques. 

Economies of scale. When an industry expands 
its output, it will normally procure additional goods 
and services from other firms or industries. Some 
of the supplying industries may operate under con- 
ditions which would permit them to enjoy “econo- 
mies of scale.” Such a condition means that an 
industry is not fully utilizing its capacity, perhaps 
because the market demand is not large enough to 
take all of the output the plant is efficiently capable 
of producing. Where economies of scale exist, it is 
possible to produce additional output at lower aver- 
age unit costs. Because the industry utilizes its 
capacity more intensively, it spreads its overhead 
and fixed capital costs over a larger output and 
enjoys an “internal economy.” Industries that can 
exploit internal economies are also termed “de- 
creasing cost industries.” 

When a given industry expands its output in a 
way that necessitates purchases from decreasing 
cost industries, the industry that expanded initially 
may also enjoy lower costs. For example, as coal 
mines in a given district increase output, the rail- 
road serving the mines may experience lower aver- 
age unit costs. If the lower unit costs are passed on 
to the coal mine operators, the latter experience a 
“pecuniary external economy.” 

The extent to which such pecuniary external 
economies occur, and the impact they have on mar- 
ket prices, depends on several factors that can only 
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be determined empirically on a case-by-case basis. 
First, it is a question of fact as to just how im- 
portant economies of scale are in the economic 
system and how important their variation between 
different industries. Although railroads and electric 
utilities are often cited as examples of decreasing 
cost industries, what if the railroad or power plant 
is already fully utilized? Second, when a firm enjoys 
economies of scale, the activity is imperfectly com- 
petitive, or even monopolistic. As illustrated by the 
coal mining-railroad example, the railroad enjoys 
a monopoly subject only to the competitive con- 
straint imposed by, say, motor trucks. Unless there 
is some institutional arrangement such as a public 
utility commission that forces a rate reduction, the 
larger coal traffic and consequent lower unit costs 
will only increase railroad profits. The “internal 
economy” will not become an external economy to 
anyone else. 

Finally, whether an external pecuniary economy 
actually permits lower prices for the expanding 
industry’s end product depends upon two opposing 
forces: the external pecuniary economy will lower 
cost; however, the industry's expansion can bid up 
the prices of hired resources and other inputs. The 
net balance of these opposite forces can only be 
determined through examination of the composi- 
tion of the expanding industry’s required inputs. 

Division of labor and economic growth. An- 
other set of external economies is dynamic and 
closely associated with economic growth and de- 
velopment. As an industry expands, the growth can 
create a number of supplying firms and activities 
which, through increased specialization, afford 
lower-cost products and labor services. There can 
arise specialized banking and financial facilities, 
firrns that specialize in machinery design and re- 
pair, warehousing and transportation specialists, 
and numerous other activities oriented to servicing 
the industry. A labor force will emerge that is more 
sophisticated. If an industry is concentrated in a 
given region, knowledge gained in one segment 
rapidly spreads and speeds the rate at which cost 
reductions occur and are “competed away” through 
lower prices. The phenomenon was well treated by 
Young (1928). 

Thus industry expansion stimulates the division 
of labor. The keener division of labor lowers costs, 
The lower costs and resulting lower prices increase 
output even more, which permits a further division 
of labor. Such a process helps explain the historical 
development of great regional industrial areas. The 
phenomenon, when it cuts across an entire econ- 
omy, can provide a basis for a “take-off” toward the 
goal of a high degree of economic development. 
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The external economies associated with indus- 
trial development and growth (as well as those 
associated with economies of scale) pose special 
problems in the planning and public policy of un- 
developed countries. These countries usually pro- 
duce raw materials or semifinished goods for ex- 
port and import finished products. Policy makers 
thus face interesting but perhaps difficult options. 
To exploit externalities originating in the division 
of labor at a minimum cost of scarce capital, the 
best place to start might be the industries that are 
already the most developed. However, these indus- 
tries are likely to be the raw material or semifin- 
ished product export industries. If they are devel- 
oped further, the external economies may accrue 
mainly to foreigners through lower-priced exports. 
It may thus appear more attractive to concentrate 
on the development of industries that will produce 
products which are extensively imported. The foun- 
dation for such a development strategy may be 
meager, however, and the cost-effectiveness per 
unit of investment may be low. 

But it is in manufacturing industries where great 
gains—apparently resulting from exploiting both 
the dynamics of the division of labor and econo- 
mies of scale—have eventually occurred. In recog- 
nition of these historical phenomena in Western 
developed countries, some students (e.g., Rosen- 
stein-Rodan 1943) have advanced the doctrine of 
“balanced growth.” This doctrine suggests that eco- 
nomic development should be promoted on a wide 
front so that the external economies of each indus- 
try will be mutually reinforcing, thus generating a 
cumulative process of over-all industrial develop- 
ment. 

The precise investment strategy that a develop- 
ing country should adopt is not evident on a purely 
theoretical basis. Much depends upon the price 
elasticities and income elasticities of its traditional 
export industries. Moreover, a country can design 
a structure of tariffs and export duties that could 
prevent an inordinate amount of the benefits due 
to external economies from accruing to foreigners. 
And by earning more foreign exchange through 
more efficient operation of its export industries, a 
country may more effectively gain the resources to 
finance its internal capital investment programs; 
or resources may be freed from the export indus- 
tries to be available for domestically oriented en- 
deavors. Finally, careful attention should be given 
to the precise qualities of the economy’s resources 
that may provide the basis for the proposed state- 
supported domestic—as contrasted with export- 
oriented—industries. At a minimum, it appears 
that a development program must be integrated 
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with a foreign trade policy. [For further discussion 
of the issues in this section, see ECONOMIC GROWTH, 
articles on THEORY and MATHEMATICAL THEORY.| 

Growth repercussions. Another variety of ex- 
ternal economies may be designated as growth or 
investment repercussions. They were labeled “pe- 
cuniary external economies,” however, by Scitovsky 
(1954). Consider the example where the steel in- 
dustry enjoys high profits. It expands its capacity 
and consequently increases its output. The larger 
output reduces steel prices. The cost to steel-using 
industries thus falls, and they enjoy higher profits. 
Their enhanced profits, although external to the 
steel industry, may be attributed to additional in- 
vestment in the steel industry. 

From this sort of sequence, Scitovsky developed 
the following line of argument: The profit signal 
revealed to the steel industry alone is an inadequate 
measure of the profit that should guide investment 
in the steel industry. If the steel-using industries 
were integrated with the steel-producing industry, 
managers of the integrated industries would have 
a better guide for their decisions. However, the 
repercussions and interactions of investment ina 
major industry can extend throughout the entire 
economy, which would suggest that the entire 
economy be “integrated.” 

Thus it is asserted that the price system pro- 
vides a poor guide for investment decisions. The 
policy implication is that central planning and de- 
cision making is a better way to allocate invest- 
ment than is decentralized decision making, since 
the central authority can explicitly take into ac- 
count such external economies. A milder policy 
prescription is that private investors be provided 
with better information about one another's inten- 
tions, which may be done through the French vari- 
ant of economic planning [see PLANNING, ECO- 
NOMIC, article on WESTERN EUROPE]. 

The contention that private investors may be un- 
able to exploit this class of pecuniary “external 
economies” recognizes that, in fact, a market mech- 
anism and its system of price indicators do not 
provide “perfect” intelligence of what the future 
holds. But the price system is not the sole source 
of information in an economic system: business- 
men communicate with one another in other ways; 
they communicate with engineers and scientists on 
technological possibilities, and they conduct con- 
sumer surveys. Conversely, central planners or gov- 
ernment officials also operate under imperfect 
knowledge. The critical substantive issue here is 
therefore: can the aggregate of private investors— 
each highly knowledgeable about his own business 
and responding to prices he confronts in the mar- 


ket place—more efficiently use investable resources 
than can a central authority, which may have less 
detailed knowledge about consumer preference and 
technology but a broader view of the economy in 
its entirety? Both types of decision making will be 
“imperfect” because all decision makers have im- 
perfect knowledge. In part, the question will turn 
upon how highly developed the price system and 
private communication systems are in a particular 
society. In a developing economy the information 
system may be poor. Hence, a case may be made 
for a high degree of central planning in such a 
setting. 

However, even if an instance can be found of 
the price system being an inferior information sys- 
tem, the case for central planning and control of 
investment is not established. The price system 
combined with private investment decision making 
provides widely diffused control over investable 
resources. Centrally controlled investment decision 
making is subject only to constraints that affect 
the entire economy. The choice is therefore one 
between many small decision makers making many 
small mistakes because of a poor information sys- 
tem, as opposed to the central authority possibly 
making fewer mistakes but perhaps making monu- 
mental ones. Which system is the most efficient is 
not obvious. 


External economies and diseconomies are a man- 
ifestation of the fact that, in complex systems, one 
man’s decision or behavior can often have an un- 
designed impact upon others. The trick in system 
design is to establish arrangements by which the 
mutually interacting and dependent behavior of all 
decision makers harmonizes so that the larger sys- 
tem operates in an optimal or efficient way. 

The kinds of problems and phenomena we have 
discussed are not unique to the operation of a pri- 
vate enterprise social economy, although they have 
been most extensively treated by economists in 
such a context. The large multiproduct corpora- 
tion, the government agency, a military service, or 
a university—organizations that may be character- 
ized as “closed” systems and may be “centrally man- 
aged” to a high degree—have identical problems. 
Decision making and authority are necessarily dif- 
fused (governments consist of departments and 
bureaus, armies consist of divisions and squads, 
etc.); the decisions of many must nevertheless 
result in some coordinated and mutually consistent 
behavior; decision makers must be constrained as 
well as motivated; finally, they must be able to 
obtain knowledge about their constraints and op- 
portunities, which includes the impact of the be- 
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havior of others. In varying degrees, discussions 
of externalities focus on these fundamental aspects 
of system or organization design and management. 

The problems are basically those of specifying 
over-all system objectives, measuring effectiveness 
criteria, identifying and measuring the relevant 
cost concept, determining the relative merits of 
alternative information systems (with particular 
reference to the cost and worth of obtaining and 
communicating information), and specifying the 
appropriate “decision rules” that should guide indi- 
vidual decision makers. 

Much of the economic literature on “external- 
ities” suggests that economists have often failed to 
meet these problems head-on. On the other hand, 
the discipline of economics and much of the liter- 
ature on the general operation of the price system 
—the product of nearly two hundred years of effort 
to understand the workings of a complex social 
economy—provide worthwhile insights into the 
problems that confront all large organizations. 


J. A. STOCKFISCH 


[See also WELFARE ECONOMICS. ] 
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I. STATISTICAL ASPECTS 
II. PSYCHOLOGICAL APPLICATIONS 


I 
STATISTICAL ASPECTS 


In many fields of research—for example, agri- 
culture (Banks 1954), psychology (Burt 1947), 
economics (Geary 1948), medicine (Hammond 
1944; 1955), and the study of accidents (Herdan 
1943), but notably in psychology and the other 
social sciences—an experimenter frequently has 
scores for each member of a sample of individuals, 
animals, or other experimental units on each of a 
number of variates, such as cognitive tests, per- 
sonality inventories, sociometric and socioeconomic 
ratings, and physical or physiological measures. If 
the number of variates is large, or even moderately 
so, the experimenter may wish to seek some re- 
duction or simplification of his data. One approach 
to this problem is to search for some hypothetical 
variates that are weighted sums of the observed 
variates and that, although fewer in number than 
the latter, can be used to replace them. The sta- 
tistical techniques by which such a reduction of 
data is achieved are known collectively as factor 
analysis, although it is well to note here that the 
principal component method of analysis discussed 
below (see also Kendall & Lawley 1956) has cer- 
tain special features. The derived variates are gen- 
erally viewed merely as convenient descriptive 
summarizations of the observed data. But occa- 
sionally their composition is such that they appear 
to represent some general basic aspects of everyday 
life, performance or achievement, and in such 


cases they are often suitably labeled and are re- 
ferred to as factors. Typical examples from psy- 
chology are such factors as “numerical ability,” 
“originality,” “neuroticism,” and “toughmindedness.” 
This article describes the statistical procedures in 
general use for arriving at these hypothetical vari- 
ates or factors. 

Preliminary concepts. Suppose that for a ran- 
dom sample of size N from some population, scores 
exist on each of p jointly normally distributed vari- 
ates x; (i=1,2,---,p). If the scores on each 
variate are expressed as deviations from the sample 
mean of that variate, then an unbiased estimator 
of the variance of x; is given by the expression 


ay = (N-1)7 D2, 


summation being over the sample of size N. Simi- 
larly, an unbiased estimator of the covariance be- 
tween variates x; and x; is given by 


ay = (N —-1)1 E xix. 


Note that this is conventional condensed notation. 
A fuller, but clumsier, notation would use xj, for 
the deviation (v = 1, :-:,N) so that £ x? really 
means >), X7, . 

In practice, factor analysis is often used even in 
cases in which its usual assumptions are known 
to be appreciably in error. Such uses make the 
tacit presumption that the effect of the erroneous 
assumptions will be small or negligible. Unfortu- 
nately, nearly nothing is known about the circum- 
stances under which this robustness, or nonsensi- 
tivity to errors in assumptions, is justified. Of 
course, the formal manipulations may always be 
carried out; the assumptions enter crucially into dis- 
tribution theory and optimality of the estimators. 
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The estimated variances and covariances be- 
tween the p variates can conveniently be written in 
Square matrix form as follows: 


An Ain Uy 

Qn Any Ap 
A= 

Qp Qp *** App 


Since a;; = aji , the matrix A is symmetric about its 
main diagonal. 

From the terms of A, the sample correlations, 
Tij, between the pairs of variates may be obtained 
from 
a Be 
~ (aua) 
with ri: = 1. The corresponding matrix is the cor- 
relation matrix. 

The partial correlation concept is helpful here. 
If, to take the simplest case, estimates of the corre- 
lations between three variates are available, then 
the estimated correlation between any two, say x; 
and x;, for a given constant value of the third, xr, 
can be found from the expression 


Tij 


Tij — Tirfjk 
(1-781 = Tik)? 
and is denoted by r;;.,. 

In terms of a correlation matrix, the aim of fac- 
tor analysis can be simply stated in terms of partial 
correlations (see Howe 1955). The first question 
asked is whether a hypothetical random variate f, 
exists such that the partial correlations Tij+s,, for 
all i and j, are zero, within the limits of sampling 
error, after the effect of fı has been removed, (If 
this is so, it is customary to say that the correlation 
matrix, apart from its diagonal cells, is of rank 
one, but details will not be given here.) If the 
partial correlations are not zero, then the question 
is asked whether two hypothetical random variates, 

ı and f,, exist such that the partial correlations 
between the variates are zero after the effects of 
both f, and f, have been removed from the original 
matrix, and so on. (If fı and f, reduce the partial 
correlations to zero, then the matrix, apart from 
its diagonal cells, is said to be of rank two, and 
so on.) The aim of the procedure is to replace the 
observed variates with a set of derived variates that, 
although fewer in number than the former, are still 
adequate to account for the correlations between 
them. In other words, the derived variates, or fac- 
tors, account for the variance common to the ob- 
served variates. 


Historical note. Factor analysis is generally 
taken to date from 1904, when C. E. Spearman 
published an article entitled “ ‘General Intelligence 
Objectively Determined and Measured.” Spearman 
postulated that a single hypothetical variate would 
in general account for the intercorrelations of a 
set of cognitive tests, and this variate was his 
famous factor “g.” For the sets of tests that Spear- 
man was considering, this hypothesis seemed rea- 
sonable. As further matrices of correlations became 
available, however, it soon became obvious that 
Spearman’s hypothesis was an oversimplification 
of the facts, and multiple factor concepts were de- 
veloped. L. L. Thurstone, in America, and C. Burt 
and G. H. Thomson, in Britain, were the most ac- 
tive pioneers in this movement. Details of their 
contributions and references to early journal arti- 
cles can be found in their textbooks (Thurstone 
1935; 1947; Burt 1940; Thomson 1939). These 
writers were psychologists, and the statistical 
methods they developed for estimating factors were 
more or less approximate in nature. The first rig- 
orous attempt by a mathematical statistician to 
treat the problem of factor estimation (as distinct 
from principal components) came with the publi- 
cation in 1940 of a paper by D. N. Lawley entitled 
“The Estimation of Factor Loadings by the Method 
of Maximum Likelihood.” Since 1940, Lawley has 
published other articles dealing with various factor 
problems, and further contributions have been 
made by Howe (1955), by Anderson and Rubin 
(1956), and by Rao (1955), to mention just a few. 
Modern textbooks on factor analysis are those of 
Harman (1960) and Lawley and Maxwell (1963). 

While methods of factor analysis, based on the 
above model, were being developed, Hotelling in 
1933 published his principal components model, 
which, although it bears certain formal resem- 
blances to the factor model proper, has rather dif- 
ferent aims. It is widely used today and is described 
below. 

The basic factor equations. The factor model 
described in general correlational terms above can 
be expressed more explicitly by the equations 

k 
(1) X = D life + ei, k <p. 
= 
In these equations k (the number of factors) is 
Specified; f, stands for the factors (generally re- 
ferred to as common factors, since they usually 
enter into the composition of more than one vari- 
ate), The factors are taken to be normally distrib- 
uted and, without loss of generality, to have zero 
means and unit variances; to begin with, they will 
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be assumed to be independent. The term e; refers 
to a residual random variate affecting only the 
variate x;. There are p of these e;, and they are 
assumed to be normally distributed with zero 
means and to be independent of each other and 
of the f,. Their variances will be denoted by uj; 
the diagonal matrix of the v; is called V. The 
Lvalues are called loadings (weights), li, being 
the loading of the ith variate on the sth factor. 
The quantities l;, and v; are taken to be unknown 
parameters that have to be estimated. If a subscript 
for individual were introduced, it would be added 
to x; and f,, but not to lis or Vi. 

If the population variance—covariance matrix 
corresponding to the sample matrix A is denoted 
by C, with elements c;,;, then it follows from the 
model that 


k 
(2) Cu = > B, HU 
and 1 
k 
(3) Cu =J} lulis (=j). 


s1 
If the loadings for p variates on k factors are de- 
noted by the p x k matrix L, with transpose K’, 
eqs. (2) and (3) can be combined in the single 
matrix equation 


(4) C=LLĽ +V. 


Estimating the parameters in the model. Since 
the introduction of multiple factor analysis, various 
approximate methods for estimating the param- 
eters li, and v; have been proposed. Of these, the 
best known is the centroid, or simple summation, 
method. It is well described in the textbooks men- 
tioned above, but since the arithmetic details are 
unwieldy, they will not be given here. The method 
works fairly well in practice, but there is an arbi- 
trariness in its procedure that makes statistical 
treatment of it almost impossible (see Lawley & 
Maxwell 1963, chapter 3). For a rigorous approach 
to the estimation of the factor parameters, I turn 
to the method of maximum likelihood, although 
this decision requires some justification. The maxi- 
mum likelihood method of factor estimation has 
not been widely used in the past for two reasons. 
First, it involves very onerous calculations which 
were well-nigh prohibitive before the development 
of electronic computers. Second, the arithmetic 
procedures available, which were iterative, fre- 
quently did not lead to convergent estimates of the 
loadings. But recently, largely because of the work 
of the Swedish statistician K. G. Jöreskog, quick 
and efficient estimation procedures have been 
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found. These methods are still being perfected, but 
a preliminary account of them is contained in a 
recent paper (Jöreskog 1966). When they become 
better known, it is likely that the maximum likeli- 
hood method of factor analysis will become the 
accepted method. An earlier monograph by Jöres- 
kog (1963) is also of interest. In it he links up 
work by Guttman (1953) on image theory with 
classical factor analytic concepts (see also Kaiser, 
in Harris 1963). (The image of a variate is defined 
as that part of its variance which can be estimated 
from the other variates in a matrix. ) 

The first point to note about eqs. (1) is that 
since the p observed variates x; are expressed in 
terms of p +k other variates, namely, the k com- 
mon factors and the p residual variates, which are 
not observable, these equations are not capable of 
direct verification. But eq. (4) implies a hypoth- 
esis, H, , regarding the covariance matrix C, which 
can be tested, that it can be expressed as the sum 
of a diagonal matrix with positive diagonal ele- 
ments and a symmetric positive semidefinite matrix 
with at most k latent roots: these matrices are 
respectively V and LL’. The value postulated for k 
must not be too large; otherwise, the hypothesis 
would be trivially true. If the v; were known, it 
would only be necessary to require k <p, but in 
the more usual case, where they are unknown, the 
condition can be shown to be (p +k) < (p — k}. 
Since the x; are assumed to be distributed in a 
multivariate normal way, the log-likelihood func- 
tion, omitting a function of the observations, is 
given by 


(5) L= —3nIn |C| -łn Z ac“, 
ij 


where n=N — 1, |C] is the determinant of the 
matrix C, and c™ is the element in the ith row and 
jth column of its inverse, C~. To find maximum 
likelihood estimators of lis and v;, (5) is differen- 
tiated with respect to them and the results are 
equated to zero. A difficulty arises, however, when 
k> 1, for there are then too many parameters in 
the model for them to be specified uniquely. This 
can be seen by an examination of eq. (4), for if L 
is postmultiplied by an orthogonal matrix M, the 
value of LL’, which is now given by LMM’L’, is 
unaltered since MM’= F, the identity matrix. 
This means that the maximum likelihood method, 
although it provides a unique set of estimates of 
the cij, leads to equations for estimating the lis 
which are satisfied by an infinity of solutions, all 
equally good from a statistical point of view. 

In this situation all the statistician can do is to 
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select a particular solution, one that is convenient 
to find, and leave the experimenter to apply what- 
ever rotation he thinks desirable. Thus the custom 
is to choose L in such a way that the k x k matrix 
J = L’'V"L is diagonal. It can be shown that the 
successive elements of J are the latent roots, in 
order of magnitude, of the matrix V- (A-V) V-, 
so that for a given value of V, the determination 
of the factors in the factor model resembles the 
determination of the principal components in the 
component model. 

The maximization of eq. (5) with the above 
diagonalization side condition leads to the equations 


Ĉii = dii 
or 
k 
(6) 0 =ai- E h, 
and y 
(7) È = iva- v), 


where circumflex accents denote estimates of the 
parameters in question. Eq. (7) can usually þe 
solved by iterative methods and details of those in 
current use can be found in Lawley and Maxwell 
(1963), Howe (1955), and Jöreskog (1963; 1966). 
The calculations involved are onerous, and when p 
is fairly large, say 12 or more, an electronic com- 
puter is essential. 

A satisfactory property of the above method of 
estimation, which does not hold for the centroid 
and principal component methods, is that it can be 
shown to be independent of the metric used. A 
change of scale of any variate x; merely introduces 
proportional changes in its loadings. 

Testing hypotheses on number of factors. In 
the factor analysis of a set of data the value of k 
is seldom known in advance and has to be esti- 
mated. To begin with, some value of it is assumed 
and a matrix of loadings L for this value is esti- 
mated. The effects of the factors concerned are 
now eliminated from the observed covariance (or 
correlation) matrix, and the residual matrix, 
A — LW’, is tested for significance. If it is found to 
be statistically significant, the value of k is increased 
by one and the estimation process is repeated. The 
test employed is of the large sample chi-square type, 
based on the likelihood ratio method of Neyman 
and Pearson, and is given by 


(8) X? = nln (|€|/|Al), 


with 3{(p — k)? —(p + k)} degrees of freedom. A 
good approximation to expression (8), and one 
easier to calculate, is 


(9) X= 0 Y (aiy — êy)?/ (00). 


There is also some evidence to suggest that the 
test can be improved by replacing n by n'=n— 
4(2p + 5) — 2k. 

Factor interpretation. As already mentioned, 
the matrix of loadings, L, given by a factor analysis 
is not unique and can be replaced by an equivalent 
set LM where M is an orthogonal matrix. This fact 
is frequently used by experimenters when inter- 
preting their results, a matrix M being chosen that 
will in some way simplify the pattern of loadings 
or make it more intuitively meaningful. For exam- 
ple, M may be chosen so as to reduce to zero, or 
nearly zero, as many loadings as possible in order 
to reduce the number of parameters necessary for 
describing the data. Again, M may be chosen so 
as to concentrate the loadings of variates of similar 
content, say verbal tests, on a single factor so that 
this factor may be labeled appropriately, Occasion- 
ally, too, the factors are allowed to become corre- 
lated if this seems to lead to more meaningful re- 
sults. 

It is now clear that given a matrix of loadings 
from some analysis, different experimenters might 
choose different rotation matrices in their interpre- 
tation of the data. This subjective element in factor 
analysis has led to a great deal of controversy. To 
avoid subjectivity, various empirical methods of 
rotation have been proposed which, while tending 
to simplify the pattern of loadings, also lead to 
unique solutions. The best known of these are the 
varimax and the promax methods (for details see 
Kaiser 1958; Hendrickson & White 1964). But 
another approach to the problem, proposed inde- 
pendently by Howe (1955), Anderson and Rubin 
(1956), and Lawley (1958), seems promising. 
From prior knowledge the experimenter is asked 
to postulate in advance (a) how many factors he 
expects from his data and (b) which variates will 
have zero loadings on the several factors. In other 
words, he is asked to formulate a specific hypothe- 
sis about the factor composition of his variates. 
The statistician then estimates the nonzero load- 
ings and makes a test of the “goodness of fit” of 
the factors structure. In this approach the factors 
may be correlated or uncorrelated, and in the 
former case estimates of the correlations between 
them are obtained. The equations of estimation and 
illustrative examples of their application can be 
found in Howe (1955) and in Lawley and Maxwell 
(1963; 1964); the latter gives a quick method of 
finding approximate estimates of the nonzero load- 
ings. 

Estimating factor scores. As the statistical the- 
ory of factor analysis now stands, estimation is a 
twofold process. First, the factor structure, as de- 
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scribed above, of a set of data is determined. In 
practice, however, it is often desirable to find, in 
addition, equations for estimating the scores of 
individuals on the factors themselves. One method 
of doing this, developed by Thomson, is known as 
the “regression method.” In it the Li, are taken to be 
the covariances between the f, and the x;, and then 
for uncorrelated factors the estimation equation is 


(10) f=L/Cx, 
or, more simply from the computational viewpoint, 
(11) f= (1+d)1’Vx, 


where f= th fe, tee, fry’, = {X1, %e, ¢-* Xp)’, 
and, as before, J = L'V-L, and I is the identity 
matrix. If sampling errors in L and V are neglected, 
the covariance matrix for the errors of estimates 
of the factor scores is given by (E + J)". 

If the factors are correlated and their estimated 
correlation matrix is denoted by P, then eqs. (10) 
and (11) become, respectively, 


(12) f = PLC x, 
and 
(13) f= (P4 + JJL’ Vx, 


while the errors of estimates are given by (P+). 

An alternative method of estimating factor scores 
is that of Bartlett (1938). Here, the principle 
adopted is the minimization, for a given set of 
observations, of }\,e?/v;, which is the sum of 
squares of standardized residuals. The estimation 
equation now is 


(14) f= JVZ. 


It is of interest to note that although the sets of 
estimates gotten by the two methods have been 
reached by entirely different approaches, a compar- 
ison shows that they are simply related. For uncor- 
related factors the relationship is 


ft = (1+ I); 
for correlated factors it is 
f= (1+ dP*)f. 


Comparing factors across populations. If fac- 
tors can be viewed as representing “permanent” 
aspects of behavior or performance, ways of identi- 
fying them from one population to another are 
required. In the past, identification has generally 
been based on the comparison of matrices of load- 
ings. In the case of two matrices, a common ap- 
proach, developed by Ahmavaara (1954) and 
Cattell and Hurley (1962), is to rotate one into 
maximum conformity in the least square sense 


with the other. For example, the matrix required 
for rotating L, into maximum conformity with L., 
when they both involve the same variates, is ob- 
tained by calculating the expression (E/E, )“{L, 
and normalizing it by columns. The factors repre- 
sented by L, in its transformed state are likely to 
be more or less correlated, but estimates of the 
correlations between them are given by (MiM,)", 
standardized so that its diagonal cells are unity, 
where M, = (I/L,)" LL, and M? is its transpose. 
This procedure is fairly satisfactory when the 
sample covariance matrices involved do not differ 
significantly. When they do, the problem of iden- 
tifying factors is more complicated. 

A possible approach to it has been suggested by 
Lawley and Maxwell (1963, chapter 8), who make 
the assumption that although two covariance ma- 
trices, C, and C, , involving the same variates may 
be different, they may still have the same E-matrix. 
This could occur if the two k x k covariance ma- 
trices F, and F, between the factors themselves 
were different. To kee, the model fairly simple, 
they assume that the residual variances in the 
populations are in each case W and then set up 
the equations 
(15) C,=LI’+V, 

C,=LT.L’ +V. 


For this model Lawley and Maxwell show how 
estimates of L, V, T,, and T, may be obtained from 
two sample covariance matrices A, and A,. They 
also supply a test for assessing the significance of 
the difference between the estimates of F, and T3, 
and also for testing the “goodness of fit” of the 
model. 


The method of principal components 

The principal component method of analyzing a 
matrix of covariances or correlations is also widely 
used in the social sciences. The components corre- 
spond to the latent roots of the matrix, and the 
weights defining them are proportional to the cor- 
responding latent vectors. 

The model can also be stated in terms of the 
observed variates and the derived components. An 
orthogonal transformation is applied to the xi; 
(i= 1,2, +++, p) to produce a new set of uncorre- 
lated variates yı, Y2, ‘**, Yp. These are chosen such 
that y, has maximum variance, yz has maximum 
variance subject to being uncorrelated with yı, and 
so on. This is equivalent to a rotation of the coordi- 
nate system so that the new coordinate axes lie along 
the principal axes of an ellipsoid closely related to 
the covariance structure of the x;. The transformed 
variates are then standardized to give a new set, 
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which will be denoted z,. When this method is used, 
no hypothesis need be made about the nature or 
distribution of the x;. The model is by definition 
linear and additive, ‘and the basic equations are 
(16) = F wars, i, 3=1,2,---,p, 
a 

where z, stands for the sth component, and w;, is 
the weight of the sth component in the ith variate. 
In matrix notation eqs. (16) become 


x= Wz, 


where x = {x,,%;, +++, %))’,2= {Z:,Z, +++, Z,}’, and 
W is a square matrix of order p with elements w;,. 

Comparison of eqs. (16) with eqs. (1) shows 
that in the principal component model residual 
variates do not appear, and that if all p components 
are obtained, the sample covariances can be repro- 
duced exactly, that is, A= W’W. Indeed, there is 
a simple reciprocal relationship between the ob- 
served variates and the derived components, 

A straightforward iterative method for obtaining 
the weights wi, is given by Hotelling in his original 
papers; the details are also given in most textbooks 
on factor analysis. In practice, all p components are 
seldom found, for a small number generally ac- 
counts for a large percentage of the variance of the 
variates and can be used to summarize the data. 
There is also a criterion, developed by Bartlett 
(1950; 1954), for testing the equality of the re- 
maining latent roots of a matrix after the first k 
have been extracted; this is sometimes used to help 
in deciding when to stop the analysis. 

The principal component method is most useful 
when the variates x; are all measured in the same 
units. Otherwise, it is more difficult to justify. A 
change in the scales of measurement of some or 
all of the variates results in the covariance matrix 
being multiplied on both sides by a diagonal matrix. 
The effect of this on the latent roots and vectors is 
very complicated, and unfortunately the compo- 
nents are not invariant under such changes of 
scale. Because of this, the principal component 
approach is at a disadvantage in comparison with 
the proper factor analysis approach. 

A. E. MAXWELL 


[See also CLUSTERING; DISTRIBUTIONS, STATISTICAL, 
article on MIXTURES OF DISTRIBUTIONS; LATENT 
STRUCTURE; STATISTICAL IDENTIFIABILITY, ] 
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it 
PSYCHOLOGICAL APPLICATIONS 


The essential statistical problem of factor anal- 
ysis involves reduction or simplification of a large 
number of variates so that some hypothetical var- 
iates, fewer in number, which are weighted sums 
of the observed variates, can be used to replace 
them. If psychological experimenters were satisfied 
with this sole, statistical objective, there would be 
no problem of psychological interpretation and of 
meaning of factors. They would simply be conven- 
ient abstractions. However, psychologists and psy- 
chometricians, starting with Charles Spearman 
(1904), the pioneer factor analyst, have wanted to 
go beyond this objective and have thereby created 
the very large psychological literature in this field. 
The goal of factor analysts following in the Spear- 
man tradition has been to find not only convenient 
statistical abstractions but the elements or the basic 
building blocks, the primary mental abilities and 
personality traits in human behavior. Such theo- 
rists have explicitly accepted chemical elements— 
sometimes even the periodic table—as their model 


281 


and factor analysis as the method of choice in 
reaching their goal. 


Factor interpretation and methodology 


Factor extraction methods. There are several 
variations of factor methods, certain of which are 
more amenable to psychological interpretation 
than others. For example, the experimenter can 
start his analysis from a variance—covariance ma- 
trix or from a correlational matrix with estimated 
communalities (discussed below) in the principal 
diagonal. If he is interested in psychological inter- 
pretations of factors, he almost uniformly selects 
the latter, since use of the variance—covariance 
procedure results in obtaining factors that contain 
unknown amounts of common-factor, nonerror- 
specific, and error components. For purposes of 
psychological interpretation, including generalizing 
to new samples of psychological measures, the in- 
clusion of nonerror-specific and error variance in 
the factors is undesirable. The intercorrelations 
and communalities, on the other hand, are deter- 
mined only by the common factors. 

The experimenter also has a choice among sev- 
eral methods of factor extraction, including the 
centroid, principal components (sometimes called 
principal axes), and maximum likelihood methods. 
Choice among these is based largely on feasibility 
criteria. The first was used almost exclusively be- 
fore the advent of high-speed digital computers. 
The third is generally acknowledged to be superior 
statistically to the second, but it is too expensive in 
time and computer to use, The second is at present 
the method most frequently used by psychologists, 
since it extracts a maximum amount of variance 
with each successive factor. The centroid method 
only approximates this criterion, although fre- 
quently it is a close approximation. There is thus 
no pressing need to redo all previous work involv- 
ing the centroid method now that computational 
facilities are available. The maximum likelihood 
method can and should be used, as a check on 
conclusions reached with the more economical 
principal components, when size of matrix and 
computer availability make it feasible. 

The communality problem. When the experi- 
menter elects to analyze correlations and commu- 
nalities, he must estimate the latter. These com- 
munalities represent the proportion of common 
factor variance in the total variance of a variable: 
the amount that a variable has in common with 
other variables in a particular study. Unfortunately, 
from the methodological viewpoint, there is no way 
to obtain an unbiased estimate of the communality. 
Several rule-of-thumb methods are available, and 
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there are theoretically sound upper and lower 
bounds for the communality estimate. 

An unbiased reliability estimate can be used as 
an upper bound for the estimated communality. 
Reliability and communality differ to the extent 
that reliability includes specific nonerror variance. 
A lower-bound estimate in the population of per- 
sons is the squared multiple correlation between 
each variable and all of the others (Guttman 1954), 
The reader should note, however, that while this 
procedure provides a lower-bound estimate for the 
population, a sample value can be seriously in- 
flated. The multiple correlation coefficient capital- 
izes on chance very effectively. For example, when 
the number of variables equals the number of ob- 
servations, the multiple correlation in the sample 
is necessarily unity, although the population value 
may in fact be zero. The investigator who wants a 
lower-bound estimate may still utilize the Guttman 
theorem if he estimates the population values from 
sample values that are corrected for their capitali- 
zation on chance. 

Number of factors. If he is interested in the 
psychological meaning of his factors, the experi- 
menter has a further choice among criteria for de- 
termining the number of factors to retain and 
interpret. When estimated communalities are em- 
ployed, no one of the possible criteria is more than 
a rule of thumb. The various criteria lead to rad- 
ically different decisions concerning the number 
of factors to be retained; and different investi- 
gators, in applying one, several, or all of these 
criteria, will reach different conclusions about the 
number of factors. 

One class of criteria for determining the num- 
ber of factors has been characterized as empha- 
sizing psychological importance without regard to 
sampling stability. Some investigators use some 
absolute value of the factor loadings, e.g., .30, 
either rotated or unrotated or both, without regard 
to the number of observations on which the corre- 
lations are based. A more recent suggestion has 
been to retain factors whose principal roots were 
greater than unity (Kaiser & Caffrey 1965). Such 
criteria appear to make an assumption that the 
number of observations is very large, so that the 
factors and loadings that are large enough psycho- 
logically are at the same time not the result of 
sampling error, 

A second class of criteria has been characterized 
as emphasizing the number of observations, even 
though there are no known sampling distributions 
for factors or factor loadings. In several related 
criteria, factor loadings and/or residuals are com- 
pared in one way or another with the standard 


error of correlation coefficients of zero magnitude 
for the sample size involved. Factoring of the in- 
tercorrelations of random normal deviates as a 
method of obtaining empirical sampling errors has 
also been used. 

A third criterion involves the “psychological 
meaning” of the rotated factors: the investigator 
merely states in effect that he is satisfied with the 
results of his analysis. Since any behavioral scien- 
tist of any modest degree of ingenuity can rational- 
ize the random grouping of any set of variables, 
this does not appear to be a useful criterion scien- 
tifically. Without agreement on an objective cri- 
terion, however, psychological meaning of the 
factors tends to be the principal criterion used in 
deciding upon the number of factors to interpret. 

Even in situations where probabilities of alpha 
and beta errors can be estimated, different investi- 
gators, depending on their temperaments or on 
social consequences, may set quite different stand- 
ards for such errors. In determining the number of 
factors, however, there are no objective methods 
of error estimation, and the range of probabilities 
of alpha and beta errors resulting from differences 
among investigators or differences in social conse- 
quences is increased several-fold, For example, for 
one matrix of personality variables two investiga- 
tors differ by a ratio of four to one in their assess- 
ment of the proper number of factors to retain and 
interpret. The difference between 12 and 3 factors 
is far from trivial. Such discrepancies reduce fac- 
tor analysis to a hypothesis formation technique. 
As a method of discovery of psychological princi- 
ples, or of hypothesis testing generally, ambiguities 
of this magnitude cannot be tolerated. The lack of 
a suitable test for number of factors has opened the 
door for a great deal of poor research. 

Factor rotations. After the factors are extracted, 
the experimenter has to decide whether to rotate 
or not. Rotation of axes to psychologically mean- 
ingful positions follows inevitably from an interest 
in finding the psychological elements. 

The rotation problem is seen most clearly in the 
two-factor case. First, the two factors are concep- 
tualized as orthogonal (perpendicular) dimensions 
extending from values of —1.00 to +1.00. Then 
the points representing the loadings of the tests on 
these factors are fixed in the space defined by these 
dimensions. Imagine now that a pin is inserted at 
the origin of the two dimensions and that these 
are now rotated about the pin. Wherever they stop, 
new coordinates can be determined for the test 
points. It must be noted that the test points are 
located as accurately by the new dimensions as by 
the original ones, and that the intercorrelations of 
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the tests are described with equal accuracy. There 
are, in point of fact, an infinite number of posi- 
tions of the coordinates and thus an infinite num- 
ber of mathematical solutions to the factor prob- 
lem. The investigator interested in psychological 
meaning rotates the dimensions into some psycho- 
logically unique position. It is important, where 
possible, that factor descriptions of measures re- 
main stable from sample to sample of either per- 
sons or measures, or both. This can be achieved, 
apparently in the great majority of cases, with an 
adequate rotational solution. 

Rotation is almost uniformly performed when 
factors obtained are from a correlation matrix 
having communality estimates in the diagonal. 
Factors obtained from the variance—covariance ma- 
trix, on the other hand, are generally not rotated 
and are preferred by the experimenter interested 
in description alone rather than in explanation. 
The experimenter also has a choice among several 
different rotational methods, based upon different 
criteria and leading to either orthogonal or oblique 
factors. 

Orthogonal versus oblique rotation. Orthogonal 
rotations offer the simplicity of uncorrelated dimen- 
sions in exchange for a poorer fit of the test points. 
Oblique rotations offer a better fit for the test points 
in exchange for a complexity of correlated dimen- 
sions. If oblique rotations are used, the investigator 
can also elect to factor in the second and perhaps 
higher orders; i.e., he can factor the intercorrela- 
tions among his first-order factors, among his sec- 
ond-order factors, and so on. After factoring in 
several orders, the investigator also has the option 
of presenting and interpreting his results in the 
several orders, or, by means of a simple transfor- 
mation, he can convert the oblique factors in sev- 
eral orders to orthogonal, hierarchical factors in a 
single order. 

Until the advent of high-speed digital computers, 
basically the only method for achieving a given 
rotational result was hand rotation. There are now 
several computer programs for rotation to either 
orthogonal or oblique structure. 

If the investigator elects an orthogonal solution 
to his problem, he has a number of programs 
among which to choose. One of the earlier pro- 
grams is the quartimax of Neuhaus and Wrigley 
(1954). This was followed by Kaiser’s varimax pro- 
gram (1958). An important difference between the 
two is that quartimax typically produces a general 
factor in ability data which is a function of the 
sampling of test variables, i.e., the general factor 
may reflect verbal, perceptual, or other specific em- 
phasis, depending upon the nature of the tests 


sampled. Varimax provides results that are more 
stable from one test battery to another. This is 
achieved by a more even distribution of variance 
among the rotated factors. In the opinion of many 
investigators, varimax rotations have achieved a 
near-ultimate status for the orthogonal case, but 
Schonemann (1964) has now developed a program 
that he calls varisim, which spreads existing var- 
iance more evenly among the several factors than 
varimax does. Results from the two programs are 
not completely parallel even for well-defined fac- 
tors. There is as much rationale for varisim as for 
varimax. In consequence, the ultimate status of 
varimax has been dislodged, and we are again 
faced with a somewhat arbitrary choice among or- 
thogonal rotational methods. 

Oblique rotational programs are now fairly nu- 
merous and exhibit variability in results compa- 
rable to that among orthogonal ones. There is one 
important difference: no oblique program has as 
yet achieved the status that varimax once had. Be- 
cause of the various sources of dissatisfaction with 
existing programs, there is much more research 
activity in the area of oblique rotation than in or- 
thogonal rotation. There is still frequent resort to 
visually guided rotations if the investigator is striv- 
ing for an oblique structure. 

Methodological summary. The investigator who 
wishes to find psychological meaning in his data, 
the one who is trying to discover the basic building 
blocks or causal entities in human behavior, has a 
difficult task. Important decisions for which there 
are no sound foundations must be made at several 
steps in the procedure. Communalities must be es- 
timated; the estimate of the number of factors to 
be extracted and retained for rotations must be 
based upon inadequate criteria; and although sub- 
jective bias possibly resulting from hand rotations 
has been eliminated by rotations obtained on high- 
speed computers, the choice of rotational program 
among either oblique or orthogonal solutions may 
lead to quite different results. 

In the absence of sound estimation methods, the 
criterion of replicability is typically offered as a 
substitute. Replicability is a very important crite- 
rion in science generally. When applied to factor 
analysis, however, one must be aware that seem- 
ingly parallel results may have been forced on the 
data, typically without intention on the part of the 
experimentalist to do so. For example, considerable 
congruence of factor patterns can be obtained from 
the intercorrelations of two independent sets of 
random normal deviates by extracting as many fac- 
tors as variables and by rotating to oblique simple 
structure. The result will be one-to-one correspond- 
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ence of the factors. The intercorrelations of the 
factors will differ, but even these differences will 
not be large, since they are randomly distributed 
about zero. 

Methods of assessing the congruence of factor 
patterns also leave something to be desired. The 
most common method by far is that of visual in- 
spection and unaided judgment. The most precise, 
the correlation between two estimated factor scores 
in the same sample, is rarely seen. Claimed replica- 
tion of a factor is frequently without adequate 
foundation. 


Early general factor interpretations 


Mental energy. Spearman did not have avail- 
able any of the above-described techniques for the 
factor analysis of relationships among variables. 
Neither did he have access to the multitude of tests 
now available. He hypothesized that one general 
factor was sufficient to account for the intercorre- 
lations among his variables, and he developed rela- 
tively simple methods to test this hypothesis. (Pres- 
ent methods of multiple factor analysis include 
Spearman’s single factor as a special case.) In psy- 
chological interpretation Spearman is of interest, 
however, because he interpreted his single factor 
as “mental energy.” This was considered the sole 
basis or building block of mental ability or intelli- 
gence. [See SPEARMAN.] 

Multiple bonds. Spearman’s interpretation was 
challenged by Godfrey Thomson (1919) and by 
Edward Thorndike (Thorndike et al. 1926). Thom- 
son proved that correlational matrices having the 
form required to satisfy Spearman’s one-factor in- 
terpretation could also be “explained” by the pres- 
ence of many overlapping elements. Thorndike dis- 
cussed connections (bonds) between stimuli and 
responses as an alternative to Spearman’s mental 
energy concept. Considering that there are many 
thousands of stimuli to which a person will respond 
differentially, and that tests sample these, the ex- 
tent to which there is overlap in the elements 
sampled by two measures determines the degree to 
which they are correlated. If the intercorrelations 
of several measures have the formal properties nec- 
essary for Spearman’s unitary mental energy ex- 
planation (one factor), they also can be explained 
by multiple bonds or overlapping elements (multi- 
plicity of factors). [See THORNDIKE. ] 

Unitary mental energy is a basic building block, 
a general influence or “cause”; multiple bonds are 
a complex of stimulus-response connections that 
are acquired in a dynamic, complex physical and 
social environment. Multiple bonds that underlie 
the behavior under observation cannot be said to 
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cause that behavior in the same sense that mental 
energy is said to cause intellectual performance. 
Recourse to parsimony in this instance is not an 
acceptable solution, since the two explanations are 
so different. It should come as no surprise, for ex- 
ample, to learn that Spearman and his followers 
have stressed genetic bases for intelligence, while 
the multiple bonds notion lends itself most readily 
to a stress on environmental forces and learning. 


Multiple factors 


Thurstone’s primary mental abilities. Although 
Thurstone (1938) is considered to have broken 
with Spearman, the break was related only to the 
number of factors required to account for intelli- 
gence. Thurstone considered that some seven to 
nine factors were sufficient to account for the in- 
tercorrelations of the more than fifty tests he used. 
However, Spearman himself had come to doubt the 
single-factor explanation; the break was more ap- 
parent than real. On the issue of what lies behind 
factors there was no break. Careful reading of 
Thurstone’s writings makes it quite clear that to 
him factors were much more than descriptive de- 
vices. Factors were functional unities; their ubiq- 
uity strongly suggested genetic determiners; after 
all, they were called primary mental abilities. [See 
THURSTONE. | 

Ferguson’s learning emphasis. However, just as 
a single factor can be replaced by multiple overlap- 
ping bonds, so also can multiple group factors be 
replaced by sets of overlapping bonds. One need 
only assume that environmental pressures and 
learning come in somewhat separate “chunks.” 
Demographic differences, e.g., parental occupation, 
region of the country, rural-urban differences, etc., 
could account for some of the “chunking” required. 
Ferguson (1956) has produced a very satisfactory 
explanation along these lines in which learning 
and transfer are important variables. Various kinds 
of learning are facilitated or inhibited by the variety 
of environments in which children develop. Learn- 
ing transfers, both positively and negatively, to 
novel situations. The amount and direction of the 
transfer are determined by stimulus and environ- 
mental similarities. Learning and transfer, along 
with environmental differences, produce the clus- 
tering of measures on which the factors depend. 
[See LEARNING, article on TRANSFER. | 

Physical analogies. Thurstone (1947), in order 
to convince himself and others that factors were 
“real,” constructed a factor problem that has at- 
tracted a good deal of attention. He showed that if 
dimensions of boxes were factored, the result was 
a three-factor solution which could be rotated into 


FACTOR ANALYSIS: Psychological Applications 


a position such that the factors represented the 
dimensions of length, breadth, and depth, the three 
basic dimensions of Euclidean space. He also 
showed that these factors were correlated, i.e., an 
oblique solution gave a better fit to the data than 
did an orthogonal one. The obliquity reflects, of 
course, the fact that the dimensions of man-made 
boxes tend to be correlated, i.e., long boxes tend to 
be big boxes. 

In a situation more relevant to behavior Cattell 
and Dickman (1962) have demonstrated that the 
intercorrelations of the performance of balls in 
several “tests” yield four factors that can be iden- 
tified as size, weight, elasticity, and string length. 
It is clear from this and the preceding example that 
factor analysis can sometimes identify known 
physical factors in data. 

One question about these examples is the certi- 
tude with which the factors can be identified after 
rotation, granting that the correct number of fac- 
tors can be obtained by present methods. Thurstone 
suggested the criteria of simple structure for the 
adequacy of rotations. Generally speaking, simple 
structure is achieved when the number of zero 
loadings in a factor table has been maximized 
while increasing the magnitude of loadings on a 
small number of variables. The application of these 
criteria to the examples described resulted in clear- 
cut identification of the three and four factors. It 
has been shown by Overall (1964), however, that 
if Thurstone had started with a different set of 
measurements, the criteria of simple structure for 
rotations would have led to differently defined fac- 
tors, i.e., they would not have been the “pure” 
physical dimensions but would have represented 
complex combinations of those dimensions. 

A more basic question is whether psychological 
data are similar to physical data, i.e., whether psy- 
chological dimensions obtained by factoring are 
similar to physical dimensions. The demonstration 
that three or four physical factors, as the case may 
be, can be recovered from correlational data does 
not prove that factors in psychological data have 
a similar functional unity. Not only is Thomson’s 
alternative explanation theoretically acceptable for 
multiple factors, but it makes good psychological 
sense as well. Psychological tests measure perform- 
ance on each of a series of items. These perform- 
ances make up the total score. Although Thomson 
would not have suggested a one-to-one correspond- 
ence between item and element or stimulus— 
response bond, one can conclude that there are at 
least as many elements represented in a test as 
there are items. Thus the multiple bonds approach 
fits the actual measurement situation so well that 
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the adherents of the other point of view must bear 
the burden of proof—and for psychological, not 
physical, data. 

Guilford’s structure of intellect. The work of 
J. P. Guilford has been most influential in the 
factor analysis of human abilities (e.g., 1956). It 
has increased by ten times the small number of 
primary mental abilities proposed by Thurstone, 
but the approach to their interpretation remains 
much the same. Guilford’s thinking about the 
nature of factors is modeled very closely after the 
periodic table of the chemical elements; he has in 
fact proposed a structure which points out missing 
factors and has proceeded in his own empirical 
work to “discover” many of these. 

In spite of similarities in thinking about the na- 
ture of factors, the discrepancy in numbers be- 
tween Guilford and Thurstone is highly significant, 
and it illustrates a basic difficulty with psycholog- 
ical tests and the attempts to find causal entities 
from the analysis of their intercorrelations. Not 
only do psychological tests measure performance 
on a relatively large number of pass—fail items, but 
there is at present no necessary or sufficient meth- 
odological or theoretical basis for deciding which 
items should be added together to make up a single 
test score (Humphreys 1962). The number of fac- 
tors has proliferated in Guilford’s work because he 
has produced large numbers of homogeneous experi- 
mental tests. By additional test construction, mak- 
ing each test more and more homogeneous, the 
number of factors could be increased still further. 
As a matter of fact, there is no agreed-upon stop- 
ping place short of the individual test item, i.e., a 
single item represents the maximum amount of 
homogeneity. This logic results in the same num- 
ber of primary mental abilities as there are ability 
test items. 

The progression from Thurstone to Guilford can 
be interpreted as further evidence for the multiple 
bonds theoretical approach. On the other hand, 
positing a functional unity inside the organism for 
each item represents a scientific dead end. 

Cattell’s structure of personality. The work of 
R. B. Cattell has been most influential in the factor 
analysis of the domain of personality (e.g. 1957). 
Cattell’s thinking about the character of factors 
does not differ materially from that of Spearman, 
Thurstone, and Guilford in that for Cattell, factors 
are real influences. 

The number of identified personality factors has 
increased considerably under Cattell’s direction. 
Although measurement problems differ, Cattell’s 
work parallels that of Guilford with human abili- 
ties. Self-report questionnaires present the multiple 
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items problem with yes—no scoring of items. Per- 
sonality investigators also have the problem of 
deciding which items should be added together in 
any given score. A great deal of additional work, 
however, has been done with rating scales and with 
so-called objective tests of personality. “Density” 
of sampling of the test or rating domain, a concept 
introduced by Cattell, is still involved in the pro- 
liferation of factors, even though the mechanism 
is not that of item selection. Thus, in obtaining 
ratings, one must decide on the number and over- 
lap in meaning of traits to be rated. One must 
decide whether to include both extroversion and 
sociability or, even closer, both ascendance and 
dominance. While there is no rigorous method to 
depend on in the sampling of measures, decisions 
about what will be tested still affect the number 
of factors and their importance. 

Furthermore, it is also typical of many experi- 
mental designs that large numbers of variables 
relative to the number of observations are an- 
alyzed; that many of these variables have low re- 
liability and thus low communality; that many 
factors are retained for rotational purposes; and 
that rotations are made to an oblique structure. 
All of these elements contribute to possible capi- 
talization on chance. 

It is of interest that Cattell uses as a primary 
rotational criterion a count of the number of vari- 
ables in the hyperplane, i.e., the multidimensional 
plane defined by all factors other than the one in 
question. (More simply, a measure having a zero 
loading on a factor is located geometrically some- 
place in the factors hyperplane.) This criterion 
places a premium on the extraction of a large 
number of factors relative to the number of meas- 
ures, on the use of variables of low reliability, and 
on the use of variables unrelated to the major 
purpose of the analysis. In the opinion of many 
critics Cattell has increased the probability of 
making Type 1 errors beyond tolerable bounds, 
although neither he nor his critics can assign a 
value to alpha in this situation. 


A dramatic example of the difficulties that may 
be involved in typical factor analytic research is 
given by some data described by Horn (1967). 
He obtained a good fit to an oblique factor pattern 
derived from an analysis of ability and personality 
variables by factoring the intercorrelations of the 
same number of random normal deviates, based 
upon the same number of observations, as the 
psychological variables. This finding highlights the 
principle that replication of findings may be of 
little import in factor analytic investigations. 

It is also apparent that the essential reason for 


factor analyzing intercorrelations, to seek some 
reduction or simplification of data, has not been 
realized. The number of variables and the number 
of factors have grown astronomically, and the end 
is not yet in sight. It is highly possible that the 
search for psychological meaning, the search for 
the basic building blocks or elements, has been re- 
sponsible. If psychological data are different from 
physical data in important respects, and if the 
multiple bonds are a more accurate representation 
of the data than the chemical elements point of 
view, researchers would profit from taking another 
look at the reasons why they factor analyze. An 
economical description of complex data is itself an 
important scientific goal. 

LLOYD G. HUMPHREYS 


[Directly related are the entries CLUSTERING; MULTI- 
VARIATE ANALYSIS; TRAITS. Other relevant material 
may be found in INTELLIGENCE AND INTELLIGENCE 
TESTING; PsycHoLocy, article on CONSTITUTIONAL 
PSYCHOLOGY; and in the biographies of SPEARMAN 
and THORNDIKE.] 
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FAHLBECK, PONTUS ERLAND 


Pontus Fahlbeck (1850-1923) was one of the 
cultural leaders of Sweden around the turn of the 
century. Successful in a career which combined 
scientific, political, and business interests, he was, 
at first, professor of history and political science at 
the University of Lund and, later, professor of polit- 
ical science and statistics at that university. It was 
largely thanks to his efforts that a chair in statis- 
tics, the first in Sweden, was established at Uppsala 
in 1910. In addition, he endowed a widely known 
foundation at the University of Lund that supports 
research in political science. 

His scientific and political books and articles re- 
flect, on the one hand, his broad socioeconomic per- 
spectives and, on the other, his keen interest in 
the political problems of the day. Some influence 
from German Kathedersozialismus and social-law 
theory can be traced in his treatment of social 
problems, He described himself as moderately right 
wing in political matters and radical in social 
matters. 

Fahlbeck began his academic career in history, 
concentrating on problems of European medieval 
history, and then took up political science, dealing 
with the Swedish constitution and its development 
in relation to social organization. His involvement 
deepened as he gradually moved toward general 
social science, or sociology in a broad sense. His 
interest focused on human society as a system of 
classes, the socioeconomic functions of class dif- 
ferentiation, and the transition from feudal society, 
with its class privileges, to the free groupings of 
modern society. In modern society, as he saw it, 
the legal differences between the social strata have 
been erased. 

Fahlbeck’s magnum opus is Sveriges adel (“The 
Swedish Nobility’; 1897-1902). Volume 1 is a 
historical study tracing each family from its en- 
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noblement through subsequent generations. Out of 
a total of 2,735 families, some 25 per cent were 
extinct by 1898. The statistical analysis focuses on 
the process of extinction and includes death rates 
and survival tables, calculated by family. The sec- 
ond part is a demographic-statistical study of the 
then contemporary nobility. This many-faceted 
analysis has been referred to as a pioneering work. 
Throughout, Fahlbeck developed his argument on 
two planes—careful statistical analysis and specu- 
lative discussion. Notable are his strong emphasis 
on the importance of social circulation and of the 
leadership of the upper classes, his resigned atti- 
tude toward the deadly dangers resulting from the 
excessive refinement of the upper classes, and his 
dissatisfaction with Malthus and, in particular, 
neo-Malthusianism. 

In general, Fahlbeck will be remembered not so 
much for the concrete results he produced but for 
the way in which he understood and dealt with 
some of the key issues which then confronted scien- 
tists. Like other brilliant men of his epoch, Fahl- 
beck was in a frustrating situation. He was aware 
of rapid socioeconomic developments and their 
great challenge to the rising social sciences, and 
he was also vaguely aware that the available scien- 
tific methods were inadequate to meet the chal- 
lenge. At times he seemed to be far ahead of his 
contemporaries, groping for something that he 
could not articulate and that has perhaps still not 
been reached. 

Fahlbeck’s attitude toward statistical methods 
was ambivalent. He set forth his views on statistics, 
as an independent branch of social science, in two 
programmatic articles (1897; 1918). In the first 
he enthusiastically supported the actual and poten- 
tial use of statistics and the numerical method in 
the social sciences. By the time he wrote the sec- 
ond article, Fahlbeck’s attitude had changed. Dur- 
ing the interval between the appearance of the two 
articles, the flourishing period of German demo- 
graphic and social statistics had been cut off by the 
emergence of a strong, antitheoretical position in 
official statistics (von Mayr, Zahn). In England, in 
the meantime, the use of statistics in the biological 
sciences had begun, leading to a rapid and epoch- 
making development of advanced statistical meth- 
ods, especially in the experimental area. In the 
period between the writing of the two articles, Fahl- 
beck had himself successfully used statistical meth- 
ods in his study of the nobility. Yet, in his 1918 
article he dismissed the advanced methods of sta- 
tistics as “higher mathematics” and argued that in 
the social application of statistics there is room 
only for simple methods. 

Despite the loose arguments and clichés of the 
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1918 article, Fahlbeck’s position is not without its 
merits. Although his argument that the study of 
human behavior and social phenomena is the cru- 
cial area for statistical work is extreme, he was 
right to stress the key importance of studying them 
statistically. And his untenable view that simple 
methods suffice for the social sciences may have 
been an outgrowth of his frustration with the 
methods then available. His primary goal was the 
development of appropriate methods, comparable 
with those of the natural sciences, for dealing with 
the highest order of phenomena in the scientific 
hierarchy, that is, with human phenomena. 


HERMAN WoLD 


[See also ELITES; STRATIFICATION, SOCIAL; and the 
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FALANGISM 


Falangism is the Spanish variant of the fascist 
doctrines that gained vogue in Europe during the 
1930s. Its origins are purely theoretical, for falan- 
gism existed as an ideology on paper for several 
years before it became a significant political move- 
ment. The originators of Falangist doctrine were 
Ramiro Ledesma Ramos (1905-1936), sometime 


postal clerk and unemployed intellectual, and José 
Antonio Primo de Rivera (1903-1936), a young 
aristocrat and son of the former dictator General 
Miguel Primo de Rivera. The ideological basis of 
Falangist doctrine was first expounded by Ledesma 
in 1931 under the label “national syndicalism.” 
Intoxicated by the spectacle of German national 
socialism, Ledesma endeavored singlehandedly to 
fabricate a Spanish fascistic program that would 
combine the two main radical forces in early 
twentieth-century Europe—nationalism and social- 
ism. Ledesma’s notion of “national syndicalism” 
was that of a revolutionary movement that would 
carry out the socioeconomic program of Spanish 
anarchosyndicalism under the aegis of a dictatorial 
nationalist state. 

Falangism took formal shape in 1933-1934, 
after the founding of a new political movement 
called Falange Española (Spanish Phalanx). This 
organization was largely the creation of José An- 
tonio Primo de Rivera, a restless, romantic, and 
energetic young man who wanted to complete the 
work begun by his father. Amid the frustrating 
experiences of the first two years of the Second 
Spanish Republic, he groped for a formula that 
could unify all the Spanish people and promote na- 
tional rejuvenation. Like Ledesma, he thought it 
could be supplied by some kind of “Spanish fas- 
cism,” which would combine nationalistic dictator- 
ship and sweeping socioeconomic reform. Certain 
conservatives helped him found the Falange on 
October 29, 1933, and it soon absorbed the tiny 
group of radicals who followed Ledesma. 

The classic doctrine of falangism was defined at 
the end of 1934 in a program written principally 
by Ledesma and referred to as the “Twenty-seven 
Points” (see Primo de Rivera 1942). The major 
principles announced were: 


(1) Political unity of Spain and elimination of re- 
gional separatism. 

(2) Abolition of political parties. 

(3) Establishment of a nationalist dictatorship led 
by the party. 

(4) Use of violence in regenerating Spain. 

(5) Development of Spanish imperial power. 

(6) Expansion and strengthening of the armed 
forces, 

(7) Recognition and support of Catholicism as the 
official religion of Spain but rejection of any clerical 
influence in government. 

(8) A sweeping economic reform, referred to as 
“revolution,” which emphasized the following: 

(a) Establishment of a complete system of na- 
tional syndicates, embracing employers and employees, 
to organize, coordinate, and represent all of Spain's 
economic activity. 


(b) Sweeping agrarian reform, reclaiming waste- 
land, improving techniques, concentrating scattered 
holdings, and reorganizing the great latifundia. 

(c) Stimulation of industrial expansion. 

(d) Basic respect for private property, but na- 
tionalization of all credit facilities to eliminate capital- 
ist usury. 


Together with these specific goals, a general mys- 
tique of Spanish nationalism and an interpretation 
of Spanish history exalting certain peculiarities of 
Spanish life were expounded, and almost all of 
Spanish liberalism and the nation’s experience 
with constitutional parliamentary government were 
denounced. The fanaticism in the Spanish past was 
interpreted as a triumph of the nation’s spirit. The 
Inquisition and some of the nation’s more abso- 
lutist rulers were exalted and used as evidence that 
Spaniards needed forceful, authoritarian rule. 

At the time this program was drafted the Falange 
probably had the support of only one per cent of 
the Spanish people. Its economic program remained 
vague because almost none of the party's few intel- 
lectuals had concrete experience with economic 
matters. One of the more confusing of the Twenty- 
seven Points was the declaration “We have a will 
to empire,” for the Falangist notion of “empire” 
was never made clear. One Falangist leader soon 
went out of his way to indicate that “empire” meant 
only cultural influence and diplomatic leadership 
and not political or territorial domination. Other 
Falangists spoke in more aggressive terms, pro- 
posing the annexation of Portugal. 

In October 1934 José Antonio Primo de Rivera 
was made jefe nacional (national chief) of the 
movement. In his person the jefe summarized the 
contradictions of Falangist doctrine, for he was an 
intellectual liberal and manifestly uncomfortable 
with the fascist aggressiveness of Falangist ide- 
ology. Most Falangists had originally referred to 
their movement as “fascist,” but Primo de Rivera 
and other party leaders soon became worried about 
close identification with Italian fascism and Ger- 
man Nazism. By 1935 they were emphasizing that 
falangism was not intended to be “fascist” at all, 
but merely a native development of Spanish 
nationalism. 

Since falangism began as an intellectual abstrac- 
tion and changed greatly when finally placed in 
practice, it can be adequately understood only 
through a study of its historical evolution. The 
but it lacked popular support and had already been 
Falange was a rabid foe of the Spanish Republic, 
driven underground by the police and rendered 
physically impotent when the rebellion against the 
republic broke out, on July 17, 1936. The revolt 
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was organized entirely by the Spanish Army, and 
the Falange, which had no official standing, was 
forced to collaborate with the rebels on the army's 
terms. During the first months of the civil war the 
movement lost most of its original leaders, for 
José Antonio Primo de Rivera, Ramiro Ledesma, 
and others were shot by the Republicans. 

Nonetheless, in this climate of civil war, the 
membership of the party swelled enormously, for 
falangism was the only radical new ideology with 
which the Spanish right could combat the militant 
left. The military dictatorship had few clear po- 
litical ideas, and after Generalissimo Francisco 
Franco emerged as head of the new Nationalist 
government, he and his advisers searched for some 
kind of official political ideology and structure. 
Falangism seemed the answer, for it stressed the 
defense of traditional Spanish institutions, pro- 
posed a solution to class warfare, and was the 
ideology most congenial to Nazi Germany and 
fascist Italy, the two states on which the military 
dictatorship had to rely to win the war. On April 
19, 1937, the Falange was therefore elevated to the 
role of the state party of Spain, and the Twenty- 
seven Points were adopted as a government 
program. 

Falangism was thus taken over as a tool of the 
military dictatorship; the Falangists did not in any 
way take over the Franco state. Franco became 
head of the Falange, which was reorganized, 
watered down, and mixed with a variety of more 
conservative groups. Officially, the party was fused 
with the reactionary, monarchist Carlist militia 
and renamed “Falange Espanola Tradicionalista de 
las Juntas de Ofensiva Nacional Sindicalista.” The 
27th point of the Falangist program, which had 
prohibited any such fusion, was then dropped. 
Power rested securely in the hands of Franco and 
the military hierarchy, almost none of whom was 
truly “Falangist.” Indeed, most Spanish officers 
tended to sneer at the orthodox Falangists, calling 
them “our Reds.” 

Falangism had two main tasks after the end of 
the civil war, in 1939. The first was the creation 
of a nationwide system of workers’ and employers’ 
syndicates. This soon blanketed all urban and in- 
dustrial regions, although it was slow to include 
the rural areas. The syndicates were organized by 
state functionaries from the top downward. They 
did not represent the workers but controlled them 
in the interests of the state and of the employers. 
Such a situation contrasted sharply with the 
original Falangist theory, in which the syndicates 
were supposed to be dynamic revolutionary or- 
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ganisms, helping to carry out a national economic 
reformation. 

The second major task of falangism was ideo- 
logical: to provide an authoritarian nationalist po- 
litical theory that would supply intellectual and 
emotional support for the dictatorship. Falangism 
furnished the substance of the state propaganda 
machine, its antiliberal, antiparliamentary ideology 
being used to discredit all elements not within the 
Franco government and to indoctrinate the new 
generation of Spanish youth with loyalty to the 
regime. Falangist demagogy was widely employed 
in an attempt to convince the workers and the rest 
of the population that the present conservative dic- 
tatorship, protecting propertied interests and the 
status quo, was somehow a dynamic national 
revolution. 

At no time was the Falange party organization 
allowed a position of independent authority inside 
the state, but Falangists were given a near monop- 
oly on bureaucratic positions and many jobs in the 
lower echelon of the government apparatus. At 
the same time, the Falange National Council was 
packed with conservatives and army men so that 
there would be no danger of the movement’s origi- 
nal ideological radicalism becoming more than 
rhetoric. As a result of the great influx of wartime 
members, the majority of Falangists had little 
sense of revolutionary national-syndicalist doc- 
trine. They were largely an amorphous collection 
of opportunists and timeservers, who supported the 
regime as a means of gaining employment. 

Since the nationalistic authoritarianism of Fa- 
langist propaganda was extended, on the interna- 
tional plane, to intense enmity against Britain, the 
United States, and the other Western democracies, 
falangism was especially useful to the Franco re- 
gime during the years 1937-1942, when the Nazi— 
fascist star was rising and the Spanish regime 
hoped to find a place inside the fascist new order. 
During this period it was often found convenient 
to revert to the original definition of falangism and 
to call it “Spanish fascism.” 

After 1943, however, with fascism on the wane 
in Europe, this definition came to be as much a lia- 
bility as an asset to the Franco regime. From that 
time on, great efforts were made to erase the 
overtly fascistic, imperialistic aspects of Falangist 
ideology. The official government spokesmen and 
even Falangist leaders began to place especially 
heavy emphasis on the “Catholic” nature of falan- 
gism, to play up the “religious” aspect of the re- 
gime’s political orientation. Falangist ideologues 
now began to say that not everything done by 
liberal regimes was bad, that in fact the Falange 


itself was rather “liberal,” or at least broad-minded 
and forgiving. The amount of money provided the 
Falangist organization fell off rapidly by 1945, as 
the Spanish regime tried anxiously to provide itself 
with a new facade. No more was heard. about the 
Spanish “empire” or the virtues of violence. There 
was even some speculation that the Falange would 
be dismantled altogether. However, there was never 
any danger of the Franco regime's going that far, 
for the vague Falangist ideology was the only po- 
litical ideology the military dictatorship possessed. 
Rather, the Falangist “new line” was expanded. 
“National revolution” now meant merely the con- 
tinuation of the existing syndical system to keep 
the workers in their place. “Nationalism” and “anti- 
liberalism” were used mostly to provide propaganda 
Support for the regime and not to defame the 
progressive nations of the Western world. 

Franco had always stressed the fact that falan- 
gism had not achieved final form as a doctrine, that 
the program of the state would continue to change. 
Presumably, it would never cease to do so. After 
1945 emphasis was placed on the wide variety of 
political and ideological forces in the original 
“movement” behind the Franco government. Con- 
servatives, moderate liberals, constitutional mon- 
archists, clericals, and reactionary Carlists, who had 
always resented the relative ideological monopoly 
of falangism, were now permitted more voice. 

In 1947 Franco arranged a plebiscite to turn his 
regime into a sort of regency, with some member 
of the Spanish Bourbon dynasty slated to succeed 
him when he wearied of his role as dictator or died. 
Falangist doctrine had always been more or less 
antimonarchist. In fact, much of the rank and file 
of the Falangist organization was rabidly so, deem- 
ing monarchism incompatible with any kind of 
vigorous modern political movement. Therefore, 
Franco's official establishment of the monarchy as 
his successor seemed to reduce the function of 
falangism even further. 

The true result of the transformation of the 
regime into a pseudo regency was nonetheless that 
it increased the significance of falangism for the 
dictatorship. Franco's own monarchism was largely 
fraudulent; he had primarily sought a way to pacify 
restlessness about the future. Publicly favoring 
monarchist interests, Franco actually sustained 
Falangist propaganda and the Falangist bureauc- 
racy so that monarchist influence would not be- 
come too strong. The Spanish dictator could thus 
threaten the monarchists with a revival of extreme 
falangism whenever they tried to place any pres- 
sure on his regime. 

Although he cared even less about ideology than 


before, Franco probably felt more comfortable with 
falangism and the party bureaucracy than with 
most other elements of his regime. The Falangists 
were so completely dependent on his personal be- 
nevolence that they dared not seriously oppose him, 
and their ideology of nationalism and authoritarian 
rule was still the most comfortable political ra- 
tionale for his heterogeneous, but increasingly 
light-handed, tyranny. 

The immediate future of the regime was assured 
in 1951, when the United States began its policy 
of rapprochement with Franco. It became clear 
that the regime would no longer suffer interna- 
tional pressure because of its quasi-fascist texture, 
and Franco responded by placing a regular Falan- 
gist at the head of the bureaucracy and appointing 
four new Falangist ministers in the 1951 cabinet 
shake-up. Falangist propagandists felt freer to re- 
turn to some of their old prejudices. Nonetheless, 
and despite the fact that the United States was still, 
in the classic Falangist definition, a decadent liberal 
parliamentary democracy, the new doctrine had to 
be one of friendship with America, for the latter 
had indirectly become a principal supporter of the 
Franco dictatorship. 

The growing prosperity of Spain in the 1950s 
finally made it possible for the regime to attempt 
to realize a few peripheral aspects of the original 
Falangist program of economic reform. A program 
of state-operated industrialization, directed by the 
National Institute of Industry (INI), lavished vast 
amounts of money on a great variety of industrial 
projects between 1951 and 1958. At a much slower 
pace, a new beginning was made in state irri- 
gation projects and agrarian reform. This, however, 
had little to do with falangism per se, being 
directed by separate bureaucracies and paying little 
heed to the incomplete plans of the creators of 
falangism. 

The last effort to define falangism politically and 
to turn it into a living, controlling force was made 
in 1956-1957. The growth of political opposition 
had made a mild shake-up of the government 
necessary. A commission of old-guard Falangists 
was called together, and three separate antepro- 
yectos (draft proposals) were drawn up to define 
the Falangist position. (The anteproyectos were 
not officially published, but copies of them were 
made available to this author by one of the Falan- 
gist leaders involved.) These made it clear that the 
Twenty-seven Points were quite dead. There was 
no mention of “empire” or “violence.” Instead, the 
new catechism stressed the pre-eminence of Cathol- 
icism, national unity, and social justice and the 
viability of a moderated capitalism. 
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With regard to the structure of the state, the 
Falangists approved Franco's Cortes (parliament), 
a rubber-stamp assembly elected indirectly from a 
state-controlled list of nominees. However, they 
requested the elevation of the Falangist National 
Council to a sort of upper chamber or senate. Their 
project acknowledged a king as successor to Franco 
in the role of chief of state but requested the ap- 
pointment of a prime minister, or chief of govern- 
ment, who would carry on ordinary executive func- 
tions. He was to be responsible primarily to the 
chief of state, although it was stipulated that three 
adverse votes by the Falangist National Council 
should bring his resignation. Continuation of the 
one-party system was projected, and the Franco 
system of establishing a series of “fundamental 
laws” in lieu of a constitution was approved. 

Such a project contained a wholly original state- 
ment of Falangist notions of political structure, for 
at no time previously had the party’s organic goals 
been made so clear. Nonetheless, the anteproyectos 
were irrelevant, for there was no chance that they 
would ever be realized. The most influential pres- 
sure groups in Franco Spain—the hierarchy of the 
army and the Roman Catholic church—reacted 
with extreme anger. The anteproyectos were quietly 
buried without ever having been made public, and 
in the 1957 cabinet change falangism was further 
downgraded. 

By that time the major interest in Spain was 
economic development, not politics. The Franco 
regime preferred to give the appearance of liberal- 
ization and a more purely technical kind of gov- 
ernment. The tendency was thus toward the tran- 
scendence of falangism by a more “prismatic” 
political doctrine, permitting expression of plural- 
istic tendencies, within a limited range, under one- 
man rule. Such an orientation was much more 
effective in an era of increasing integration with 
liberal western Europe. 

It was perhaps impossible to expect that Fa- 
langism, as an ideology or as a movement, could 
survive and flourish in the postfascist epoch of 
western Europe. Having attained a position of 
pseudo responsibility without effective power, it 
had served as an effective scapegoat, to be blamed 
or downgraded whenever the regime felt a show of 
“liberalization” to be necessary. Nonetheless, it did 
leave a significant legacy, since it provided the 
principal rationale for authoritarian unity and gov- 
ernment in Spain. At the same time, by its rhe- 
torical emphasis on economic reform and social 
justice, it probably constituted the most liberal 
force within the regime. 

STANLEY G. PAYNE 
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FALLACIES, STATISTICAL 


This article will be mainly concerned with statis- 
tical fallacies, but it should be noted that most 
other fallacious types of reasoning can be carried 
over into statistics. 

Most fallacies seem foolish when pinpointed, but 
they are not the prerogative of fools and statis- 
ticians. Great men make mistakes, and when they 
admit them remorsefully, they reveal a facet of 
their greatness. The reason for mentioning the mis- 
takes of eminent people in this article is to make it 
more fun to read. 

Many fallacies, statistical or otherwise, have their 
origin in wishful thinking, laziness, and busyness. 
These conditions lead to oversimplification, the de- 


sire to win an argument at all costs (even at the 
cost of overcomplication), failure to listen to the 
Opposition, too-ready acceptance of authority, too- 
ready rejection of it, too-ready acceptance of the 
printed word (even in newspapers), too-great reli- 
ance on a machine or formal system or formula 
(deus ex machina), and too-ready rejection of them 
(diabolus ex machina). These emotionally deter- 
mined weaknesses are not themselves fallacies, but 
they provoke them. For example, they provoke spe- 
cial pleading, the use of language in more than 
one sense without notice of the ambiguity (if the 
argument leads to a desirable conclusion), the in- 
sistence that a method used successfully in one 
field of research is the only appropriate one in 
another, the distortion of judgment, and the forget- 
ting of the need for judgment. 

A logical or syntactical fallacy. We begin with 
an example of a fallacious argument in which the 
conclusion is correct: 


“No cat has no tail. One cat has one more tail 
than no cat. Therefore one cat has one tail.” 


A good technique for exposing fallacious reasoning 
is to use the same form of argument in order to 
deduce an obviously false result: 


“No cat has eight tails. One cat has one more 
tail than no cat. Therefore one cat has nine 
tails.” 


The fallacy can be explained by careful atten- 
tion to syntax, specifically by noting that the fol- 
lowing two propositions have been confused: (1) It 
is false that any cat has eight tails, and (2) the 
object named “no cat” has eight tails. P. M. S. 
Blackett once said, exaggerating somewhat, that a 
physicist is satisfied with an argument if it leads to 
a result that he believes to be true. 

Arguments from authority. The book Popular 
Fallacies by Alfred S. E. Ackermann is more con- 
cerned with fallacies of fact than of reasoning, and 
here many fallacies depend on the acceptance of 
authority. It is interesting to see that the author 
was himself misled by authority on at least two 
occasions, 

First, he argues that it is a fallacy “that cigarette 
smoking is especially pernicious,” appealing to the 
opinions of several authorities: for example, “Of 
the various forms of smoking, cigarette smoking is 
the most wholesome, preferably without a holder,” 
according to Sir Robert Armstrong-Jones, F.R.C.P., 
in the Daily Mail, January 1, 1927 (Ackermann 
[1907] 1950, pp. 174-175). (The current medical 
Opinion is that, of cigarettes, cigars, and pipes, 
cigarettes are the least wholesome, at any rate in 


regard to lung cancer. Of course, Armstrong-Jones 
might be right after all.) 

Then Ackermann refers to the thesis “that there 
is a prospect of atomic energy being of practical 
use.” Lord Rutherford is quoted, from the Evening 
News, September 11, 1933, as saying that “anyone 
who expects a source of power from the transfor- 
mation of these atoms is talking moonshine” (ibid., 
pp. 708-709). 

It would be unfair to blame Ackermann for rely- 
ing on these authorities, but it is useful to hold in 
mind that the highest authorities can be wrong, 
even when they are emphatic in their opinions. 
Their desire not to seem too academic should some- 
times be allowed for, especially when they hold an 
administrative appointment. 

What should the question really be? When 
Gertrude Stein was on her deathbed, one of her 
friends asked her, “What is the answer?” After a 
few seconds she whispered back, “What is the 
question?” 

It is important for the statistician to satisfy him- 
self that a right question is being asked, by in- 
quiring into the purposes behind the question. 
Chambers (1965) states that when a member of 
Parliament asked for some inland revenue figures 
that were not available from the published statis- 
tics, his invariable rule was to find out the purpose 
for which the information was needed. More often 
than not he found that the figures sought were 
irrelevant, that other figures already published were 
more helpful, or that the M.P. was misguided over 
the whole business. 

It is often reasonable to make exploratory in- 
vestigations without a clear purpose in mind. The 
fallacy we have just pointed out is the assumption 
that the questioner necessarily asks for informa- 
tion that is very relevant to his purposes, whether 
those purposes are clear or vague. The fallacy of 
giving the “right answer to the wrong question” is 
further discussed by A. W. Kimball (1957). 

Ignoring the “exposure base.” Consider, for 
example, the reports on traffic deaths that are 
issued after public holidays. Many readers con- 
clude from the increased number of deaths that 
it is more dangerous to drive on public holidays 
than on ordinary days. The conclusion may or may 
not be correct, but the reasoning is fallacious, since 
it ignores the fact that many more people drive 
automobiles on public holidays and thus more peo- 
ple are exposed to the possibility of an accident. 
If holidays and ordinary days are compared on the 
basis of deaths per passenger mile, it might turn 
out that the holiday death rate is lower, since the 
reduction of the average speed caused by the vol- 
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ume of traffic also may reduce the seriousness, if 
not the number, of accidents. 

“Deus ex machina”—the precision fallacy. When 
we know a machine or formal system that can 
produce an exact answer to a question, we are 
tempted to provide an answer and inquire no fur- 
ther. But exact methods often produce exact an- 
swers to wrong questions. 

One of the main aims of statistical technique is 
to fight the danger of wishful thinking and achieve 
a measure of objectivity in probability statements. 
But absolute objectivity and precision are seldom 
if ever attainable: there is always, or nearly always, 
a need for judgment in any application of statis- 
tical methods. (This point is especially empha- 
sized in Good 1950.) Inexperienced statisticians 
often overestimate the degree of precision and ob- 
jectivity that can be attained. An elementary form 
of the precision fallacy, which is less often com- 
mitted by statisticians than by others, is the use of 
an average without reference to “spread.” A related 
trap is to gauge closeness by some measure of 
spread but to ignore systematic errors (bias) [see 
Errors, article on NONSAMPLING ERRORS]. 

The use of an average without a measure of 
spread can be especially misleading, or even comic, 
when the sample is small and the spread is there- 
fore large. But even if the mean and standard devi- 
ation of the population are given, they can be mis- 
leading if the population is very skew. For skew 
populations it is better to give some of the quantiles 
[see STATISTICS, DESCRIPTIVE, article on LOCATION 
AND DISPERSION]. 

Randomization. An example of the precision 
fallacy occurs in connection with the important 
technique of randomization. Let us consider the 
famous tea-tasting experiment (see Fisher 1935, 
chapter 1; Good 1956). A lady claims to be able to 
tell by tasting, with better than random chance, 
whether the milk is put into her tea first or last. 
We decide to test her by giving her twenty cups of 
tea to taste, in ten of which the milk is poured first, 
and in ten last. If the lady gets many more than ten 
of her assertions right and if we have not random- 
ized the order of the twenty trials, we might sus- 
pect that whatever sequence we selected for some 
psychological reason, the lady might have tended 
to guess for similar psychological reasons. So we 
randomize the order and can then apparently make 
use of the hypergeometric tail-area probability as a 
precise, objective, and effectively complete sum- 
mary of the statistical significance of the experi- 
ment [see EXPERIMENTAL DESIGN and RANDOM 
NUMBERS]. 

But suppose we now examine the random se- 
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quence and spot some pattern in it—for example, 
that cups 3, 6, 9, 12, 15, and 18 had the milk in 
first. This would at once undermine the precise 
validity of our result in relation to its relevance to 
the hypothesis under investigation. We can partly 
remove this difficulty by means of “restricted ran- 
domization,” but we cannot completely remove it, 
because every finite sequence exhibits some special 
features, however recondite. Only by judgment, 
necessarily subjective, can we decide that any given 
feature is unimportant. The only way we can pre- 
serve the precision is to make use of a “statistician’s 
stooge” to perform the experiment for us, including 
the randomization (Good 1960-1961). He must 
report the number of successes to us but must on 
no account tell us the randomized order. Precision 
in a randomized experiment can be obtained only 
at the price of suppressing some of the information. 
The point made here is not universally accepted by 
Statisticians. 

If the number of cups of tea is very large, then 
the importance of the above criticism of random- 
ization will usually be negligible. But long experi- 
ments are expensive, and in the statistical design 
of experiments the expense can never be ignored. 

Randomization in itself is by no means a falla- 
cious technique; what is fallacious is the notion 
that without the suppression of information, it can 
lead to a precise tail-area probability relevant only 
to a null hypothesis. 

The suppression of information. In its crudest 
forms the suppression of information is often at 
least as wicked as an outright lie. We shall later 
refer to some of the cruder forms. But we have just 
seen that randomization loses its precision unless 
some information is suppressed, and we shall now 
argue more forcibly that it is a fallacy to suppose 
that the suppression of information is always 
culpable. 

One way of seeing this is in terms of digital 
communication. When an electrical “pulse” of a 
given shape is liable to have been attenuated and 
distorted by noise, some circuitry is often incorpo- 
rated for the purpose of re-forming ( “regenerating” ) 
the pulse. This circuitry, as it were, accepts the 
hypothesis that the pulse is supposed to be present 
rather than absent. Since noise in the electronic 
system makes the probability less than one that 
the supposed pulse is present, the regeneration loses 
some information. But allowing for the nature of 
the subsequent communication channel, it can be 
proved that the loss is often more than compen- 
sated. (This will not be proved here, but it is not 
Surprising to common sense.) In pedagogy the cor- 
responding principle is that simplification is neces- 


sary when teaching beginners. In statistics the 
corresponding device is known as the reduction of 
the data, that is, the reduction of a mass of data 
to a more easily assimilable form. If the statistics 
are “sufficient,” then there is no loss of information, 
but we often have to be satisfied with “insufficient” 
Statistics in order to make an effective reduction 
of the data. Thus it is fallacious to say that the 
suppression of information is always a statistical 
crime. (Note that apparently sufficient statistics 
might not really be so if the model is wrong. People 
sometimes publish, say, only a mean and variance 
of a sample, and this prevents readers from check- 
ing the validity of the model for which these statis- 
tics would be sufficient.) [See SUFFICIENCY.] 

Terminological ambiguities. Important exam- 
ples of terminological ambiguities occur both in the 
philosophy of statistics and in its practical appli- 
cations. Often they are as obvious as the “no cat” 
ambiguity, once they are pointed out. But before 
they are pointed out, they lead to a great deal of 
argument at cross purposes. Thus, many of the 
problems in the philosophy of probability clear 
themselves up as soon as we distinguish between 
various kinds of probability (see Good 1959a). 
(We shall not discuss here whether they can all be 
reduced to a single kind or whether they all “exist.” 
But they are all talked about.) [See PROBABILITY, 
article on INTERPRETATIONS. ] 

There is tautological, or mathematical, prob- 
ability, which occurs in mathematical theories and 
requires no operational definition. It occurs also in 
the definition of a “simple statistical hypothesis,” 
which is a hypothesis for which some probabilities 
of the form P(E|H) are assigned by definition. 
(Here E represents an event or a proposition as- 
serting that an event obtains, and the vertical 
stroke stands for “given” or “assuming.”) There 
are physical, or material, probabilities, or chances. 
These relate to tautological probabilities by means 
of the linguistic axiom that to say that H is true is to 
say that the physical probability of E is P(E|H), 
for some class of events, E. There are logical prob- 
abilities, or credibilities. There are subjective, or 
personal, probabilities, which are the intensities of 
conviction that a man will use for betting purposes, 
after mature consideration. There are multisubjec- 
tive probabilities, belonging to groups of people. 
And there are psychological probabilities, which are 
the probabilities that people behave as if they ac- 
cept, even before applying any criterion to test their 
consistency. By confusing pairs of these six kinds 
of probability, fifteen different kinds of fallacy can 
be generated. For example, it is often said that 
there is no sense in talking about the probability 


that a population parameter has a certain value, 
for “it either has the value or it does not, and there- 
fore the probability is either 0 or 1.” It need hardly 
be mentioned, to those who are not choked with 
emotion, that the probability need not be O or 1 
when it is interpreted not as a physical probability 
but as a logical or as a subjective probability. 

Even physical probabilities can be confused with 
each other, since they can be mistakenly referred 
to the same event. For example, apparent varia- 
tions in the incidence of some crime or disease 
from one place or time to another are very often 
found to be due to variations in the methods of 
classification. Adultery would appear to increase 
enormously if Christ’s definition were suddenly to 
be accepted in the law—“Whosoever looketh on a 
woman to lust after her hath committed adultery 
with her already in his heart” (Matthew 5.28). 
Since partners to adultery do not often turn in 
official reports, perhaps a better example is that of 
crime records. An example with documentation, 
quoted by Wallis and Roberts (1956), is that of 
felonies in New York. It was alleged that there had 
been an increase of 34.8 per cent from 1949 to 
1950, but later it appeared that this was at least 
largely due to a revised method of classification. 

This class of practical statistical fallacies is ex- 
tremely common in the social sciences, and one 
should be very much on guard against it. As a fur- 
ther example, two standard definitions of the num- 
ber of unemployed in the United States differ by 
a factor of over 3, namely, the “average monthly 
rate” and the “total annual rate.” Putting it roughly, 
one measure for any given year is the average 
monthly number of people unemployed; the other, 
larger measure is the number of people who were 
unemployed at any time during the year. [See 
EMPLOYMENT AND UNEMPLOYMENT.] 

An example of a terminological fallacy is the 
confusion of “some” and “all.” It is perpetrated by 
John Hughlings Jackson in the following excerpt: 
“To coin the word, verbalising, to include all ways 
in which words serve, I would assert that both 
halves of the brain are alike in that each serves in 
verbalising. That the left half does is evident, be- 
cause damage of it makes a man speechless. That 
the right does is inferable, because the speechless 
man understands all I say to him in ordinary mat- 
ters” (quoted in Penfield & Roberts 1959, p. 62). 
Some damage to the left hemisphere seems here to 
have been confused with destruction of all of it. 

Another example of the “some and all” fallacy 
is to assume that since some poems are better than 
others in the opinion of any reasonable judge, then, 
given any set of poems, one of them must be the 
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best. More generally, the possibility of partial order- 
ing is easily overlooked. But sometimes the as- 
sumption of partial ordering, although truer than 
complete ordering, is too complicated for a given 
application. In a beauty competition, for example, 
each girl might be the best of her kind, but it might 
be essential to award the prize to only one of them. 

In some social surveys respondents are asked to 
rank several objects in order of merit. An alternate 
design, which will often be less watered down by 
the need to reach decisions in doubtful cases, is to 
ask for comparisons of pairs of objects but to per- 
mit “no comparison” as a response for any given 
pair. 

Ignoring a relevant concomitant variable. “The 
death rate in the American Army in peacetime is 
lower than that in New York City, Therefore leave 
New York and join the army.” The fallacy is that 
the methods of selection for the army are biased 
toward longevity, both by age and by health, and 
these clearly relevant variables have been ignored. 
The fallacy can also be categorized as failure to 
control for exposure, since many inhabitants of 
New York City are subject to the possibility of 
death from infant diseases, chronic diseases, and 
old age, whereas very few men in the army are so 
exposed. The example can also be regarded as one 
of “biased sampling,” a category of fallacy to be 
considered later. 

Ignoring half of a contingency table. It is com- 
monly believed that government scientists in the 
United Kingdom earn more on the average than 
university teachers. But, as Rowe pointed out 
(1962), the average age of university teachers is 
less than that of government scientists, because a 
large proportion of lecturers leave the universities 
before their mid-thirties. Rowe showed that the 
median earning of university teachers above the 
age of 35 is greater than that of government sci- 
entists. But he did not estimate what these men 
would have earned in government service. Thus, 
although he refuted the original argument, he did 
not ask the really relevant question. This question 
is very difficult to answer. A possible approach, 
which would shed some light, would be to find out 
the distribution of salaries as a function of job, age, 
and intelligence quotient. 

The perennial problem here is which covariates 
to choose and where to stop choosing them, since 
the list of possibilities is typically impracticably 
large. A related issue is that the greater the num- 
ber of conditioning classificatory variables (dimen- 
sionality of the contingency table), the fewer the 
cases in the relevant cross-classification cell. This 
is one of the unsolved problems of actuarial sci- 
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ence, where the problem of estimating probabilities 
in multidimensional contingency tables is philo- 
sophically basic (for some discussion of this prob- 
lem, with references, see Good 1965). 

It sometimes happens that a fact is almost uni- 
versally ignored, although in retrospect it is clearly 
highly relevant. In the late 1950s people in the 
United States were arguing that the standard of 
college teaching staffs was deteriorating, since the 
proportion of newly employed teachers who held a 
doctorate was decreasing. What was overlooked 
was that the proportion of teachers who took their 
doctorates after becoming teachers was increasing. 
Cartter (1965) states that Bernard Berelson was 
almost alone in his correct interpretation of the 
situation. 

Biased sample. At one time, most known quasi- 
stellar radio sources lay approximately in a plane, 
and this seemed to one writer to have deep cosmo- 
logical significance. But these radio sources could 
not be definitely identified with optical sources un- 
less they were located with great accuracy, and for 
this purpose they had to be occluded by the moon. 
Also, as it happened, most of the observations had 
been made from the same observatory. Hence there 
was a very strong bias in the sampling of the 
sources (this was mentioned by D. W. Dewhurst 
in a lecture in Oxford on January 28, 1965). As 
Sir Arthur Eddington once pointed out, if you catch 
fish with a net having a 6-inch mesh, you are liable 
to formulate the hypothesis that all fish are more 
than 6 inches in length. [See Errors, article on 
NONSAMPLING ERRORS. | 

It is sometimes overlooked that atrocity stories 
usually form a biased sample. Newspapers tend to 
report the atrocities of political opponents more 
than those of friends. An exception was the Nazi 
atrocities, which were so great that the evidence 
for them had to be overwhelming before they could 
be believed. (For example, there appears to be 
no reference to them in the 1951 edition of the 
Encyclopaedia Britannica.) 

Sometimes inferences from a sample are biased 
because of seasonal variations. According to Starnes 
(1962), Democratic Secretary of Labor Willard 
Wirtz stated just before an election that over “four 
and a half million more Americans have jobs than 
when this Administration took office in January of 
1961.” Wirtz later admitted that the figure should 
have been 1.224 million, and he said, “It isn’t 
proper to compare January figures with October 
figures without a seasonal adjustment.” Similarly, 
the Republican governor of New York, Nelson D. 
Rockefeller, once referred to a “net increase of 


450,000 jobs” since he had taken office. The figure 
is worthless because it again ignores the adjust- 
ment for seasonal variations. 

Bias is difficult to avoid in social surveys, for 
example, in the use of questionnaires, where poor 
wording is frequent and where one sometimes (es- 
pecially in political and commercial surveys) finds 
tendentious wording. 

Even with an unbiased sample, it is possible to 
get a biased conclusion by computing the signifi- 
cance level of various tests of the null hypothesis 
and selecting the one most favorable to one’s wishes. 
Although these tests are based on the same sample 
and are therefore statistically interdependent, there 
will be a reasonable probability that one out of 
twenty such tests will reach a 5 per cent signifi- 
cance level. A suggestion of how to combine such 
“parallel” tests is given by Good (1958a). 

The suppression of the uninteresting. Suppose 
we have done an experiment, and it reaches a sig- 
nificance level of 5 per cent. Should we reject the 
null hypothesis? Perhaps the experiment has been 
performed by others without significant results. If 
these other experiments were taken into account, 
the total significance of all the experiments com- 
bined might be negligible. Moreover, the other re- 
sults might have been unpublished because they 
were nonsignificant and therefore uninteresting. 
This explains why some apparent medical ad- 
vances do not fulfill their early promise. The pub- 
lished statistics are biased in favor of what is 
interesting. As one physician said, “Hasten to use 
the remedy before it is too late” (Good 1958b, 
p. 283; Sterling 1959). 

Sample too small. One of the most frequent 
and elementary statistical fallacies is the reliance 
on too small a sample. In 1933 Meduna, believing 
that schizophrenia and epilepsy were incompatible 
because of the rarity of their joint occurrence, 
started to induce convulsions in mental patients by 
chemical means. Consequently, the beneficial effect 
of convulsions on depressives was eventually acci- 
dentally discovered. Meduna’s sample was too small, 
and in fact it has now been found that schizo- 
phrenia and epilepsy are positively correlated 
(Slater & Beard 1963). One moral of this story is 
that experiments can be worth trying without theo- 
retical reason to believe that they might be suc- 
cessful. 

Misleading use of graphs and pictures. Graphs 
and pictures are often used in newspapers in the 
hope of misleading readers who are not experienced 
in interpreting them. Sometimes graphs are inade- 
quately labeled; sometimes the scale is chosen so 


as to make a small slope appear large; sometimes 
the graph is drawn on a board and the board is 
pictured in perspective so as to accentuate the most 
recent slope; sometimes too little of a time series 
is shown, and the graph is started at a trough 
(a device that is useful for salesmen of stocks, 
when they wish the public to invest in a particular 
equity ). 

A useful method for misleading with pictures is 
to depict, say, salaries by means of objects such as 
cash boxes whose linear dimensions are propor- 
tional to the salaries. In this way an increase is 
made to appear much larger than it really is. An- 
other useful method for misleading the public is 
attributed by Huff (1954) to the First National 
Bank of Boston. The bank represented governmen- 
tal expenditure by means of a map of the United 
States in which states of low population densities 
were shaded to indicate that total government 
spending was equal to the combined income of the 
people of those states. The hope was that the reader 
would get the impression that federal spending, as 
a fraction of the total income of the United States, 
was equal to the total area of the shaded states 
divided by the whole area of the country. [See 
GRAPHIC PRESENTATION. ] 

“Smaller” versus “smaller than necessary.” The 
confusion of “smaller” with “smaller than neces- 
sary” will be illustrated in a hereditary context, and 
an oversimplification of the theory of natural selec- 
tion will be pointed out. Let us suppose that it is 
true that intelligent people tend to have fewer chil- 
dren than less intelligent people and that the level 
of intelligence is hereditary. (We are not here con- 
cerned with whether and where this supposition is 
true, nor with the precise interpretation of “intelli- 
gent.”) It then appears to follow that the average 
level of intelligence will necessarily decline. This 
fallacy will be perpetrated on most readers of Chap- 
ter 5 of the book by the eminent zoologist Peter B. 
Medawar (1960, p. 86), in spite of the words itali- 
cized by us in the following quotation: “If innately 
unintelligent people tend to have larger families, 
then, with some qualifications, we can infer that 
the average level of intelligence will decline.” In 
order to show that the argument without the quali- 
fication is invalid it is sufficient to use a mathe- 
matical model that, for other purposes, would be 
much oversimplified (see Behrens 1963). Imagine 
a population in which 10 per cent of men are in- 
telligent and 90 per cent are unintelligent and that, 
on the average, 100 intelligent fathers have 46 
sons, of whom 28 are intelligent and 18 unintelli- 
gent, whereas 100 unintelligent fathers have 106 
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Table 1 — Hypothetical proportions of intelligent and 
unintelligent sons 


SONS 
Intelligent Unintelligent 
100 intelligent 28 18 
FATHERS 
900 unintelligent 72 882 
Total 100 900 


sons, of whom 98 are unintelligent and 8 are intel- 
ligent. It will be seen from Table 1 that the propor- 
tion of intelligent males would remain steady in 
expectation. 

But now it must be determined whether the 
right question is being asked. Suppose we were 
convinced that the general level of intelligence was 
decreasing, and we made suggestions accordingly 
for encouraging the more intelligent to have more 
children. Should we not put these suggestions for- 
ward even if the general level of intelligence were 
increasing? Would we not like to see the rate of 
increase also increase? Yes, of course. Looking 
from this point of view, we might not fully agree 
with Medawar’s arguments, but we might well agree 
with some of his recommendations. 

“Regression fallacy.” If we select a short or tall 
person at random, the chances are that his rela- 
tives will be closer in average height than he is to 
the mean height of the population. Francis Galton 
described this phenomenon as “regression.” If now 
we consider the heights of the sons of tall men and 
of short men, we might infer that the variability 
of heights is decreasing with time. This would be 
an example of the regression fallacy. One way of 
seeing that the argument must be fallacious is by 
considering the heights of the parents of short and 
tall people: we would then infer that the variability 
of heights is increasing with time! 

Wallis and Roberts (1956) mention several other 
examples of the regression fallacy. One is the wide- 
spread belief that the second year in the major 
leagues is an unlucky one for new baseball players 
who have successfully finished their first year. 

Invalid use of formulas or theorems. The use 
of formulas or theorems in situations where they 
are not valid is a special case of the deus ex machina 
class of fallacies and is very frequent. The follow- 
ing are a few examples. 

Implicit assumption of independence. In an ex- 
periment consisting of n trials, each successful with 
probability p, is the variance (the square of the 
standard deviation) of the number of successes 
equal to np(1 — p), as it would be if independence 
held? (An example would be the quality inspection 
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of items on an assembly line.) The formula is so 
familiar that it is tempting to assume that it is 
always a good approximation. But familiarity breeds 
mistakes. For a Markov chain the variance can be 
quite different (see, for example, Good 1963), as 
it can also be when sampling features of children 
in families or fruit on trees. 

Another example of a fallacious assumption of 
independence relates to the variability of physio- 
logical traits. Why, even if there were only eight 
traits, each trichotomized into equal thirds, only 
one person out of 3° = 6561 would be in the middle 
(normal) group for all eight traits! 

Assuming form determines distribution. Let n; j 
be the frequency of the “dinome,” that is, pair of 
adjacent digits (i,j), in a sequence of N random 
sampling digits (i,j = 0,1,2, ---, 9). Clearly Dnjij= 
N—1. Let 


o uera a) N-1\7 
vay (m - a)" 


It has been erroneously assumed at least four times 
in the statistical literature that y? has asymptoti- 
cally (for large N) a tabular chi-squared distribu- 
tion. In one case this led to the unfair rejection of 
a method of producing pseudo random numbers. 
Presumably the erroneous distribution arose from 
the typographical identity of the expression for y? 
with the familiar statistic of the chi-square test. 
(For references to three of these papers and to a 
paper that gives a correct method of using Y°, see 
Good 1963.) The misapplication of the above so- 
called serial test is particularly disastrous when 
working with binary digits (0 and 1), that is, with 
base 2. 

Assuming the winner leads half the time. There 
is a fallacy in assuming that in a long sequence of 
statistically independent fair games of chance be- 
tween two players the ultimate winner will be in 
the lead about half the time. This is a misapplica- 
tion of the law of large numbers. That it is a fallacy 
depends on one of the most surprising theorems in 
the theory of probability, the so-called arc sine law. 
In fact, the probability that a specified player will 
be in the lead for less than a fraction x of the time 
is approximately (2/7) arc sin x! (see, for exam- 
ple, Feller 1950-1966, vol. 1, p. 251), This implies 
that however long the game, it is much more likely 
that a specified player will be ahead most of the 
time or behind most of the time than that he will 
be about even; for example, the probability that a 
specific player will be ahead 90 per cent or more 
of the time, or behind 90 per cent or more of the 
time, is about .40, while the probability that the 
player will be ahead between 40 and 60 per cent 


of the time is only about .13. As Feller says, the 
arc sine law “should serve as a warning to those 
who easily discover ‘obvious’ secular trends” in eco- 
nomic and social phenomena. 

The “maturity of the chances.” An elementary 
misapplication of the law of large numbers, or “law 
of averages,” is known as the maturity of the 
chances. In World War 1 many soldiers took shelter 
in bomb craters on the grounds that two bombs 
seldom hit the same spot. For the same reason 
P. S. Milner-Barry, the British chess master, de- 
cided to retain his London flat after it was bombed 
in World War u. As a matter of fact it was bombed 
again. At roulette tables, it is said, the chips pile 
up on the color that has not occurred much in 
recent spins. Of course, in practice, if a coin came 
down heads fifty times running, it would be more 
likely than not, in logical probability, to come down 
heads on the next spin, not less likely. In fact, it 
would probably be double-headed. There are cir- 
cumstances, of course, when an event is less likely 
to occur soon after it has just occurred: this would 
be true for some kinds of accidents and in many 
situations where one is sampling without replace- 
ment. Usually the question is basically empirical, 
but the expression “maturity of the chances,” or 
“Monte Carlo fallacy,” usually refers to sequences 
of events that are statistically independent, at least 
to a good approximation. 

Law of large numbers misapplied to pairs. A 
mnemonic for the fallacy of misapplying the law 
of large numbers when considering pairs of objects 
selected from a set is the well-known “birthday 
problem.” If 24 people are selected at random, then 
it is more likely than not that at least one pair of 
them will have the same birthday (that is, month 
and day). This is simple to prove, but a good intui- 
tive “reason” for it is that the number of pairs 
of people in a group of 24 people is 276, and 
exp (—276/365) < 4. (The crude argument here is 
based on a Poisson approximation to the probability 
of no “successes” in 276 roughly independent trials 
with common success probability 1/365.) The re- 
sult is true a fortiori if births are not distributed 
uniformly over days of the year. 

C. R. Hewitt, in a “Science Survey” program of 
the British Broadcasting Corporation in Febru- 
ary, 1951, stated that the probability is less than 
1/64,000,000,000 that two fingerprints of different 
people will be indistinguishable. From this he in- 
ferred that no two people have indistinguishable 
fingerprints and thus committed the birthday fal- 
lacy. The argument is fallacious even if we ignore 
resemblances of fingerprints among relatives, since 
the number of pairs of people in the world exceeds 


4,000,000,000,000,000,000. The conclusion might 
be correct. 

A similar fallacy arises in connection with pre- 
cognition, Suppose, entirely unrealistically, that 
there is just one remarkable and well-documented 
case of somebody in the world having an appar- 
ently precognitive dream. How small must the ap- 
parent probability be in order that the report, if 
true, should by itself convince us of precognition? 
Presumably its reciprocal should be at least of the 
order of the population of the world times the num- 
ber of dream experiences of a man times the num- 
ber of his waking experiences. This triple product 
might be as large as 1,000,000,000,000,000,000,- 
000,000,000 (or 10%"). This informal application 
of statistics should discourage a too ready assump- 
tion that the evidence from apparently precognitive 
dreams is overwhelming. A formal application of 
statistical methods to this problem is very difficult. 
This discussion is not intended to undermine a 
belief in the possibility of precognition, but it is a 
plea for a better evaluation of the evidence. 

Failure to use precise notation. An example of 
the fallacy of failure to use sufficiently explicit 
notation is given by the “fiducial argument.” The 
purpose of R. A. Fisher’s fiducial argument (1956, 
pp. 52-54) was to produce a final (posterior) dis- 
tribution for a parameter without assuming an 
initial (prior) distribution for it. This was ambi- 
tious, to say the least, since de nihilo nihilum. 

The argument starts off from a parametric dis- 
tribution for a random variable, X. Fisher selected 
an example of which the following is a special 
case. For each positive number x,, suppose that 


P(X > x, | 0) = exp(—%8), 


where @ is a positive parameter in whose value 
and final distribution we are interested. Writing 
% = u/O, we get P(X 0 >u | 0) = exp (~u). From 
this it can be proved, using the usual axioms of 
probability theory (although Fisher omitted the 
proof), that P(X @ > u) = exp (—u) for any positive 
number, u, provided that an initial distribution for 
ð is assumed to exist. (It is not necessary to as- 
sume that this distribution is in any sense known. ) 
Hence P(@>0,)=exp(—x0)), where 6) = U/x. 
Fisher infers from this that 


P(0 > 6, | x) =exp(—x6,), 


where @, is any real positive number. But this last 
equation does not follow from the axioms of proba- 
bility unless the initial probability density of 0 is 
proportional to 1/@. The fallacy in the fiducial 
argument was due to Fisher’s failure to indicate 
what is “given” in his probability notation. So great 
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was Fisher's authority that there are still many 
statisticians who make use of the fiducial argu- 
ment; thus the analysis given here is currently con- 
sidered controversial [see FIDUCIAL INFERENCE]. 

Assuming order of operations reversible. An ex- 
ample of the fallacy of assuming that the order of 
two mathematical operations can be interchanged 
is the assumption that the expectation of a square 
is equal to the square of the expectation. This 
occurs in M. J. Moroney (1951, p. 250), where he 
says that evidently the expected value of chi- 
square for a multinomial distribution is zero. 

Correlation and causation. Positive correlation 
does not imply causation, either way round. There 
is a positive correlation between the number of 
maiden aunts one has and the proportion of cal- 
cium in one’s bones. But you cannot acquire more 
maiden aunts by eating calcium tablets. (Younger 
people tend to have more maiden aunts and more 
bone calcium.) In New Hebrides people in good 
health are lousier than people with fever. The ad- 
vice to acquire lice cannot be rationally given, since 
lice avoid hot bodies [see Huff 1954, p. 99; see also 
CAUSATION]. 

Zero correlation does not imply statistical inde- 
pendence, although it does so for a bivariate nor- 
mal distribution and for some other special families 
of distributions. 

If there is a positive correlation between A and 
B and also between B and C, this does not imply 
that the correlation between A and C is positive, 
even for a trivariate normal distribution. But the 
implication does follow if the sum of the squares 
of the first two correlation coefficients exceeds 
unity. 

If the time order is wrong, then causation is un- 
likely, to say the least. In one survey vaccination 
was found to be positively correlated with various 
infectious diseases, when one looked at different 
districts in India. This was used by antivaccination- 
ists for propaganda. If they had not been emo- 
tionally involved, they would probably have noticed 
that in several districts increased vaccination had 
followed an increase in the incidence of disease 
(Chambers 1965). 

Post hoc, ergo propter hoc (“after this, therefore 
because of this”). D. O. Moberg, in a lecture in 
Oxford on February 2, 1965, stated that premarital 
intercourse seemed to be positively correlated with 
divorce and inferred that the propensity to divorce 
was increased by premarital intercourse. The infer- 
ence might be correct, but an equally good explana- 
tion is that premarital intercourse and divorce are 
both largely consequences of the same attitude 
toward the institution of matrimony. It is also pos- 


300 FALLACIES, STATISTICAL 


sible that untruthful responses are associated with 
a propensity to divorce or with a propensity to 
avoid divorce. 

Ecological correlation. Suppose we find that in 
American cities the illiteracy rate and the percent- 
age of foreign-born are associated. This does not 
imply the same association for individuals (see 
Goodman 1959). It would even be possible that 
every foreign-born person was highly literate. Cities 
might attract foreign-born people and also attract 
or produce illiteracy. 

Wrong criteria for suboptimization. Granted 
that in most decision problems it is not so much a 
matter of optimization as of “suboptimization,” that 
is, of approximate optimization, there is still an 
acute problem in choosing what to suboptimize. 
Various fallacies arise through choosing a wrong 
criterion or through not using a criterion at all (see 
Koopman 1956; Good 1962). Often a criterion is 
selected from too narrow a point of view, ignoring 
questions of consistency with higher-level criteria. 
For example, when coeducation at New College, 
Oxford, was being discussed at another Oxford col- 
lege, the question of the relative requirements for 
education of men and women was ignored, but the 
effect on the atmosphere of the senior common 
room was mentioned. Another fallacy is to ignore 
the “spillover,” or side effects, of some project. 
Sometimes, when an urgent decision is required, 
the cost in delay of detailed theory is unjustifiably 
ignored. At other times the cost of the theory is said 
to be too heavy, and the fact is overlooked that the 
results of this theory might be valuable in similar 
circumstances in the future and that the training 
of the theoretician is important. Sometimes the cri- 
terion of profitability is given too little weight, 
sometimes too much (see also McKean 1958; Hitch 
& McKean 1954). 

Statistics of statistical fallacies. There is some 
unpublished work by Christopher Scott on the sta- 
tistics of statistical fallacies and errors for the 
specialized field of sample surveys conducted by 
mail. Scott read the 117 articles and research re- 
ports that had been written in English on this topic 
up to the end of 1960. He excluded 22 of the re- 
ports either because they were duplicates of others 
or because they gave almost no details of method. 
Of the remaining 95 articles, he found one or more 
definite errors in 54 and definite shortcomings in 
another 13. Among the definite errors there were 
14 cases in which the experimental variable was 
not successfully isolated, that is, a change in tech- 
nique was reported as causing a change in the re- 
sult, whereas the latter change could reasonably be 
ascribed to variation in some concomitant variable. 


There were 9 cases in which obviously relevant 
data, such as sample size or response rate, were 
not reported, and 7 cases in which a necessary sig- 
nificance test was not given. There is not space 
here for further details, and hopefully they will be 
published elsewhere. 

For misuses of the chi-square test, see Lewis and 
Burke (1949). 

Good fallacies. It is a fallacy to suppose that 
all fallacies are bad. A clearly self-contradictory 
epigram can be a neat way of conveying truth or 
advice, to everybody except quibblers. For example: 

“Only a half-truth can be expressed in a nut- 
shell.” 

“Everything in moderation.” 

“It would be a non sequitur if it were not a 
tautology.” 

“Races in which people were immortal became 
extinct by natural selection.” 

“There’s nothing wrong with chess players that 
not being people wouldn’t put right.” 


In this article it has been necessary to omit ref- 
erence to many kinds of fallacies. A more complete 
listing is given in the categorization of logical and 
statistical fallacies by Good (1959b). 


IRVING JoHN GOOD 


[See also Errors, article on NONSAMPLING ERRORS.] 
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FAMILY 


The articles under this heading discuss the cen- 
tral aspects of the family as a universal institution 
in society. Closely related are the entries KINSHIP; 
MARRIAGE. The reproduction function of the family 
is discussed in FERTILITY; FERTILITY CONTROL; 
the socializing function of the family is reviewed 
in SOCIALIZATION as well as in CULTURE AND 
PERSONALITY; DEVELOPMENTAL PSYCHOLOGY; IN- 
FANCY; PERSONALITY, article on PERSONALITY DE- 
VELOPMENT; MORAL DEVELOPMENT. Stages in 
family life are reviewed in ADOLESCENCE; AGING; 
DEATH; LIFE CYCLE; as well as in FAMILY, article 
On DISORGANIZATION AND DISSOLUTION. Specialized 
aspects of the family are reviewed in ADOPTION; 
AFFECTION; INCEST; NUPTIALITY; PATERNALISM; 
SEXUAL BEHAVIOR. Many anthropologists, psycholo- 
gists, and sociologists have contributed to an un- 
derstanding of the family; the following biographies 
are of particular relevance: BACHOFEN; BURGESS; 
ENGELS; FRAZER; FREUD; GESELL; HALL; LE PLAY; 
MCLENNAN; MAINE; MALINOWSKI; MORGAN, LEWIS 
HENRY; RADCLIFFE-BROWN; TYLOR; WALLER; 
WESTERMARCK. 


I. COMPARATIVE STRUCTURE Raymond T. Smith 


11. DISORGANIZATION 
AND DISSOLUTION 


Clifford Kirkpatrick 
I 
COMPARATIVE STRUCTURE 


One of the few widely accepted generalizations 
of social science asserts that “the family” is an 
institution found in all human societies. There is 
controversy over just what constitutes the family 
and what its functions might be, but the generaliza- 
tion itself is widely accepted. In modern Euro— 
American societies it is accepted that the normal 
family is a coresidential group which consists of 
a married couple and their own children and which 
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lives apart from other kin. This has become the 
reference type for comparative study, and it is ar- 
gued that in modern, highly differentiated societies 
such as those of North America and Europe the 
family has become a highly specialized agency ex- 
hibiting the essential and irreducible characteris- 
tics which are significant for cross-cultural com- 
parison (Parsons & Bales 1955, p. 10). 

In all human societies provision must be made 
for biological and social reproduction if the society 
is to continue, and it is generally assumed that the 
family performs at least these functions. Mating is 
never simply random, and children require a long 
period of care by a limited number of individuals 
with whom they develop relations of intimacy if 
they are to grow up as normal human beings ca- 
pable of playing adult roles. This conclusion is 
based partly upon experimental evidence (Bowlby 
1951) and partly upon inference from the fact that 
in all known societies children are raised in small 
kinship-based groups and there are customary 
modes of regulation between children and their so- 
cially recognized parents and between the parents 
themselves. Looked at in this way there is a very 
close relationship between the family and the do- 
mestic group, and both constitute systems of rela- 
tionship which vary over time. The form of these 
systems is tied to the physical processes of individ- 
ual birth, maturation, and death, so that families, 
like individuals, pass through developmental stages 
(Fortes 1958). The processes of individual matura- 
tion and death are social as well as physical, since 
individuals are participants in social systems as 
well as in biological processes. Societies live longer 
than either individuals or families, but societies 
exist only through the patterned interaction of in- 
dividuals who share common understandings. Mat- 
ing and procreation provide for the continuity of 
the biological species, while socialization relates 
it to the equally important continuity of social 
Structure and cultural pattern. It has been argued 
on both theoretical and empirical grounds that the 
primary function of the family is socialization, so 
that an intimate relation is established between bio- 
logical and social processes, a relation which is re- 
flected in the dependence of demographic trends 
upon social custom, 

The generalizations set out above seem to hold 
true in a large number of different societies, but it 
is not certain to what extent the observations and 
descriptions used have been influenced by the def- 
initions adopted by field investigators. Some writers 
have expressed doubt that the family is a universal 
human institution, but both the assertion and its 


refutation depend upon some agreed identification 
of what is being discussed. 


Definitions 


Domestic groups. A domestic group may be de- 
fined as a group of people who habitually share a 
common dwelling and a common food supply. 
These minimal activities of domestic groups may 
be greatly extended, and domestic groups may vary 
in size and stability. The word “family” has its 
origin in a Latin word which could be roughly 
equated with “domestic group,” but for sociological 
purposes the two must be sharply distinguished. 
Domestic groups may be made up of individuals 
between whom no kinship ties exist, and, con- 
versely, members of one family may be distributed 
over two or more domestic groups. The term “house- 
hold” may be used interchangeably with “domestic 
group.” 

Biological family. In Euro—American societies 
the basic model of kinship and family ties is that 
of biological relatedness and sexual intercourse, so 
that kin and familial relations are thought of in 
terms of physical descent or sexual relations. This 
is not so in all societies, even though it appears 
that family relations are almost invariably inti- 
mately associated with sexual regulation and re- 
production. Social relations of family and kinship 
can develop independently of genetic links or sex- 
ual relations, so that adoption or other forms of 
fictive kinship are just as real as blood ties. Even 
where actual genetic links exist they do not con- 
stitute social relations; these must be learned and 
developed separately. Similarly, marriage is always 
more than sexual mating. Schneider (see Aspects 
of the Analysis . . . 1965) has expressed in a par- 
ticularly pointed and convincing way the objections 
to confusing an analytical definition of kinship 
with definitions stressing biological relatedness. 

The relation between mother and child appears 
to be in a rather special category, since the human 
infant is part of its mother’s physical being before 
birth and continues to be dependent upon her or 
a substitute for a considerable period afterward. 
The development of artificial feeding has modified 
this close dependence, but experience in rearing 
motherless infants has shown that adverse effects 
result from deprivation of close continuous contact 
with a mother figure in early infancy. The “mother 
figure” can be any suitable person, and experiments 
with monkeys have shown that, for them, even 
contraptions of wire and cloth can serve as 
“mothers” in some sense. This raises the question 
of just what we mean when we use words such as 


“mother” and should lead to a careful distinction 
between the activities involved in mothering on 
the one hand and the biological relationship of the 
mother to the child on the other. In the case of the 
father-child relationship, the more frequent dis- 
junction between biological and social paternity 
has led to a distinction between pater and genitor. 
The former term is used to refer to a child's legally 
recognized father and the latter to his supposed bio- 
logical father. In some societies a child’s father 
(pater) may be a woman or a dead man. Among 
the Nuer of east Africa, the kin of a man killed in 
war before he had married would sometimes marry 
a woman to “his name.” After payment of the bride 
price the woman cohabited with a lover, but any 
children born to her were the legal offspring of the 
dead “husband,” inheriting from him and sacrific- 
ing to him as their ancestor. Cases such as this 
make clear the differences between biological and 
social links, and although the family is frequently, 
or even usually, coincident with a particular con- 
stellation of biological links, it cannot be defined 
in terms of them. 

Nuclear family. The term “nuclear family” (or 
“elementary,” “simple,” or “basic”) is most fre- 
quently used to refer to a group consisting of a 
man, a woman, and their socially recognized chil- 
dren. This is a straightforward use of the term; it 
refers to concrete groups, and the qualification 
“nuclear” suggests that this is the unit out of which 
more extensive family groups are built or grow. 
The group need not be coresidential provided reg- 
ular relationships are maintained among its mem- 
bers. It need not exist as a separate and isolated 
entity but may be contained within more extensive 
groups provided it is given some recognition. It has 
been suggested that the nuclear family is the uni- 
versal form of family relations, always fulfilling 
“distinctive and vital functions—sexual, economic, 
reproductive, and educational . . .” (Murdock 1949, 
p. 3). “The nuclear family is a universal human 
social grouping. Either as the sole prevailing form 
of the family or as the basic unit from which more 
complex familial forms are compounded, it exists 
as a distinct and strongly functional group in every 
known society” (p. 2). The evidence for this state- 
ment is far from conclusive, and much of it that 
does seem to support the generalization may be 
biased simply because of the way in which the data 
have been collected and presented. Cases such as 
the Nayar and the Ashanti (see below) tend to dis- 
prove the assertion, and this has now been recog- 
nized by Murdock. 

Parsons has argued that the nuclear family ex- 
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hibits characteristics which seem to be necessary 
(on theoretical grounds) for the socialization of 
children and the stabilization of adult personalities 
(Parsons & Bales 1955). He has argued elsewhere 
that the incest taboo is also universal in human so- 
cieties for similar reasons and that taboo results 
in the perpetual creation of new nuclear-family 
groups through marriage (Parsons 1954). Even if 
one accepts the ideas that sexual intercourse must 
be regulated and that children must be cared for 
and socialized within small groups with the char- 
acteristics exhibited by the nuclear family, this 
does not necessarily mean that the nuclear family, 
as defined by Murdock, is universal. Levy and 
Fallers (1959) have suggested that a distinction 
should be made between the nuclear family as a 
concrete group and the “nuclear-family relationship 
complex.” The latter would consist of the relation- 
ships of husband—wife, mother—son, mother—daugh- 
ter, father-son, father-daughter, brother-sister, 
brother-brother, and sister—sister considered as a 
system of interaction between roles. By looking at 
the nuclear family as a system of roles rather than 
as a concrete grouping of individuals, it is possible 
to see that role behavior appropriate to this com- 
plex might be distributed among a number of in- 
dividuals, groups, or agencies that do not them- 
selves constitute a single group. Malinowski showed 
that certain aspects of what he considered to be 
the normal “father” role are played by the mother’s 
brother in some societies with matrilineal inheri- 
tance and descent. Similarly, in some societies the 
“mother” role is played in whole or in part by 
mother surrogates, such as mother’s sisters, 
mother’s mothers, paid nurses, or teachers. In or- 
der to make statements such as these it is necessary 
to assume that we know what the “real” or “normal” 
nuclear-family role complex is, and here it is evi- 
dent that there is a tendency to take the Euro— 
American family pattern as the type case. Despite 
this drawback, the concentration of attention upon 
a role complex rather than concrete groups is a step 
toward freeing the concept of nuclear family from 
the rigidity imposed upon it by Murdock. If one 
asserts that the nuclear-family relationship com- 
plex is institutionalized in all human societies—in- 
stead of speaking of the universal occurrence of 
nuclear-family grouwps—then it is possible to 
account for the normal development of children 
brought up in groups that are not nuclear families 
in the restricted sense. Where one parent is missing 
or no siblings are present, children may still be 
affected by the institutionalized nuclear-family role 
complex, different elements of which are activated 
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by other kinsmen, neighbors, teachers, or even 
more remote individuals. We may also relate the 
necessary biological and social functions to this 
role complex rather than to concrete nuclear fam- 
ilies and leave it to empirical investigation to show 
just how the roles are embodied in a particular so- 
ciety. While this goes some way toward freeing 
investigation from the analytical restrictions im- 
posed by the concept of the universality of the nu- 
clear family, there are good grounds for going even 
further and asking just what is meant by the roles 
involved in the supposedly universal nuclear-family 
role complex. 

Compound family. The term “compound fam- 
ily” is used to refer to a concrete group formed 
through the amalgamation of nuclear-family units 
or parts of them. A polygynous household consist- 
ing of one man, his three wives, and their respec- 
tive children would constitute a compound family, 
as would a family group constituted by remarried 
widows or divorcees with children from a previous 
marriage. A compound family need not constitute 
a coresidential group. 

Joint family. According to the handbook Notes 
and Queries on Anthropology (see British Associa- 
tion . . . 1874), a joint family exists when “two or 
more lineally related kinsfolk of the same sex, their 
spouses and offspring, occupy a single homestead 
and are jointly subject to the same authority or 
single head.” An example would be a group consist- 
ing of a man and his wife with their married sons 
and their wives and children. It would be wrong to 
think of such a group as being a mere mechanical 
aggregate of nuclear families. Joint families gen- 
erally arise, exist, and persist because they carry out 
activities more extensive than would be possible for 
a nuclear-family group. Joint families grow as 
younger members bring in spouses rather than set- 
ting up independent households. As the younger 
married couples beget children, it may be possible to 
detect the existence of a number of nuclear-family 
cells within the structure of the joint family. The 
younger married couples will often have their own 
living quarters and may establish their own cooking 
facilities. If they have their own budgetary arrange- 
ments, then the joint family has really split—even 
if the young couples do not actually move out of the 
house. On the other hand, the joint family may be 
such a cohesive unit that it is difficult to see nuclear 
families within it as separate groups in any mean- 
ingful sense. The men may form one solidary group 
and the women another; the children may regard 
all the women of the house as their “mothers.” 
Eventually, as the original family grows too big, 


the joint family will either split or some persons 
will leave to form separate groups. 

Extended family. An extended family is a dis- 
persed version of the joint family. That is, the 
members of the constituent groups of an extended 
family do not all live together in one dwelling. They 
usually live close together and engage in common 
activities. 

The concept of extended family really exhausts 
the usefulness of the word “family” for this kind of 
empirical classification, since an extended family 
is already a short lineage and can be discussed in 
terms of lineage theory, or it is a kindred and may 
be analyzed as such. Since birth is the criterion of 
membership in lineages and kindreds, they tend to 
develop considerable interest in the birth, training, 
and ultimate loyalties of their recruits, so that there 
is a close fit between these formations and the 
“families” which provide and train the recruits. 


Comparative family studies 


Because they are a part of all human experience 
and such a powerful agency in the formation of 
adult attitudes, familial relations have always been 
a subject of interest and discussion. Their chang- 
ing and varied forms have been studied by such 
diverse and ancient disciplines as theology, classics, 
law, and philosophy. This interest quickened during 
the period known in Europe as the Enlightenment, 
but it was the late nineteenth-century interest in 
evolution that stimulated extensive comparative 
studies of family forms. Fantastic theories were 
proposed in an attempt to reconstruct the history of 
mankind and to account for the existence of seem- 
ingly pointless customs and strange kinship ter- 
minologies among non-European peoples. Ba- 
chofen’s postulated development of society from 
promiscuity through a great period of mother right 
and female dominance to father right and patri- 
archy was an attempt to account for the recognized 
importance of matrilineal descent systems and the 
universal importance of motherhood. Other writers 
of the period argued in much the same way, al- 
though disagreeing on developmental priorities. 
Such writers as Sir Henry Maine, Edward Tylor, 
J. F. McLennan, Lewis H. Morgan, Friedrich En- 
gels, Sir James Frazer, E. Westermarck, and Robert 
Briffault carried forward the debate on origins, and, 
although their theories were highly speculative, 
they encompassed a wide range of cases and paid 
particular attention to historical sources. Despite 
the shortcomings of both theories and sources, 
these writers laid the foundations for modern 
studies by systematizing existing information on 


the family, and they are still worth reading for 
their insight into problems that have been neglected. 
It is surprising that students of the family have 
not paid more attention to a series of basic socio- 
logical works which concern themselves with the 
classification of social relationships. Maine, Tén- 
nies, and Durkheim, in particular, set up dichoto- 
mous categories which they believed to represent 
fundamentally different types of social relation- 
ship: relations of status as opposed to contract; 
relations of Gemeinschaft as opposed to Gesell- 
schaft; mechanical versus organic solidarity. In all 
these systems one pole was most clearly represented 
by relations of a familial or kinship type. Since it 
was generally assumed that we know what familial 
relations are, this reference was intended to be il- 
lustrative, although Ténnies, in his Gemeinschaft 
und Gesellschaft, attempted to show how natural 
will develops within the complex of relations that 
go to make up the family. The crux of the matter 
is that what we term familial relations are relations 
exhibiting a particular kind of solidarity that in- 
volves persons in total and permanent social rela- 
tions. This type of relationship is contrasted with 
the partial and specialized character of Gesellschaft 
or contractual relations. Just where the boundaries 
of familial relations are drawn varies from one so- 
ciety or subgroup to another and is, of course, 
affected by a large number of empirical factors. 
Intensive field-work methods developed during 
the early twentieth century by Boas, Kroeber, Lowie, 
and others in the United States and by Malinowski 
and Radcliffe-Brown in Britain produced a new 
body of detailed information on kinship, marriage, 
and the family. Malinowski’s work on the family 
in Australia and Melanesia was a study of prim- 
itive matriliny and was intended to show (in part 
at least) that the nuclear family emerges naturally 
even where there is ignorance of the male role in 
the physiological process of reproduction. The Tro- 
briand Islanders studied by Malinowski asserted 
that children are conceived when a spirit enters 
the body of a woman and that sexual intercourse 
is simply a matter of mutual satisfaction to the 
persons involved. Malinowski was impressed by 
the fact that men developed important social rela- 
tionships with their wives’ children even though 
they did not consider that there was any biological 
connection involved. Furthermore, men did not ex- 
ercise the kind of authority over their children that 
is often associated with “paternity”; the mother’s 
brother was more important in that respect. He 
concluded from these observations that the nuclear 
family is a universal institution but does not al- 
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ways assume the emotional configuration suggested 
by Freud's theory of the Oedipus complex; it is 
rather a “functional formation dependent upon the 
Structure and upon the culture of a society” ( Mali- 
nowski [1927] 1953, pp. 142-143). The implica- 
tion of this idea was that the nuclear-family com- 
plex could be activated by various persons and 
agencies—in the case of societies with matrilineal 
descent groups it may be that repressed hatred be- 
tween uncle and nephew replaces that found be- 
tween father and son in patriarchal societies. 

Radcliffe-Brown’s brilliant and lucid treatment 
of the jural significance of kinship ties enabled him 
to look at the family in terms of the formal duties, 
rights, and obligations between the members and 
between them and external groups. Whereas Mali- 
nowski regarded wider kinship ties as “mere exten- 
sions” of the nuclear-family complex and the senti- 
ments generated within it, Radcliffe-Brown spoke 
of the varying modes of incorporation, for jural 
purposes, of the unit of mother and children. He 
saw the relationship between a mother and her 
children as being somehow “basic” because of the 
close emotional and nurturant ties involved, while 
other kinship ties could be stressed or neglected to 
varying degrees depending upon the structure of 
the whole social and kinship system. In other words, 
Radcliffe-Brown’s concern was not with the family 
as a locus of emotional solidarity, or socialization 
functions, or sexual satisfactions; he was more 
concerned to explore the ways in which the empir- 
ical reference points of birth, sexual relations, and 
child rearing could be variously stressed for legal 
purposes. Apart from the apparent invariability of 
the mother-child relationship, he was prepared to 
accept the fact that all the other activities and 
functions could be distributed in widely varying 
ways. 

In the generation since these pioneers of field 
investigation, detailed work has been carried out in 
many parts of the world, yielding new information 
and continuous refinement of theory. The develop- 
ment of the Yale cross-cultural survey is an attempt 
to organize material from a wide range of sources 
in order to make inductive generalizations about 
human societies or to test particular hypotheses. 
Other writers make similar generalizations based 
upon a more selective use of sources. In either case 
it is evident that the generalizations are no better 
than the observations on which they are based. 
This is well demonstrated by Murdock’s generaliza- 
tions based on the cross-cultural survey files. Ob- 
servations reported in monographic studies depend 
to a considerable extent upon the investigator’s pre- 
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conceived ideas and categories or upon his inter- 
ests. In the field of family studies all these may be 
severely limited. 

Parallel to the development of a wide-ranging 
comparative study of kinship and family structure, 
there has been a continuing tradition of interest 
in the family life of Europe and America. In the 
nineteenth century, Le Play constructed a typology 
of European family structures ranging from the 
patriarchal systems of the semifeudal eastern Eu- 
ropean areas, which resulted in the development of 
extended families, through the famille-souche, or 
stem-family, of the semi-industrial areas to what 
he considered to be the unstable and disorganized 
systems of the urban industrial areas. Le Play’s 
typology, or something very like it, seems to have 
regained favor as a universal system of classifica- 
tion (see Levy in Aspects of the Analysis . . . 1965). 
The stem-family type is of particular interest be- 
cause it seems to combine an emphasis upon the 
ideal of family continuity with an actual situation 
where most of the domestic units are small and 
self-liquidating. 

As a result of their work on the development of 
capitalism and its institutional structure, Marx and 
Engels became interested in the development of the 
family, an interest which was quickened by the 
publication of Lewis H. Morgan’s Ancient Society 
(1877). Marx read this work and made extensive 
notes on it, but it was Engels who published an 
extensive commentary on it after Marx’s death, 
under the title The Origin of the Family, Private 
Property and the State. In this work Marx’s general 
theories of social and economic history are com- 
bined with Morgan’s speculative history of kinship 
institutions. The most general conclusion of the 
book is that a stable monogamous family system, 
dominated by male authority and prescribed and 
supported by law, has really developed as a device 
for the perpetuation of the private ownership of 
property. This conclusion has been widely misin- 
terpreted to mean that Engels was opposed to any 
sort of family relationships and that the family 
should be abolished in socialist societies. On the 
contrary, Engels went out of his way to point out 
that the emotional and sexual elements involved in 
family relations can only be paramount when con- 
siderations of property do not enter into the pic- 
ture. Thus, he argued that the proletariat make and 
break marital unions only upon the basis of mutual 
attraction, whereas in bourgeois marriage the con- 
Jugal pair are indissolubly united even when they 
have no love for each other and perhaps engage in 
adultery with other available partners. 

The influence of Freud’s work upon comparative 


family studies can only be mentioned in passing. It 
has produced some of the most penetrating hy- 
potheses concerning the content of family relation- 
ships, particularly in European and American so- 
cieties where psychoanalysis’ has been most widely 
practiced as a therapeutic technique. Studies of 
the relation between culture and personality focus 
upon the familial milieu as the context of person- 
ality development, and students of this field have 
produced some of the most detailed and intimate 
documentation of family life in non-Western cul- 
tures. It is also evident that Freud’s thought has 
influenced the formation of hypotheses about the 
nature and functions of family structure, at least 
since the work of Malinowski. 


Theoretical and research problems 


Despite the considerable time and effort devoted 
to family studies and despite the refinement of 
definitions and the improvement of concepts, our 
knowledge of the family remains rudimentary. We 
use the term “family” in the loosest possible way, per- 
haps because of the very nature of the data them- 
selves. For comparative study it is necessary to 
exercise great care in the construction of theoretical 
models in order to avoid distortions in observation 
and description. The bulk of family theory is de- 
rived from the Euro-American cultural tradition 
and from the study of European and American 
societies. The development and extension of psy- 
choanalysis has produced both a large collection of 
empirical data and a growing sophistication in its 
interpretation. But this has been accompanied by 
a tendency to attribute functional necessity to ob- 
served patterns, so that there is a danger of assum- 
ing that the functions performed by small family 
groups in Euro-American societies must be per- 
formed by similar groups, similarly defined and 
constituted, in all societies. Parsons (see Parsons 
& Bales 1955) has tried to show the systematic 
relation between nuclear-family structure on the 
one hand and socialization for participation in 
wider social systems on the other. This analysis is 
very convincing in its establishment of the general 
conditions which seem to be necessary for adequate 
human socialization, but those conditions can be 
expressed in very general terms and could be met 
by a wide variety of actual persons and groups. If 
the nuclear family is to be thought of as that group 
that carries out primary socialization, then it is 
necessary to investigate the empirical composition 
of such groups in a wide range of societies. 

Should we continue to use the model of nuclear 
family or even of nuclear-family relationship com- 
plex as a measure for cross-cultural comparison? 


Is this anything more than a device for making all 
observations fit our conception of what should 
exist? Or is it mainly a problem of language? Will 
more flexible terms that are not derived directly 
from a particular cultural tradition enable us to 
overcome these difficulties? Similar problems have 
arisen in debates over the question of whether all 
societies have “law” or “a political system.” In spite 
of some recent defense of the idea of the universal- 
ity of the nuclear family from linguistic analysts 
(Lounsbury 1965), it would seem to be desirable 
to keep an open mind on the question. 

Much of the work already done in the study of 
comparative family structure has been based upon 
the collection of information about norms, that is, 
the ideal modes of behavior in different societies; 
and, as we have suggested above, these have usu- 
ally been translated into terms of the norms of 
Euro—American society. Many anthropologists have 
supplemented interview data and informants state- 
ments with case material, genealogical investiga- 
tion, sample surveys, and statistical analyses. The 
aim has been either to try to determine what the 
system “really is” or to discover the manner in 
which norms work out in practice. Pioneering work 
in this regard was done by Audrey Richards, John 
Barnes, and Meyer Fortes. Fortes has always 
stressed the importance of regarding the family as 
a process in time and of looking at the cyclical de- 
velopment of family structure. As the individuals 
who compose family groups grow older and as new 
members are born and others die, the structure of 
the group changes as well as the relationships be- 
tween the individuals concerned. New possibilities 
for this type of analysis are presented by the devel- 
opment of computers. With the aid of these ma- 
chines it is possible to simulate the experience of 
a whole community of families over a long time 
period by constructing models of family structure 
or household composition and then applying to 
them such variable elements as birth and death 
rates, age at marriage, duration of unions, and fer- 
tility patterns. In this way it is possible to deter- 
mine what varieties of family and household types 
would be produced under varying circumstances 
over varying time periods. 

A major controversy that has developed within 
anthropology has considerable bearing on the whole 
subject of comparative family studies. This contro- 
versy starts with Lévi-Strauss’ idea that the building 
block of kinship systems is not the biological or 
nuclear family but a constellation of roles which 
includes the siblings of a marrying couple as well 
as their children. In disagreeing with Radcliffe- 
Brown’s contention that it is the relations between 


FAMILY: Comparative Structure 307 


members of the “elementary family” that constitute 
the unit of structure from which all kinship sys- 
tems are built, he says: 


Of course, the biological family is ubiquitous in human 
society. But what confers upon kinship its socio-cul- 
tural character is not what it retains from nature, but, 
rather, the essential way in which it diverges from 
nature. A kinship system does not consist in the ob- 
jective ties of descent or consanguinity between indi- 
viduals. . . . The essence of human kinship is to require 
the establishment of relations among what Radcliffe- 
Brown calls “elementary families.” Thus, it is not the 
families (isolated terms) which are truly “elementary,” 
but, rather, the relations between those terms. (Lévi- 
Strauss [1958] 1963, pp. 50-51) 


From this starting point Lévi-Strauss proceeds to 
a complex analysis of preferential and prescriptive 
marriage systems, but the perspective which he 
introduced leads to a new assessment of intrafa- 
milial relations. 

Briefly, the question is whether the relations of 
nuclear family and of filiation always constitute a 
basic “natural” system which tends to counterbal- 
ance particular kinship ties stressed for jural pur- 
poses, or whether we should regard nuclear-family 
relations as being nothing more than a fortuitous 
complex of highly variable relations some of which 
may be dispensed with altogether. For example, if 
filiation is always bilateral (because every child 
must have a “father” and a “mother”), then where 
patrilineal descent is stressed for jural purposes 
the strong jural tie between father and child will 
be balanced by the tie between mother and child 
and, by extension, between the child and its 
mother’s kin—a tie of complementary filiation. 
Conversely, it could be argued that where matri- 
lineal descent is stressed, a compensating bond will 
exist between father and children and between the 
child and the father’s kin. The point to be noted in 
this argument is that the ties between parents and 
children may be accorded great legal significance 
or may be simply affective, but they derive ulti- 
mately from the bonds of filiation and descent 
which arise within the nuclear-family relationship 
complex, These relations are socially rather than 
biologically defined, but they focus upon activities 
which are familial. 

The contrary view is that relationships between 
a child and its parents may derive from the fact 
that the parents are married to each other rather 
than from any bond of filiation. Thus, the Trobri- 
and Islanders assert that there is no genetic link 
between fathers and their children. A man is a 
“father” because he is the husband of the child’s 
mother. Of course this ideological view of the mat- 
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ter seems to be related to the political and legal 
sphere rather than to the field of domestic relations, 
where fatherhood may be rooted in a set of activ- 
ities rather than a set of abstract principles. 

It is not necessary to document the details of this 
controversy here; adequate references will be found 
in Leach (1961) and Schneider (see Association 
of Social Anthropologists . . . 1966). The point of 
interest here is Leach’s contention that relations 
between children and parents may be as variable 
as any other social relations and that we should be 
prepared to find societies in which even the mother— 
child relationship is conceived of as one wholly de- 
rivative from the fact of the mother’s marriage to 
the father, who is thought of as the person with 
whom the child has a real consanguineal relation- 
ship, In such a situation the mother is an affine; 
she is not “mother” necessarily at all, but “father’s 
wife.” 

The controversy outlined here does not focus 
upon the internal relations or the structure of the 
family as such; it is concerned with an understand- 
ing of the structure of total social systems. But it 
does bring into quite sharp focus the need to aban- 
don a dogmatic view of the universality of nuclear- 
family relations and the need to differentiate clearly 
between the activities involved in domestic rela- 
tions, the facts of biological interconnectedness, 
and the interlocking of familial and wider kinship 
relations. It seems likely that future research will 
Operate with a bigger collection of more differen- 
tiated concepts derived from a splitting up of such 
notions as descent, filiation, sibling bonds, alliance, 
and, of course, the general notion of nuclear family. 

Throughout the discussion so far it has been as- 
sumed that comparative family studies refer to 
comparisons between different societies and that 
“structure” means societal norms. Some of the most 
interesting modern research is concerned with the 
study of differences in family structure within so- 
cieties; studies which throw light upon the nature 
of society itself. It has long been recognized that 
the structure of family relations appears to vary 
with class, ethnic group, and other factors. This 
raises the question of whether the variation lies 
at the level of culture (that is, in the norms them- 
selves) or whether the actual distribution of empir- 
ically ordered “types” of family is the result of con- 
Straints operating to prevent the full realization of 
cultural ideals. For example, it is generally accepted 
that Hindus value a joint family system in which 
marriages are contracted at an early age and are 
indissoluble and in which brides are brought to 
live in their husband's father’s house, where they 


come under the authority of the senior woman and 
the men form a corporate group. In fact, it would 
seem that such ideal patterns are met only by the 
wealthier families, where a common interest in 
property serves as a binding factor, while the poorer 
families tend to be small and to consist of only a 
couple with their own minor children. It has also 
been suggested that the size of households would 
be limited under preindustrial conditions every- 
where by the early death of parents and high infant 
mortality rates. Such arguments ignore the fact 
that purely demographic constraints can be circum- 
vented, since family and kinship relations are not 
biological relations and since adoption or other 
forms of culturally defined kinship ties can be used 
as the basis of household formation. 

Important though adoption and the collapsing 
of relationships may be as a means of keeping func- 
tioning groups intact, it must be realized that the 
exigencies of birth-and-death rates and other demo- 
graphic variables do have an effect upon house- 
hold composition, just as the biological processes of 
maturation have an effect. Sociologically, a more 
important consideration is the extent to which 
norms vary from group to group and result in the 
precipitation of different configurations of family 
organization. In the United States there has been 
discussion of “the culture of poverty” in relation to 
the incidence of fatherless families among lower- 
Status and lower-income groups. In such cases it is 
difficult to determine whether the pattern of fam- 
ily living among these lower-income groups is de- 
termined by a simple inability to command the 
means of achieving valued patterns of behavior or 
whether different values are operative. In such 
cases the study of family life becomes a very sen- 
sitive index to the whole mode of integration of 
the over-all social system. This question is discussed 
further below. 

Although knowledge of the genesis of emotional 
disturbance and social deviance is much more ex- 
tensive than it was 25 years ago, it is still inade- 
quate. The complexity of the relationship system 
of family groups is understood in only the most 
general terms despite the considerable body of de- 
tailed information pertaining to Euro—American 
families. The profound hatreds and bitter rivalries 
which are the counterpart of the relations of fa- 
milial solidarity are well known to anthropologists, 
both as the source of conflicts with political and 
economic significance and as the basis of witch- 
craft beliefs and magical and religious practices. 
These conflicts are the product of the interaction 
systems themselves within small “multibonded” 


groups and may have considerable significance for 
wider system organization. It has often been re- 
marked that the family is the unit within which 
primary socialization takes place and within which 
the growing child is taught the ultimate value ori- 
entations of the whole society. This is not necessar- 
ily done formally, but the values may be implicit 
in the general structure of the familial relationship 
system itself. If this is so, then it is important to 
know how these relationships are structured in dif- 
ferent societies and just how the values are inter- 
nalized by the growing child. 


The range of variability 


In the large Ashanti villages of central Ghana 
one can often see children carrying pots of food 
from their mother’s house to the house of their 
father. The children may eat with their father and 
then go home to sleep at the house where their 
mother lives, while the mother may visit the father 
for the night. This pattern of divided residence and 
visiting back and forth seems to exist because the 
Ashanti traditional social system stresses matrilin- 
eal descent for important social purposes, such as 
inheritance of land, succession to office, and polit- 
ical status. Women often value the tie to their 
brother as highly as, or higher than, the relation- 
ship to their husband, since it is from the brother 
that their children will inherit. Because the child’s 
place in society is mainly determined through its 
relationship to its mother and her matrilineal kin, 
the breaking of a marital tie is of little legal con- 
sequence either for the spouses or for the children. 
It is true that fathers have always had responsibility 
for the upbringing of their own children, including 
the provision of food and the teaching of crafts 
and skills, but these duties could be carried out 
even when the child did not reside with the father 
and even after the parents had broken their con- 
jugal tie. According to Fortes (1949) another fac- 
tor inducing women to remain in their own homes 
even after marriage is the very close relationship 
between mother and children, and particularly be- 
tween mothers and daughters. This case demon- 
strates the way in which interests and relationships 
can cut right across the solidarity of the nuclear 
family, and it would be very difficult to demon- 
strate that the nuclear family is either a normal or 
a necessary coresidential unit among the Ashanti. 
The Ashanti concept of family is quite different 
from the European concept; Fortes mentions that 
persons who speak English often say, “Your mother 
is your family, your father is not.” 

The most extreme and best known example of 
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the effect of matrilineal descent upon familial re- 
lationships was to be found among the Nayar caste 
of southern India in the period before the full ef- 
fects of British rule were felt. Among the Nayar, 
it appears that the marital relationship was re- 
duced to a merely symbolic level, being contracted 
at around the time that the girl reached puberty 
and shortly thereafter being ritually broken. After- 
ward, the women were allowed to have informal 
love affairs with men who visited them at night. 
The households consisted of a group of brothers 
and their sisters and the sisters’ children, and any 
children born to the woman of the household be- 
came members of the matrilineal joint family. The 
children were discouraged from developing any 
strong attachments to either their pater or genitor, 
and while it would be impossible to argue that the 
role of father was completely absent from this so- 
cial system, it was obviously diminished in favor 
of the solidarity of the property-owning matrilineal 
group. It seems likely that many of the functions 
that are normally fulfilled by fathers in relation to 
socialization and personality development were 
carried out by male members of the matrilineage 
(Gough 1952). 

Cases such as these have led some writers, in- 
cluding Radcliffe-Brown, to suggest that the basic 
structural unit of kinship systems is the unit of 
mother and children, in other words, to separate 
the nuclear family into a number of paired relation- 
ships, or dyads, and to see how, in differing socie- 
ties, they fit in relation to each other and to other 
groups. This is certainly a step in the right direc- 
tion toward loosening the categories involved in 
the nuclear-family complex and exploring more 
fully their independent connections. Even in socie- 
ties with pronounced patriarchal authority we find 
that the mother-children unit often forms an im- 
portant subgroup, given special recognition and 
marked by close emotional ties among its members. 

Among the Swazi as described by Hilda Kuper 
(see Radcliffe-Brown & Forde 1950), descent, in- 
heritance of economically valuable possessions, and 
succession to office is patrilineal, but women are 
given a position of great prominence in the organ- 
ization of domestic life. Homesteads are built in 
the form of a semicircular group of huts surround- 
ing a cattle pen, each hut being occupied by a 
mother and her children. In the center of the arc 
is the “great hut” occupied by the mother of the 
male head of the compound (or one of his senior 
wives if his mother is dead). When a young man 
or a group of full brothers sets up a new homestead, 
their mother is installed in the place of honor, for 
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it was she who reared them, watched over their 
rights, guarded their inheritance, fed and nurtured 
them. The father is respected as the head of the 
group, revered, and eventually worshiped as an an- 
cestor, but the mother is loved and cherished. This 
pattern is common in patriarchal societies where 
the father is an authoritarian figure and the holder 
of property that the heir requires in order to attain 
full maturity—thus introducing an underlying ten- 
sion into the formally respectful relationship. Par- 
sons has suggested that the father role is always 
an “instrumental” one as opposed to the more emo- 
tional or “expressive” quality of the mother role 
(Parsons & Bales 1955, pp. 157-158). In a pene- 
trating study comparing two adjacent groups in 
west Africa which share a basically similar culture, 
Goody has shown that in one of them the matrilin- 
eal inheritance of certain types of property alters 
the interpersonal relationships between close kin 
and even alters the mode of ancestor worship 
(1962). Since intrafamilial relationship patterns 
seem to be so responsive to the effect of outside 
influences, it is premature to assert too many con- 
clusions about the structure of the nuclear family 
as such. The specialization of males and females 
in instrumental and expressive roles may be an 
aspect of sex-role differentiation in this case rather 
than of role specialization within a small group 
structure. Certainly women take on instrumental 
roles in the absence of men, but they retain the 
predominantly expressive quality appropriate to 
females. 

In tribal societies where kinship is the basis of 
recruitment to all or most important social roles, 
the family must articulate directly with descent 
groups and becomes a mechanism for the continu- 
ous generation of new kinship ties. As Fortes says, 
“the workshop, so to speak, of social reproduction, 
is the domestic group” (Goody 1958, p. 2). In more 
differentiated social systems with an increased di- 
vision of labor, a literary tradition, and a well-de- 
veloped class or caste system, it is still true that 
families may be large, multifunctional units. In 
China and India, for example, the whole weight of 
tradition, values, and religious sentiment favored 
the growth of large patrilineal joint families and 
patriarchal authority. The uneven distribution of 
wealth and status seems to affect the realization of 
these ideals, so that the majority of households are 
small and contain only parents and children. This 
is not simply a matter of demographic determin- 
ism; the absence of a sufficient property base leads 
to the early secession of mature sons, who try to 
make out for themselves. Large joint families are 


found mainly among landowning and merchant 
groups, where sons have a continuing material in- 
terest in the patrimony to reinforce and sustain 
filial piety. Such large “families” are really lineages 
or descent groups which constitute perpetual cor- 
porations managing a common enterprise. 

Le Play thought of the famille-souche, or stem- 
family, as preserving familial continuity while 
adapting to the demands of industrialism. While 
many of the younger members of the family mi- 
grated or went off to work in industrial centers, the 
main family, or stem, continued and was usually 
located on its own land or in its own house. A regular 
mode of inheritance designed to prevent fragmenta- 
tion was imposed to ensure both the continuity and 
the integrity of the familial estate. The existence of 
such a system in twentieth-century rural Ireland 
has been described by Arensberg and Kimball 
(1940), and its presence in parts of the United 
States has been depicted (Zimmerman & Frampton 
1935). On the small farms of rural Ireland it is 
customary for one son to inherit the family farm, 
while the other children marry out or migrate to 
the towns. The heir usually postpones his marriage 
until his father is nearly ready to hand over con- 
trol, thus ensuring that the farm will not have to 
support more persons than it can carry and ensur- 
ing the continuity of one male line on an undivided 
farm. The stem-family is not a product of industrial 
society. Recent work by Laslett indicates that the 
preindustrial family system in England did not en- 
courage the growth of large extended families oc- 
cupying single dwellings; on the contrary, the nu- 
clear-family household seems to have been the 
normal type of dwelling group, and when large 
household groups are found, they appear to have 
been made up by servants or craft laborers rather 
than by kin. There was also a continuous process 
of downward mobility in which the children of 
wealthy or high-status families were sloughed off 
into lower positions, while the main heir took over 
an intact patrimony (Laslett 1965). 

The urban family system of modern Europe and 
America developed out of the system found in 
agrarian society (or “the world we have lost,” as 
Laslett terms it). It has been a constant preoccupa- 
tion of writers on the modern family to think of it 
as being somehow stripped of functions and re- 
duced to a pale shadow of its former ample self. 
The ideal is that of a nuclear-family group living 
in its own house independently of other kin and 
subsisting upon the wages or salary of the husband— 
father. Falling birth rates, an increase in the inci- 
dence of divorce, a decline in home food process- 


ing, clothes making, and so on—all these have been 
held to signify a decline in family living and a shift 
toward increasing individualism and material val- 
ues. Parsons and others have argued that the family 
system of the urban United States is not a denuded 
form of a more “normal” or “natural” family system 
but is itself a highly specialized form that articu- 
lates most satisfactorily with a highly differenti- 
ated economic and political system and with insti- 
tutionalized values that stress achievement rather 
than inheritance. It is argued that the smallness 
and relative isolation of the family from other kin- 
ship ties is an adaptation that makes possible the 
spatial and status mobility of its members. The unit 
of mother and children remains the basic affective 
group within which there is close emotional iden- 
tification, and the husband-father is closely inte- 
grated with it both emotionally and in terms of his 
status-conferring and economic-support functions. 
It is upon his position in the occupational system 
that the status and the style of life of the whole 
family group depends. This functional interpreta- 
tion of the importance of the nuclear family in in- 
dustrial society may be considerably weakened by 
the work of Laslett, referred to above. If the nuclear 
family has been the-major form of European family 
structure for many centuries, predating the indus- 
trial revolution, and if this family form was the 
product of social values rather than of demographic 
or other constraints, then the modern family forms 
may be simply the continuation of a culturally pre- 
ferred form. 

Studies of the structure of kinship and family 
relations in modern industrial societies are rare in 
spite of the considerable literature on special prob- 
lems of family living. Recent work in Britain has 
shown that lower-class families often live in close 
proximity to near kin and maintain intimate con- 
tact with them despite the “isolation” of the nuclear 
family (Young & Willmott 1957). The cooperation 
between mother and married daughters is partic- 
ularly noticeable, and most writers have remarked 
upon the great importance of women in their role 
as mothers of grown-up children. Fathers are im- 
portant, of course, as wage earners and supporters 
of their families, but in the lower class the status 
of the family tends to be fixed less by the occupa- 
tion of the husband-father than by birth into a par- 
ticular neighborhood and class. The network of re- 
lations between female kin is reinforced by daily 
contact, mutual help, and support in times of eco- 
nomic hardship, and is balanced by a tendency for 
men to spend their leisure time together rather 
than with the members of their own nuclear family. 
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A more extreme form of matrifocal family struc- 
ture occurs in low-status ethnic groups where there 
appears to be some correlation with insecurity in 
male employment. In the plantation areas of the 
southern United States, the West Indies, and Latin 
America and in the urban areas of developing so- 
cieties in Africa and Asia, one often finds high ille- 
gitimacy rates and unstable mating patterns. An 
array of types of domestic group is produced, the 
most distinctive being those in which a mother is 
surrounded by adult children and some of her 
daughter's children. Money income flows into such 
a household from a variety of sources, such as chil- 
dren's earnings, payment to daughters for the sup- 
port of illegitimate children, the older woman's own 
labor, and perhaps government relief funds. This 
income is reallocated around the essential tasks of 
child rearing and nurturance, and it appears that 
a household group of this kind is a useful adapta- 
tion to conditions of economic insecurity. This 
form of domestic group appears to be always re- 
garded as a deviant form, partly because of the 
instability of male roles in relation to it. Unlike the 
matrilineal situation, men do not have important 
legal ties to their sisters; it is the mother who is the 
focus of the familial solidarity, and her position 
derives from affective rather than jural relations. 
It would be wrong to suggest that this is the typical 
household or familial form in the societies in which 
it occurs. Even at the class levels where it is most 
frequently found it constitutes only a small pro- 
portion of actual household groups, but its signifi- 
cance is as an expression of the strength of mother— 
child relations and the relative weakness of marital 
ties. 


This brief sketch has outlined only some of the 
major features of family structure. Variations 
within single societies have hardly been touched 
upon, and nothing has been said about pathological 
forms. Nor has attention been directed toward the 
attempts which have been made from time to time 
to do away with family solidarity in favor of alle- 
giance to other groups, although such attempts 
provide valuable experimental data. The Israeli co- 
operative communities are probably the best recent 
examples of this, although no definite conclusions 
about the necessity or otherwise of nuclear-family 
relationships can be drawn (Spiro 1958). What 
these cases do perhaps indicate, along with some 
of the matrilineal and matrifocal cases, is that 
marriage is not the only means of securing social 
“legitimacy” for children. Spiro shows that chil- 
dren brought up in central nurseries and schools 
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are in a sense children of the whole community, 
in spite of the specific relationships they may de- 
velop with their “own” parents. This is possible 
because of the relatively small size of the com- 
munities. 

However the family may be defined or struc- 
tured, it always constitutes an area of diffuse and 
permanent solidarity between a limited number of 
individuals, and this is probably its most impor- 
tant distinguishing characteristic. This is a view 
at least suggested by W. H. Rivers in his Social 
Organization when he deliberately identifies the 
family of parents and children as the basic unit 
in European society but points out that in other 
societies the family may be much bigger, that bio- 
logical connection is no guide to kinship, and that 
genealogies express kinship ties but may be fabri- 
cated to fit social reality. Rivers is very wary about 
drawing any precise line between family relation- 
ships and other kinship ties. Such lines can be 
drawn in particular cases but are very difficult to 
define a priori. 

RAYMOND T. SMITH 


[Directly related are the entries KrnsHie and MAR- 
RIAGE; see also CONFLICT, article on ANTHROPOLOGI- 
CAL ASPECTS. ]| 
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m 
DISORGANIZATION AND DISSOLUTION 


Aside from some slight semantic disagreements, 
most social scientists agree that some form of a 
familial group is universal among human beings. 
Likewise in every society some of the familial 
groups are imperfect in their functioning and de- 
serving of the label “disorganized.” Generally there 
is some relationship between disorganization and 
a more climactic, terminal situation called “dissolu- 
tion.” Family groups are eventually dissolved by 
death or some form of separation, insofar as these 
groups are organized around the matings of men 
and women. Of course, a family process, in the 
sense of overlapping and interacting generations, 
is immortal, providing there is reproductive re- 
placement before the familial group of a particular 
generation disintegrates (Kirkpatrick [1955] 1963, 
pp. 192-195). 


Nature and significance 


The significance of familial disorganization and 
dissolution depends upon the institutional arrange- 
ments for family life in a particular culture. If 
stress is laid on the nuclear family, then kinfolk 
are less available to aid members of the smaller 
familial group. Some families are vulnerable in 
the sense that their members expect permanence, 
have made deep emotional commitments, and do 
not readily redirect affection and loyalty. On the 
other hand, people may be less hurt by familial 
instability in cultures where instability is an estab- 
lished norm and where kinfolk are helpful and 
meaningful. 

In most cultures, the familial group helps pre- 
pare children both for membership in a society 
and for effective functioning as parents. Disor- 
ganization and dissolution are significant in that 
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they tend to interfere with this socialization proc- 
ess. Furthermore, disorganization and dissolution 
are often cumulative and recurrent in the family 
process. There is a convergence of evidence in 
the United States that people with happy marriages 
tend to come from happy homes. Divorce also runs 
in families; divorced persons are divorce-prone, 
and children suffering from bereavement or pa- 
rental divorce may be involved with delinquency, il- 
legitimacy, and unwise marriage. The consequences 
of disorganization and dissolution experienced by 
the parents are commonly visited upon the off- 
spring, perhaps for several generations. Some 
children, however, are warned by the fate of their 
parents and in marriage make an effort to reverse 
a trend toward disorganization. 

Terminology. It is impossible to make a useful 
analysis of the nature and significance of family 
disorganization and dissolution without precise 
terminology. In current usage “disorganization” 
seems to imply maladjustment, malfunctioning, 
psychological decay, and the existence of family 
problems. Goode defines family disorganization as 
“the breakup of a family unit, the dissolution or 
fracture of a structure of social roles when one or 
more members fail to perform adequately their 
role obligation.” He lists five types of family dis- 
organization, namely, illegitimacy, dissolution, 
whether.as a result of divorce, annulment, separa- 
tion, or desertion; “empty shell family”; unwilled 
absence of one spouse; and “unwilled” major role 
failures (Goode 1964, pp. 91-92). 

Clearly, such taxonomy makes no distinction be- 
tween disorganization and dissolution. It seems 
from the inclusion of illegitimacy that a family can 
be “disorganized” by role failure even before it is 
organized. Caplow defines an organization as “a 
social system that has an unequivocal collective 
identity, an exact roster of members, a program of 
activity, and procedures for replacing members” 
(1964, p. 1). Following this line of thought, it 
would seem more proper to regard the “illegitimate 
family” as unorganized rather than as a type of 
family disorganization. 

Distinction between disorganization and dissolu- 
tion. In the present discussion a distinction will 
be made between family disorganization and family 
dissolution, and marital disorganization will be re- 
garded as a special case of family disorganization. 
Family dissolution means disintegration, disrup- 
tion, or chronic instability; it is characterized by 
“dependence, agency intrusion, member extraction, 
premature atypical departure and family violence” 
(Kirkpatrick [1955] 1963, p. 565). 

Marital dissolution, as a special form of family 
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dissolution, may take the form of suicide, murder 
of spouse, annulment, separation (ranging from 
psychological withdrawal to legal separation), and, 
finally, divorce. The withdrawal from spouse may 
be accompanied by substitution for wife of a co- 
wife, concubine, mistress, or prostitute. Sometimes 
dissolution is by stages, for example, in a sequence 
of avoidance, suit for separate maintenance, legal 
separation, an interlocutory divorce decree, and 
finally an absolute divorce with the right of re- 
marriage. 

Family dissolution is an imperfect index of 
family disorganization. Divorce is an imperfect 
index of marital disorganization because there may 
be disorganization without divorce. For example, 
divorce was very difficult to obtain in England 
prior to liberalization of the laws in 1857 and again 
in 1937. When the divorce door is closed, marital 
disorganization may be inconspicuously endured. 
On the other hand, a couple with high expectations 
of marital bliss but relatively little marital disorgan- 
ization may seek divorce as an exit from marriage, 
That marital disorganization is often not extreme 
is shown by the frequency of reconciliation, drop- 
ping of divorce suits, and remarriage to the former 
spouse. Thus, Harvey J. Locke (1951, pp. 54-55) 
found that about a fourth of the divorced couples 
he studied were in the same marital adjustment 
category as the less happy fourth of the married 
couples, 

Religious and moralistic viewpoints. Family 
disorganization and dissolution are viewed by 
many people in moralistic and religious terms. The 
family at one time performed functions so im- 
portant that disharmony and disruption were 
viewed with alarm. One assumption of the patri- 
archal system in western European culture was 
that paternal authority could insure efficiency and 
Stability in family life. It is probably still the as- 
sumption of middle-class people in the United 
States that “nice” people get on together and as 
Spouses stay together. 

The Roman Catholic viewpoint concerning 
family disorganization and dissolution has re- 
mained rather clear and constant since the Council 
of Trent, 1545-1563. The ideal is patriarchal har- 
mony established by a marriage sacrament directed 
toward the goals of parenthood and marital love. 
The Catholic doctrine assumes various impedi- 
ments, such as disparity of religion, lack of con- 
sent, or intention to seek a divorce. Given the 
impediment of disparity of religion, the marriage 
of a Catholic to a non-Catholic, especially one un- 
baptized, can be nullified, thus permitting the 
Catholic party to obtain a civil divorce and marry 
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a Catholic partner “in favor of the Faith.” Even 
a marriage between Catholics may be dissolved by 
papal dispensation if the marriage is unconsum- 
mated and grounds exist for the marital disruption. 
In addition to nullification and dissolution by dis- 
pensation, the church grants separation from bed 
and board, which does not include the right of re- 
marriage. The consummated and valid marriage 
between two Catholics is indissoluble and permits 
no remarriage unless terminated by death. Certain 
Protestant denominations hold equally religious 
and moralistic views (Kirkpatrick [1955] 1963, pp. 
573-575). 

The legalistic viewpoint. The legalistic view, 
which is prevalent in the United States, had its 
origins in canon law and was influenced by the 
common law of England. It is complicated by the 
lack of uniformity among the various statutes and 
decisions of the 50 states; the practice of seeking 
divorce in states other than the state of residence 
has brought controversial cases to the U.S. Su- 
preme Court (Harper & Skolnick [1952] 1962, pp. 
451-458). 

Basic to the legalistic doctrine of divorce in the 
United States is adversary procedure. It is assumed 
that a relatively innocent party may be granted a 
divorce because of the relative guilt of the other 
party with reference to “grounds” specified in the 
Statutes of the various states. There are four legal 
doctrines associated with the basic adversary pro- 
cedure. If spouses cooperate in providing evidence 
calculated to obtain a divorce on certain grounds, 
that is collusion, and the divorce should be denied. 
If one party forgives an offense such as adultery 
and continues to live with the spouse, condonation 
is assumed, and the derivative inference is made 
that the damage to the innocent party was not 
great enough to justify a divorce. If one party 
shows that the other party is equally guilty, suc- 
cessful recrimination is established and the divorce 
is not to be granted. If one party plots the guilty 
behavior of the other party, that is connivance, 
and the divorce would be denied to the allegedly 
innocent party. Thus the legal operation of divorce 
is withheld from marriages so sick as to menace 
the well-being of both adults and children in the 
familial group (Kirkpatrick [1955] 1963, p. 595). 

Social pathology. Disorganization and dissolu- 
tion are subject to a third viewpoint, that of social 
pathology. This viewpoint assumes that well-func- 
tioning, organized families are “normal”; malad- 
justment and disruptions are regarded as deviant 
behavior. The families with problems should be 
brought back into line by monetary aid, psycholog- 
ical manipulation, and a therapy designed to help 
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people achieve conformity to the social norms. 
With little reflection upon alternative frames of 
reference, there is concern for the cure of patho- 
logical situations. 

Sociological viewpoint. A fourth viewpoint is 
that of scientific sociology. In the choice of this 
label there is no intended implication that sociology 
is a distinct, mature, and sophisticated science. 
Perhaps at the present time it is merely an attempt 
at clear, honest, abstract thinking about social 
phenomena, increasingly aided by systematic ob- 
servation and analysis. From such an intellectual 
orientation, which is relatively free from concealed 
value assumptions and distorting preconceptions, 
family dissolution is sometimes seen as a relief of 
family disorganization. For certain infections, am- 
putation is the choice of a lesser evil as compared 
with further spread of infection. So it may be with 
divorce. 

The sociological viewpoint is generally deter- 
ministic and assumes that social changes make 
inevitable certain reactions in terms of disorganiza- 
tion and reorganization. According to this view, 
familial institutions, groups, and relationships are 
part of a larger social structure in which every 
aspect of the total configuration is related to every 
other aspect. Family disorganization may be one 
facet of a larger social disorganization. 


Methods and trends in research 


Family crisis. Research concerning crises of the 
family has been extensive. The great depression in 
the United States brought forth books on unem- 
ployment as a crisis for the family, and World 
War 1 brought an interest in bereavement and 
other trials visited upon the family by wartime 
conditions, More recently scholars have speculated 
as to the devastating influence that an atomic war 
would have on the family. 

The confusion of terminology in regard to dis- 
organization and dissolution is compounded by the 
concept of family crisis. Death, like divorce, can 
cause the dissolution of family units, and often 
bereavement is placed in the category of family 
crisis. Hill writes of the event (A) which interacts 
with the ability of the family to meet a crisis (B); 
both (A) and (B) interact with the definition which 
the family makes of the event (C), and thus the 
Situation is an (X)—the crisis for a particular 
family. One family may take in its stride an im- 
pact which would crush another family that is 
lacking in resources and prone to regard a trifle 
as a crisis (Waller [1938] 1951, p. 460). 

Endogenous and exogenous disorganization. 
Kirkpatrick drew a distinction between endoge- 
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nous and exogenous disorganization, the former 
having an internal origin and the latter a relatively 
external origin. Distinctions were also made be- 
tween the normal and atypical, the timely and the 
premature, and the objectively severe as compared 
with the objectively trivial. It was suggested that 
the term “crisis” be restricted to exogenous disor- 
ganization due to an objectively severe atypical or 
premature event (Kirkpatrick 1955, p. 503). Insofar 
as family dissolution results from exogenous dis- 
organization, as defined above, it would seem 
proper to speak of exogenous family dissolution; 
an example would be the wiping out of a family 
by flood or fire. 

Bereavement. The assassination of President 
Kennedy in 1963 was a conspicuous atypical, ex- 
treme experience which brought premature be- 
reavement. The disorganization in the Kennedy 
family, in the sense of shock and sorrow, was 
exogenous disorganization; and the dissolution, in 
the sense of the loss of the head of the family, 
could be regarded as exogenous dissolution. Per- 
haps all bereavement, save that due to violence 
within the family group, could be regarded as 
exogenous, and if premature or atypical, it would 
be a family crisis. An expected death from natural 
causes of old age would be less clearly a crisis 
by the criteria which have been set forth, although 
some degree of disorganization and dissolution 
might result. There is no claim of exclusive cate- 
gories in the suggested terminology, and the dis- 
tinctions are relative. 

It is important to make a distinction between 
father-deceased bereavement and mother-deceased 
bereavement in view of their differential implica- 
tions for the family in terms of economic conse- 
quences and implications for the offspring in terms 
of sex typing, identification, and preparatory af- 
fectional relationships with parents of the opposite 
sex. Many aspects of bereavement experience have 
been probed by Howard Becker, Willard Waller, 
Thomas D. Eliot, David M. Fulcomer, and others. 
Perhaps more attention should be paid to the loss 
of common memories. If a beloved person is no 
longer available to confirm a happening, there may 
be doubt about the reality of prior experience and 
uncertainty about personal identity. If the identity 
of A is essentially the identity of A in relation to 
B, the loss of B drastically alters the subjective 
identity of A. Improved knowledge of familial dis- 
organization and dissolution calls not only for 
greater precision of terminology but also for fur- 
ther exploration of bereavement as a family crisis 
that may lead to subtle forms of exogenous dis- 
organization. 
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Demographic investigation. Investigators have 
explored the distribution of familial disorganization 
and dissolution over space, by social categories, and 
in time. For example, relief cases of a certain type 
may be more prevalent in certain urban areas and 
within social categories such as “nonwhites.” Fur- 
thermore, there may be fluctuations in the case 
load of social welfare agencies by time periods. 
Unemployment associated with the business cycle 
has generally meant an increase in the number of 
families dependent upon external financial as- 
sistance. 

The most extensive investigation has been that 
of divorce regarded as a measure of familial dis- 
solution and, more specifically, of marital dissolu- 
tion. Numerous studies have noted variations in 
divorce rates by geographical region, sometimes 
considered in combination with various social cate- 
gories. Paul Glick has studied “marriage instability” 
as measured by separation ratios, divorce ratios, 
and proportion of children not living with both 
parents. His hypothesis that marital disruption in 
the United States tends to be more closely related 
to urbanization than to regions of residence was 
only partially supported (1963, pp. 47-54). 

Various writers such as William J. Goode, 
Robert Winch, and Christine H. Hillman have 
stressed the higher divorce ratios among persons 
of lower socioeconomic status (Goode 1964, pp. 
88-90; Winch [1952] 1963, pp. 706-708). How- 
ever, this evidence contains some inconsistencies 
when men and women are considered separately. 
One finds a rather high incidence of divorce among 
college women as compared both with less edu- 
cated women and with men at the same college 
level (Kirkpatrick [1955] 1963, p. 589). It should 
be stressed that the proportion divorced in a par- 
ticular group or category of people is affected by 
remarriage and the rapidity of remarriage. If 
every person receiving a divorce decree remarried 
on the same day, there would be no residue of 
unmarried “divorced persons” in the population. 

Desertion, another mode of marital dissolution, 
may not be accurately reflected in the census cate- 
gories “separated,” “spouse absent,” and “divorced,” 
because of remarriage. During 1955 a Philadelphia 
court recorded 4,224 desertions handled, in con- 
trast to only 2,812 divorces (Kephart 1961, p. 
548). Desertion may be a substitute for divorce or 
preliminary to the legal action of divorce. 

The demographic approach stresses the chang- 
ing prevalence of divorce in various countries. The 
trend of the divorce rate after World War 1 was 
upward in many European countries, such as 
France, England, and Sweden, as well as in the 
United States and Canada (Jacobson 1959, table 


42, p. 90; table 47, p. 98; Kirkpatrick [1955] 1963, 
p. 586). For the United States, rates are available 
from 1920 through 1960, expressed as divorces 
per year per one thousand married females 15 
years of age and over. A curve based on such rates 
shows a drop during the depression in the 1930s, 
a huge peak in 1946 following the war, a decline to 
a plateau, and a figure of 9.2 on that plateau in 
1960 as compared with 8.0 in 1920 (Carter & 
Plateris 1963). From a sociological point of view, 
this increase is far from spectacular. 

The clinical approach. For many decades in- 
vestigators have utilized methods less objective but 
perhaps more penetrating than those of the demog- 
raphers. The various theories of Freud concerning 
emotion-laden interaction in the family group were 
integrated by J. C. Fliigel. Waller, in his brilliant 
pioneer study of divorced couples (1930), drew 
upon psychoanalytic concepts in his analysis of 
alienation and postdivorce adjustment. He ex- 
pressed original intuitions concerning motivation, 
as, for example, the suspicion that some ex-wives 
seek alimony not out of greed but because of jeal- 
ous disinclination to see an ex-husband financially 
capable of marrying another woman. Some psy- 
chiatrists probably go to an extreme in regarding 
the seeking of divorce as an expression of neu- 
roticism. 

Historical and comparative studies. The older 
and well-known accounts of the family, such as 
those by Edward Westermarck, George Elliott 
Howard, and Arthur W. Calhoun, contain much 
historical and comparative information concerning 
disorganization and dissolution. A book by Lichten- 
berger, published in 1931, gives a thoughtful and 
comprehensive analysis of divorce. 

More recently, Murdock (1950) made a sys- 
tematic survey of divorce practices in 40 nonin- 
dustrial societies. In many of these societies he 
found divorce to be more frequent than in the 
United States, but he also found more preventives 
and functional adaptations. From this anthro- 
pological perspective, the American family does 
not seem to be in a state of hopeless decay. Fol- 
lowing Murdock’s lead, various anthropologists 
have investigated determinants of divorce rates in 
various cultures. Ackerman (1963), from a study 
of 62 societies, found certain forms of homogamy 
to be associated with low divorce rates. Christensen 
(1963) has given an elegant demonstration of the 
fact that the association between premarital preg- 
nancy and divorce varies with a culture’s sexual 
permissiveness. 

Measurement and prediction. Beginning in the 
1930s, researchers attempted to measure and pre- 
dict marital disorganization. The investigators often 
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collected information concerning background fac- 
tors at the same time that information was re- 
quested concerning marital adjustment. Some of 
the correlations between the so-called prediction 
scales (based upon various items, for the most 
part premarital) and the adjustment scales prob- 
ably involved a “halo effect” (see Kirkpatrick [1955] 
1963, chapters 15, 18, and appendix). Well-ad- 
justed respondents tended to report favorably on 
both their marriages and their childhood back- 
grounds. The reverse was probably true for mal- 
adjusted couples. 

In the longitudinal or forecasting studies of such 
authors as Ernest W. Burgess and Paul Wallin, 
Truman L. Kelley and Louis M. Terman, informa- 
tion was first obtained on a sample of individuals, 
and then after a period of years further informa- 
tion was gathered concerning their marital adjust- 
ment. [See the biography of BurcEss.] In general, 
correlations between scores on a prediction instru- 
ment and some categories of outcome tended to 
be lower when the halo effect was controlled by 
the passage of time. 

In connection with this body of research, a great 
deal of information was collected concerning fac- 
tors associated with “marital adjustment” or other 
criteria of success. The findings of many of these 
studies were brought together, with the revelation 
that there was rather a striking lack of convergence 
in evidence (Kirkpatrick 1947). Cautious verbal 
generalizations about the convergence of findings 
concerning factors related to marital adjustment 
and a guide to the sources from which these find- 
ings were derived are available (see Kirkpatrick 
[1955] 1963, chapter 15 and appendix A). 


Improvement of scientific knowledge 


More precise terminology, pertinent distinctions, 
and rigorous theory would all be helpful in further- 
ing scientific knowledge in the area under discus- 
sion. Since disorganization and dissolution are 
aspects of family behavior, any advance in knowl- 
edge of family systems in general would indirectly 
contribute to knowledge in this specific area. 

Theories of mating. Winch has systematically 
carried over his functional theory of the family 
to the area of family disorganization and dissolu- 
tion. If he is correct in his theory of complementary 
mating, a foundation would be laid for generaliza- 
tions concerning the kind of matings which would 
avoid family disorganization and dissolution 
([1952] 1963, pp. 567-655). 

Kirkpatrick derived a theory of mating from a 
basic postulate concerning the influence of prior 
Satisfactions and dissatisfactions in the course of 
family experience; this theory was called a theory 
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of “selective needs” based on prior satisfactions 
and dissatisfactions. In some respects it is different 
from the Winch theory, and in other respects it is 
convergent and supplementary (Kirkpatrick [1955] 
1963, pp. 300-307). According to the Kirkpatrick 
theory, an extreme of marital disorganization and 
perhaps consequent familial disorganization would 
exist when the spouse violates “minus—plus needs” 
by providing more of the particular kind of dis- 
satisfaction which was distressing during a prior 
phase of family experience. For example, the wife 
who fails to compensate for the dissatisfactions ex- 
perienced by her husband with his mother and 
aggravates these same dissatisfactions by behavior 
similar to that of the mother, risks marital dis- 
organization and possibly divorce. 

Criteria of scientific worth. Improvement of 
scientific knowledge in this as in other areas de- 
pends upon meeting five different criteria of scien- 
tific worth (Kirkpatrick 1939; 1959). One criterion 
is that of verifiability. In one sense verifiability im- 
plies repeatability of procedure, as in the case of 
experiments in the natural science field. There is 
some assurance that research is verifiable by others 
when it is presented as already verified by the in- 
vestigators in the sense of proper sampling, validity 
and reliability of instruments, and correctness of 
logical inference, It is striking that so many find- 
ings concerning factors favorable to success in 
marriage have been nonverifiable or unverified. 

A second criterion is transmissibility. A marriage 
counselor may achieve striking success and yet be 
unable to transmit his art or technique to a pupil. 
Often a description of divorce rates in a certain 
country cannot be transmitted to another scholar 
because symbols lack identical meanings for each 
person. 

A third criterion of scientific worth is consist- 
ency. The contribution having this quality is logi- 
cally sound, and the terms are carefully defined 
and consistently used with reference to their 
assigned meanings. Theory, hypotheses, research 
design, findings, and conclusion are interrelated. 
Any convergence or discrepancy in evidence is 
systematically presented. 

The fourth criterion is scope. Family disorgani- 
zation and disruption may be analyzed in a single 
brief case study, or a massive three-volume work 
may describe this aspect of family systems with 
global coverage. For example, Goode made an 
admirable study of divorce as affecting a sample 
of women in Detroit (1956) and in another book 
analyzed the family patterns as affected by indus- 
trialism in varied countries of the world (1963). 

A fifth and somewhat multiple criterion is that 
of implication. Research having the quality of im- 
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plication has one or more of four characteristics 
—namely, prediction, practical application, meth- 
odological innovation, and creative integration. It 
would be useful to construct a questionnaire that 
would predict divorce with 100 per cent accuracy. 
A study would have implications when, if applied 
to family counseling, it prevented 90 per cent of 
the needless divorces. It would be a methodological 
contribution to integrate the various measures of 
marital dissolution into an improved index of 
marital disorganization. Finally, the subcriterion 
of creative integration would be illustrated by a 
new integration of existing theories concerning 
family disorganization into a generalized theory 
that generated hypotheses subject to definitive 
verification by empirical research. 

Statistics and indices. The improvement of 
scientific knowledge concerning disorganization 
and dissolution depends upon improved statistics. 
In recent years some improvements have been 
made in marriage and divorce statistics in the 
United States, which are often inferior to those of 
less wealthy countries. In 1958 a reporting system 
known as the “divorce registration area” was estab- 
lished. For inclusion in the divorce registration 
area a state must have central files of divorce 
records; include on its record forms certain stand- 
ard items; receive regularly reports from local 
areas such as counties; and cooperate with the 
National Vital Statistics Division in regard to test- 
ing data for completeness and accuracy (Carter 
1960). Further support to the movement for im- 
proved statistics was given by a committee on 
marriage and divorce statistics of the American 
Sociological Association, which published a report 
in June 1958. As part of an action program initi- 
ated by the federal agencies concerned with mar- 
riage and divorce statistics, a national sample of 
marriage and divorce transcripts was collected for 
the year 1960, and the official improved statistics 
for 1960 are based on this new source, which ex- 
tends coverage beyond the 18 states in the divorce 
registration area. 

There are various ways in which scientific 
knowledge would be advanced by improvement of 
marriage and divorce statistics. Certain needs and 
research possibilities will be mentioned. 

The duration of marriage prior to divorce is a 
very important item of information. While it is 
claimed that the duration of marriage to the time 
of the decree is known for 96.2 per cent of all 
divorces granted in the United States during 1960, 
this information has not been fully used to com- 
pute the probability of divorce for couples marry- 
ing at a particular time. Such true probabilities for 


earlier years have been computed by Monahan 
(1959) and Jacobson (1959), but there is a tend- 
ency to regard mistakenly the ratio of divorces to 
marriages of the same year as expressing the risk 
of divorce. 

Improved divorce statistics would reveal on a 
national basis the details of prior experience with 
marriage and divorce prior to a particular divorce 
that is being recorded and tabulated. 

It would also be helpful to have national statis- 
tics on the total number of persons of various 
types who have been affected by divorce. People 
pass out of categories which imply a record of 
divorce, but they may carry the subjective experi- 
ence of divorce with them into a remarriage or a 
relationship with a stepparent. 

Statistics concerning divorce should be more 
closely related to improved statistics concerning 
separation and desertion. It may be that concern 
about the divorce of a particular date is inappro- 
priate in view of the fact that the granting of a 
decree is merely a legal event far removed in time 
from significant experiences of the individual con- 
cerned (Monahan 1962). 

It is highly important to have national statistics 
which are cross tabulated, showing the character- 
istics of one divorced spouse as matched with 
those of the other spouse. For research purposes it 
would be useful to have such cross tabulations by 
previous marital experience, religion, age at di- 
vorce, education, occupation and social status, and 
children resulting from prior marriages of the 
divorced spouses. 

An interrelation of divorce, marriage, and birth 
statistics on a national basis would be helpful. 
Christensen (1963) has already demonstrated that 
such comparisons are useful in showing the asso- 
ciation of premarital pregnancy with divorce. It 
may be that computer storage and retrieval of data 
will make the method of “record linkage” increas- 
ingly applicable. 

Statistics concerning family dissolution which 
permit valid international comparisons would be 
desirable. Obviously, a country which regards di- 
vorce as a private family matter and lacks a gov- 
ernmental agency concerned with divorce may 
seem to have a low divorce rate because of under- 
reporting. A logical extension of trends in the 
United States would be an international divorce 
registration area, having as members countries 
with acceptable statistics. Other countries might 
seek then to improve their indices of family dis- 
solution. 

Given the process of changing marital status, 
which is conspicuous in the United States and in 
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other industrialized countries, there is need for 
statistics that contribute to an index of what might 
be called the velocity of family dissolution. The 
speed of separation, divorce, remarriage and redi- 
vorce, could be known only with the aid of infor- 
mation concerning prior family experience, which 
would be reported at each transition. It is known 
from unusually good statistics in Iowa for 1953 
that the duration of marriage prior to a divorce 
ranges from an average of 6.3 years for primary 
marriages, to 3.2 when both parties have been 
married before, to 1.8 for couples breaking up 
their third marriage (Monahan 1959). Kirkpatrick 
attempted a crude measure of velocity of family 
dissolution by dividing the number of divorces per 
year by the number of persons who in the year 
following were in the status of divorced (1955, 
p. 534). A good index of velocity of family disso- 
lution would require improved statistics and would 
take into account the speed with which children 
lose natural parents to acquire a stepparent and 
perhaps a series of such stepparents. 

Knowledge of causes and consequences. The 
distinction between disorganization and dissolu- 
tion must again be stressed. As compared with 
couples equally disorganized but undivorced (either 
contemporary or living in a previous century), 
certain modern couples seek divorce for special 
reasons. Often the woman is able to get a job and 
hence can insist on a high standard of marital 
happiness. Thus, dissolution may be a preventive 
of disorganization. Some people are less tied to 
marriage than others; family functions, religious 
taboos, or life-long commitment to parental roles 
may simply have less meaning for them (Landis 
1963). The door of divorce is more open to certain 
couples than to others because of lenient laws in 
their locality, relative freedom to migrate elsewhere 
for an easy divorce, encouragement of relatives, 
Opportunities for remarriage, and in general, the 
prospect of freedom without the economic, reli- 
gious, and psychological penalties which have 
threatened other couples in various times and 
places. It can even be argued that divorce itself 
is a cause of divorce, in that frequent divorce 
weakens the norm of marital stability. Divorce 
frees bad matrimonial risks for remarriage and 
Subsequent divorce; offspring imitate divorced 
parents; and the divorce of numerous friends 
makes the choice of divorce more normal and 
respectable (Kirkpatrick [1955] 1963, pp. 577- 
582), 

Persons facing divorce during the latter half of 
the twentieth century in the United States have 
varied problems. Variations in the divorce laws of 
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the 50 states contribute to the possibility of legal 
tangles. The divorce procedure is neither consist- 
ently routinized nor consistently established as an 
individualized therapeutic procedure; certainly, as 
compared with the bereaved person, the divorced 
person lacks a clearly defined role to play. The 
more meaningful any marriage is, the more pain- 
ful will be its dissolution. But the confusion in- 
volved in divorce as compared with bereavement 
is often almost as disturbing as the previous frus- 
tration and deprivation. Seriously maladjusted 
couples are confused and torn between the choices 
of continued sickness in their marriage and the 
amputation brought about by the choice of divorce. 
The grim challenge is to find the choice which 
brings the lesser evil. 

Effect on children. It can be argued that re- 
search concerning the effect of disorganization 
and disintegration on children is important, even 
if it is imperfect by the criteria previously men- 
tioned. Children are the helpless victims of the 
mistakes of their elders; they have little control 
over their destinies and usually cannot select the 
stepparent who comes to them in the course of 
the divorce-remarriage process. 

Much of the research has failed to compare the 
children of disorganized but undissolved homes 
with those from disorganized and broken homes 
of specific types. Landis and Nye found that chil- 
dren from broken homes fared as well or better, 
according to certain criteria, than children from 
intact but unhappy homes (Nye 1957). Burchinal 
(1964) considered homes reintegrated by remar- 
riage, as well as broken and unbroken homes, and 
found no differential effect on adolescents. 

In regard to the convergence of valid findings, 
much depends on the methods used, the type of 
effect on children, and the age of offspring at 
which possible effects are investigated, Landis 
(1963) found rather impressive evidence that 
happy parental marriages were favorably related 
to the dating experience of offspring. Robert R. 
Bell and James DeBurger, both using the sibling- 
cooperator method (Kirkpatrick & Cotton 1951, 
p. 82), found that parental divorce showed greater 
influence than bereavement on the marital adjust- 
ment of grown offspring (see Kirkpatrick [1955] 
1963, pp. 665-674), 


Applications of knowledge 


Effective application of improved knowledge of 
family disorganization and dissolution depends on 
the emerging pattern of interprofessional coopera- 
tion. Lawyers, doctors, psychologists, social work- 
ers, educators, home economists, clergymen, and 
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demographers, as well as social scientists, draw 
upon the growing body of knowledge in this area. 
That thriving interdisciplinary organization in the 
United States—the National Council on Family 
Relations—draws members from all of the above- 
mentioned professions. Furthermore, the emerging 
international organizations concerned with the 
family tend to be interprofessional in character. 

Legal changes. Legal changes, both actual and 
proposed, tend to reflect a sharper international 
perspective. Varied explanations of the divorce 
trends in England include the influence of changes 
in the divorce laws. However, Max Rheinstein 
(1960) found little influence of legal changes on 
divorce rates in Germany. In the United States, a 
tabulation of the divorce laws in the various states 
as of 1952 was cross tabulated by the crude divorce 
rates of 1950 and the percentages of respondents 
in the various states answering “No” to a question 
on a poll of 1936 which read, “Should divorces be 
easier to obtain in your state?” There was a strik- 
ing lack of correspondence between severity of the 
laws, behavior as to obtaining divorces, and public 
opinion concerning legal restrictions (Kirkpatrick 
1955, p. 538). Of course, extreme differences in 
divorce law among various countries or extreme 
changes within an area, for instance, in South 
Carolina, do correspond in some degree with di- 
vorce rates. 

David and Vera Mace note a more “folksy” and 
less legalistic type of divorce procedure in Russia 
than in the United States (Mace & Mace 1963, 
pp. 203-226). Olson (1961) makes similar obser- 
vations with regard to Japan. In view of the long 
and, thus far, futile campaign in the United States 
to achieve either a uniform divorce law in the 
various states or federal control by a change in 
the constitution, it is interesting to observe that 
Australia established such a uniform federal law 
in 1959. This may well have some effect on the 
pattern of Australian divorce rates. 

Family counseling. Counseling and therapy 
offer one means of applying improved knowledge 
concerning family disorganization and dissolution. 
At times, legal reforms converge with provisions 
for family counseling. In Australia the Matrimonial 
Causes Act (mentioned above) restricts dissolution 
before three years of marriage and provides for 
marriage counseling and conciliation; it authorizes 
federal payment for such services to approved 
marriage guidance organizations (Harvey 1964). 
The provision for counseling in connection with 
the divorce procedure in Australia and Japan may 
give encouragement to persons in the United States 


who, like Judge Paul W. Alexander, plead for a 
therapeutic approach, abolition of adversary pro- 
cedure, and family courts that provide for coun- 
seling prior to the granting of a divorce (Kirk- 
patrick [1955] 1963, pp. 631-632). 

There is some agreement, furthered by experi- 
ence with counseling and therapy, that prevention 
of family trouble should be sought by a focus on 
education and counseling at an early period in the 
life span. Since family dissolution results in no 
small measure from the coming together of the 
wrong combinations of people, there is an argu- 
ment for premarital counseling and guidance in 
the process of selecting mates. Many issues, how- 
ever, remain controversial. For example, the small 
number of trained counselors in relation to poten- 
tial demand provides an incentive to sacrifice 
quality to quantity by approving the activity of 
many relatively untrained persons as counselors. 

Perhaps the dilemma of quality versus quantity 
may be solved to some extent by group counseling 
and therapy instead of individual treatment. The 
mutual support found in such groups is impressive. 
Certain instruments constructed for measuring 
attitudinal agreement, marital adjustment, and 
empathy in dyadic relationships might be adapted 
for self-analysis on the part of individuals and 
couples with minimal outside aid from experts 
(Kirkpatrick & Hobart 1954, pp. 10-19). 

International trends. The international outlook 
with regard to family disorganization and dissolu- 
tion offers an intellectual challenge that goes be- 
yond the parochial applications of existing knowl- 
edge. However, some promising beginnings have 
been made; Goode, for instance, has noted the 
influence of industrialization upon differential 
family instability among the different social classes 
of various countries. These classes may vary, of 
course, as to degree of urbanization, type of kin- 
ship system, and feministic orientation (de Vis- 
scher 1956). The most valid generalization seems 
to be that with the spread of Western technology 
there tends to be increasing similarity in the fam- 
ily systems of the world, including their provision 
for marital dissolution. The diffusion of feminism 
may temporarily increase family dissolution, as 
women demand equality in the privilege of divorce 
and become economically independent of both hus- 
bands and kinfolk (Sysiharju 1960). 

Family systems tend to have a certain resiliency 
under the impact of drastic social change. The free 
divorce of the 1920s in communist Russia has been 
replaced by a conservative and restrictive proce- 
dure. Hitler attempted reactionary change of the 
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German family (Kirkpatrick 1938), but after the 
political and military crisis had passed, the general 
feministic and egalitarian trends characteristic of 
Western civilization reappeared (Schelsky 1953). 
As the dominance of the independent nuclear fam- 
ily is proclaimed, evidence appears of the survival 
of functional kinship systems. Carle C. Zimmer- 
man, who expressed concern about the return of 
the unstable “atomistic family,” found substantial 
evidence that “friend-families” tend to be support- 
ive and homogeneous in regard to low incidence 
of divorce (Zimmerman & Cervantes 1956, pp. 91- 
117; see also Shanas & Streib 1965). 

Improved facilities for communication are char- 
acteristic of the modern world, on the international, 
the national, and the familial level. Cultural inter- 
action may increase both the similarity of familial 
institutions and agreement concerning human 
goals such as peace, freedom, and happiness— 
ends to which social institutions are means. If 
improved communication reduces repression, hol- 
low ritual, and pretense at every level, the pursuit 
of happiness and freedom may lead to an increase 
in divorce rates or other indices of familial disso- 
lution in conservative areas of society. Neverthe- 
less, if dissolution be distinguished from disorgani- 
zation, certain increases in dissolution may mean 
merely a spread of higher aspirations and a reluc- 
tance to endure familial disorganization with its 
associated misery. 

CLIFFORD KIRKPATRICK 


[See also MARRIAGE, article on FAMILY FORMATION; 
and the biographies of WALLER; WESTERMARCK.] 
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FAMINE 


“Famine is like insanity, hard to define but glar- 
ing enough when recognized. . . . one country will 
define as food shortage what another country 
would call famine” (Taylor 1947, pp. 98, 102). In 
recent years, and particularly in the United States, 
where food surpluses have been embarrassing po- 
litically (perhaps also morally), journals have been 
prone to report from abroad as “famines” what sub- 
sequently appear as shortages or merely threats of 
shortage. Shortages are not infrequently relieved 
before they become famines. 

True famine is shortage of total food so extreme 
and protracted as to result in widespread persisting 
hunger, notable emaciation in many of the affected 
population, and a considerable elevation of com- 
munity death rate attributable at least in part to 
deaths from starvation. Criteria do not exist to 
measure the degree of hunger, emaciation, or ele- 
vation of death rate serving to differentiate famine 
from shortage. The archetypical famine extends 
over a wide area and affects a large population. 
Starvation deaths on a small scale, as among mem- 
bers of an isolated family, a small hamlet, a group 
of travelers in wild country, an icebound ship, 
would not commonly be characterized as famine. 
Acute shortage of food for a few weeks, such as 
preharvest hunger in some parts of the underde- 
veloped world, is not famine. Lack of a particular 
customary food, such as sugar or beef, is not fam- 


ine if there is abundance of other items. Shortage 
of a particular vitamin or mineral in a population, 
evidenced perhaps by uncommonly heavy incidence 
of scurvy, beriberi, pellagra, rickets, or impaired 
vision, is not famine, although in recent decades 
the word has been applied to such shortages, 

The members of a community beset by famine 
gradually become greatly emaciated and increas- 
ingly weak and listless—eventually to the point of 
lying in homes or along the streets and roads, ut- 
terly inactive, skeletonized, often with swollen 
bellies, waiting for death. In famine-stricken re- 
gions beggars are encountered in abnormal num- 
bers. There are riots, aimless wanderings, purpose- 
ful migrations; men, women, and children comb 
fields, alleys, and dumps, hoping to find a scrap of 
edible material. Livestock owned by the poor are 
sold or eaten. 

A house-to-house survey in north China during 
the famine of 1920-1921 revealed that people were 
eating, among other items not in their normal diets, 
“flour made of ground leaves, fuller’s earth, flower 
seed, poplar buds, corncobs, . . . sawdust, . . . cotton 
seed, elm bark, .. . peanut hulls, sweet potato vines 
ground...” (Mallory 1926, p. 2). Kravchenko 
(1946, p. 113), who as an official of the govern- 
ment witnessed the famine of 1932-1933 in the 
Soviet Union, quotes a young peasant woman: “I 
will not tell you about the dead. . . . The half-dead, 
the nearly-dead are even worse. There are hundreds 
of people in Petrovo bloated with hunger. I don’t 
know how many die every day. Many are so weak 
that they no longer come out of their houses. A 
wagon goes around now and then to pick up the 
corpses, We've eaten everything we could lay our 
hands on—cats, dogs, field mice, birds. When it’s 
light tomorrow you will see the trees stripped of 
their bark. . . . And the horse manure has been 
eaten. .. . Sometimes there are whole grains in it. 
Upon such horrors in a famine-ridden area are su- 
perimposed an upsurge of burglary, robbery with 
violence, murder for gain. Cannibalism occurs but 
is a rare as well as secret event of which little can 
be known. Sorokin (1942, p. 81) holds the opin- 
ion that less than one-third of one per cent of a 
population in noncannibalistic societies would 
practice cannibalism under pressure of starvation. 
Disease flourishes abnormally as resistance is Te- 
duced by low food intake. 

The calamity of famine falls most heavily upon 
the poor, unless the state is dispossessing and pun- 
ishing a former aristocracy or bourgeoisie. Food 
prices begin to rise even before damages to crops Or 
military or political interferences with food inflow 
become generally apparent. People with ample pur- 


chasing power begin to accumulate stocks of food, 
either for their own future use or for sale at higher 
prices in weeks to come. Markets are swept clear 
of foodstuffs. Employment shrinks, and wages, 
where there is employment, seem not to rise in 
proportion to food prices. Families with low in- 
comes (if not dependents of the wealthy) feel the 
pinch of hunger first. They sell their possessions— 
their clothing, household furnishings, house tim- 
bers; even the means customarily used to provision 
themselves with food are sold. According to Wood- 
ham Smith, during the great Irish potato famine of 
the 1840s fishermen all over Ireland pawned or 
sold their gear to buy meals (1962, p. 291). Dur- 
ing that and other famines many poor peasants 
have eaten the seed necessary to produce a new 
crop, Children have been sold in Chinese famines; 
and men have sold themselves into slavery. Pros- 
titution burgeons. Buyers seem always to appear, 
for although famine may decimate a large popula- 
tion, it does not annihilate. Some members of a 
stricken community profit from the circumstances 
of famine. Some are protected by their position in, 
or by power of, the government. 

Causes. Famine has many causes. Nearly a 
century ago Walford (1878-1879, p. 450) listed 
12, classifying them into natural causes beyond 
human control and artificial causes within human 
control. This distinction remains valid in a general 
way, although it is certainly true that man has 
learned to modify some of the natural causes as 
well as to minimize their impact. Natural causes 
include drought, excessive rains and flood, unsea- 
sonably cold weather, typhoons and other high 
winds, tidal waves, depredations by vermin and 
such insects as locusts, and plant diseases. They 
tend chiefly to reduce production of food and to 
destroy stocks, Occasionally, though mostly for 
short periods, floods or frosts restrict the flow of 
foodstuffs from surplus to deficit areas. The artifi- 
cial causes—commonly political—include warfare 
that involves siege or blockade, or destruction of 
food stocks or growing grain; and wartime strains 
on economies that diminish manpower, machines, 
or fertilizers, thus reducing cultivated acreage, 
yields, and production. Revolutions, particularly 
when they involve a struggle between peasantry 
and officialdom, may reduce food acreages and 
yields and thus contribute to famine; so may exces- 
sive taxation or collection from peasants of grain 
surpluses, which happened in Soviet Russia in 
1932-1933. It is difficult to perceive in the vague 
history of famines a major one in which political 
causes alone were operative, although this may be 
said of a good many minor famines—typified in 
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sieges of cities. An age-old device of war is to im- 
pose famine on the enemy. 

The great famines of the world have been due to 
natural forces, frequently intensified, however, by 
political factors. Sometimes economic or demo- 
graphic situations—prevalence of poverty, includ- 
ing unemployment, peasant agriculture of the bare 
subsistence type, or many landless agricultural la- 
borers in a population of high density—are re- 
garded as causes of famines. They certainly make 
for vulnerability to famine, but unlike natural or 
political catastrophes are chronic rather than epi- 
sodic in character. 

The principal natural causes of major famines 
have been deficiency of rainfall (drought) or excess 
of rainfall (flood). Probably no major famines, but 
only localized and minor ones, have been due to ex- 
cessively cold weather, high winds, or infestations 
of insects or vermin. Even a great swarm of locusts 
consuming every growing plant would rarely spread 
over an area more than a fraction as large as that 
covered by major drought or flood. Plant disease in 
the form of potato blight, however, did emerge on 
one occasion—in Ireland in 1845-1849—as the 
outstanding natural cause of a famine of great se- 
verity. Drought outranks flood as a major cause, 
except perhaps in north China, where in some sum- 
mers the Yellow River may rise so high that it over- 
flows its diked banks and renders unproductive the 
vast agricultural plains of its valley. Few flood fam- 
ines of major proportions are recorded elsewhere, 
but one did occur, in the years 1315, 1316, and 
1317, in the British Isles and on the Continent east 
and north of the Pyrenees and Alps at least through 
present-day Poland; mortality was high (Lucas 
1930). Continuous rain greatly reduced the har- 
vests of grain crops, and pestilence (murrain) 
killed many farm animals. 

Famine follows upon extreme shortage, insuffi- 
ciently relieved by inshipment, of the staple starchy 
food crop of the afflicted area. That crop is usually 
grain—usually wheat or rye in temperate zones; 
rice, a millet, or sorghum in warmer climates. Fam- 
ine or general food shortage in northeastern Brazil, 
however, will chiefly represent deficiency of the 
manioc crop; in Ireland it was the potato crop. 
The grains and starchy roots provide the bulk of the 
energy-yielding food for most of the world’s popu- 
lation. In the absence of shortage of grain crops or 
(rarely) of starchy roots, a major natural famine 
is unlikely to occur. Grain is relatively cheap per 
thousand calories of nutriment, readily storable, 
and easily transported and processed into meal or 
flour. It is the most serviceable foodstuff to be 
brought in to ward off or relieve famine. 
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Geographical incidence. Whether natural or 
artificial, famine is always regional or local, never 
world-wide or continent-wide—or even nationwide 
in such vast countries as India, China, Russia, and 
Brazil. Conceivably, some large areas of the world 
have escaped, if judgment can be based upon the 
two major chronicles of famine (see Walford 
1878-1879; Minnesota. . . 1950). Therein no men- 
tion is made of famine in Australia, in the great 
islands of the East Indies, or in Africa south of the 
Sahara. In North America and Central America, 
the only listing is of a famine in Mexico in 1051 
(“Famine which caused the Toltecs to migrate”), 
and this is perhaps not clearly authenticated. South 
America seems to have experienced major famine 
only in northeastern Brazil (the sertão), an area 
subject to recurrent severe droughts. Although the 
chroniclers of world famine make no mention of 
Japan, historians record three famines there in 
different regions, in 1732-1733, 1783-1787, and 
1832-1836, severe enough to provoke violent riots 
(Sansom 1963, p. 222), 

Europe west of Russia has witnessed no natural 
famine since the great Irish calamity of the 1840s, 
although artificial famines on a much smaller 
scale accompanied World War n, at least in Greece 
and in the western part of Holland. There were 
food shortages elsewhere both then and during and 
after World War r. Ancel Keys (see Minnesota 
- + » 1950, p. 1251) lists since 1850 one famine in 
Persia (1871), one in Asia Minor (1874-1875), 
one in Egypt (1897), one in Brazil (1877), and 
one or two in Morocco (1877-1878), indicating 
infrequent occurrence in those countries. In Russia 
over the same period no fewer than ten famines 
are noted, and in India 13, not counting the most 
recent one, the great Bengal famine of 1943. The 
number of famines in China since 1850 is uncer- 
tain, but a severe one occurred in 1877-1878; 
others in 1919-1920 and 1929-1930; and lesser 
ones in 1906, 1911, 1916, and 1924. Russia, India, 
and China over the past century have encompassed 
the outstanding famine areas of the world. Each 
contains regions adjacent to deserts, where rainfall 
is regularly low, highly variable, and of summer 
incidence; crops tend to fail in the exceptionally 
low-rainfall years, These regions also have a rather 
dense and impoverished agricultural population. 
In Russia the region most frequently drought- 
ridden centers in the Volga basin; in China, the 
valley of the Yellow River; in India, the northwest 
and the Deccan plateau. Each country also con- 
tains regions of abundant and dependable rainfall, 
where famine rarely occurs, But in all the great 
famines of the twentieth century natural and arti- 


ficial causes worked simultaneously—drought and 
flood, war and revolution. 

Relation to disease. Since famine stimulates 
human diseases, statistical differentiation between 
deaths from starvation and deaths from disease is 
practically impossible, as is close measurement of 
the degree to which famine elevates death (and 
morbidity) rates above normal levels, A depend- 
able ranking of the famines of even the past cen- 
tury from most to least lethal is out of the question. 
Nevertheless, it can probably be said that there 
was mortality of a million persons or more above 
average at least in the Irish famine of 1845-1849; 
the Indian of 1877-1878, 1896-1897, 1899-1900, 
and 1943; the Russian of 1921-1922 and 1932- 
1933; and the Chinese of 1877-1878 and 1929- 
1930. 

Famine reduces resistance to many diseases, in- 
cluding malaria, influenza, and tuberculosis: small- 
pox, cholera, typhus, or relapsing fever may plague 
the afflicted regions, especially if the population is 
crowded into unsanitary refugee camps, as on the 
fringes of cities. Acute deficiency diseases take a 
much larger toll than usual, for reduced food con- 
sumption is certain to bring intake of some of the 
essential vitamins and minerals below require- 
ments. Acute and protracted diarrhea (“bloody 
flux”), induced by polluted water and the eating 
of improper materials, appears to be a lethal 
Scourge, particularly among children. Famine not 
only increases death rates but also reduces birth 
rates, thus slowing growth of population. Advanc- 
ing scientific knowledge of diseases and growth of 
both national and international health services in 
the twentieth century have greatly lessened the 
tisk of high mortality as a result of famine. 

Relation to migration. The circumstances of 
famine induce people to flee from it, not only to 
escape but also to seek work that will permit them 
to restore purchasing power in some form to family 
and friends left behind. Refugees from the country- 
side often flock to cities, especially centers of gov- 
ernment. Of those who flee, many return upon the 
abatement of famine conditions, but others find 
new homes. The drought-ridden sertão of north- 
eastern Brazil has witnessed both the flight and 
return and the permanent export of population— 
not abroad, but to other parts of the nation (James 
1942, p. 425; Smith 1879, pp. 398-435). Scarcity- 
induced migration that crosses national frontiers 
has not been common. The conspicuous example in 
history is the great migration of more than a 
million people from Ireland during and after the 
famine of the 1840s, the bulk of whom came to 
the United States, remaining to participate in and 


influence that country’s development. Internal mi- 
grations that may have occurred in Russia, China, 
and India have not been carefully recorded or 
studied. In general, demographers appear not to 
lay much stress on famine as a cause of the surg- 
ing migrations of history or prehistory. The unre- 
corded breakup of families attributable to famine 
migration must have caused millions of individual 
catastrophes. The famine in Ireland led to the con- 
viction in Great Britain that at all times basic food 
must be available as cheaply as possible to the 
poor, and the Corn Laws that had long held grain- 
prices high were repealed in 1846. Famines or 
shortages there and elsewhere have tended to force 
a lightening of the burden of taxes and rents upon 
peasant farmers and a wider acceptance by govern- 
ments of responsibility for prevention and relief. 

Remedial measures. Five centuries ago famine 
was regarded almost throughout the world as in- 
evitable and was so accepted, often as a manifesta- 
tion of divine wrath. Occasionally, however, there 
were rulers who sought to prevent or relieve it. The 
Biblical story of Joseph in Egypt, storing grain in 
“fat” years against the “lean” that might follow, 
exemplifies probably the most common method of 
famine prevention in antiquity and medieval times. 
The rulers of the Inca Empire guarded against 
famine by storage and by construction of irrigation 
canals. Irrigation, a safeguard against famine be- 
cause it both elevates and stabilizes the acre-yields 
of crops, was practiced some 5,000 years ago in 
Sumer and is very ancient elsewhere in Asia and 
north Africa, but famine can hardly have been the 
sole stimulus for irrigation. Flood control by dikes 
and dams is also an ancient device which militates 
against famine but has other values. Destruction 
of stores of grain and of irrigation and flood-control 
works is obviously a method of creating artificial 
famine. 

Natural famines having their origin mainly in 
drought or flood, sometimes in plant disease or 
insect pest, are not now regarded as inevitable. 
Within nations, a naturally induced or threatened 
food shortage is certain to be met by domestic 
efforts to ward off or relieve it. This was not true 
in the Soviet Union as late as 1932-1933 but oc- 
curred there in 1963-1964, following a very severe 
drought, when a huge quantity of grain was im- 
ported and paid for. So it was in China a year or 
two earlier. India and Pakistan have been able 
since war ended in 1945 to arrange, partly on the 
basis of international credits, for sporadic grain 
imports sufficient to preclude famine; so also have 
Brazil and Yugoslavia. The disposition of all gov- 
ernments by the 1960s was to prevent or relieve 
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famine or shortages within their own borders; and 
of some governments to donate or loan funds or 
food surpluses to prevent or relieve famine beyond 
their own borders. The capacity of nations to pay 
or to loan or donate has increased. International 
cooperation in famine relief or prevention has in- 
creased in the past century, as evidenced by such 
organizations as the Red Cross, the China Relief 
commissions, the American Relief Administration 
of World War 1, the United Nations Relief and Re- 
habilitation Administration following World War n, 
and the Food and Agriculture Organization of the 
United Nations. Supplies have been available de- 
spite the huge growth of world population; and 
with the advancement in agriculture everywhere, 
few localities stand in such risk of drought or pests 
or plant disease as prevailed even half a century 
ago. If, nevertheless, natural calamity strikes, the 
network of transport, by ship overseas, by barge on 
canal and river, by rail, by truck on roads, even by 
air, has so grown and is now so far-flung and 
efficient that stricken regions can be reached. Gov- 
ernments have learned how to ration food in short 
supply in a manner more equitable than was pos- 
sible earlier. 

Progress in coping with natural famine is thus 
apparent politically, economically, and socially. 
Until the end of the twentieth century there seems 
no reason why true famine of natural origin should 
be endured in any country, for over so short a time 
world population seems unlikely to outrun food 
supplies. What the more distant future holds is 
purely conjectural. But even in the shorter term, 
it cannot be said that artificial famine, induced by 
war or revolution, may not again appear. 


M. K. BENNETT 


[See also Foop.] 
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Frantz Fanon (1925-1961), political theorist, 
was born in Martinique. He studied medicine in 
France after World War 11, became a psychiatrist, 
and served in a government hospital in Algeria. 
In 1956 he resigned from French government serv- 
ice and joined the Algerian National Liberation 
Front (FLN), becoming first an editor of the party's 
journal and then, under the Provisional Govern- 
ment of the Algerian Republic, an ambassador to 
Ghana and a special envoy to the Congo. When he 
died in Bethesda, Maryland, of leukemia, he had 
established a reputation as the leading ideologist 
of the Algerian revolution. Subsequently, he came 
to be regarded as a major theorist of anticolonial- 
ism. His last work, The Wretched of the Earth 
(1961), is the clearest extant statement of this 
ideology. 

Fanon began as a psychiatrist. He argued that 
neuroses can be understood and treated only if 
they are related to the cultural situation that gives 
rise to them, what he called their “sociogenesis.” 
For Fanon, the crucial aspect of a social structure 
is the extent to which it creates institutions that 
fulfill men’s needs. He maintained that “a society 
which forces its members to desperate solutions 
is a nonviable Society, which must be replaced” 
(Pour la révolution africaine, p. 61). 

Fanon argued that in the eyes of most men the 


fundamental conflict of the contemporary world is 
the race conflict: Black men and white men con- 
stitute two hostile camps. He was very clear that 
the source of this conflict, as of all conflicts, is 
oppression—in this case, a racial oppression which 
masks the oppression of capitalism and colonial- 
ism. “It is the white man who creates the Negro. 
But it is the Negro who creates negritude” ([1959] 
1965, p. 47). Rooted in a tradition of universalistic 
rationalism, Fanon nevertheless rejected the French 
idea and policy of assimilation which, he felt, could 
never transcend paternalism and which tended to 
justify continued oppression. He defended the wear- 
ing of the veil by Algerian women as a symbol of 
nationalist opposition but thought that the veil 
would disappear after the success of the fundamen- 
tal revolution against the colonizers. 

Fanon sought ways for the colonized to purge 
themselves of the degrading effects of colonialism. 
For an individual or a group, the primary symptom 
of the fact that it has not attained human status 
is that it asks—indeed, appeals—for magnanimity. 
Rights can only be had if they are taken by pur- 
gative acts of violence. In a situation of collective 
denial of rights, there must be “collective catharsis.” 
Violence is the means to this catharsis. 

Fanon believed that renewal is not easy. It re- 
quires violence, which frees the individual from 
the mental deformities imposed by colonialism and 
creates the possibility of renewal. Politicization can 
then follow; for to politicize is to awaken the mind 
to the world. Thus, self-consciousness will be the 
guarantee of cultural openness. The liberation of 
the oppressed individual from the apathy that has 
been encouraged in him—Fanon spoke of the “cul- 
tural mummification of colonialism”’—is the only 
possible basis of true liberation. 

Neither this individual liberation nor real decol- 
onization can be achieved without violence. Vio- 
lence is “in the atmosphere” and is constantly 
giving rise to spontaneous outbursts of violent pro- 
test. This spontaneity is a great source of strength, 
placing pressure on the inherently reluctant leader- 
ship of protest movements. But it is also dangerous, 
because it is neither disciplined nor farsighted. Suc- 
cessful revolution requires the proper balance of 
spontaneity and self-control. 

Nationalist movements in the contemporary 
world have been the principal vehicle of decoloniza- 
tion. However, for Fanon they are an uncertain 
instrument because they tend to be led by the urban 
middle classes and because they ignore, insofar as 
they can, the peasantry. But only the peasantry in 
the underdeveloped world is really revolutionary, 
for only it has nothing to lose. The urban proletar- 


iat, by contrast, is a privileged group which tends 
to look down upon the peasants. The chief char- 
acteristic of the urban nationalist elite is that its 
anticolonialism goes hand in hand with its spirit 
of accommodation. Its propensity to compromise, 
combined with the peasants’ often premature ac- 
ceptance of concessions, constitutes the chief dan- 
ger to the anticolonialist revolution. 

Fanon saw the colonial powers and their agents 
as following a policy of making minimal conces- 
sions which are intended to undermine the unity of 
the oppressed classes. He analyzed the guided steps 
to independence in much of Asia and Africa in 
these terms, Under pressure, the colonial powers 
cede authority to each national bourgeoisie, thus 
effectively in most cases creating new allies. The 
single-party systems of the “third world”—the under- 
developed world—have been “dictatorships of the 
bourgeoisie.” 

The struggle for national liberation of colonial 
countries can, according to Fanon, be understood 
only in the context of the struggle of the “third 
world” for full equality and self-realization. Fanon 
drew on Hegel and Marx for his analysis of the 
underlying contradictions of the capitalist world. 
He followed Sorel and Lenin in their emphasis on 
the necessity of organized violence to polarize so- 
ciety. And he drew on a modified version of Freud’s 
theories for his analysis of how changes in the 
social structure lead to shifts in the personality 
structure, which in turn determine the possibilities 
of political action. 

Fanon sought to go beyond nationalism to the 
world revolution of the colonized against the col- 
onizers. He considered the attainment of national 
sovereignty to be insufficient. Fundamental eman- 
cipation of the once-colonial peoples can come 
about, according to Fanon, through their participa- 
tion in this world-wide revolution. He thought that 
a “new man” must be created and that only through 
the reawakened self-consciousness of the oppressed 
peoples can a true international consciousness be 
created and man’s humanity be established. 


IMMANUEL WALLERSTEIN 


[See also COLONIALISM; IMPERIALISM. ‘Other relevant 
material may be found in MODERNIZATION. ] 
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FANTASY 


Fantasy, man’s capacity to “give to airy nothing 
a local habitation and a name,” has long intrigued 
poets, playwrights, and painters but has only dur- 
ing the twentieth century become a formal area of 
scientific inquiry in psychology. In current usage 
the term is almost synonymous with “daydream.” 
Within the area of experimental or clinical study, 
however, the term “fantasy” has a broader signifi- 
cance since it deals not only with imaginary activ- 
ity spontaneously produced as part of the ongoing 
stream of thought (daydreams) but also with prod- 
ucts of thought elicited upon demand from a clini- 
cian or in response to inkblots or ambiguous pic- 
tures. In effect, whether as a privately occurring 
daydream or as a publicly elicited story or imagina- 
tive response, fantasy generally seems to involve 
a complex associative activity in which events from 
the past are integrated into some more or less 
elaborate ongoing image. This image, usually visual 
but often auditory or verbal, is generally an inner 
“event” in which a new situation, possibility, per- 
sonal role, or sequence of behavior is formulated 
either on a “mental screen” or as a story told to 
an examiner. Deriving from the Greek phantasia 
and still often spelled in Greek style as “phantasy,” 
fantasy refers to man’s remarkable ability to create 
an “as if” world either spontaneously or upon de- 
mand. It is possible that at one time such a capacity 
was not so widely developed, and people were 
more prone to regard their own fleeting imagery or 
brief daydreams as actual visions, omens, or ap- 
pearances of deities, much as they responded to 
nocturnal dreams. Prophetic visions, like Ezekiel’s 
“wheel” or John’s Apocalypse, probably represent 
literary expressions of elaborate daydreams or fan- 
tasies used for expository or hortatory purposes. 


History 

Within psychology the formal study of fantasy 
as a behavioral phenomenon is probably traceable 
to Francis Galton’s classic studies of individual dif- 
ferences in imagery. Although imagery itself as a 
field of inquiry has a somewhat separate history, 
the fact that much fantasy behavior involves some 
form of imagery led Galton to touch on issues in- 
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volving the differences between persons in their 
internal private constructions. By 1890 William 
James in his great work, The Principles of Psy- 
chology, devoted portions of several chapters, in- 
cluding the famous one on the “stream of thought,” 
to issues closely related to fantasy processes. James 
was among the first great psychologists to sense a 
continuity between normal phenomena character- 
istic of the ongoing self-stimulating properties of 
the living brain and the bizarre delusions or hal- 
lucinations of the insane or the inspirational visions 
of religious mystics. James also called attention to 
what he termed the reproductive and memorial 
facets of imagery, or the degree to which the image 
is of an object recently perceived or one called 
forth from the distant past. The fantasy presum- 
ably may represent a response to a stimulus per- 
ceived momentarily which triggers off a complex 
associative process in the ongoing stream of 
thought. The taste of the madeleine crumb pro- 
duced in Proust a vivid memory of his childhood in 
Combray, a strong memorial association, whereas 
the sound of the song of the nightingale touched 
off a more “fanciful” association in Keats, the song 
that 


oft-times hath 
Charm’d magic casements, opening on the foam 
Of perilous seas, in faery lands forlorn. 


“Ode to a Nightingale,” stanza 7, Il. 8-10 


Psychoanalytic contributions. Although the nine- 
teenth and early twentieth centuries were times of 
considerable research and controversy within psy- 
chology with respect to imagery and the possible 
existence of “imageless” thought, little attention 
was paid to the daydream itself as a spontaneously 
produced image complex. Interest in fantasy proc- 
esses and dreams was limited at that period (and 
particularly following the emergence of Watsonian 
behaviorism in America) to the clinical psychi- 
atrist and especially to the psychoanalyst. Freud’s 
elucidation of the structure and interpretative pos- 
sibilities of the phenomena of nocturnal dreaming, 
based largely on remarkable self-observation and 
intensive clinical work, also led quite naturally to 
explorations of other dreamlike phenomena, such 
as daydreams. When Freud realized that patients’ 
accounts of infantile seductions (which he be- 
lieved to be the foundation of neurotic develop- 
ments) could not possibly have occurred, he was 
compelled to pay more attention to the nature of 
childhood fantasy as the source of such material. 
The free-association process of psychoanalysis itself 
also led to frequent reports of fantasies or day- 
dreams, the most important, from a technical 


psychotherapeutic sense, being those involving the 
relationship between analyst and patient. Many 
psychoanalysts, therefore, paid careful attention to 
daydreams of patients and began in some instances 
to experiment with eliciting fantasies when pa- 
tients were resistant to or incapable of recalling 
night dreams. Freud himself during the early part 
of the twentieth century speculated on the psycho- 
logical significance of the daydream in his papers 
“The Relation of the Poet to Day-dreaming” (1908) 
and “Formulations on the Two Principles of Men- 
tal Functioning” (1911). 

Throughout the early decades of the twentieth 
century psychoanalysts made regular use of reports 
of spontaneous fantasies and published many pa- 
pers in which myths or popular stories and liter- 
ature were interpreted as forms of fantasy. 

Projective methods. Some attempts were made 
to study children’s fantasies through stories elicited 
from them or through their drawings, but the most 
influential advances in the psychological study of 
fantasy came from the development of so-called 
projective techniques. 

Rorschach technique. The first and most fa- 
mous of the projective methods was the Rorschach 
Inkblot Technique, developed by Hermann Ror- 
schach, a Swiss psychiatrist. Of particular interest 
to him was his finding that persons who reported 
numerous “human movement” responses to the ink- 
blots, such as “two footmen bowing, two ladies 
dancing,” were characterized on the one hand by 
considerable motor inhibition or awkward motility 
and on the other by a heightened capacity for “inner 
living,” imagination, or fantasy production. In con- 
trast, persons who produced numerous color- 
dominated responses were more likely to be out- 
going, motorically affective, and emotionally labile. 
The juxtaposition of the human-movement and 
color responses provided an indication of what 
Rorschach called the “experience type,” the relative 
degree to which capacity for fantasy behavior and 
inner living were balanced with tendencies toward 
emotionality and direct action. 

Rorschach’s method, somewhat modified, spread 
slowly through Europe and was introduced into 
the United States in the late 1920s and early 1930s 
by David Levy and Samuel Beck. In the United 
States by the time of World War 11 it had been 
adopted by clinical psychologists, who saw the 
method as a means of exploring the private fantasy 
world of patients without direct questioning and of 
eliciting behavioral data that could be quantified, 
thus meeting to some extent the objections of aca- 
demic psychology to the purely qualitative approach 
of clinical psychoanalysis and psychiatry. With the 


introduction of more formal academic courses 
within the universities and private seminars on 
Rorschach technique, a great deal of systematic 
research was generated to check specific features 
of the technique or to compare cultures, psycho- 
pathological groups, or dimensions of individual 
difference through the use of the elicited fantasy 
response to the inkblots. A summary of much of 
the work in this field and of the methodological 
and theoretical issues involved in the use of the 
Rorschach method is available in Rorschach Psy- 
chology, edited by Rickers-Ovsiankina (1960) [see 
PROJECTIVE METHODS, article on THE RORSCHACH 
TEST]. 

Thematic Apperception Test. Beginning in the 
early 1930s at Harvard University’s Psychological 
Clinic, a group of psychologists under the leader- 
ship of Henry A. Murray undertook extensive 
studies of various techniques using elicited fan- 
tasies to study the nature of the normal personality. 
A specific technique that emerged was Murray's 
Thematic Apperception Test, a method for eliciting 
imaginative stories in response to somewhat am- 
biguous pictures, for example, of a boy staring at 
a violin set before him on a table. The stories 
elicited by each of a series of 20 pictures were then 
analyzed in terms of the motivational pattern pre- 
sumably involved in the story content, in effect an 
application of psychoanalytic interpretative meth- 
ods, perhaps more systematic, quantifiable, and 
suitable for rigorous research requirements. With 
various modifications in scoring categories and 
picture content, the basic method is in widespread 
use today for clinical, industrial, and research pur- 
poses, Such scoring of motivational patterns from 
elicited fantasy products has been especially instru- 
mental in the careful study by David McClelland, 
John Atkinson, and their numerous students of 
achievement motivation through fantasy. Because 
of the relatively objective scoring techniques avail- 
able and high reliability of trained judges scoring 
for such motives as achievement and affiliation, the 
Thematic Apperception Test or related methods for 
scoring story material as a fantasy expression of a 
need or motive are perhaps more widely used in 
Systematic research than the more diffuse Ror- 
schach method. [For relevant summaries of liter- 
ature, see Atkinson 1958; see also ACHIEVEMENT 
MOTIVATION; PROJECTIVE METHODS, article on THE 
THEMATIC APPERCEPTION TEST.] 

Other projective methods. Within clinical prac- 
tice and, to a lesser extent, in formal research, a 
large variety of fantasy-eliciting techniques are 
used. These include various drawing procedures or 
such studies of artistic activity as finger painting, 
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eliciting responses to incomplete sentences or to 
requests like “Name your three wishes,” choices of 
various symbolic objects, organization of mosaic 
patterns, and puppet play. All make some general 
assumptions that a long-standing personality pre- 
disposition or style, as well as a motive pattern or 
aspects of bizarre thought and pathology, will be 
manifested by such elicited quasi-fantasy expres- 
sions. The proliferation in the use of these projec- 
tive techniques has not, however, been the fruit of 
sufficient systematic prior experimentation with 
any one technique as a rule, nor have any really 
Significant contributions to systematic psychologi- 
cal theory resulted from their widespread applica- 
tion. Individual clinical psychologists may be espe- 
cially gifted at eliciting material using a particular 
technique with which they have much experience, 
but such personalized effectiveness does not usually 
add to the general body of psychological knowledge. 

Experimental approaches. More recently, as a 
result of particular theoretical developments within 
psychoanalytic theory and progress in research in 
psychology and neurophysiology, greater attention 
has been paid to experimental efforts to study the 
nature of ongoing imagery or fantasy and day- 
dream experience. New techniques—some involv- 
ing electrophysiological measurement or systematic 
inquiry of subjects during sensory isolation, under 
the influence of special drugs, or during monitoring 
or vigilance tasks—have led to some increased 
sophistication and rigor in the study of the “stream 
of thought” in the normal adult. Recent develop- 
ments in this area are summarized in Holt (1964) 
and Singer (1966). 


Current theories and relevant research 


Surprisingly little as yet is known of the range 
and variation of fantasy in normal adults and 
children or of differences in the content, frequency, 
or structure of daydreams among various cultural 
groups (Singer 1966). There has emerged, how- 
ever, chiefly from psychoanalysis, a preliminary 
theoretical formulation of the function of day- 
dreaming in the economy of the personality. Re- 
search reflects efforts to examine and test the im- 
plications of the theory of fantasy, which derives 
chiefly from the specific formulations of Freud. 

Defensive or cathartic models. One of Freud’s 
more fascinating insights was his observation that 
the capacity to delay gratification, a vitally signifi- 
cant step in man’s adaptive development, was 
somehow linked to his imaginative capacity. Two 
general conceptions of the relation between thought 
or fantasy and control mechanisms can be dis- 
cerned in Freud’s work and in that of most psycho- 
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analytically oriented investigators or learning the- 
orists who have developed the implications of the 
notion of the relations of thought and delay. One 
conception stems from Freud's early paper on the 
poet and daydreaming (1908), in which he stated 
that “unsatisfied wishes are the driving power be- 
hind fantasies” and pointed out the degree of sim- 
ilarity between daydreams and night dreams in 
their wish-fulfilling function. The daydreams that 
the poet transforms into artistic productions are 
disguised representations of unfulfilled infantile 
desires, defenses against direct recognition of these 
desires. As Freud put it: “Happy people never 
daydream.” 

This view that the daydream is an outgrowth of 
an unsatisfied wish and is a defense against its 
direct manifestation pervades much psychoanalytic 
thinking. Closely related to the defensive function 
of fantasy but perhaps more precise in formulation 
is Freud’s second conception that all processes of 
thought, including fantasy, by allowing partial 
satisfactions (expenditure of small quantities of 
energy), partially reduce the drive and thereby 
permits delay in gross motor discharge. This theory 
of the catharsis or drive reduction obtained through 
play and fantasy in children or daydreams in 
adults has many important implications (see Rapa- 
port 1956). 

In effect, it proposes that where there is a delay 
in the opportunity to discharge a certain amount 
of energy through satisfaction of a need or through 
movements associated with attaining such satis- 
faction, the fantasied reproduction “in the mind’s 
eye” of the gratifying activity or associated move- 
ments will partially reduce the amount of energy 
at least enough, say, so that rash or fruitless action 
will not occur. From a psychoanalytic standpoint 
(where the emphasis is on the inner origin of 
stimulation) such partial discharges may effec- 
tively control maladaptive overt discharge of many 
antisocial or asocial tendencies, Thus, regular 
watching of wrestling or boxing on television may 
curb man’s tendency to assault his neighbor upon 
minimal provocation, and the conscious acceptance 
of daydreams of sexual conquest may effectively 
prevent rape, whereas inability to experience such 
thoughts may lead to overt impulse-gratifying be- 
havior that has unfortunate consequences, 

Within academic psychology the learning-theory 
formulation of the catharsis model does not specify 
the amount of instinctual energy but does em- 
phasize that fantasied behavior has, in effect, sec- 
ondary-reinforcing qualities that may lead to a 
lowering of drive Strength consequent to engaging 
in a daydream about the goal object. It is assumed 


that if a child is hungry he begins to make motor 
reactions associated with searching for food. To 
the extent that he can, at an ideational level, re- 
produce an image of the movements that would 
gratify his hunger or of his mother coming in with 
the food, and if just about that time his mother 
does indeed come in, then the ideational repre- 
sentation acquires secondary-reinforcing properties. 
Provided his mother comes regularly but perhaps 
not invariably (intermittent reinforcement being 
superior in preventing extinction), the fantasy 
activity will be learned as one means of response 
in a state of need. 

A number of interesting lines of research have 
been developed in attempts to test the cathartic 
theory of fantasy and some of the implications of 
this drive-reduction model. The theory also has 
some important implications for the logic of pro- 
jective testing where, for example, one faces the 
interpretative task of deciding whether the person 
who provides a certain frequent response pattern 
to inkblots or photographs is likely to show the 
same or opposite pattern in overt behavior. In 
the case of the Rorschach, for example, there is 
evidence for an inverse relationship between per- 
ception of movement on the blots and overt 
movement, impulsivity, and other ego functions 
(Rickers-Ovsiankina 1960). Most of the evidence 
supports Rorschach’s notion that human-move- 
ment responses are linked with controlled motility, 
on the one hand, and imaginative development 
on the other. An experiment by Page (1957) has 
provided an important link in this chain by dem- 
onstrating that only this human-movement re- 
sponse of all Rorschach factors is significantly 
associated with reported frequency of daydreaming. 

Criticism of the cathartic model. When the 
focus in the projective method is not on the struc- 
tural aspect of fantasy and motor behavior but on 
the relation of specific content to projective fantasy, 
a different finding emerges. The controversy be- 
tween the alternative channel and direct expres- 
sion theories of Thematic Apperception Test inter- 
pretation seems to be resolved by the large majority 
of studies in favor of direct expression. The litera- 
ture on fantasied aggression as summarized by 
Buss (1961) and the extensive work by McClelland 
(1961) and Atkinson (1958) has certainly sup- 
ported the idea that persons who produce aggres- 
sive or achievement-oriented themes behave ac- 
cordingly in daily life, y 

An important step in the effort to test the drive- 
reduction model of fantasy came in Feshbach’s 
study (1955) in which it was demonstrated that 
when insulted Ss were given an opportunity to 


express aggression through storytelling they showed 
less aggression in subsequent ratings than did a 
group given no opportunity for fantasied aggressive 
expression. Feshbach also found that viewing a 
filmed prize fight after induction of anger would 
partially reduce later aggressive ratings. It should 
be noted, however, that the very writing of the 
stories is perhaps a more motorically involved ex- 
pression than pure daydreaming, the daily life phe- 
nomenon associated with drive reduction. 

Where experimental efforts have focused atten- 
tion not upon experimentally aroused anger but 
upon the arousal of stress, anxiety, or fear, the 
cathartic significance of daydream or fantasy 
activity is less in evidence or unsupported (Singer 
1966). These results suggest that when we turn to 
stress rather than anger as the drive or affect in- 
volved, the effects of daydreaming are more com- 
plex; engaging in fantasy may enhance anxiety 
when the future situation is unstructured but may 
reduce it when a short-term stress is imminent by 
distracting the person from continued thought. 

Particularly telling evidence against a catharsis 
theory has come in the works of Berkowitz (1964) 
and Bandura, Ross, and Ross (1963). These 
studies, concentrating on aggressive expression, 
seem to offer a body blow to the cathartic theory 
and to point up the importance of considering the 
pattern of environmental stimulation more thor- 
oughly in understanding the meaning of fantasy 
play in relation to direct motor expression. Berko- 
witz, for example, found that Ss who viewed a 
movie scene in which a prize fighter was taking 
a “justifiable” beating showed a heightened tend- 
ency to aggressive behavior toward a fellow student 
soon after the show, compared with Ss who had 
been told the beating was undeserved [see AGGRES- 
SION, article on PSYCHOLOGICAL ASPECTS]. The 
studies by Bandura, Ross, and Ross of modeling 
behavior indicate, too, that young children aroused 
to anger by frustration are likely to imitate directly 
the aggressive behavior of a previously witnessed 
live or filmed model [see IMITATION]. 

The evidence for a catharsis theory of fantasy 
seems weak, at best, in view of recent research and 
a restudying of the problem. The data, of course, 
are chiefly from the study of aggressive behavior, 
and in general there has been little concern either 
with naturalistic daydreaming behavior and its con- 
sequences (as against filmed or thematically stim- 
ulated responses) or with affects other than anger 
and aggressive behavior. 

Fantasy as a cognitive skill. An alternative ap- 
proach may be proposed that at least conceptually 
reconciles the Rorschach data on movement per- 
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ception and motor inhibition and the results of the 
direct expression or modeling theories of fantasy 
aggression. Heinz Hartmann, while developing his 
concept of the autonomous ego, noted that fantasy 
may serve to prepare the person for a greater 
mastery of the outside world through increased in- 
sight into his own psychic processes. It seems pref- 
erable to go further than Hartmann and begin 
again with a fresh look at the origin of fantasy 
and imagery without the burden of concepts of 
cathectic dynamics or neutralized versus libidinized 
energy. One may start, instead, by regarding the 
development of fantasy behavior or daydreaming 
as a cognitive skill, a capacity for gradual inter- 
nalization of response and for attending to the 
ongoing “reverberatory” behavior of one’s brain— 
responses available to most children. Conceivably, 
children may differ constitutionally in their capac- 
ity to form visual or auditory images, but these 
differences for the most part account for only a 
small amount of the variance in subsequent de- 
velopment of daydreaming skills or styles. Special 
handicaps do exist, and it has been found that 
congenitally blind children are less imaginative in 
their spontaneous play and storytelling than other- 
wise matched sighted children (Singer 1966). 

Fantasy play is thus an important feature in 
the development of children, a part of the con- 
tinuous assimilation—accommodation pattern, as 
Piaget put it (1945). There are, however, certain 
particular factors that increase the likelihood that 
fantasy play will become an integrated skill, rather 
than a sporadic response pattern, and, thus, a 
capacity to manipulate or to attend to an inner 
dimension, which may be one of man’s greatest 
assets. Similarly, if we recognize more clearly the 
potential of man’s inner skills, we may make pos- 
sible a further advance in man’s fullest use of his 
capacities. 

Factors in development of fantasy. What, then, 
are some circumstances conducive to fantasy de- 
velopment? 

Identification and imitation. The modeling 
theory suggests that imitation of parents is prob- 
ably an important element in fantasy development. 
The child who perceives his parents as benign 
persons will strive for identification more readily 
and learn more of their behaviors, and there is 
some evidence that persons showing more imagina- 
tion on the Rorschach (human-movement re- 
sponses) also report that their parents were benign 
and loving (Singer 1960). Identification with a 
specific adult figure, usually the mother, who repre- 
sents inhibition of impulse, socialization, and story- 
telling, may be an important feature in fostering 
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internalization. Several studies on daydreaming 
have yielded evidence of more frequent daydream- 
ing by male and female adults who were more 
closely identified with their mother than with their 
father or who perceived their father as further 
from their ideal (Singer 1966). A similar finding 
was reported by Sharef (1959) in a doctoral dis- 
sertation at Harvard: he noted that young men 
who had a history of being chosen as a mother’s 
confidant showed greater imaginative or introcep- 
tive tendencies. Some degree of modeling and im- 
itation caused in part by eagerness to please, by 
the availability of copying of certain types of re- 
sponses, and by the parent's rewarding these re- 
sponses may foster fantasy skill. A well-known 
quotation from Goethe serves to make this point: 


From father I have looks and build 
And the serious conduct of living. 
My mother gave me gaiety 

And zest for fantasizing. 


Xenions 


Opportunity for play. Given the closeness to at 
least one adult who encourages verbal interchange 
or fantasy play, a child still requires some op- 
portunity to practice such activities. There are in- 
dications that extensive contact with other children 
is likely to provide less opportunity for such fantasy 
play unless the children are older and play a quasi- 
parental role, for example, a much older sister who 
plays “house” or “school” with a younger child. By 
and large, the extensive kaleidoscopic ebb and flow 
of physical motion and varied external stimulation 
provided by a group of children is most likely to 
involve a child in external behaviors and afford 
him little chance to practice fantasy play. A study 
carried out with children aged six to nine (Singer 
1966) produced evidence that indicated that chil- 
dren reporting more fantasy play had significantly 
fewer older siblings, that is, they tended to be 
either first-born or only children and not members 
of large families. 

One might argue that daydreaming thus be- 
comes a defense against loneliness. At the same 
time many children who take great pleasure in 
elaborate fantasy games and who are eager to share 
these with others are frustrated by the disorganiza- 
tion of the other children and return to solitary 
play or, if they have some influence on the group, 
develop a more organized fantasy game, for ex- 
ample, pirates or detectives. Chance factors of 
being alone, of being encouraged to read, of story- 
telling or fantasy making as part of the family 
tradition as well as cultural aspects that strongly 
emphasize imaginative exchange or intellectual ex- 


ploration, achievement motivation, upward mobility 
in society—all increase the likelihood of extensive 
involvement in fantasy play by a given child 
(Singer 1966). In effect, the child who plays ex- 
tensively by creating fantasy people, situations, 
imaginary worlds, or scientific make-believe is 
practicing a complex series of skills related to what 
Kurt Goldstein has called the “attitude toward the 
possible.” These children have a more complex 
vocabulary, develop structural and organizational 
skills, see relationships that others may have to be 
taught in school, and, perhaps most important, 
carve out for themselves a dimension of experience 
—attentiveness to their own associational flow— 
which may be a realm they can enter or leave at 
will. This dimension can, of course, become the 
basis of an extensive defensive maneuver should 
the outside world become too stressful or should 
the child fail to develop positive social skills with 
his peers. The development of an extensive inner 
life by no means excludes the possibility of enjoy- 
ment and skill in social experience, group games, 
or sports. It merely provides an additional medium 
for play and satisfaction. There is as yet no evi- 
dence that psychotic patients show any excessive 
predisposition toward daydreaming. Indeed, chronic 
patients are often characterized more by an im- 
poverishment of inner living and daydreaming 
than by the excessive involvement in “private 
worlds” so often attributed to them [see DEFENSE 
MECHANISMS]. 

Adaptive aspects of fantasy. It is obvious that 
enhanced vocabulary and abstract skills represent 
great evolutionary advances in man. But let us 
also consider the adaptive manifestations of the 
more “fantastic” aspects of thought, daydreams, 
and reverie. The Rorschach findings about the in- 
verse relationship between imagination and motor 
impulsivity are better explained not by a relatively 
mechanical catharsis theory but by viewing the 
person who has imagination as having a whole set 
of alternative responses in addition to motor 
activity. In a waiting room he need not be so rest- 
less (as was found in several studies) if he can 
provide himself with an inner source of stimulation 
to engage his interest and provide positive affective 
experience. He may voluntarily set up an inner 
Sequence (imagining a movie, replaying a base- 
ball game) or he May permit the ongoing stream 
of consciousness, the reverberatory activity of his 
brain, to hold his attention. In situations of re- 
duced sensory cues, this capacity may be an asset, 
as was found in studies of sensory deprivation 
(Holt 1964), Similarly, it was found (Singer 
1966, p. 108) that persons required to monitor a 


blinking light in a dark room with all external cues 
reduced were more likely to remain awake and 
comfortable if at the same time they could engage 
in free associative talk. When these same persons 
were limited to monotonous counting, they tended 
to fall asleep and became quite irritable. There is 
also evidence that persons who have a history of 
daydreaming, that is, for whom attention to inner 
experience is a well-established pattern, are less 
susceptible to hallucination. 

The person who has developed a considerable 
ability to picture events or scenes concerning peo- 
ple through reading and imaginative play has 
available a tool for greatly increasing his own re- 
sponse repertory. The ability to imagine a variety 
of social responses or scenes concerning people 
may enhance a person’s ability to deal with novel 
or difficult social demands. Obviously, practice in 
actual situations is also necessary, but it is possible 
that such learning will come more quickly to a 
person who has engaged in some preparatory fan- 
tasy. The fact that fantasy originates in play brings 
it closer to humor and wit and makes it possible 
for the person who employs fantasy when anxious 
or in conflict to see a humorous, bizarre, or unusual 
facet to his dilemma and to obtain at least some 
temporary positive affective experience or to al- 
leviate the stress through a comical fantasy. The 
development of an inner cognitive style of dif- 
ferentiated attentiveness to ongoing thoughts, 
memories, or marginal associations, or of an ability 
to manipulate such material at will, thus can be 
seen as an important feature of the over-all ego 
strength of the personality, a skill, much as is 
motor ability, empathic capacity, or precision in 
communication [see HUMOR]. 


Recent research developments 


In addition to the research centering on theoret- 
ical issues, other areas of significant ongoing ex- 
perimental work in the field of fantasy may be 
cited briefly. The extensive work of McClelland 
interpreting the fantasy content of stories in third- 
grade readers as evidence of achievement motiva- 
tion has been particularly fruitful. McClelland’s 
work has shown, for example, that indications of 
the degee of achievement motivation expressed in 
the fantasy products of various countries around 
the world are highly related to the actual economic 
development of the country over a period of years 
(McClelland 1961). These striking results suggest 
either that the inclusion of such achievement- 
oriented materials in the stories used in teaching 
young children stimulates them to become more 
achievement oriented or, what is more likely, 
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that an already ongoing national achievement- 
oriented pattern is then consciously or uncon- 
sciously reflected in the thematic materials provided 
children. Most of the work of Atkinson and 
McClelland has emphasized the nature of motiva- 
tion rather than the study of the fantasy process 
for its own sake. McClelland, however, has at- 
tempted a formulation of the relationship between 
degree of deprivation and the occurrence of a fan- 
tasy expression of the deprived need. 

The greater awareness that during periods of 
reduced sensory stimulation man becomes in- 
creasingly responsive to internally produced stim- 
ulation, as well as the methodological advances in 
the study of night dreaming by electroencephalo- 
graphic patterns and oculoretinograms, have en- 
couraged studies of man’s waking pattern of on- 
going thought. A series of investigations has 
suggested, for example, that adult Ss who are 
engaging in daydreaming show relatively little eye 
movement but that ocular motility increases when 
attempts at suppression of ongoing thought occur 
(Singer 1966). Similarly, experimental studies of 
the relative degree of external stimulation neces- 
sary to prevent subjects from actually engaging in 
passing fantasies also suggest that a remarkable 
degree of ongoing thought activity occurs, at least 
for college adults, despite great demands on atten- 
tion from the outside (Singer 1966). It seems 
likely that man’s relative willingness or ability to 
attend to what appears to be an almost continuous 
ongoing stream of internal reverberatory associative 
activity may be a significant dimension for per- 
sonality study [see DREAMS]. 

As yet, however, relatively little is known of the 
physiological basis of this stream of thought or of 
the factors that lead to relative priorities in atten- 
tion to such internal processes, We also know 
little in a systematic way of individual differences 
in daydreaming and the personality correlates of 
such differences, although ‘some studies have 
opened the way for further work in these areas. 
Formal data on whether there are differences be- 
tween cultural or national groups in frequency, 
content, and structure of daydreaming or resort to 
inner living are scant indeed. Singer and McCraven 
(Singer 1966, pp. 57-63) did find some evidence 
that American-born subcultural groups (Negro, 
Jewish, Italian, Irish, German, and Anglo-Saxon ) 
did differ in reported daydream frequency, with 
the more upwardly mobile groups (or less socially 
secure ones) showing more fantasy activity, An 
earlier comparison of American-born Irish and 
Italian schizophrenic adults by Singer and Opler 
also yielded evidence of differences in degree of 
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fantasy activity between persons of differing cul- 
tural background (Singer 1966, pp. 64-65). 


JEROME L. SINGER 


[Directly related are the entries CREATIVITY; STIMULA- 
TION DRIVES. Other relevant material may be found 
in AESTHETICS; FINE ARTS; LITERATURE; PsycuHo- 
ANALYSIS. ] 
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FASCISM 


“Fascism” is used primarily to identify the po- 
litical system by which Italy was ruled from 1922 
to 1945. It is also used to identify a prototype of 
totalitarianism and is applied to variations of po- 
litical systems thought to parallel the Italian one. 


Historical development 


Historically, fascism has its origins in the crisis 
of Italian parliamentary institutions. This crisis 
was caused in large part by a failure in the process 
of adjustment of the traditional parliamentary 
parties to new mass parties. It occurred at a mo- 
ment of intensified difficulties caused by World 
War 1, as deep economic and social upheavals were 
complicated by an upsurge of nationalism and the 
aftermath of the Bolshevik Revolution of 1917. 
Thus, the combination of the weakness of the liberal 
ruling class, the revolutionary aspirations of the 
working classes, the extremism of patriotism, the 
dissatisfaction with the 1919 peace settlements, 
the psychological dislocation of war veterans, the 
fears of property-owning classes, and the political 
role played by the army and the crown produced, 
four years after the armistice of 1918, the episode 
known as the March on Rome and the beginnings 
of a 23-year dictatorship of the Fascist party under 
the leadership of Mussolini, 

As fascism sought, not without difficulty but 
with final success, to organize itself for the seizure 
of power, the clash of different ideological trends 
and the contradictions that were to mark its entire 
life stood out clearly. In the beginning, fascism 
showed a strong socialist inspiration. Many of the 
leaders had come to fascism from socialist and 
syndicalist movements, but they had differentiated 
themselves from orthodox socialists by maintain- 
ing an aggressive nationalistic attitude that had 
caused them to favor Italy’s intervention in World 
War 1, to be against what they described as the 
“unjust” peace settlements, and to support the po- 
litical adventure of D’Annunzio's seizure of Fiume. 
Mussolini backed the occupation of the factories 
by the workers as part of a revolutionary program 
intended to give to the industrial working classes 
the political role they did not have, But, by playing 
on the themes of national grandeur and power, he 
was also enlisting the support of nationalistic ac- 


tivists, who thought in terms of territorial expan- 
sion and colonialism. 

By the end of 1920, however, nationalism rather 
than socialism was providing the main driving 
force of fascism. The government of the time, 
headed by the liberal leader Giovanni Giolitti, 
rightly thought that a specific revolutionary danger 
did not exist and that prudent handling of the 
situation would lead, as it did, to a peaceful reso- 
lution of that particular conflict. But Italy’s middle 
classes, the landowners, the business world, the 
army, and the crown saw in fascism the militant 
movement that, properly led, could make Italy safe 
from the Marxist peril. In 1921/1922 the original, 
and small, fascist movement was swamped by 
hundreds of thousands of new members, most of 
them of middle-class provenance, while it received 
subsidies from industrial quarters and weapons 
from the military establishment. 

Fascism reoriented itself along fresh lines, with 
policies that stressed, above all, the need to restore 
the authority of the state at home and abroad. The 
state was conceived as the defender of law and 
order and as the unyielding supporter of the na- 
tional interest in foreign affairs. The younger party 
members were organized into blackshirted squads 
that proceeded to destroy the physical structure 
and to liquidate the leadership of socialism and 
communism, which by then had become the chief 
targets. 

When, after the 1919 elections, Giolitti failed to 
reach agreement with either the Socialists or the 
Christian Democrats, he imagined that fascism 
would see Italy through the political impasse by 
taming these two mass parties and that in the end 
liberalism would return to power. Liberal Italy 
greeted October 28, 1922, the day of the fascist 
seizure of power, as the beginning of an interlude 
that would stabilize political life, restore the au- 
thority of the state, and prepare the return to tra- 
dition shortly afterward. 

The second period in the history of fascism goes 
from 1922 to 1925. In the course of these three 
years the fascist regime sought to answer a num- 
ber of questions about its own direction and pur- 
pose. After the bloody violence of the preceding 
two years, these years appeared mild enough on 
the surface, and some of the liberal leaders even 
thought that their forecasts would be realized. 
Laissez-faire was the prevailing economic policy. 
Parties and the press seemed to function almost 
normally. The word “totalitarian” had not yet been 
invented. 

But the inherent logic of the system was already 
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at work. First, the blackshirted army had not been 
dissolved, and the dualism typical of a totalitarian 
state was born, with two sets of institutions—one 
answerable to the government, the other to the 
Fascist party. Second, the use of violence accepted 
as normal since 1922 could not be given up. Force 
was still the foundation of the regime. One after 
the other, the voices of the opposition were stilled. 
The climax was the murder of Matteotti, the So- 
cialist party leader. For the old ruling class, this 
proved to be the final test of its sense of responsi- 
bility and its understanding of the nature of the 
modern political process. In the summer of 1924 
the crown might still have succeeded in obtaining 
Mussolini’s resignation. The advice it received was 
that no change should be attempted. The Vatican 
joined in this appraisal when it drove Luigi Sturzo, 
the leader of the Christian democratic movement, 
into exile. 

In 1925, exploiting this extraordinary vote of 
confidence, fascism built the totalitarian structure. 
The press was silenced or taken over. All parties 
were abolished except the Fascist party. Constitu- 
tional changes were begun that created the unique 
figure of the leader embodying in his person the 
sum total of power. 

The next ten years, from 1925 to 1935, repre- 
sented a period of both practical consolidation and 
theoretical doctrinal development. With power 
safely in his hands, Mussolini began to consider 
the problem of the place of fascism in history. 
Fascism had been criticized as being a naked, 
pragmatic movement. It was necessary to acquire 
ideas and to develop an overview on the nature of 
man and of nations that could promise the recog- 
nition of fascism as one of the important revolu- 
tionary movements of the twentieth century. 

The enemies were identified with great pre- 
cision. Marxism still was the foremost opponent. 
But liberalism became another enemy to be fought. 
Showing little regard for those who had put him 
in office, Mussolini began to cultivate the ideas 
that led by 1927 to the labor charter, by 1930/ 
1931 to the beginnings of the corporate state, and 
by 1934 to the establishment of the corporations 
themselves. 

Some of these economic ideas did provide a cer- 
tain amount of lively discussion at the time among 
those whose aim was primarily to find some third 
way between Marxism and liberalism. The key 
notions were that (a) the community alone was 
to have the right to determine what the national 
interest required; (b) therefore, the conflicting in- 
terests of owners, workers, technicians, and the 
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state were to be brought together in a single unit, 
the corporation, operating under public control; 
(c) strikes and lockouts were to be forbidden; and 
(d) the doctrine of the primacy of the politician 
over the expert was to be abandoned. The divisive- 
ness of politics was to be eliminated by the unity 
of expertise. 

The world-wide depression that had hit the West- 
ern world after 1929 facilitated Mussolini’s task. 
By 1931 the industrial and banking systems of Italy 
were in serious trouble. The totalitarian regime 
made possible a quick salvage operation, which 
placed the key industrial and financial sectors of 
the country under direct government ownership or 
control. By 1935 fascism had realized, at least on 
paper, the goals of a state-controlled society. In its 
repression of the individual and of social groups, 
fascism was steadily strengthening and centraliz- 
ing its power, which was exercised in the name of 
an ideology that had become a key operational tool 
in the hands of the new elite. 

In its third period, 1935 to 1943, violence and 
war became the substance of fascism. The first 
important manifestation of this totalitarian charac- 
teristic took place in 1935 with the aggression 
against Ethiopia, which provided the regime with a 
testing ground for its military policies, challenged 
the League of Nations, and furnished German 
Nazism with evidence of the might of fascist Italy. 
The second act was played on the battlefields of 
Spain, where both Nazism and fascism joined 
hands against republican Spain. Historically, this 
armed clash was of great significance, for it gave the 
enemies of Mussolini some idea of their strength 
and of guerrilla-warfare techniques. The third act 
was played in 1940, when Mussolini entered World 
War 11 on the side of Hitler after the defeat of 
France. Italy’s defeat came soon, and by 1943 the 
fascist regime collapsed, as it had begun, through 
an intervention of the crown. 

From 1943 to 1945 the Fascist Social Republic 
came feebly and fleetingly to life under the control 
of the Germans, The only point worthy of note is 
that Mussolini, on the eve of the final collapse and 
of his own death, tried in a clumsy way to go back 
to his syndicalist origins and appear as a defender 
of the proletariat. Industrial plants were now to be 
turned over to the workers themselves. But it was 
1945 and too late. 


The fascist regime—ideal and reality 


As a movement based on a pragmatic appraisal 
of the conditions necessary to retain power, fascism 
was never too preoccupied with the task of a the- 
oretical definition of its own origins and goals. The 


philosopher Giovanni Gentile sought to link it to 
Hegelian idealism, the jurist Alfredo Rocco at- 
tempted to develop a heavy-handed theory of the 
state, while Mussolini himself sought to provide 
the ideology of totalitarianism. 

Instruments of power. But the real drive of 
fascism was in the building of instruments of pow- 
er and not in the building of theory. Between 1925 
and 1939, four main tools of power were developed 
and refined: charismatic leadership, single-party 
rule, terror, and economic controls. 

Leadership. Around the leader, Benito Musso- 
lini, a series of institutional privileges were built, 
intended to make his position unchallenged. Con- 
stitutionally, he was chief of state and, as such, 
was placed in a position that was not subordinate 
to that of the king. Although the monarchy was 
kept, the Great Council of Fascism had been given 
certain rights on questions affecting the succession 
to the throne which placed the monarchy in a de- 
pendent position. Politically, the constant rotation in 
office of Mussolini’s subordinates kept competitors 
out of the way. Psychologically, the unique position 
of the leader was carefully maintained by all the 
devices of communication and propaganda typical 
of totalitarian states. The identification of fascism 
with Mussolini was made compulsory in meetings 
of parliamentary assemblies, of the Fascist party, 
of economic bodies, of schoolchildren, of every 
form of group life. 

Party rule. The party became a capillary instru- 
ment of power going from its highest body, the 
Great Council of Fascism, which met from time to 
time to decide major questions of policy, through 
the secretary of the party and the provincial fed- 
erations to the thousands of party units, which at 
the communal level were the daily instruments of 
propaganda and contact with the country. The 
party reached out in all directions with its subsid- 
iary organizations, affecting the activities of school- 
children and the cultural and sport activities of the 
people. The party became the carrier of the ideology 
and slogans of the leader, and, more important, the 
channel through which most of the life of the 
country had to flow. Jobs, advancement, and pre- 
ferment had to be cleared in most instances through 
the party. Membership in the party was, at first, a 
right belonging to the small elite group that in the 
pre-1922 days had supported the party's fight for 
power; in a second phase which lasted into the 
early 1930s, membership was made available to all 
who applied; in a third and final phase, one could 
become a member only by moving up through the 
youth organizations that by then had been created. 
Thus, membership in the Fascist party was re- 


served at first to the fighters, later to the oppor- 
tunists, and finally to the perfect citizens of a 
fascist state nurtured on the ideals of fascism from 
their most tender age. 

Terror. Although between 1925 and 1943 the 
party was the chief vehicle for the consolidation of 
the regime, it was no longer the chief instrument of 
terror, as it had been between 1920 and 1925. 
“Legalized” repressive functions were carried out by 
the Special Tribunal for the Safety of the State and 
by the secret police under the Ministry of the In- 
terior. But from 1943 to 1945, the years of renewed 
bloody civil war between fascist militias and the 
resistance groups, the party again undertook the 
task of meting out summary justice to the increas- 
ingly rebellious population and was guilty of mas- 
sacres that exceeded in scope anything that had 
been witnessed between 1920 and 1925. 

Economic controls. As the pseudo liberalism of 
the initial years gave way to controls on economic 
life equaling those on political life, fascism devel- 
oped two principal instruments of policy. The first 
was the Institute for Industrial Reconstruction 
(IRI), which, started in 1931 as an antidepression 
device, was soon changed into an agency to pursue 
the military goals that after 1935 became the heart 
of the dictatorship. Controlling all major financial 
institutions and nearly all heavy industry through 
the IRI, the regime transformed it into an ever- 
expanding industrial complex on which fascist war 
production plans were based. The second was the 
corporation, which imposed central controls over 
all forms of economic activity, including all remain- 
ing so-called private activities. Before becoming a 
“political” tool in 1939, with the establishment of 
the Chamber of Corporations, the corporation had 
helped the government control both labor unions 
and employers’ associations by bringing them all 
under the rule of a central bureaucracy. The cor- 
porations were the best example of a basic tenet of 
fascist doctrine: the supremacy of the expert over 
the politician. They were the evidence of the tri- 
umph of economics over politics, as parliamentary 
institutions, made up of representatives of the gen- 
eral interests of the community, were superseded 
by experts talking the language of economics and 
technology, given to the hard-headed discussion of 
facts and not to the empty rhetoric of parliamen- 
tarism. 

Fascism in operation. All told, this structure 
represented something new. As Mussolini said, “A 
party holding ‘totalitarian’ rule over a nation, is a 
new departure in history. There are no points of 
reference nor of comparison” ([1932] 1935, p. 36). 
Later analysts were to agree in large part with this 
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statement and to say that totalitarianism did, in- 
deed, represent a twentieth-century departure in 
the political evolution of mankind. In addition, the 
essence of totalitarianism was to be found in a com- 
bination of leadership, an ideologically inspired 
mass party, and violence, and in so total a claim by 
the ruling group over the lives of the individuals 
that no separateness, no autonomous legal system, 
and no group life could survive. 

The totalitarian pattern of fascism, however, 
falls short of this model, in part because Mussolini 
gave too much weight to the state: “For Fascism 
the State is absolute, individuals and groups rela- 
tive. Individuals and groups are admissible insofar 
as they [cannot attack the state]... . The Fascist 
state . . . has a will of its own. For this reason it 
can be described as ‘ethical’ ” ([1932] 1935, pp. 37— 
38). And in article 1 of the labor charter of 1927 
the Italian nation was described as an “organism 
having ends, a life and means superior in power 
and duration to the single individuals or groups of 
individuals composing it” (National Fascist Party 
[1927] 1935, p. 53). These were Hegelian influences 
that reflected the early role of the idealism of Gen- 
tile, whose presence could not be fully reconciled 
with the nihilism of authentic totalitarianism. 

The party itself was one additional field in which 
fascist totalitarianism did not ring true. The party 
efforts to maintain ideological coherence, discipline, 
and a sense of mission were thwarted by the Italian 
belief in relativism, the spirit of compromise, and 
the refusal to take ideology seriously. The history 
of other totalitarian parties has shown that the 
party purge is of the essence in a totalitarian sys- 
tem. A domesticated totalitarian party, not wracked 
by fear, not cowed by the brooding image of the 
leader, but rather reduced to a mere vehicle for the 
securing of jobs, is no longer a revolutionary move- 
ment. The events from 1943 to 1945 showed that 
the behavior of blackshirted killer squads, the trial 
and execution of a few high-placed fascist leaders, 
including Mussolini’s own son-in-law, were evidence 
only of the extremes of panic to which the surviv- 
ing fascists were driven under the twin pressures 
of partisans’ attacks and of German controls. The 
ease with which the country shed its fascist trim- 
mings showed that in 25 years an effective hard 
core of fascist militants had not developed. 

Hence, terror on a large scale and for preventive 
or repressive purposes never quite materialized. 
Fascism suffers by comparison with the apocalyptic 
liquidations for which Hitler and Stalin will be 
known to history. Instead of millions, Mussolini 
had on his conscience only a few tens of thousands 
of dead, excluding casualties due to direct military 
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action during World War 11. The will was lacking. 
Mussolini’s cynical boasting was nearly always ac- 
companied by a most lively sense of his inadequacy. 
The assured appearance of the leader on the public 
Square was not matched by equal confidence in his 
private political activities. He had been a member 
of a democratic socialist party for too long to for- 
get entirely the habit of doubt and skepticism. Hid- 
den admiration for certain traditional forms of 
Italian culture stopped him from exercising his 
powers to the fullest extent. His ignorance made 
him avoid direct confrontation with established 
forms of conducting public business, which could 
therefore continue as before. 

Mussolini’s most persistent feeling probably was 
that of the crisis of the modern capitalistic order. 
He kept asking certain questions over and over 
again, without necessarily providing the answers. 
In 1932 he had asked, “Is this crisis, which has 
held us in its grip for the past four years . . . a crisis 
‘within’ the system or ‘of’ the system?” (1935, p. 
10). He felt he could not give an answer then, but 
in November 1933 he was to say, “Today my an- 
swer is: the crisis has sunk so deep into the system 
that it has become a crisis ‘of’ the system. ... We 
can now assert that the capitalistic mode of produc- 
tion has been superseded” (pp. 10-11), Why is this 
so? Because by its very size capitalism has turned 
“into a social phenomenon, and it is precisely at 
this moment that capitalistic enterprise . . . falls 
like a dead weight into the arms of the State” 
(p. 16). 

This is a purely Marxian analysis of the problem 
(the dominant socialized characteristics of a highly 
developed capitalistic system, the sudden crisis) 
up to the point at which the heir of capitalism is 
not the armed proletariat sitting on the ruins of the 
institutions of the bourgeois state, but is the “eth- 
ical state itself.” 

Mussolini tried hard, with the labor charter and 
the laws on the corporations, to give some unity to 
the new system. He made large theoretical claims 
for the corporations, which were to unite workers, 
owners, experts, the state, and the party and to 
which powers had been given extending from wage 
fixing to the regulation of production, the settle- 
ment of disputes, the drafting of collective labor 
contracts, and the prevention of strikes and lock- 
outs. 

But in his vaguely socialist dream, he was lim- 
ited, on the one hand, by his recognition of private 
property as necessary to the fulfillment of the hu- 
man personality and, on the other, by the over- 
whelming bureaucratic complexities of the all-or- 


nothing paper structure of the corporate state, and 
this made it necessary for him to appoint to the 
governing boards of the corporations representa- 
tives not only of wine but also of vinegar producers, 
not only of umbrella but also of button manufac- 
turers. The resistance of the property owners who 
had put Mussolini in power was subtle and stub- 
born. They saw, in the immense Roman bureauc- 
racy and in a party where the stout of heart and 
the believers were few, the chance to use the state 
capitalism of fascism in the same way in which 
public systems in other countries have been used 
by anxious capitalists in trouble, that is, as a prop 
to keep them going until better times. 

The easy way out for everybody was military 
adventure. Again, the final flare-up of the Social 
Republic in the spring of 1945 is evidence of the 
decay of a system on the eve of its liquidation. Un- 
til then, the vast structure of the production sys- 
tem had been used only to prepare for war and to 
enable the regime to find overseas the outlets not 
found at home. 


Interpretations 


As a phenomenon that, having spread from Italy 
to other countries, affected the course of history 
between the two world wars, fascism has been sub- 
jected to a 40-year effort at interpretation. The 
variety of analyses has been correspondingly great, 
with sharp contrasts among the points of view de- 
pending on the time, the interests, and the ap- 
proaches of those dealing with it. 

The fascists themselves, those with 2 more spec- 
ulative frame of mind and able to write with some 
detachment after the event, have tended to see in 
fascism one phase of the world-wide shifting of the 
political discourse from multiparty to one-party 
systems and of the transfer of power from the leg- 
islative to the executive, in which violence was dis- 
cipline and military aggression was reaction to 
foreign hostility. They still believe that some of the 
trends and programs foreshadowed by the fascist 
era should be developed in the future as part of the 
needs of modern government. Fascism without 
ideology, war, and concentration camps could find 
expression in a depoliticized society that would turn 
its back on the rhetoric of the nineteenth century, 
but not on deeply felt national sentiments, and 
seek its way under the guidance of stable and effi- 
cient leadership. 

The parties in power at the end of World War 1, 
when the crisis began, saw the fascist movement 
in a different light. To them it was the unavoidable 
reaction of the “healthier” political forces in the 


country to the process of disintegration of the 
community and the constitutional system, caused 
by forces largely identified with Marxism. Marxism, 
in its twin embodiment of socialism and commu- 
nism, loomed as a many-sided assault on the tra- 
ditional institutions. The infrastructure agencies 
(cooperatives, peasants’ leagues, trade unions) 
moved against the state with excessive economic 
claims and with a systematic onslaught on the 
processes of production. The authority of the state 
itself was being weakened by a series of strikes that 
affected vital public services. Fascism was a reac- 
tion of certain social groups, primarily the middle 
classes and the well-educated urban youth, in- 
tended to restore law and order. 

But the undermining of the liberal constitutional 
order was not carried out at this level alone, for at 
the national political level Marxist parties were act- 
ing in alliance with another large and new political 
formation, the Christian Democratic party. Marx- 
ism and Christian democracy as mass parties joined 
hands here in their attempt to deprive the parlia- 
ment and cabinet of their traditional roles by im- 
posing rigid programs, which, the liberals thought, 
were not in keeping with the discussion and com- 
promise typical of a constitutional democracy. 
Party bosses, who were constitutional “outsiders,” 
sought to dominate political life. Fascism was to 
restore the constitutional system through the de- 
struction of mass parties, those intruders which, 
since 1919, had upset the apple cart. Hence, a dual 
purpose was attached to fascism: the immediate 
restoration of normalcy against communist sub- 
version and the long-range return to parliamentary 
government freed from the obnoxious influence of 
mass party rule. In brief, fascism was an interlude 
at the end of which the forward march of liberalism 
could be resumed. 

This view did not survive the events of 1924/ 
1925, when the institutions of totalitarianism were 
set up and a fuller view of fascism stood revealed. 
The realization by the liberals of the illiberal reali- 
ties of single-party dictatorship took place by stages 
between 1923 and 1925. By then, from Giolitti to 
Croce, Italian liberalism presented a united anti- 
fascist front. But what stood out most clearly was 
the liberals’ singular attachment to certain consti- 
tutional values that ruled out modern variations, 
chiefly the constitutional transfer of power to mass 
democratic parties. Liberal elitism was in the end 
confronted and defeated by brutal and stronger 
varieties of fascist elitism. The critics of the lib- 
erals’ position have pointed out that this was a his- 
torical mistake of which they were the first victims. 
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Many of the spokesmen for the new mass parties, 
from Sturzo to Tasca, have pointed out that it was 
the refusal of the old ruling class to come to terms 
with the new elite emerging from the mass upheav- 
als of postwar Italy that was at the root of the tri- 
umph of fascism. Socialists and Christian Demo- 
crats, far from seeing in themselves the agents for 
the destruction of the state, saw in their programs 
the only hope for a democratic renovation of Italian 
life and institutions. Their view was that the lib- 
erals were the accomplices of fascism in an attempt 
to stop the normal democratic evolution of a so- 
ciety in rapid transformation. 

The Marxists follow orthodox lines. Fascism was 
the defender of capitalistic society, threatened by 
the steady widening of the power of the Russian 
Revolution and of the influence of Marxism in Italy. 
Fascism was a repressive movement developing 
along the lines Marx had anticipated for the final 
phase of bourgeois society. 

This understanding of fascism as a class phe- 
nomenon, deprived of mystery and uncertainty, 
simplified the task of the Marxist opposition, which, 
after 1925, was to be essentially communist oppo- 
sition. The socialists were no longer an effectively 
organized force, whereas other Marxist groups, 
aware of the evolution of Stalinism, had moved 
toward the center with reformulation of a modern 
liberal—socialist faith. Through their firm rejection 
of fascism, the communists were to derive great 
political benefits after 1945, for they could then 
identify themselves with one clear alternative in 
which they claimed to have believed all along. How- 
ever, the communist interpretation of fascism did 
run into some difficulties. At the beginning the diffi- 
culty lay in the fact that the clear-cut class lines 
which would have had to be present as capitalism 
engaged in a supreme attempt at survival were not 
there at all. Fascism drew mass support from lower 
middle classes, intellectuals, peasants, and workers. 
Ten years later the noncapitalistic inclinations of 
fascism had become apparent, at least in theory, 
even though the fledgling corporate state was sub- 
merged by the requirements of fascist military poli- 
cy. At this stage, however, the communists could 
point to war as the logical outcome of a capitalistic- 
ally inspired tyranny. But at the end the lines were 
confused once more as fascism tried to revert to one 
of its ideological roots, socialism. These difficulties, 
however, did not substantially weaken the appeal 
of the communist interpretation, because its key 
element was the condemnation of the bourgeoisie, 
which it made responsible for fascism. And to this 
analysis many non-Marxists found it possible to 
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accede. Moreover, the Marxist interpretation had 
the advantage of appearing to deal seriously with 
the phenomenon of fascism. Whatever it was, it 
was not to be taken lightly. It was a phase in the 
development of certain contemporary societies. 

Such views stand a better chance of withstanding 
the test of history than predominantly literary in- 
terpretations, widely accepted at times, of fascism 
as a bad dream, as an inexplicable and certainly 
short-lived aberration that would fade away with a 
return to rational behavior. This caustically ironic 
attitude was justified perhaps by the frequency with 
which fascism appeared clothed in grotesque gar- 
ments or supported policies that, because they were 
unacceptable on the basis of tradition, could not be 
sustained. But it suffered from an incomplete anal- 
ysis of the crisis of Italian society and from a be- 
lief that Italian history since 1870 had shown noth- 
ing but favorable progress along the lines of modern 
democracy and that the anarchy prevailing deep 
in the hearts of so many Italians had been con- 
quered by a growing sense of community. 

Related to the bad-dream school was the histor- 
ical one of fascism as a periodic phenomenon of 
Italian history. Mussolini was linked to the long 
series of tyrants, large and small, adventurers, and 
Machiavellian princes, who for many centuries had 
dotted the Italian landscape. Italy was the victim 
of one more manifestation of an endemic disease. 
A lack of discrimination, a tendency to vague gen- 
eralizations, and a belief in cyclical recurrence 
afflicted an account that later could not explain the 
unprecedented catastrophic events marking the end 
of fascism and the difficulties that have continued 
to beset Italy since 1945. 


Evaluation 


No adequate review of the fascist era, from the 
point of view of the social scientist, has been under- 
taken since 1945. The literature has tended to be 
reminiscent, episodic, and introspective. At best we 
have detailed narratives of short critical periods 
in the history of fascism. Slowly, however, general 
reflections and lines of agreement appear to emerge 
and suggest some preliminary conclusions. 

The first concerns the weakness of Italy’s pre- 
fascist ruling class, a class whose credit ledger in 
the years from 1848 to 1922 was certainly not a 
mean one. But between 1912 and 1922 that class 
had been guilty of a series of decisions taken out- 
side the liberal constitutional system and against 
the interests of the country. The Libyan war was 
a surrender to nationalistic and colonial interests. 
The parliamentary manipulations of the spring of 


1915 had brought the country into World War r 
under unfavorable conditions and against the incli- 
nations of the country at large. The acceptance of 
the dismal rhetoric of D’Annunzio as the official 
ideology of a country at war released the worst 
aspect of the sentimental patriotism and aggressive 
nationalism that formed such a large part of the 
post-1919 crisis. The failure of the machinery of 
the state and of the administration to maintain 
order and, worse still, the arming by the govern- 
ment of the Black Shirts were the final evidence of 
the liquidation of a ruling class that no longer 
ruled, had no views of what it should do, and was 
ready to step aside in the hope of recovering the 
past sometime in the future. 

The second is the recognition of the depth of a 
phenomenon that today is playing a decisive role 
in the transformation and behavior of social 
groups, that of anomie. World War 1 had imposed 
an altogether excessive and cruel effort upon Italy. 
The idiocy of generals who sent hundreds of thou- 
sands of young men to a useless death, the social 
upheavals caused by war industries and profiteer- 
ing, the opening up, under the strains of conflict, 
of regions that for centuries had been cut off from 
communication not only with the rest of the world 
but almost with their neighbors, and the lack of 
any recognition after 1918 of the seriousness of 
the problems which peacetime Italy would con- 
front caused an unendurable strain on the weak 
texture of Italian society. This created large and 
vague expectations on the part of millions of un- 
employed and uprooted peasants, war veterans, 
frontline heroes, and dissatisfied students. Liberal 
Italy was not prepared to meet them or even to 
recognize them. 

Under such anomic conditions, the appeal of 
mass movements—Marxism, Christian democracy, 
fascism—was bound to be great. That of Marxism 
was notable, even though the Socialist party dated 
back to the late nineteenth century. Although the 
theoretical weaknesses of the old warrior had been 
exposed, its half-hearted revolutionary enthusiasm 
could not be concealed. The mass appeal of Chris- 
tian democracy was based on yet untested slogans 
and on new men, and it quickly gathered strength. 
But Italian liberalism had lumped Christian de- 
mocracy together with Marxism. Both were forms 
of the revolt of upstart political elites against the 
majesty of the liberal state. The last of the three, 
fascism made its appearance as something new, 
promising shelter, food, stability, jobs, and a vigor- 
ous political system to those who were looking for 
such assurances. It had the advantage over Marx- 


ism and Christian democracy of being helped by 
the ruling class and by the weapons placed at its 
disposal. 

Fascism can thus be seen as a mass movement 
to which an anomic society turned in a period of 
crisis for reassurance and the promise of satisfac- 
tion of essential community needs. But the promise 
was not kept. Behind the seemingly innovative 
facade, economic and social stagnation prevailed. 
Perhaps the most typical fascist law was the one 
that attempted to stop internal migrations. By 
freezing population movements, by keeping the 
peasant on the land, fascism strengthened the 
anarchism of individuals and acted directly con- 
trary to the needs of the country, preventing 
the modernization of its ancient, quasi-feudal 
structures. 

Thus, fascism did not resolve the anomic state 
of Italian society that had made its rise possible 
in the first place. The process of integration and 
modernization could start in earnest only after 
1945. Among all the industrialized countries of 
western Europe, Italy is still the one most substan- 
tially removed from the conditions of a modern 
state. This, in part, is the result of the long frost 
of fascism. 

At the same time, however, fascism functioned 
as if it had understood the new conditions of eco- 
nomic life. Bits of the largely unused machinery 
and ideas of the corporate state have been retained 
in post-1945 Italy in such fields as collective bar- 
gaining, where national and compulsory uniformi- 
ties are now imposed. Labor agreements binding 
even on those who have not participated in their 
negotiation and massive state intervention in the 
settlement of labor disputes are part of the prac- 
tice of republican Italy. Equally significant has 
been the resurfacing of fascist corporativism, 
the acceptance of its features in Gaullist France. 
The controlling factor everywhere is found in the 
deep malaise of European capitalism, which in its 
support of monopoly practices, sharp dealings with 
public authority, secrecy of managerial decisions, 
and drive for sheltered markets has deepened many 
of the accepted defects and realized few of the 
expected promises of the industrial revolution. 


Mario EINAUDI 


[See also CAUDILLISMO; DICTATORSHIP; FALANGISM; 
NATIONAL SOCIALISM; SOCIALISM.] 
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FASHION 


Fads and fashion are related yet fundamentally 
different social phenomena. Fashion is the more 
important of the two. Its general nature is sug- 
gested by the contrasting terms “in fashion” and 
“outmoded.” These terms signify a continuing pat- 
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tern of change in which certain social forms enjoy 
temporary acceptance and respectability only to be 
replaced by others more abreast of the times. This 
parade of social forms sets fashion apart from cus- 
tom, which is to be seen as established and fixed. 
The social approbation with which fashion is in- 
vested does not come from any demonstration of 
utility or superior merit; instead, it is a response 
to the direction of sensitivities and taste. 

Although conspicuous in the area of dress, 
fashion operates in a wide assortment of fields. 
Among them are painting, music, drama, architec- 
ture, household decoration, entertainment, litera- 
ture, medical practice, business management, polit- 
ical doctrines, philosophy, psychological and social 
science, and even such redoubtable areas as the 
physical sciences and mathematics. Any area of 
social life that is caught in continuing change is 
open to the intrusion of fashion. In contrast, fash- 
ion is scarcely to be found in settled societies, such 
as primitive tribes, peasant societies, or caste so- 
cieties, which cling to what is established and has 
been sanctioned through long usage. 

The picture of fashion as a distinctive social 
process in which collective judgment of what is 
proper and correct shifts in response to the direc- 
tion of sensitivity and taste sets three major ques- 
tions: What is the nature of the situation in which 
the fashion process operates? What is responsible 
for its operation? What societal role or function 
does the fashion process perform? 

Areas of fashion. Areas amenable to fashion 
are those that have been pulled into an orbit of 
continuing social change. The structuring of social 
life in such areas tilts away from reliance on estab- 
lished social forms and toward a receptiveness to 
novel ones that reflect new concerns and interests; 
thus, these areas are open to the recurrent pres- 
entation of prospective models of new social forms 
that differ from each other and from prevailing 
social forms. These models compete for adoption, 
and opportunity must exist for effective choice 
among them. Most significant in this selective proc- 
ess are prestigeful personages who through their 
advocacy of a model give social endorsement or 
legitimacy to it. Means and resources must be avail- 
able for the adoption of the favored models, 


Explanations of fashion 


Most theoretical analysis of fashion centers on 
the major question of what is responsible for the 
operation of fashion. We may dismiss trivial an- 
Swers such as that fashion is a crazelike outburst 
of collective disturbance or that it is a hoax per- 
petrated by venal-minded sets of persons seeking 


financial or personal gain. The more serious analy- 
ses fall into two categories. One type seeks to 
account for fashion in terms of psychological 
motives, the other in terms of societal or structured 
processes. 

Psychological theories. Psychological explana- 
tions generally treat fashion as an expression of 
feelings of revolt against the confinement of pre- 
vailing social forms. Scholars identify different 
feelings. Some regard as most important the effort 
to escape from ennui, or boredom, especially in the 
leisure class, Some ascribe fashion to playful and 
whimsical impulses to embroider the routines of 
life. Some attribute major weight to the excitement 
that comes from venturing into novel forms of con- 
duct. Others regard fashion as a symbolic ex- 
pression of hidden sexual interests. Particularly 
important is the view, most clearly expressed by 
Edward Sapir (1931), that fashion is an effort to 
add to the attractiveness of the self, especially 
under conditions which impair the integrity of the 
ego; fashion is seen as a means of rediscovering 
the self through novel yet socially sanctioned de- 
partures from prevailing social forms. Finally, some 
scholars trace fashion to desires for personal 
prestige or notoriety. 

These various psychological explanations are de- 
ficient in that they do not explain how or why the 
various feelings give rise to a fashion process. Such 
feelings are present and operate in societies and 
areas of life in which fashion does not occur. We 
are given no account of why the feelings should 
lead to the formation of fashion rather than taking 
other channels of expression available to them. In- 
stead of accounting for fashion, the feelings pre- 
Suppose its existence as a medium for their play. 

Simmel’s view of fashion. Most sociological ex- 
planations center on the idea that fashion is basi- 
cally an emulation of prestige groups. Georg Simmel 
(1904) has given the most sophisticated presenta- 
tion of this view. He contends that in an open-class 
society the elite class seeks to set itself apart 
visibly by distinctive insignia, such as dress and 
modes of living. Members of subjacent classes seek- 
ing higher status adopt these insignia. It is then 
necessary for the elite class to introduce new dif- 
ferentiating insignia, which in turn leads to a new 
wave of emulation. Simmel’s scheme characterizes 
fashion as a recurring process. It provides an ex- 
planation of how new fashions are introduced and 
acquire sanction, an account of their spread, and 
an explanation of their disappearance. It also sup- 
plies an explanation for the absence of fashion 
from folk and caste societies and from certain 
areas in modern society, such as the area of utility 


and that of the sacred, in which status considera- 
tions are irrelevant. 

However, this scheme fails to see fashion as a 
process that transcends and embraces the elite. The 
elite, although in the vanguard of fashion, is itself 
required to follow fashion’s direction. Its prestige 
does not assure that anything it introduces will be- 
come the fashion; instead, its introductions must 
coincide with the direction of what is acceptable. 
People adopt a new model to be “in fashion” rather 
than to emulate prestige groups. Any concern of 
the elite to set itself apart as a distinctive status 
group takes place within the ongoing process of 
fashion; such concern does not account for the 
process or set it in motion. 


Processes of social change 


Fashion should be seen as a process of reaching 
out for new congenial social forms in an area that 
is a part of a continually changing world. The 
movement of that world introduces new horizons, 
germinates new inclinations and interests, and 
shifts orientation away from the past to the proxi- 
mate future. The fashion process meets this kind 
of developing world through two major stages— 
innovation and selection. In the innovative stage 
new models or proposals—such as new dress de- 
signs, styles of furniture, themes in entertainment, 
approaches in philosophy, or theoretical schemes in 
science—are presented. Such models are geared to 
the current state of their respective fields; each 
seeks to sketch out a prospective line of movement. 
The models appear as rival claimants for adoption 
and thus initiate a selective process, which results 
in a new fashion. Prestigeful individuals and 
groups occupy a key role in the selection; they 
make the initial choices, and they give a stamp of 
endorsement to the model they embrace. To influ- 
ence others, however, they must be qualified to 
give an endorsement. Further, the model they en- 
dorse must be found congenial to current trends 
in order to gain general dissemination. The history 
of fashion shows dramatic instances of the failure 
of a model to become fashionable despite an effec- 
tive marshaling of prestige groups on its behalf, 
for example, the failure of the highly organized 
effort to check the trend toward shorter skirts in 
1922-1923. Fashion leaders are the unwitting sur- 
rogates of the larger body of people sharing in the 
movement of fashion. The vague tastes and pro- 
clivities aroused in such people by their moving 
world are the ultimate source and shaper of fashion. 

Historical continuity. The underlying connec- 
tion between fashion and emerging taste helps to 
explain two important features of fashion: its his- 
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torical continuity and its modernity. The history of 
fashion shows that new fashions are related to and 
grow out of their immediate predecessors. The 
typical picture is that of fashion trends—a feature 
that enables us to identify fashion periods and to 
speak of fashion cycles. Changing tastes and pro- 
clivities, while moving toward something new, 
must also take into account what is currently de- 
fined as proper and correct. Correspondingly, in 
devising their new models, fashion innovators al- 
ways have to consider the prevailing fashion. 
Although the intrinsic nature of the object of 
fashion may set a limit to a trend (as in the case 
of the lengthening or shortening of the skirt), and 
although a trend may reach a point of exhausting 
its possibilities, a reversal or abrupt redirection of 
fashion necessarily has temporal linkage with the 
preceding fashion form. 

Modernity. The feature of modernity in fashion 
is particularly significant. Fashion is always mod- 
ern; it always seeks to keep abreast of the times. 
Fashion is sensitive tu aie movement of current 
developments not only in its given field but also in 
adjacent fields and, indeed, to general movements 
in the larger social world. Thus, fashion in women’s 
dress is responsive to its own trend, to develop- 
ments in fabrics, ornamentation, and in the fine 
arts, to exciting events such as the discovery of the 
tomb of Tutankhamen, to political happenings, and 
to major social shifts such as the emancipation of 
women or the rise of the “cult of youth.” In an 
indirect and attenuated way, fashion in every field 
responds to the general or over-all direction of 
modernity itself. This responsiveness seems to be 
the chief factor in the formation of a “spirit of the 
times” or Zeitgeist. 


Social functions of fashion 

The remaining major question—what is the so- 
cial role or function of fashion—has not received 
satisfactory consideration. The conventional an- 
swers are that fashion allows for the harmless play 
of fancy and caprice, for a mild and legitimate 
escape from the tyranny of custom, for socially 
sanctioned adventure into an area of novelty, for 
the display and parading of the ego, for a cloaked 
expression of sexual interests, for the invidious 
demarcation of elite classes, and for an external 
and spurious identification by lower status people 
with a higher status group. 

Control functions. Fashions at different points 
in their careers may serve varied purposes; yet, the 
function of the fashion process cannot be reduced 
to such purposes. The functions of fashion derive, 
instead, from the fact that it introduces controlling 
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social forms into a moving area of divergent pos- 
sibilities. As such, it performs three significant 
functions. First, it introduces uniformity by select- 
ing from many models one which is to carry the 
stamp of propriety and thus compel adherence. If 
all proposed models were to be followed, social life 
in a given fashion area would become chaotic. In 
this respect, fashion performs in a moving society 
the control function that custom performs in a 
settled society. 

Second, fashion provides for an orderly march 
from the immediate past to the proximate future. 
By presenting new models and subjecting them to 
the process of competition and collective selection, 
the fashion process offers a continuous means of 
adjusting to a changing and shifting world. The 
fashion mechanism detaches social forms from the 
grip of the past, as suggested by the derogatory 
connotation of such expressions as “old-fashioned” 
and “out of date”; yet, in growing out of the pre- 
ceding mode fashion maintains continuity of 
development. 

Third, the fashion process nurtures and shapes 
a common sensitivity and taste, as is suggested by 
the congeniality and naturalness of current fashion 
in contrast to the oddness and incongruity of past 
fashions. This common sensitivity and taste is anal- 
ogous on the subjective side to a “universe of 
discourse.” Like the latter, it provides a basis for 
a common approach to the world and for handling 
and digesting the experiences the world yields. The 
value of a pliable and re-forming body of common 
taste to meet a shifting and developing world is 
apparent. 


Collective taste 


The term “taste,” which is central in the above 
discussion, deserves clarification. It represents an 
organic sensitivity to objects of social experience, 
as when we say, for example, that “vulgar comedy 
does not suit our taste” or that “they have a taste 
for orderly procedure.” Taste has a trifold charac- 
ter: it is like an appetite in seeking positive satis- 
faction; it operates as a sensitive selector, giving a 
basis for acceptance or rejection; and it is a forma- 
tive agent, guiding the development of lines of 
action and shaping objects to meet its demands, 
Thus, it appears as a subjective mechanism, giving 
orientation to individuals, structuring activity, and 
molding the world of experience. 

Tastes are themselves a product of experience; 
they usually develop from an initial state of vague- 
ness to a state of refinement and stability, but once 
formed they may decay and disintegrate. They are 
formed in the context of social interaction, respond- 


ing to the definitions and affirmations given by 
others. People thrown into areas of common inter- 
action and having similar runs of experience de- 
velop common tastes. 

The fashion process involves both a formation 
and an expression of collective taste in the given 
area of fashion. The taste is initially a loose fusion 
of vague inclinations and dissatisfactions that are 
aroused by new experiences in the field of fashion 
and in the larger surrounding world. In this initial 
state, collective taste is amorphous, inarticulate, 
and awaiting specific direction. Through models 
and proposals, fashion innovators sketch possible 
lines along which the incipient taste may gain ob- 
jective expression and take definite form. Collective 
taste is an active force in the ensuing process of 
selection, setting limits and providing guidance; yet, 
at the same time it undergoes refinement and or- 
ganization through its attachment to, and em- 
bodiment in, specific social forms. ‘The origin, 
formation, and career of collective taste constitute 
the huge problematic area in the study of fashion. 
Major advancement in our knowledge of the fashion 
mechanism depends on the charting of this area. 


Fads 


Fads, like fashion, may occur in widely different 
areas of group life, such as games, recreation, en- 
tertainment, dietary practice, health and medical 
practice, dress, ornamentation, language, and pop- 
ular beliefs. Although superficially fads seem to be 
similar to fashion, they actually constitute a sep- 
arate genre of collective behavior. The most notice- 
able difference is that fads have no line of historical 
continuity; each springs up independent of a prede- 
cessor and gives rise to no successor. This separate, 
detached, and free-floating character signifies that 
fads, unlike fashion, are not part of a regulating 
social process that gives shape and structure to 
group life. The derogatory connotation of the term 
“faddish” points to the alien and questionable status 
of fads. We may note other significant differences. 
Fads do not require endorsement by a qualified 
prestige group in order to gain acceptance; they 
may spread from any section of hierarchized soci- 
ety. Fads are ephemeral, leaving no residue except 
in the occasional remnants of a detached cult. 
Fads follow the pattern of a craze or boom, thriving 
on spectacular and excitatory appearance, suddenly 
riveting attention and inducing a quasi-impulsive 
adoption, only to exhaust their attractiveness and 
undergo a rapid demise. 

Fads, unlike fashion, may occur in any type of 
society, traditional or modern. Their universality 
suggests that they have a natural root in human 


existence. But we know little about the generic 
conditions that bring them into being. Most of the 
psychological explanations advanced to explain 
fashion seem far more appropriate as explanations 
of fads 

HERBERT G. BLUMER 


[See also COLLECTIVE BEHAVIOR; STYLE.] 
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FATIGUE 


Fatigue is an experienced state of discomfort, 
aversion, and inability to perform, otherwise known 
as tiredness or weariness (Bartley 1943; Bartley & 
Chute 1947). The term applies to the total indi- 
vidual (the organism-as-a-person ) and not properly 
to an organ or tissue. Fatigue is to be distinguished 
from impairment, which refers to the inability of 
cells or tissues to perform. Fatigue is sometimes 
confused with work decrement, for quite often the 
inquirer is actually interested in work output rather 
than in the state of the performer. 

Fatigue is brought about in various ways, one 
common cause being discomfort in muscles in- 
duced by exertion, another being impairment. Very 
often, however, fatigue arises from some sort of 
disorganization within the individual. This is in- 
duced, in part, through confrontation with circum- 
stances that are disliked or considered futile and 
useless. The disorganization referred to here can 
be described at various body-process levels. Disor- 
ganization can involve a disruption in the proper 
timing of various processes or a conflict between 
these processes with subsequent tension, awkward- 
ness, and forgetfulness that result in discomfort 
and the realization of inadequacy. 

Chronic fatigue often represents an unwhole- 
some outlook on daily routine or life in general or 
arises from personality disorganization rather than 
some debilitating body processes at a lower level 
(Muncie 1941). 

Critique of common conceptions. The common 
outlook that confuses fatigue with impairment is 
characterized by several points, some of which are 
acceptable and some not (Bartley 1951). First is 
the view that activity modifies the acting system, 
and that the modification is a chemical one. Fa- 
tigue is taken to be such a modification. However, 
this is really impairment and is a phenomenon 
that takes place at the tissue level, whereas fa- 
tigue is a phenomenon at the personality level— 
the experience, the self-awareness of condition. It 
is also assumed that fatigue can be either a gen- 
eral body condition or specifically localized. The 
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slowing down of a process is taken to be the evi- 
dence of fatigue or, by some writers, to be fatigue 
itself. Here again, it seems to be impairment that 
is at issue. 

Conventional assumptions shift from consider- 
ing fatigue as tissue effects to considering it as the 
way the organism-as-a-person behaves. Even at this 
level compensation is generally overlooked in hu- 
man performance. This is evidenced in the tacit 
assumption made by many that the subject has 
only one way of performing a given task. For ex- 
ample, it is not presumed that one muscle fiber 
group may be recovering while some other group 
is taking its place in carrying on over-all perform- 
ance. But this type of compensation does occur, 
and it makes many of the simple, customarily 
performed “fatigue” experiments inappropriate and 
inadequate. 

It is also false to assume that fatigue developed 
in one kind of performance may be measured by 
using an altogether different performance as a test. 

The concern with fatigue as distinct from work 
output arises most directly and forcefully in medi- 
cal practice and is even encountered in athletics. 
Physicians are being constantly confronted with 
patients who complain of tiredness, sometimes in 
connection with convalescence from debilitating 
disease or simply as a syndrome of its own (Alvarez 
1941; Harms & Soniat 1952; Muncie 1941). On 
the other hand, work output is generally the main 
concern in industry (Dill et al. 1936; Edwards et 
al. 1935), Fatigue is often a term incorrectly 
inferred when an individual notices certain symp- 
toms. In this case, what is called fatigue is self- 
diagnosis rather than something that arises as a 
directly felt inability to perform. Thus, a patient 
may tell an ophthalmologist that he has visual 
fatigue, inferring this (a self-diagnosis) from the 
fact that after reading awhile what he sees be- 
comes blurred. This is different than his reporting 
how he feels. Such a usage of fatigue is to be 
avoided if the language on fatigue is to be con- 
sistent. 

Finally, according to the conventional view, 
there are several kinds of fatigue. The more basic 
categorization distinguishes objective fatigue (work 
decrement); subjective fatigue (the experience of 
tiredness); and physiological fatigue (change or 
reduction in a specific body process). 

Although there are various additionally specified 
kinds of fatigue, when fatigue exists it is essen- 
tially the same in all cases. The core of fatigue is 
the self-realization of relative inability to carry on. 
It is a negative orientation toward a task demand. 
Although the stance toward demand is always es- 


sentially the same, various fringe details (various 
sensory and other components) may vary from 
case to case, but they are mere details in the unique 
over-all syndrome that is distinguishable as fatigue. 

For example, fatigue arising in intellectual tasks 
should not be called mental fatigue, if by that one 
means a special kind of fatigue. Mental fatigue is 
only the fatigue that arises in intellectual (mental) 
tasks. Despite the fact that many kinds of tasks 
may lead to fatigue, no implication that there are 
many kinds of fatigue should be made. 

So many loosely related phenomena have been 
called fatigue that it has been suggested that the 
word be done away with entirely (Muscio 1921). 
Bartley and colleagues (Bartley 1943; 1951; 1957; 
Bartley & Chute 1945; 1947) have chosen rather 
to retain the word in its original meaning (i.e., for 
the phenomena first labeled “tiredness”) and re- 
name other phenomena associated with it. 

Problems in studying fatigue. Many studies 
that are meant to be studies of fatigue begin with 
questions about the experience of tiredness and 
inability to continue performance and end up by 
bypassing these questions and by providing infor- 
mation about other problems. For example, an 
investigator may note that workers in a plant feel 
sluggish and ineffective during mid-morning and 
mid-afternoon. He may suppose that this has some- 
thing to do with blood sugar level. In his study he 
finds that at these times blood sugar level is low- 
est. He may then try a plan whereby the workers 
take in the same total amount of food per day but 
eat varying numbers of meals. Some eat two, some 
three, some four, and some even five meals per 
day. Blood sugar level is then tested periodically 
throughout the day. It is found that where many 
meals are eaten the blood sugar level never falls 
as low at any part of the work day as it does when 
few meals are eaten. At this point, the investigator 
assumes his study is completed. He should con- 
tinue and determine whether the feeling of slug- 
gishness is eliminated or greatly reduced by redis- 
tributing food intake into a° greater number of 
meals, with the total intake held constant. It is 
only when he has performed this additional step 
that the original questions of sluggishness and 
inefficiency receive an answer. It is only then that 
the investigation becomes a fatigue investigation. 

One of the persistent problems in dealing with 
fatigue is its measurement. It is always expected 
that if an entity exists it can be measured. The 
definition of fatigue as a relation to demand in 
which the individual feels his own inability to 8° 
on makes easy measurement unattainable. When 
work output is all that is to be measured, such 


measurement is easily made. However, the prob- 
lem of measuring fatigue as defined here is no 
different from the problem of measuring anxiety, 
depression, boredom, hope, anger, or a virtually 
uncountable number of other states of the organ- 
ism-as-a-person. Thus, the concept of fatigue 
should not be singled out for criticism because it 
refers to a phenomenon difficult to quantify. Never- 
theless, such criticism has been leveled, at times, 
against what is conventionally called “subjective 
fatigue,” for that reason. 

The first point to be recognized is that the phe- 
nomenon in question must exist in order to be 
measured or dealt with in any way. The first step 
in studying it is to produce it. The second is to 
manipulate conditions under which it occurs. The 
next point to realize is that the person fatigued 
is the only one able to state when it exists. The 
observations made by all others have to do with 
overt performance, body process, or both, which 
are not fatigue although they may be closely re- 
lated to it in some way. It is also to be recognized 
that the problem of quantitatively dealing with 
fatigue involves either developing a fatigue scale 
or choosing a fatigue point, such as the point at 
which the performer is no longer able to continue. 

It will become quickly evident that in most cases 
the crucial conditions that lead to fatigue lie within 
the subject himself rather than the external con- 
ditions that produce the demand. Whatever is done 
to study fatigue, the organism-centered approach 
is necessary. In fact, it is necessary in trying to 
understand work output as well. 

In certain ways, the phenomena that are per- 
sonalistic rather than environmental can be studied 
more concretely than might be supposed. For ex- 
ample, it certainly is possible to manipulate goals 
(which are personalistic phenomena). In setting a 
task for a subject one can tell him something that 
will indicate the amount of time required to per- 
form the task. One can then vary the amount of 
time so indicated to see what remoteness of goals 
has to do with the self-assessments of the subjects 
at various moments during performance and 
whether this variable has anything to do with the 
point at which “exhaustion” is reached. Other such 
dimensions and their body-process concomitants 
can also be chosen. 

Another problem in studying fatigue has to do 
with how far various subjects will push them- 
selves either in their use of energy resources Or 
in experiencing bodily discomfort. This problem is 
related to motivation, and its solution may actually 
be a way of dealing with or measuring motivation. 
This problem is encountered not only in psycho- 
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logical experimentation but also in the physiologi- 
cal treadmill experiments. Some subjects “reach 
their limits” in muscular exertion before oxygen 
debt occurs, while others do not. No physiologist 
would expect that a subject would be exhausted 
before oxygen debt appears. These results illustrate 
the difference in the amount that individuals will 
push themselves. Thus the problem of subjectivity 
is as inherent in many phases of physiology as in 
psychology. Hence it cannot be disposed of by 
assuming that it occurs only in nonbehavioristic 
types of research. 

Relieving fatigue. Various drugs and other sub- 
stances have been tried by investigators to relieve 
fatigue and improve performance in athletic activi- 
ties (Karpovich 1959; Smith & Beecher 1959). 

Fatigue may often be relieved by inducing a 
change of attitude or attention or by shifting from 
one task to another rather than by rest (inactiv- 
ity). Supposedly the needed change involves reor- 
ganization within the individual rather than a 
replenishment of energy. 


Fatigue and the syndrome called stress by Selye 
(1956) have a lot in common, particularly certain 
factors of origin. Selye’s discoveries and concepts 
go a long way toward providing a bodily basis for 
human incapacity, and when this is coupled with 
a monistic psychology, which attempts to under- 
stand human behavior, considerable progress seems 
to be possible. 

S. HOWARD BARTLEY 


[Other relevant material may be found in ACHIEVE- 
MENT MOTIVATION; INFANCY, article on THE EFFECTS 
OF EARLY EXPERIENCE; SLEEP; STRESS.] 
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FEBVRE, LUCIEN 


Lucien Febvre (1878-1956) was born in Nancy, 
the capital of Lorraine, where his father was, at 
that time, teaching at the university. Both his 
father and his mother, however, came from 
Franche-Comté, and it was to this province that 
Febvre was deeply attached. He spent all his va- 
cations there; he taught at the lycée at Besancon 
from 1897 to 1911; he was always interested in 
the life, the scenery, and the people of Franche- 
Comté; and he died there and was buried at Saint- 
Amour, on the border of Bresse and the Jura. Just 
as Michelet, to whom Febvre liked to compare 
himself, was a child of Paris, so Febvre might well 
be called a “peasant” of Franche-Comté—this, in- 
deed, is what his friend, the novelist Léon Werth, 
called him. 

Febvre’s studies took him from the lycée at 
Nancy to the faculty of letters there, then to the 
Lycée Louis-le-Grand in Paris, and finally, from 
1899 to 1902, to the Ecole Normale Supérieure, 
from which he was graduated as agrégé in history 
and geography. His closest and most influential 
friend during these formative years was Henri 
Wallon, philosopher, psychologist, and physician, 
who later became his colleague at the Collége de 
France, 

Febvre attained full intellectual maturity very 
early, well before 1911 when he defended his 
thesis at the Sorbonne. This brilliant thesis, 
Philippe II et la Franche-Comté (1911a), is a broad 
historical and geographical, as well as economic 
and social, study of this province in the second half 
of the sixteenth century. The work may be com- 
pared to the volumes of the Histoire de Belgique 


that Henri Pirenne was then writing: while the 
Franche-Comté “route” was certainly not as im- 
portant as the “crossroads” of the Low Countries, 
Febvre’s study is more profound and more erudite 
than Pirenne’s and, since it is in the mainstream 
of historiography, has not dated as much. 

At the École Normale Febvre came into contact 
with Paul Vidal de la Blache, who had established 
geography as an autonomous discipline in France, 
and with Lucien Gallois, another geographer. From 
the beginning of his scholarly life Febvre, in effect, 
had two professions, history and geography. His 
enthusiasm for geography was great, as is shown 
by the geographical preface to his thesis, by his 
numerous reviews of geographical books, and by 
his masterly work, A Geographical Introduction to 
History (1922). This volume is a long, vigorous, 
and brilliant argument against narrow determin- 
ism and in favor of a subtle possibilism that en- 
compasses human evolution and history. 

Geography was his first love, but he soon gained 
familiarity with all the sciences of man by spend- 
ing time in the office of the new Revue de synthése 
historique, founded in 1900 by Henri Berr and 
edited by him. Berr’s editing communicated his 
own outgoing spirit: he tried to bring together in 
a kind of continuous conversation all the different 
kinds of history—intellectual, cultural, social, eco- 
nomic, institutional, political, and so forth—and 
the new human sciences, especially sociology. Dur- 
ing his stay in Paris at the Fondation Thiers, from 
1903 to 1906, Febvre established close ties with 
Berr and his group, soon becoming a key contribu- 
tor to the Revue de synthése and later to the other 
undertakings that arose out of it: the Semaines 
de synthéses and the vast series, L’évolution de 
UChumanité. Thus, when Febvre wrote in 1911 on 
the Franche-Comté, he was already something of 
a historian, a geographer, a sociologist, an econ- 
omist, a psychologist, and a linguist. 

Given this background, there is nothing surpris- 
ing either in the appearance in 1922 of his Geo- 
graphical Introduction to History, a book which 
had a tremendous impact on the small world of 
French historians and geographers, or in the found- 
ing in 1929 of the Annales d'histoire économique 
et sociale, which marked a decisive turning point 
in French historiography. It may seem that these 
achievements came relatively late in Febvre’s 
career, but for five solid years he had been involved 
in World War 1, always in the front line, ending 
up as captain of a machine gun company. In 1912 
he had been appointed professor in the faculty of 
letters at Dijon and in 1919 he moved to the Uni- 
versity of Strasbourg. There he and Marc Bloch 
became close friends, and it was with Bloch that 


he was to found, edit, and, indeed, to a large extent 
write the Annales. 

The effect of the Annales was to establish a kind 
of hegemony of history over the other human 
sciences: while in general barriers were eliminated 
between disciplines, these disciplines were not 
considered equal in importance, history being ac- 
corded a preferred position, especially vis-a-vis 
social psychology and sociology and even more so 
vis-à-vis economics. This intellectual position might 
be labeled “historicism” if this term did not have 
such pejorative connotations and particularly if it 
were not so easily confused with the Historismus 
of German thinkers. The essential point is to make 
clear the differences between the Revue de syn- 
thése, whose theme was the colloquy of the human 
sciences, and the Annales, which constituted a sort 
of Common Market, with history as the prepon- 
derant power. 

Yet great as is Febvre’s contribution in the 
Annales, it does not represent the true scope of his 
intellectual role. The key to this scope is his gen- 
erosity, his deep-felt need to share his knowledge. 
Just as Paul Langevin, 1872—1946, was the “bank- 
er” for the physicists of his generation—that is, the 
lender, the disseminator of ideas, what Diderot had 
been for many of the writers of his time, so also 
was Febvre a banker: he never became tired of 
having people come in to see him, of listening, of 
guiding. When conversation did not suffice he 
wrote what he called his “letters of guidance.” It 
was in this way that he trained Marc Bloch, his 
junior by eight years, and the present author as 
well. Often his “guidance” consisted of simply sur- 
rendering his own projects to others, especially 
after 1929, when work on the Annales became his 
principal activity. Himself a “peasant” and a mar- 
velous historian of the land, he turned over this 
subject to Bloch, who produced Les caractéres 
originaux de Uhistoire rurale française (1931); 
later, after 1946, he turned over to the present 
author the task of establishing relationships be- 
tween historical and economic studies. His own 
relatively brief writings on the work of Frangois 
Simiand, Earl J. Hamilton, Giuseppe Parenti, 
Frederic C. Lane, and others, show how receptive 
he was to new ideas in economics. 

Although Febvre thus abandoned some of his 
favorite topics to others, he does seem to have 
found, outside his primary field of intellectual 
activity, even outside the Annales, a kind of sub- 
ject that delighted him: the history of art, of re- 
ligion, or even, in the broad sense, of culture. 
History was one for him and whatever his particu- 
lar concern, he surveyed the entire landscape. In 
1928 he published Martin Luther: A Destiny, a 
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book that he clearly enjoyed writing and that is 
written beautifully. Luther was not for Febvre 
simply a problem in biographical research, nor 
even a difficult subject made attractive by its very 
difficulty; Luther represented the problem of the 
unique individual in history and of the unpredict- 
able power of such an individual. 

All his life, indeed, Febvre liked to consort with 
the great minds of “his” century, the sixteenth. He 
was wonderfully well acquainted with them; to 
hear him read Rabelais was a pleasure. His ex- 
cellent humanist education (which he received 
from his father and from an uncle rather than in 
school) explains his delight in writing his charm- 
ing little book, Autour de l'Heptaméron (1944), 
or that masterpiece of classical erudition, Origéne 
et Des Périers (1942a). 

Increasingly, he fought for one kind of history 
while writing rather a different kind himself. In 
1942, with the publication of Le problème de l'in- 
croyance au XVI* siècle: La religion de Rabelais 
(1942b), he seems to have emerged victoriously 
from this conflict: this time he had all the reins 
in his hand, and the book is his best one. In the 
first two parts of the work he treated Rabelais in 
the traditional way, using evidence from his life 
and his works, but he devoted the third part to 
the “mental apparatus” of the period—the words, 
the feelings, the concepts that are the infrastruc- 
ture of the thought of the century, the basis on 
which everything was constructed or could be con- 
structed, and which may have prevented certain 
things from being constructed. 

Although the book was much praised, its origin- 
ality limited its appreciation by historians. Febvre’s 
work was ahead of its time, and it is only recently 
that the structuralists of the new literary criticism 
(for example, Michel Foucault, in his Les mots et 
les choses: Une archéologie des sciences humaines, 
1966) have done similar research into the culture 
of a society. 

Yet Febvre did not pursue his ideas, and his 
work remained incomplete. As he worked in the 
area of psychological, social, and cultural history, 
he seems to have had difficulty in stating problems, 
fixing bench marks, and drafting a methodology. 
Thus he was always uncertain about the nature of 
civilization, to use that convenient term. Often he 
insisted that civilization must be located in a par- 
ticular time, that its changeful nature must be 
pinned down, and he protested against the fre- 
quent assertion that “man is always the same.” 
But at the same time his Rabelais presupposes 
the long continuation of what he aptly called the 
mental apparatus of men, something beyond the 
individual or the unique. He might have stressed 
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this transcendence more had he not been exces- 
sively impressed, as laymen tend to be, by the 
implications of the concept of relativity as for- 
mulated by Einstein. He was tormented by the per- 
ennial problem of the objective status of the 
“observer’—the historian—and of the events being 
observed—history. 

He did not doubt, however, that history provides 
an understanding of one’s own time and that such 
an understanding is indispensable to the historian. 
This is why to the very end of his life Febvre never 
left the intellectual battleground. It is no accident 
that he entitled the first volume of his collected 
articles Combats pour histoire (1953). 

In 1947 he created the section of economic and 
social sciences of the Ecole Pratique des Hautes 
Etudes, serving as chairman of this section until 
his death. He also revived the Annales, whose pub- 
lication had been hampered by the war; he even 
tried, without any real success, to revive the 
project of publishing the Encyclopédie frangaise, 
begun in 1933 by Anatole de Monzie and himself. 
And once again he did whatever he could to help 
other scholars. Really to appreciate the intellectual 
force of Febvre one must read not only his books 
but also his many articles and reviews and his 
marvelous letters. Those who were privileged to 
enjoy his informal conversation are certain that he 
is among the greatest of modern French historians. 


FERNAND BRAUDEL 


[See also History; and the biographies of BLOCH; 
PIRENNE; VIDAL DE LA BLACHE.] 
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FECHNER, GUSTAV THEODOR 


Gustav Theodor Fechner (1801-1878) is con- 
sidered to be the founder of psychophysics and 
thus of experimental psychology as a whole. He 
was also the founder of experimental aesthetics. 
Fechner was born in Lower Lusatia, then part of 
German northern Silesia. His father and both of 
his grandfathers were clergymen. Fechner studied 
medicine first at the University of Dresden and 
then at the University of Leipzig, obtaining his 
degree in 1822. His interests turned more and 
more to physics, however, and he was appointed 
instructor of natural philosophy at Leipzig the fol- 
lowing year. During his years as a student, the 
lectures of Lorenz Oken on natural philosophy and 
of E. H. Weber on physiology made the most last- 
ing impressions on Fechner. Oken is now forgotten, 
but he was then a renowned romantic natural 
philosopher and cured Fechner of his “man, the 
machine” approach. Fechner hailed Weber, the 
discoverer of “Weber’s law,” as the real father of 
psychophysics, [See WEBER, ERNST HEINRICH.] 

After his appointment to the university faculty, 
Fechner began experimental studies of physical 
and chemical problems and supported himself by 
translating, revising, and publishing reference 
works and textbooks on physics and chemistry. The 
sheer volume of the work he did during these years 
is extraordinary. In addition to 28 investigations 
of his own, published between 1827 and 1840, he 
annually turned out 1,500 to 2,000 printed pages 
of textbooks and reference works between 1822 
and 1838. He even found time to write several 
literary pieces, which he published under the pseu- 
donym of “Dr. Mises.” 

Fechner was married in 1833, and in 1834 he 
was appointed professor of physics at Leipzig. But 
he occupied the chair only until 1839; after re- 


peated attacks of severe exhaustion he was, for 
three years, completely incapacitated by a mysteri- 
ous illness. The major symptoms were disturb- 
ances of vision, with hypersensitivity to bright 
light, sporadic total failure of digestion, obsessions, 
and, finally, more and more terrifying hallucina- 
tions. He recovered quite suddenly. It seems most 
likely that the illness was an atypical form of 
schizophrenia. 

The initial impetus for the work on which 
Fechner’s fame is based doubtless came from 
Oken, who fascinated him so much that at first 
he was resolved to emulate Oken’s unrestrained 
philosophizing. However, Fechner’s empirical re- 
search in physics and physiology saved him, and 
in the guise of “Dr. Mises,” he himself participated 
in mocking this kind of romantic inventing of con- 
cepts, which appears to prove everything, while 
actually it proves nothing. Oken continued to exert 
some influence on Fechner, however, and later he 
returned to this kind of thinking without surren- 
dering to it completely. 


Psychophysics 

The problem that concerned Fechner most was 
the connection between body and mind. His Ele- 
mente der Psychophysik (1860) aims at utilizing 
experimental procedures, such as those employed 
by Weber previously, to explore this connection 
more precisely, arriving, if possible, at mathemati- 
cally formulated laws. Earlier, Johann Friedrich 
Herbart had demanded that psychological laws be 
formulated in mathematical terms, but he de- 
spaired of the likelihood that psychology would 
ever obtain the necessary experimental data. Fech- 
ner saw in Weber's threshold experiments the pos- 
sibility of securing such data. He assumed that the 
difference between two sensations may be defined 
by the number of “just noticeable differences” 
(jnd) between them, and that these jnd’s can be 
represented mathematically. He used the number 
of jnd’s above the lower absolute threshold as a 
measure of the intensity of sensation evoked by a 
Stimulus, In order to be able to describe the rela- 
tionship between psychic and physical quantities 
mathematically, Fechner sought to establish a gen- 
eral functional relationship between sensations and 
stimuli of whatever magnitude. In the range within 
which Weber’s law is valid—Weber’s law states 
that jnd’s are proportional to the magnitude of the 
stimuli—integration yields a function that de- 
Scribes the intensity of a sensation in terms of the 
logarithm of the stimuli, measured from the abso- 
lute threshold. 

Since the acuity of the sensory apparatus fluc- 
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tuates from instant to instant and from person to 
person, thresholds can be defined only statistically. 
Fechner perfected the experimental strategies that 
had earlier been used by Weber and by Vierordt, 
as well as by such astronomers and physicists as 
Lambert, Steinheil, and Laugier, to determine the 
degree of uncertainty or the accuracy of measure- 
ment in comparisons of stimuli. According to the 
nature of variation of the stimulus in comparisons 
of successive sensations, he distinguished three 
kinds of psychophysical methods of data collec- 
tion: (1) the method of jnd’s, in which the dif- 
ference between the stimuli compared is gradually 
increased until it is perceptible or diminished until 
it is no longer perceptible; this is also called the 
method of limits; (2) the method of right and 
wrong cases, in which a standard stimulus is com- 
pared with randomly varied comparison stimuli; 
this method is also called the constancy method; 
and (3) the method of average error, in which a 
comparison stimulus is adjusted by the subject to 
correspond to a standard; this is also referred to 
as the reproduction method. 

In calculating threshold values, Fechner em- 
ployed the law of errors formulated by Gauss and 
Laplace, and in his posthumous Kollektivmass- 
lehre (1897) he set forth the applicability of the 
law of errors to many other problems in psychol- 
ogy. The Kollektivmasslehre also contains his 
contributions to correlation statistics. It is the first 
textbook of statistics to be designed especially for 
behavioral scientists. Using carefully planned ex- 
periments, whose layouts resemble those today 
used in the analysis of variance, he endeavored to 
exclude from the calculation of reaction varia- 
bility, which defines the threshold difference, 
those constant errors that arise from the particular 
features of the experimental procedure. 

Fechner realized that “psychophysics” (his coin- 
age) was an innovation of great importance: he 
called it a science “in the initial state of becom- 
ing,” and his own work for all its scope, merely 
“a modest beginning of a beginning.” His work 
was soon taken up by some of the greatest sci- 
entists of his time. As far back as 1858, Ernst 
Mach and Hermann von Helmholtz had begun to 
experiment on their own, stimulated by Fechner’s 
preliminary report. Wilhelm Wundt followed in 
1862, A. W. Volkmann in 1864, H. Aubert and 
J. R. L. Delboeuf in 1865, and J. Bernstein and 
Vierordt in 1868. Of the outstanding specialists in 
the field of psychology only one remained a skep- 
tic: William James, who as late as 1890 declared 
that despite all its acumen and all its care, the 
psychological yield of the Elemente was “just noth- 
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ing,” not important enough to merit mention in 
even a footnote. 

The methods of measurement developed by 
Fechner are now generally adopted in quantitative 
experimental psychology, although some of the 
mathematical implications of his derivations are 
open to serious criticism (Luce & Galanter 1963). 
They are employed with equal success in all sorts 
of fields and for the most diverse problems. In 
recent years, the problem of psychological scaling, 
which Fechner was the first to appreciate and deal 
with, has again come to the forefront; it has been 
remarkably clarified and given new depth, espe- 
cially in the work of S. S. Stevens and G. Ekman 
(for example, Stevens 1934). 

Fechner is remembered almost exclusively as a 
methodologist. The fundamental theoretical postu- 
lates he presented in his psychophysics have gone 
unnoticed and are generally attributed to later 
authors. Most noteworthy is his working hypothesis 
of “concrete parallelism,” which assumes an iso- 
morphism between phenomena of consciousness 
and “psychophysical phenomena,” that is, those 
processes occurring in the cerebrum that are di- 
rectly associated with the phenomena of conscious- 
ness. This assumption and its elaboration is usu- 
ally attributed to Ewald Hering, Georg Elias 
Müller, Max Wertheimer, and, above all, Wolfgang 
Köhler, although it was clearly formulated in Vol- 
ume 2 of Fechner’s Elemente. 


Aesthetics 


Following his work in psychophysics, Fechner 
ventured also to transform the field of aesthetics 
from speculation to exact factual research. He 
summarized the results of his efforts in his Vor- 
schule der Aesthetik (1876). He tried to derive 
the conditions that determine what is pleasing and 
what is displeasing, not from a higher ideal of 
beauty but from below, by systematic empirical 
comparison of the beautiful with the less beautiful. 
This endeavor was not as successful as his work 
in psychophysics. Fechner was the victim of a 
then prevalent confusion: to him, as to his con- 
temporaries, “from below” meant not only “from 
perception to concept,” but also “from the simple 
to the more complex,” and the properties of the 
complex, it was then believed, are those of its 
simple constituents. The study by Christian von 
Ehrenfels, who proved that this postulate is false— 
and thereby not only opened up a new era in psy- 
chology but also made possible a more relevant 
theory of beauty—did not appear until three years 
after Fechner’s death. 

Fechner did, however, conduct a study—in line 


with his attempt to establish an experimental 
science of aesthetics—which makes him one of 
the earliest forerunners of modern opinion re- 
search. Two nearly identical versions of a Ma- 
donna reputed to have been painted by Hans Hol- 
bein were then being exhibited in Dresden, and 
Fechner asked the visitors to the exhibition to note 
on a sheet of paper which of the two they thought 
was more beautiful. At that time the public was 
unfortunately not accustomed to such question- 
naires, and only 116 of the 11,000 visitors took 
the trouble to write their comments. Of these 116, 
some were art critics who had previously formed 
judgments, while others failed to follow instruc- 
tions, thus rendering the results useless. Nonethe- 
less, the experiment made a contribution to meth- 
odology that might have become as influential as 
psychophysics, had its significance been understood. 


Analysis of systems 


Around 1850, Fechner tackled another funda- 
mental problem, that of the apparent finality of 
particular processes in animate and inanimate na- 
ture. In contrast to the neovitalists, he insisted 
that this finality be explained within the frame- 
work of the principle of causality. He laid down 
the axiom that every system in nature that is de- 
limited with respect to its environment and hence 
is more or less closed, as well as every relatively 
autonomous part of such a system, tends, after 
a longer or shorter period of dislocation, to return 
to its previous state, and he specifically speaks of 
a “tendency toward stability.” From this there is 
a direct road, via Ernst Mach’s observation that 
the stable states of comparatively closed struc- 
tures are formally distinct, to the concepts of 
equilibrium of Wolfgang Köhler and Ludwig von 
Bertalanffy. As Fechner used the concept of “sta- 
bility,” it was broader than it is today; it included 
not only what he called “simple stability” but also 
“complex stability,” examples of which he found 
primarily in organisms. These examples are iden- 
tical with Kéhler’s “stationary states” and Berta- 
lanffy’s “steady states.” Thus, Fechner’s works con- 
tain a surprising number of stimulating ideas 
which are still influential, though they may some- 
times appear under different names [see PERSON- 
ALITY: CONTEMPORARY VIEWPOINTS; SYSTEMS 
ANALYSIS, article on PSYCHOLOGICAL SYSTEMS]. 


WOLFGANG METZGER 


[For the historical context of Fechner’s work, see the 
biographies of Gauss; LAPLACE; WEBER, E. H. 
For discussion of the subsequent development of his 
ideas, see AESTHETICS; PSYCHOPHYSICS; QUANTAL 


RESPONSE; SCALING; and the biographies of HELM- 
HOLTZ; THURSTONE; WUNDT.] 
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FECUNDITY 
See FERTILITY. 


FEDERALISM 


Federalism and its kindred terms—e.g., “federal” 
—are used, most broadly, to describe the mode of 
political organization which unites separate polities 
within an overarching political system so as to 
allow each to maintain its fundamental political 
integrity. Federal systems do this by distributing 
power among general and constituent governments 
in a manner designed to protect the existence and 
authority of all the governments. By requiring that 
basic policies be made and implemented through 
negotiation in some form, it enables all to share 
in the system’s decision-making and decision- 
executing processes. 


Different conceptions 


No single definition of federalism has proved 
satisfactory to all students, primarily because of 
the difficulties in relating theoretical formulations 
to the evidence gathered from observing the actual 
operation of federal systems. Attempts at definition 
have also foundered on the problems of distinguish- 
ing between (1) the federal principle as a broad 
social concept and federalism as a narrower politi- 
cal device; (2) two classic but different conceptions 
of federalism; (3) authentically federal systems 
and political systems which utilize elements of the 
federal principle; (4) mature and emergent federal 
systems; and (5) federalism and “intergovern- 
mental relations” as distinct political phenomena. 

Social and political principle. Federalism, con- 
ceived in the broadest social sense, looks to the 
linkage of people and institutions by mutual con- 
sent, without the sacrifice of their individual identi- 
ties as the ideal form of social organization. First 
formulated in the covenant theories of the Bible 
(Kaufman 1937-1948), this conception of feder- 
alism was revived by the Bible-centered “federal” 
theologians of seventeenth-century Britain and New 
England (Miler 1939), who coined the term 
“federal” —derived from the Latin foedus (cove- 
nant)—in 1645 to describe the system of holy and 
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enduring covenants between God and man which 
lay at the foundation of their world view. This 
conception of federalism was given new theoretical 
form by nineteenth-century French and German 
social theorists. Closely related to the various 
theories of social contract, it is characterized by 
the desire to build society on the basis of coordina- 
tive rather than subordinative relationships and 
by the emphasis on partnership among parties with 
equal claims to legitimacy who seek to cultivate 
their diverse integrities within a common social 
order (Boehm 1931). [See SOCIAL CONTRACT.] 

As a political device, federalism can be viewed 
more narrowly as a kind of political order animated 
by political principles that emphasize the primacy 
of bargaining and negotiated coordination among 
several power centers as a prelude to the exercise 
of power within a single political system, and stress 
the value of dispersed power centers as a means 
for safeguarding individual and local liberties. This 
means, in effect, that political institutions common 
to different political systems, when combined with- 
in a federal system and animated by federal prin- 
ciples, are effectively endowed with a distinctive 
character. For example, while political parties are 
common in modern political systems, parties ani- 
mated by the federal principle show unique char- 
acteristics of fragmentation and lack of central 
discipline that increase the power of local groups 
within the system as a whole (Grodzins 1960a). 

Federation and confederation. Federal ideas 
have been systematically conceptualized in two dif- 
ferent ways. On the one hand, federalism has been 
conceived as a means to unite a people already 
linked by bonds of nationality through distribution 
of political power among the nation’s constituent 
units. In such cases, the polities that constitute the 
federal system are unalterably parts of the national 
whole, and federalism invariably leads to the de- 
velopment of a strong national government operat- 
ing in direct contact with the people it serves, just 
as the constituent governments do. On the other 
hand, federalism has also been conceived as a 
means to unify diverse peoples for important but 
limited purposes, without disrupting their primary 
ties to the individual polities that constitute the 
federal system. In such cases the federal govern- 
ment is generally limited in its scope and powers, 
functioning through constituent governments which 
retain their plenary autonomy, and, to a substantial 
degree, is dependent upon them. 

Both conceptions of federalism have evolved 
from early federal experiments. The principles of 
strong national federalism were first applied by the 
ancient Israelites, beginning in the thirteenth cen- 


tury B.C., to maintain their national unity through 
linking their several tribes under a single national 
constitution and at least quasi-federal political in- 
stitutions (Bright 1959). Several centuries later, 
the Greek city-states experimented with federal-style 
institutions as means for the promotion of intra- 
national harmony and cooperation, primarily for 
defensive purposes, through associations (e.g., the 
Achaean League) that came close to what were 
later defined as confederations (Freeman 1863). A 
modified form of the Greek view was developed by 
the sixteenth-century theorists (Gierke 1913). They 
held that federalism meant a permanent league of 
states united through a perpetual covenant, bind- 
ing under international law, in which the constit- 
uent states delegated enumerated powers to a 
general government while retaining full rights of 
internal sovereignty. 

However, when the American system—the proto- 
type of modern federal systems—emerged in the 
late eighteenth century, its architects developed a 
conception of federalism much like that of ancient 
Israel. From the first, American federalism func- 
tioned to serve a people with a single national 
identity and was constituted with a strong national 
government to serve that people on a national basis, 
though, as late as 1789, The Federalist could. de- 
scribe the new American constitution as “partly 
national and partly federal” in deference to the 
then-accepted views. The successful efforts of the 
supporters of that constitution to appropriate 
the term “federalist” for their own use (Main 1961, 
pp. ix-xi) restored to common usage the older con- 
ception of federalism as a noncentralized national 
union bound by municipal law, with a general gov- 
ernment superior to the governments of the con- 
Stituent states (Diamond 1963). 

Just as the American system became the proto- 
type for other modern federal systems, so the Ameri- 
can conception of federalism became the generally 
accepted one. The other conception was ultimately 
subsumed under the word “confederation” and its 
kindred terms. The two systems described by these 
different conceptions reflect, in part, the distinc- 
tions implied in the German Staatenbund (con- 
federation) and Bundesstaat (federation), terms 
developed in the mid-nineteenth century (Mogi 
1931). A certain degree of confusion remains be- 
cause the terms invented to describe both systems 
were used indiscriminately for many years. 

Though the American conception of federalism 
is today almost universally accepted as the most 
accurate usage, the confederal conception remains 
a living and legitimate aspect of the federal idea in 
its largest political sense. Today, the latter is most 


prominent among certain advocates of limited Eu- 
ropean union (the Common Market exemplifies a 
confederal form) and among many so-called world 
federalists. [See INTERNATIONAL INTEGRATION. ] 

Federalism and related systems. Federal sys- 
tems are often confused with four other forms of 
political order which make use of specific federal 
principles. The use of some federal principles in 
multiple monarchies, legislative unions, empires, 
and decentralized unitary systems can have im- 
portant consequences similar to those in authen- 
tically federal systems. But the fact that such 
principles do not permeate the four systems makes 
the distinctions between them and true federations 
extremely important. 

Federal systems differ from multiple (or dual) 
monarchies in two essential ways. The central con- 
stitutional characteristic of the multiple monarchy 
is that union exists only in the person of the 
sovereign and is maintained only through the ex- 
ercise of executive power in his name. No signifi- 
cant common institutions exist to unite the 
constituent polities—no common legislatures, no 
common legal system, and little in the way of a 
common political substructure. On the contrary, 
each constituent polity maintains its own political 
system, which the monarch guarantees to support 
under the terms of his compact with the realm. 
Multiple monarchies have historically been less 
than democratic regimes. Even where there have 
been tendencies toward democratization, the very 
fact that union exists only by virtue of the common 
sovereign has tended to elevate the position of the 
monarch to one of real power. Attempts to transfer 
sovereignty or the attributes of sovereignty else- 
where by their very nature stimulate the division 
of this kind of association of civil societies into 
separate polities. Thus, the Austro-Hungarian Em- 
pire was held together by the Hapsburg emperors 
and disintegrated when that family ceased to rule 
(Sharma 1953, chapter 7). The dual monarchy of 
Sweden and Norway ceased to function when dem- 
ocratic government was introduced, transferring 
the attributes of sovereignty from the monarch to 
the nation(s). In Spain, on the other hand, the 
inability of the Spaniards to transform a multiple 
monarchy into a federal system, in a locale which 
by nature demanded peninsular union of some sort, 
led to the consolidation of the constituent polities 
into something approximating a unitary state which 
remained highly unstable because of the local bar- 
riers to consolidation that could neither be ac- 
commodated nor eradicated (Elliott 1964). [See 
Monarcny,] 

Multiple monarchies have been transformed into 
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stable and unified polities through legislative 
union. The United Kingdom is a case in point. The 
centrifugal tendencies of the seventeenth-century 
dual monarchy linking England and Scotland were 
finally eliminated through a legislative union of the 
two nations in 1707. Legislative union bears very 
close resemblance to federal union at several cru- 
cial points. Though designed to direct public al- 
legiance to a single national authority, the terms 
of the union encourage the political system to re- 
tain certain noncentralizing elements. The govern- 
ment of the nation remains national rather than 
central in character, since it is created by a per- 
petual covenant which guarantees the constituent 
parties their boundaries, representation in the na- 
tional legislature, and certain local autonomies, 
such as their own systems of municipal law. Legis- 
lative unions usually unite unequal polities. The 
centralizing tendencies induced by this are some- 
what counterbalanced by the residual desire for 
local self-government in the constituent states. 
Thus, in the United Kingdom the cabinet has ac- 
quired a supremacy not foreseen in 1707, but 
within the framework of cabinet government Scot- 
land has acquired a national ministry of its own 
with a separate administrative structure, based in 
Scotland, for most of its governmental programs 
(Milne 1957). 

Federal systems also differ from empires allow- 
ing cultural home rule. Such empires have often 
been termed federal—in some cases because they 
claim to be. The Roman Empire was the classic 
example of this kind of political system in the 
ancient world, and the Soviet Union may well be its 
classic modern counterpart. In both cases, highly 
centralized political authorities possessing a virtual 
monopoly of power decide, for reasons of policy, to 
allow local populations with different ethnic or cul- 
tural backgrounds to maintain a degree of cultural 
home rule, provided that they remain politically 
subservient to the imperial regime. While this often 
appears to offer a substantial degree of local au- 
tonomy, its political effects are purposely kept 
minimal, Any local efforts to transform cultural 
home rule into political power are invariably met 
with suppressive force from the central government, 
even to the point of revoking cultural rights, as 
examples from the history of both empires reveal. 

Federal systems are clearly different from de- 
centralized unitary states, even though such states 
may allow local governments considerable auton- 
omy in some ways. In such states local powers are 
invariably restricted to local matters, as determined 
by the central authorities, and are subject to na- 
tional supervision, restriction, and even with- 
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drawal, though tradition may mitigate against pre- 
cipitous action by the central government in areas 
where local privileges have been established, Still, 
as the English experience has shown, even power- 
ful traditions supporting local autonomy have not 
stood in the way of great reconcentration of power 
by democratically elected parliaments when such 
action has been deemed necessary by a national 
majority, 

Mature and emergent federal systems. Several 
recent studies (Macmahon 1955; Wheare 1946) 
have attempted to draw distinctions between ma- 
ture and emergent federal systems. The thrust of 
their argument is that federalism, when used to 
unify separate political systems to form a new 
nation, and federalism, as a form of decentralized 
government in an established nation, encourage 
markedly different kinds of political behavior. In 
the former case, federalism serves as a means to 
bring tenuous unity to nations composed of highly 
autonomous polities, with the locus of power re- 
maining among the constituent units. As federal 
systems mature, so the argument goes, power is 
increasingly concentrated at the center, and fed- 
eralism remains only to promote a certain amount 
of decentralization within an otherwise highly uni- 
fied political system. Wheare goes so far as to 
argue that federalism is a transitional phenomenon 
useful in promoting progressively larger polities 
which are then gradually discarded (in fact, if 
not in form) as an unnecessary encumbrance. This 
argument may have some validity in describing the 
history of nonfederal political systems which have 
utilized federal principles to promote national unity. 
For example, it can be used to describe the evolu- 
tion of the United Kingdom into its present con- 
Stitutional state. It cannot be applied, however, to 
any of the three exemplary federal systems— 
Canada, Switzerland, and the United States. Their 
national ties existed from the first, and their na- 
tional governments were granted broad powers at 
the outset. Nor has federalism declined in impor- 
tance as those nations have matured. 

There are undoubtedly differences between ma- 
ture and emergent federal systems, but those dif- 
ferences are more likely to relate to the character 
of conflict and negotiation between the general and 
constituent governments than to their relative 
strengths. 

Federalism and intergovernmental relations. Be- 
cause the study of federalism at its most im- 
mediately empirical level heavily stresses the study 
of intergovernmental relations, the two are often 
considered to be synonymous. Federalism, however, 
is something much more than the relationships be- 


tween governmental units, involving as it does 
principles which are designed to establish the 
proper character of those relationships and which 
must also affect the character of other political 
institutions within federal systems, As already in- 
dicated, federalism concerns the way in which fed- 
eral principles influence party and electoral sys- 
tems in federal polities just as much as it concerns 
the way in which local governments relate to their 
regional or national ones, or to each other. More- 
over, the study of intergovernmental relations exists 
apart from the study of federalism, since such 
relationships are to be found in all political sys- 
tems, federal or otherwise, where there is more 
than one government extant within a given polity. 


Characteristics and operational principles 


The most useful way to attempt to understand 
federalism as a political phenomenon is to under- 
take a survey of the basic characteristics of federal 
systems, principles, and processes in order to un- 
derstand both the manner and the direction of 
their development. 

As a first step it seems necessary to identify the 
various federal systems that exist today or have 
existed in the past; only then can we analyze them 
as operating political systems. However, identifying 
federal systems is no simple matter, as we have 
just seen. The difficulties are heightened by the 
wide functional differences easily observed in the 
various political systems which call themselves 
federal and by the often greater operational similar- 
ities between self-styled “federal” and “unitary” 
systems. Contrast, for example, the political sys- 
tems of Australia and the Soviet Union, Canada 
and Mexico, Switzerland and Yugoslavia, or com- 
pare the United States and Great Britain. 

Moreover, federal systems have historically been 
marked by great internal distinctions between 
theory and practice, perhaps more so than other 
political systems. In the United States, the measure 
of the maintenance of federalism was long con- 
sidered to be the degree of separation of govern- 
ment activities by level, because it was generally 
believed that such separation actually existed. In 
fact, American federalism from the first had been 
characterized by extensive intergovernmental func- 
tional collaboration within the framework of sepa- 
rate governmental structures (Elazar 1962). Simi- 
larly, the Canadian federal system has always been 
described as one in which the federal government 
is clearly dominant—the repository of all powers 
not explicitly granted to the provinces. Yet since 
the brief period of federal supremacy in the years 
immediately following confederation, the provinces 


have consistently gained power at federal expense 
(Smiley 1965). The Russian federal constitution 
goes so far as to grant each Soviet republic the right 
of secession—a patent impossibility under the 
realities of the Russian political system. 

Nevertheless, some basic characteristics and 
operational principles common to all truly federal 
systems can be identified, and help us to define 
such systems, These may be divided into three 
essential elements and a number of supplementary 
ones. 

Written constitution. First, the federal rela- 
tionship must be established or confirmed through 
a perpetual covenant of union, inevitably embodied 
in a written constitution that outlines, among other 
things, the terms by which power is divided or 
shared in the political system and which can be 
altered only by extraordinary procedures [see CON- 
STITUTIONS AND CONSTITUTIONALISM]. Every ex- 
isting federal nation possesses a written constitu- 
tion, as do most of the other nations incorporating 
elements of the federal principle. Juridically, fed- 
eral constitutions are distinctive in that they are 
not simply compacts between the rulers and the 
ruled but involve the people, the general govern- 
ment, and the polities constituting the federal 
union. Moreover, the constituent polities retain 
local constitution-making rights of their own. 

Noncentralization. The political system must 
reinforce the terms of the constitution through an 
actual diffusion of power among a number of sub- 
stantially self-sustaining centers that are generally 
coincident with the constituent polities established 
by the federal compact. Such a diffusion of power 
may be termed noncentralization. It differs from 
decentralization—the conditional diffusion of spe- 
cific powers to subordinate local governments by a 
central government, subject to recall by unilateral 
decision. It is also more than devolution—the spe- 
cial grant of powers to a subnational unit by a 
central government, not normally rescindable. Non- 
centralization ensures that no matter how certain 
powers may be shared by the general and constit- 
uent government at any point in time, the au- 
thority to participate in exercising them cannot be 
taken away from either without mutual consent. 
Constituent polities in federal systems are able to 
participate as partners in national governmental 
activities and to act unilaterally with a high degree 
of autonomy in areas constitutionally open to them 
—even on crucial questions and, to a degree, in 
opposition to national policies, because they possess 
effectively irrevocable powers. 

Areal division of power. A third element that 
appears to be essential in any federal system is 
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the internal division of authority and power on an 
areal basis (Maass 1959), what in the United 
States has been called territorial democracy. It is 
theoretically possible to create a federal system 
whose constituent units are fixed but not terri- 
torially based. There were premodern protofedera- 
tions of nomadic tribes, and some observers have 
seen federal elements in nations constitutionally 
structured to accommodate social and political di- 
visions along ethnic, religious, or even ideological 
lines. Nevertheless, no authentic federal system has 
existed without an areal basis for the federal di- 
vision. Historically, when areal divisions of power 
have given way to divisions on the basis of func- 
tional interest, federalism has been replaced by 
pluralism. In modern democratic theory the argu- 
ment between federalists and antifederalists has 
frequently revolved around the respective values of 
areal and functional diffusions of power. Theorists 
who have argued the obsolescence of federalism 
while endorsing the values used to justify its ex- 
istence have generally based their case on the 
superior utility of pluralism (Mogi 1931, pp. 1059— 
1115). Proponents of the federal-areal division 
argue that the deficiencies of territorial democracy 
are greatly overshadowed by the neutrality of areal 
representation of functional interests, and they 
argue further that any other system devised for 
giving power to these interests has proved unable 
to cope with the complexities and changes of in- 
terest endemic in a dynamic age while certainly 
limiting the advantages for local differentiation in- 
herent in the areal system. 

Studies of federal systems indicate the existence 
of other elements that supplement the three basic 
ones. While all of them are not always present in 
every federal system, their near universality leads 
to the conclusion that they serve important func- 
tions in the maintenance of federalism in each. 
Similarly, while many of them are found individ- 
ually in various kinds of political systems, it is 
their combination within a single system structured 
around the basic elements that is characteristic of 
federalism. 

Maintaining union. Generally characteristic of 
modern federal systems are direct lines of com- 
munication between the public and both the gen- 
eral and the constituent governments, which allow 
the public to exert direct influence on both gov- 
ernments and permit them to exercise direct au- 
thority over a common citizenry. The people may 
(and usually do) elect representatives to all gov- 
ernments which serve them. All of the governments 
may (and usually do) administer programs so as 
to serve the individual citizen directly. The courts 
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may serve both levels of government, applying the 
relevant laws directly. 

The existence of those direct lines of communica- 
tion—one of the major features distinguishing 
federations from leagues—is usually predicated on 
the existence of a sense of common nationality 
binding the constituent polities and peoples of fed- 
eral nations together, another element requisite 
for the maintenance of a successful federal system. 
In some countries this sense has been inherited, 
but in most it has had to be invented. Federalism 
in Germany has been based on a common sense 
of an inherited German nationhood. In the United 
States, Argentina, and Australia a sense of nation- 
hood had to be at least partly invented. National 
consciousness soon became second nature in those 
countries, since none of their constituent states 
ever had much more than a partially developed na- 
tional consciousness of its own. Canada, Switzer- 
land, and Yugoslavia have had to invent a sense 
of common nationality strong enough to embrace 
“nationality groups” whose intense national feelings 
are rooted in the constituent polities. In such newly 
formed federal systems as India, Malaysia, and 
Nigeria, the future of federalism is endangered by 
the absence of a common sense of nationality. Con- 
trary to some theories, federalism has not proved 
to be a particularly good device for integrating di- 
verse nationalities into a single political system un- 
less it has been accompanied by other factors com- 
pelling integration. 

Geographic necessity has been a major factor 
promoting the maintenance of union within federal 
systems, even in the face of strong pressures toward 
disunion. The Mississippi Valley in the United 
States, the Alps in Switzerland, the island character 
of the Australian continent, and the mountains and 
jungles surrounding Brazil have served as direct 
geographic influences promoting unity. More politi- 
cal than “natural,” but no less compelling geo- 
graphically, have been the pressures for Canadian 
union generated by that country’s neighbor to the 
south or for the federation of the German states 
generated by their neighbors to the east and west. 

Maintaining noncentralization. It has been well 
demonstrated that the constituent polities in a 
federal system must be fairly equal in population 
and wealth or at least balanced geographically or 
numerically in their inequalities, if noncentraliza- 
tion is to be maintained. The United States has 
been able to overcome its internal inequities be- 
cause each geographic section has included both 
great and small states. In Canada, the ethnic dif- 
ferences between the two largest provinces have 
served to inject balance into the system. The ex- 


istence of groups of cantons in different size cate- 
gories has helped maintain Swiss federalism. Simi- 
lar distributions exist in every other system whose 
federal character is not in question. 

The existence of a large polity dominating 
smaller states with which it is nominally federated 
on equal terms has often been one of the major 
reasons for the failure of federalism. In the Ger- 
man federal empire of the late nineteenth century, 
Prussia was so obviously dominant that the other 
states had little opportunity to provide national 
leadership or even a reasonably strong hedge 
against the desires of its king and government. 
Similarly, even without the problem of the Com- 
munist party, the existence of the Russian Soviet 
Federal Socialist Republic, which occupies three- 
fourths of the area and contains three-fifths of the 
population of the Soviet Union, would have severely 
crippled the possibilities of maintaining authentic 
federal relationships in that country. 

Successful federal systems have also been char- 
acterized by the permanence of the boundaries of 
their constituent units. This does not mean that 
boundary changes cannot occur, but it does mean 
that as a matter of constitutional law such changes 
can be made only with the consent of the polities 
involved and that, as a matter of political policy, 
they are avoided except in the most extreme situa- 
tions. Boundary changes have occurred in the 
“classic” federal systems—the United States divided 
Virginia during the Civil War, Canada has en- 
larged the boundaries of its provinces, and Switzer- 
land has divided cantons—but they have been the 
exception rather than the rule, and in every case 
at least the formal consent of the constituent 
polities was given. Even in weaker federal systems, 
such as those of Latin America, state boundaries 
have tended to remain relatively secure. When 
boundary changes have been made, as in the post- 
war redrawing of Länder boundaries in West Ger- 
many to account for the diminished territory of 
the Federal Republic and the alteration of state 
lines to recognize linguistic unities in India, the 
essential heartlands of the polities involved have 
been preserved. 

In a few very important cases, noncentralization 
is both reflected and supported through the con- 
Stitutionally guaranteed existence of different sys- 
tems of law in the constituent polities, Though the 
differences in those systems are likely to be some- 
what eroded over time—the extent of their pres- 
ervation varying from system to system—their 
continued existence as separate systems and the 
national mixture of laws which their existence pro- 
motes act as great bulwarks against centralization 


[see LEGAL SYSTEMS]. In the United States, each 
state’s legal system stems directly and to a certain 
extent uniquely from English law, while federal 
law occupies only an interstitial position binding 
the systems of the fifty states together insofar as 
necessary. The resulting mixture of laws keeps the 
administration of justice, even in federal courts, 
substantially noncentralized (Macmahon 1955, 
chapter 11). In Canada, the existence of common 
law and civil law systems side by side is one con- 
stitutional guarantee of French-Canadian cultural 
survival. Noncentralized legal systems, a partic- 
ularly Anglo-American device, are often used in 
legislative as well as federal unions. They are rare 
in other political cultures and have become less 
common in all federal systems established since 
1900. More common is the provision for modifica- 
tion of national legal codes by the subnational 
governments to meet special local needs, as in 
Switzerland. 

The point is generally well taken that unless the 
constituent polities have substantial influence over 
the formal or informal amending process the fed- 
eral character of the system is open to question. 
Since many constitutional changes are made with- 
out recourse to formal constitutional amendment, 
the position of the constituent polities must be 
additionally protected by a constitution designed so 
that any serious changes in the political order can 
be made only by the decision of dispersed majori- 
ties which reflect the areal division of powers. This 
protection, which federal theorists have argued is 
important for popular government as well as for 
federalism (Diamond 1963), is a feature of the 
most truly federal systems. 

Noncentralization is strengthened in all federal 
systems by giving the constituent polities guaran- 
teed representation in the national legislature and, 
often, by giving them a guaranteed role in the na- 
tional political process. In some federal systems, 
notably those of the United States and Switzerland, 
the latter is guaranteed in the written constitution. 
In others, such as Canada and those in Latin 
America, certain powers of participation have been 
acquired and have become part of the traditional 
constitution. 

Recent studies have shown that the existence of 
a noncentralized party system is perhaps the most 
important single element in the maintenance of 
federal noncentralization (Macmahon 1955), Non- 
centralized parties initially develop because of the 
constitutional arrangements of the federal com- 
pact, but once they have come into existence, they 
tend to be self-perpetuating and to function as 
decentralizing forces in their own right. 
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The United States and Canada provide two ex- 
amples of the different forms which can be assumed 
by a noncentralized party system. In the United 
States, where party responsibility is minimal and 
virtually nonexistent on the national level, a two- 
party system has developed, with the parties 
actually coalitions of the several state or, in some 
cases, local party organizations functioning as na- 
tional units only for the quadrennial presidential 
elections or for purposes of organizing the national 
Congress. Party financing and decision making are 
functions which are dispersed either among the 
State organizations or among widely divergent fac- 
tions operating nationwide. In Canada, on the other 
hand, the parliamentary form of government, with 
its concomitant requirement of party responsibility, 
means that at the national level considerably more 
party cohesiveness must be maintained simply in 
order to gain and hold power. 

The noncentralized party system in Canada has 
developed through a fragmentation of the parties 
along regional or provincial lines. The parties with 
nationwide bases are still divided internally along 
provincial lines, with each provincial organization 
autonomous. Individual provinces are frequently 
dominated by regional parties that send only a few 
representatives to the national legislature, adding 
to the fragmentation of the system. Very often, the 
party victorious in national elections is the one 
which is briefly able to expand its base to most 
nearly national proportions. 

European-style federal systems where parlia- 
mentary government is the norm follow the Cana- 
dian model. Australia and Switzerland come closest 
to paralleling it, and traces of it can be found in 
the German Federal Republic. A more centralized 
variation of the same pattern exists in countries 
like India, in which the national government is 
dominated by one very large and diffuse national 
party which is held together nationally by personal 
leadership but is quite factionalized in the states 
where it must share the governing power with other 
parties. 

Federal nations with less developed party sys- 
tems frequently gain some of the same decentral- 
izing effects through what the Latins call caudil- 
lismo—noncentralized personal leadership systems 
which diffuse power through strong local leaders 
operating in the constituent polities. Caudillistic 
noncentralization is most characteristic of Latin 
American federal systems but apparently exists in 
such new federations as Nigeria and Malaysia as 
well [see CAUDILLISMO]. 

The importance to federalism of a noncentral- 
ized party system is well illustrated by contrast 
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with those formally federal nations dominated by 
one highly centralized party, such as the Soviet 
Union, Yugoslavia, and Mexico. In all three cases, 
the dominant party has operated to limit the power 
of the constituent polities in direct proportion to 
the extent of its dominance. 

Ultimately, however, noncentralization is main- 
tained to the extent that there is respect for the 
federal principle within each federal system. Such 
respect is necessarily reflected in the immediate 
recognition by the decision-making publics that the 
preservation of the constituent polities is as impor- 
tant as the preservation of the nation as a whole. 
In the words of the American Chief Justice Salmon 
P. Chase, federalism looks to “an indestructible 
Union, composed of indestructible States” (Texas 
v. White, 7 Wallace [1869]). This recognition may 
be based on loyalty to particular constituent polities 
or on an understanding of the role played by fed- 
eralism in animating the political system along 
certain unique lines. Thus, those who value govern- 
ment by conciliation and partnership, with em- 
phasis on local control, are likely to have respect 
for the federal principle. 

Citizens of a federal nation must show that 
respect in two ways, by showing self-restraint and 
by cultivating the political art of negotiation. Fed- 
eralism can exist only where there is considerable 
tolerance of diversity and willingness to take politi- 
cal action through conciliation even when the 
power to act unilaterally is available. The usual 
prerequisite to action in federal systems is the 
ability to build consensus rather than the power to 
threaten coercion. Western federal nations can 
furnish many examples of the exercise of national 
self-restraint in dealing with difficult federal prob- 
lems. Even in a federal system as centralized as 
that of India, the constitutional right of the na- 
tional government to assume control of the state 
governments is exercised as little as possible— 
notably when the communists win local elections — 
and is then clearly a temporary action. 

The historical record indicates that the dual pur- 
pose implied in Chase’s dictum has been at least as 
responsible for the creation of federal systems as 
has the single interest in political unification. The 
Canadian confederation came into being not only 
to create a new nation out of the British North 
American colonies but also to give Ontario and 
Quebec autonomous political systems of their own. 
Similarly, every move toward greater union in the 
Swiss confederation has been made in order to 
preserve the independence of the cantons from 
both outside encroachment and revolutionary cen- 
tralism (Sharma 1953, pp. 269-275). A good case 


can be made that similar motivations were impor- 
tant in the creation of Australia, Malaysia, Nigeria, 
and the United States. 

Maintaining the federal principle. Several of 
the devices commonly found in federal systems 
serve to maintain the federal principle per se and 
are consequently supportive of both the national 
government and the constituent polities. Two of 
these are particularly common and important. 

The maintenance of federalism requires that the 
nation and its constituent polities each have a sub- 
stantially complete set of governing institutions of 
their own with the right—within limits set by the 
compact—to modify those institutions unilaterally. 
Separate legislative and administrative institutions 
are both necessary. This does not necessarily mean 
that all governmental activities must be carried out 
by separate institutions at each level. It is possible 
for the agencies of one government to serve as 
agents of the other by mutual agreement. But each 
government must have the needed institutions to 
function independently in the areas of its authority 
and the structural resources to cooperate freely 
with the other government's counterpart agencies. 

In this regard, the contractual sharing of public 
responsibilities by all governments in the system 
appears to be a central characteristic of federalism. 
Sharing, broadly conceived, includes common in- 
volvement in policy making, financing, and admin- 
istration of government activities. In contemporary 
federal systems, it is characterized by extensive 
intergovernmental collaboration. Sharing can be 
based on highly formal arrangements or informal 
agreements. In federal systems, it is usually con- 
tractual in nature. The contract—politically a lim- 
ited expression of the compact principle—is used 
in formal arrangements as a legal device to enable 
governments responsible to separate polities to 
engage in joint action while remaining independ- 
ent entities. Even where government agencies C0- 
operate without formally contracting to do so, the 
spirit of federalism that pervades ongoing federal 
systems tends to infuse the participating parties 
with a sense of contractual obligation. 

In any federal system, it is likely that there will 
be continued tension between the federal govern- 
ment and the constituent polities over the years 
and that different “balances” between them will 
develop at different times. The existence of this 
tension is an integral part of the federal relation- 
ship, and its character does much to determine the 
future of federalism in each system. The question 
of federal-state relations which it produces he 
perennially a matter of public concern because vir- 
tually all other political issues arising in a federal 


system are phrased in terms of their implications 
for federalism. In this way federalism imposes a 
way of looking at problems that stands apart from 
the substantive issues raised by the problems them- 
selves. This is particularly true of those issues 
which affect the very fabric of society. In the United 
States, for example, the race question is a problem 
of federal—state as well as Negro—white relations, 
and the same is true of the cultural question in 
Canada and the linguistic question in India. 

The end product. The very terminology of fed- 
eralism is characterized by a revealing ambiguity 
that is indicative of the end product of federal 
systems. The word “federalize” is used to describe 
the unification of “sovereign” states into a federal 
polity and also the permanent devolution of au- 
thority and power within a nation to subnational 
governments. In this ambiguity lies the essence of 
the federal principle—the perpetuation of both 
union and noncentralization. 

Viewed from the top, the combination of the 
elements discussed above results in a federal rather 
than a central government—i.e., a government 
composed of a nationwide coalition of political 
institutions, some with predominantly local power 
bases (such as the national legislature), others 
with predominantly national power bases (such as 
the national bureaucracy ). This government, whose 
power is thus diffused vertically and laterally, 
functions in cooperation with the constituent poli- 
ties which it must conciliate in order to act. Deci- 
sion making is characterized by heavy reliance 
upon negotiation and bargaining and by minimal 
reliance upon the exercise of force. Operations are 
characterized by a measure of disorder, since non- 
centralization breeds multiple power centers located 
at or cutting across all levels of government. Each 
of these centers seeks to keep open routes of access 
to the others, usually succeeding because it is in 
the best interests of all to maintain this kind of 
disorder as part of the “rules of the game.” 

Viewed locally, a federal system consists of gov- 
ernmental inputs from different sources whose 
local connections normally serve to fragment local 
authority, However, because such a system rewards 
those who actively seek to reconcile the diffuse ele- 
ments and bind them together for a larger purpose, 
local political leaders can control these inputs to a 
great extent. While this may not prevent the na- 
tional government from exercising great power at 
any given time or from increasing its total power 
over time, it does mean that as long as the federal 
principle remains operative, the public can and 
almost invariably does limit certain kinds of na- 
tional government actions or guides such actions 
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into particular channels (often directed toward 
strengthening the constituent governments) by in- 
voking the terms of the compact. 

Viewed theoretically, these patterns of behavior 
and the arguments advanced to justify them serve 
to reaffirm the fundamental principles that (1) the 
strength of a federal polity does not stem from 
the power of the national government but from the 
authority vested in the nation as a whole; (2) both 
the national government and the governments of 
the constituent polities are possessed of delegated 
powers only; and (3) all governments are limited 
by the common national constitution. 

All this should make it apparent that federalism 
is a form of popular government embodying ele- 
ments of both republicanism and democracy, The 
federal structures occasionally adopted by nondem- 
ocratic systems must generally be considered “win- 
dow dressing” except insofar as the injection of 
the federal principle may serve as a democratizing 
force in itself. In Yugoslavia, for example, the 
existence of a federal superstructure has proved 
useful in fostering such decentralization as the 
Communist party leadership wished to allow and 
may even have played a role in stimulating decen- 
tralizing tendencies. 


Empirical and theoretical development 


Ancient protofederal systems. Long before the 
term “federal” was invented, there were political 
systems that embodied elements of the federal prin- 
ciple. The Israelite political system was probably 
the first example in recorded history of a union of 
constituent polities based on a sense of common 
nationality, with national and tribal political insti- 
tutions and some division of functions between the 
two partly formalized by a written constitution, As 
a republic it was never able to overcome the prob- 
lems of national executive leadership and succes- 
sion and, after some two hundred years, revised its 
constitution to superimpose a limited monarchy on 
its federal institutions. Still, as many of the seven- 
teenth-century federalists noted, it came closer to 
resembling a modern federal system than any com- 
parable premodern nation, Its classic intellectual 
product, the Bible, was the first book to discuss the 
problems of a federal polity. 

Permanent leagues of independent states united 
by a sense of common need but without any sense 
of common nationhood were found in various parts 
of the Greek world. They were entrusted with cer- 
tain matters in the realm of foreign affairs and 
defense but were in every respect accountable to 
their member states. The classic example of this 
system was the Achaean League (251-146 B.c.), 
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a protofederal system often erroneously considered 
to be the first federal polity (Freeman 1863), The 
Greeks left some descriptions of their leagues but 
no theoretical discussions of the league as a politi- 
cal system. Except for Aristotle’s criticisms, the 
great Greek political theorists ignored federalism 
as a political principle because the very idea con- 
tradicted their conception of the small, unified polis 
as the only basis upon which to build the good 
regime. 

Several of the great ancient empires, notably the 
Persian, Hellenic, and Roman, structured their 
political systems around the principle of cultural 
home rule. Since political life was virtually insep- 
arable from the religious and cultural aspects of 
society in the ancient world, imperial recognition of 
local constitutions offered a measure of contractual 
devolution of political power; however, as in more 
recent examples of this form of imperialism, such 
home rule was not a matter of local right but repre- 
sented a conditional grant subject to unilateral 
revocation by the imperial rulers. 

Medieval experiments. Elements of the federal 
principle are foreshadowed in medieval feudalism 
through its emphasis on essentially immutable con- 
tractual relationships that permanently link the 
contracting parties while guaranteeing their rights. 
However, the hierarchical character of these rela- 
tionships, coupled with the lack of practical mech- 
anisms to maintain the terms of the contracts, led 
to the degeneration of those elements in most 
feudal societies, Another movement in the direc- 
tion of federalism grew out of the development of 
medieval commercial towns in central Europe which 
formed leagues for mutual defense and assistance 
following the Greek model. The most important 
development in this period was the first confedera- 
tion of Swiss cantons in 1291 for mutual aid in 
defense of their independence. The success of this 
effort was in no small measure due to its connec- 
tion, from the beginning, with quasi-popular gov- 
ernment. These embryonic federal experiments all 
proceeded pragmatically while federal theory was 
confined to juridical discussions of the corporate 
relationships between polities in the Holy Roman 
Empire. 

Ultimately a fusion of contractual elements from 
feudalism with political mechanisms from the com- 
mercial confederacies gave rise to the immediate 
antecedents of modern federalism, The Christian 
states on the Iberian Peninsula created a political 
system which in its most advanced stages came 
very close to authentic federalism, During the 
years of the reconquest, most of the peninsula was 
reorganized under the fuero system, which estab- 


lished local governments with relatively liberal 
political institutions in order to encourage resettle- 
ment. New states were formed through feudal-style 
contractual relationships designed to protect local 
rights. Three of these states joined in a quasi- 
federal arrangement under the crown of Aragon, 
each of them (plus several in Italy added later) 
retaining its own constitution and governing insti- 
tutions as well as acquiring representation in the 
over-all Aragonese government. Unification of Spain 
under a multiple monarchy in 1469 left most of 
these federal elements intact for the next two and 
a half centuries, but the demands of the monarchy 
ultimately subverted them, transforming Spain into 
a precariously centralized state. 

In the sixteenth century, certain emergent civil 
societies, influenced by the Reformation to return 
to Scripture as a political source and by the Span- 
ish system of political organization, as well as by 
local necessity, began to apply federal principles 
for state-building purposes. The Hapsburg heirs to 
the Spanish crown had applied Iberian principles 
to the organization of their other European posses- 
sions, Their governmental reforms in the Nether- 
lands provided an organizational basis for the fed- 
eration of the United Provinces in the late sixteenth 
century. When that country gained its independ- 
ence, it established a political system which, while 
unable to solve the most crucial technical problems 
of federalism, maintained itself in federal style for 
two hundred years, until Napoleon put an end to 
its existence, leaving a residue of noncentraliza- 
tion that marks the Netherlands today. 

The Swiss, in the meantime, were developing 
their own techniques for combining feudal and 
commercial elements to create a loose confedera- 
tion of cantons, which was also influenced by 
Biblical ideas and, perhaps negatively, by contacts 
with Hapsburg Spain. Achieving full independence 
in 1648, the Swiss confederation remained loosely 
leagued for two centuries (except for the Napole- 
onic interlude), until it adopted a federal constitu- 
tion in 1848. 

First modern formulations. The protofederalism 
of the United Provinces and the Swiss cantons, 
coming at the outset of the age of nationalism, also 
stimulated the first serious efforts to formulate fed- 
eral theories based on modern political ideas. Jean 
Bodin analyzed the possibilities of federation in 
light of the problem of sovereignty, Hugo Grotius 
and Samuel Pufendorf examined federal arrange- 
ments as aspects of international law. These the- 
orists all treated federalism as an aspect of inter- 
national law. Johannes Althusius (1603), analyzing 
the Dutch and Swiss constitutions, was the first to 


perceive that federalism was really concerned with 
problems of national unity. The real father of mod- 
ern federal theory, he was also the first to connect 
federalism with popular sovereignty and to distin- 
guish between leagues, multiple monarchies, and 
confederations. His retention of hierarchical prin- 
ciples and his emphasis on the corporate organiza- 
tion of society both flawed the federal character of 
his work and reflected the empirical roots of his 
analysis. 

Thus the rise of the nation-state in the sixteenth 
and seventeenth centuries stimulated federal solu- 
tions to the problems of national unification. In all 
but a few countries on the periphery of western 
Europe, the application of federal principles foun- 
dered on three problems: (1) the conciliation of 
feudally rooted hierarchies with a system demand- 
ing fundamental social equality in order to facili- 
tate the sharing of power; (2) the reconciliation of 
local autonomy with national energy in an era 
of political upheaval that required most nations to 
maintain a state of constant mobilization basically 
incompatible with the toleration of local differ- 
ences; and (3) the problem of executive leadership 
and succession, which is particularly complex in 
federal systems and was not solved until the 
United States invented the elected presidency. 

Modern federalism. The rise of modern im- 
perialism also contributed to the emergence of fed- 
eralism, as indicated by the works of the important 
prerevolutionary political theorists of the eighteenth 
century, e.g., Montesquieu and Adam Smith. Here, 
too, the Spanish experience was influential, but it 
remained for the British to create the requisite pop- 
ular institutions in their colonization of North 
America and for the Biblically influenced colonists 
to create the theoretical justification for these in- 
stitutions. The theoretical ambiguity of those quasi- 
federal institutions led Americans to assume that 
their relationship to the British government was 
federal, while London entertained no such notion 
(Becker 1922). The Americans’ response to their 
view of the imperial system helped them develop 
the federal ideas they were later to use so creatively. 

The founders of the United States of America 
can be said to have transformed and organized the 
principles of federalism into a practical system of 
government. They were able to do so partly be- 
cause their nation developed without the disadvan- 
tages that plagued earlier federal systems, As a 
postfeudal society, the United States had no serious 
problem of coping with hierarchies. As a relatively 
isolated nation, external pressures for centraliza- 
tion were not present for nearly 150 years. Ameri- 
can political inventiveness took care of the internal 
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problems of applying the federal principle, though 
not without having to fight a major civil war to 
resolve some of them. Though the specific forms 
of American federalism were not widely imitated 
with success, its basic principles of organization 
were emulated by almost every other nation at- 
tempting the federal solution to the problems of 
popular government in a pluralistic civil society. 
The creation of the theoretical framework for those 
principles was part and parcel of the invention of 
federalism. Set forth in its basics in the debate over 
ratification of the constitution, that framework had 
at its core The Federalist (1787—1788), the classic 
formulation of the principles of modern federalism. 
Equally important to the evolution of federal sys- 
tems, however, were the arguments of those who 
wished to preserve even greater state autonomy; 
many of these arguments were transformed into 
tools to promote extraconstitutional decentraliza- 
tion during the nineteenth century. 

From the first, American contributions to federal 
theory—even those of the few theorists not actively 
involved in politics—have been rooted in the prac- 
tical concerns of maintaining a federal system. 
Most of these contributions have, accordingly, been 
formulated as discussions of constitutional law. 
The courts, particularly the federal Supreme Court, 
have conducted continuing debate on the meaning 
and character of federalism through the medium of 
case law. Leading political figures, such as Gallatin, 
Calhoun, Lincoln, Wilson, and the Roosevelts, have 
made real contributions through their state papers. 
The pragmatic orientation of those contributions, 
however, has tended to obscure their more lasting 
theoretical importance (Anderson 1955). 

The French Revolution, while stimulating the 
development of popular government, was essen- 
tially hostile to the spirit and institutions of federal- 
ism. Its immediate heirs tried to destroy federal 
institutions in western Europe in the name of 
democracy, and the subsequent bearers of its tradi- 
tion have proved equally hostile to federal ideas— 
except insofar as some of them have equated fed- 
eralism with decentralized government. 

In the nineteenth century, several of the new 
Latin American nations, following the United States 
example and also influenced by the federal ele- 
ments in the Hispanic imperial tradition, experi- 
mented with federalism, with distinctly mixed 
results. Even where federalism survived in theory, 
the instability of Latin American governments and 
the frequent recourse to dictatorial regimes ham- 
pered its effective operation, Even so, the three 
largest Latin American nations—Argentina, Brazil, 
and Mexico—retain federal systems of varying 
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political significance; federal principles are also in- 
cluded in the political systems of Colombia and 
Venezuela. 

In the mid-nineteenth century European politi- 
cians and political theorists, stimulated by neces- 
sity, the American example, and the very influential 
studies of Tocqueville (1835), turned to consider 
federalism as a form of democratic political organ- 
ization. Though practical applications remained 
few, numerous works were produced, primarily in 
the German-speaking countries, where doctrinaire 
and metaphysical analyses of federalism in relation 
to the problems of nationalism, sovereignty, and 
popular consent were in vogue. The most important 
of these works were the theoretical formulations 
of Bluntschli (1849-1852), based on his observa- 
tions of federal reorganization in Switzerland, and 
the historical studies of Gierke. In the end, federal 
principles were used in the unification of Germany, 
and Switzerland adopted a modern federal consti- 
tution, Fully federal solutions were rejected in 
other nations, but several adopted quasi-federal 
institutions to meet particular problems of unifica- 
tion and decentralization. 

During the late nineteenth century, British inter- 
est in imperial federalism was manifested in sev- 
eral ways. Canada and Australia were given federal 
constitutions and dominion status in 1867 and 
1901, respectively, and the foundations were laid 
for the federal unification of India. British political 
theorists interested in imperial unity and internal 
devolution explored contemporary (Bryce 1888) 
and historical (Freeman 1863) federal experiments 
and presented arguments of their own as to the 
utility and proper organization of federal systems 
(Labilliére 1894), 

Whereas in the nineteenth century federalism 
was used to abet ethnic nationalism, in the twen- 
tieth it has been used as a means to unify multi- 
ethnic nations, Several of the ethnically hetero- 
geneous nations created or reconstructed after 
World War 1, including the Soviet Union and Yugo- 
slavia, formally embraced federalism as a nominal 
solution to their nationality problems. The United 
Kingdom added a federal dimension at the same 
time to accommodate the Irish, Extension of nation- 
building activities to Asia and Africa, where ethnic 
diversity is even greater than in Europe, has led to 
new efforts in the same vein. In nations outside of 
the totalitarian orbit, such as India and Malaysia, 
federalism has been used to secure political and 
cultural rights for the larger ethnolinguistic groups. 
In Africa, where the survival of separate ethnic 
groups has been called into question by the native 
nationalists, federalism has been applied in several 


nations, including Nigeria and Cameroon, as a 
device for sharing political power rather than a 
way to maintain cultural autonomy. 

The contemporary study of federalism. The 
emergence of political science as a discipline in the 
late nineteenth century stimulated a shift from an 
explicitly normative to a predominantly empirical 
interest in federalism. Such noted British scholars 
as Bryce (1901) and Dicey (1885) were the first 
to study federalism as part of their general interest 
in political systems. American scholars began their 
work in the 1870s, as the Civil War generation was 
passing into history, but their first works still re- 
flected the issues of the war. Thus Burgess (1886) 
concluded that the utility of the states was dissi- 
pated by modern technology, just as their power 
was destroyed by the war, while Wilson (1885) 
accepted the view that the war had wrought great 
changes but still saw federalism as alive and vital. 

Though these men and their colleagues laid the 
foundations for the empirical study of federal sys- 
tems with the tools of contemporary political sci- 
ence, federalism as a field of study was neglected 
for many years. The rise of other problems to 
attract the attention of scholars, the negation of 
earlier legalistic and metaphysical approaches, and 
the decline of normative interest in the federal 
principle combined to dissuade younger political 
scientists from examining questions of federal gov- 
ernment, except incidentally, until the twentieth 
century was well advanced. 

Renewed interest in the field first developed 
when American students of public administration 
found themselves confronted with problems of 
intergovernmental relations at nearly every turn. 
The study of intergovernmental relations in the 
administrative realm brought about significant 
gains in the understanding of the process of fed- 
eral government, not the least of which was a 
growing recognition that the assumptions about 
federalism underlying their work, borrowed whole 
from nineteenth-century theorists, needed serious 
re-examination. Beginning in the 1930s and 1940s, 
American and British political scientists began to 
raise fundamental questions about the nature of 
federal systems and the interrelationships of their 
governmental components (Anderson 1946). In 
the 1950s these questions were expanded to in- 
clude, among others, problems of political influ- 
ence, the role of political parties, the historical 
development of federal systems, and the meaning 
of earlier federal theories ( Bachelder & Shaw 1964). 
By the early 1960s, students of existing federal 
governments were rediscovering the need to clarify 
the principles of federalism in order to understand 


the operation of those governments. Students of 
comparative government were also becoming in- 
creasingly interested in problems of political inte- 
gration, centralization, and decentralization—all of 
which stimulated new interest in the systematic 
study of federalism. 


Evaluation 

While many attempts to establish federal sys- 
tems have ended in failure, such systems, once 
established, have proved to be most durable. No 
authentic federal system that has lasted for even 
15 years has ever been abandoned except through 
revolutionary disruption (as in the case of Ger- 
many), and in every such case federalism—show- 
ing remarkable resilience—has ultimately been 
restored, Certain theories to the contrary, there is 
no evidence that federalism represents a transi- 
tional stage on the road to unitary government. No 
federal system in history has ever “evolved” into a 
unitary one, nor has any established system been 
structurally consolidated by internal decision, On 
the contrary, federal devices to conciliate minority 
populations have been used in place of force to 
maintain unity even in consolidated systems. More- 
over, federal systems or systems strongly influ- 
enced by the federal principle have been among 
the most stable and long-lasting of polities. 

At the same time, relatively few cultures have 
been able to utilize federal principles in govern- 
ment. Anglo-American civil societies have done so 
most successfully. Even those not fully committed 
to federalism have, without exception, included 
elements of the federal principle in whatever sys- 
tems they have chosen, no doubt because both 
constitutionalism and noncentralization rate high 
on the scale of Anglo—American political values. 

Of the 16 formally federal nations that exist in 
the world today, Australia, Cameroon, Canada, 
India, Malaysia, Nigeria, and the United States 
were created under British colonial tutelage. These 
seven include all the nations established since 
World War 1 that have been able to maintain fed- 
eral systems, and they provide most of the success- 
ful examples of federalism in operation. Of the 
nine remaining federal nations, Argentina, Brazil, 
and Mexico fall directly within the Hispanic politi- 
cal tradition and Austria, Germany, and Switzer- 
land, though they follow the Germanic political 
tradition, were also influenced by Hispanic ideas 
at some point in their development. Both political 
traditions have been influential in stimulating fed- 
eral inclinations in many of the nonfederal nations, 
but they have been notably less successful in fos- 
tering lasting federal institutions; the Hispanic 
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tradition has failed to combine federalism and 
stability, while the Germanic has tended toward 
authoritarian centralization, (The three remaining 
nations, Libya, the Soviet Union, and Yugoslavia, 
are federal in name and formal structure but hardly 
in any meaningful sense of the term.) 

The successful operation of federal systems re- 
quires a particular kind of political environment, 
one which is conducive to popular government and 
has the strong traditions of political cooperation 
and self-restraint that are needed to maintain a 
system which minimizes the use of coercion. Be- 
yond the level of tradition, federal systems operate 
best in societies with sufficient homogeneity of 
fundamental interests—or consensus—to allow a 
great deal of latitude in political operations and to 
place primary reliance upon voluntary collabora- 
tion. The existence of severe strains on the body 
politic which lead to the use of force to maintain 
domestic order is even more inimical to the suc- 
cessful maintenance of federal patterns of govern- 
ment than of other forms of popular government. 
Moreover, federal systems are most successful in 
civil societies with the human resources to fill many 
public offices competently and with material re- 
sources plentiful enough to allow a measure of 
economic waste in payment for the luxury of 
liberty. 

DANIEL J. ELAZAR 


[See also CENTRALIZATION AND DECENTRALIZATION; 
CONSTITUTIONAL LAW, article on DISTRIBUTION OF 
POWERS; PRESIDENTIAL GOVERNMENT. Other rele- 
vant material may be found in GOVERNMENT; INTER- 
NATIONAL INTEGRATION; NATION; STATE.] 
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Sandor Ferenczi, one of the earliest psycho- 
analysts, was born in Miskolc, Hungary, in 1873 
and died in Budapest, in his house in the Buda 
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hills, in 1933. He was for a time Sigmund Freud's 
closest friend and collaborator. 

Early life. Ferenczi was one of 11 children. His 
parents, whose surname had originally been 
Fraenkel, were well-to-do, owning the largest book- 
store in Miskolc. A biographer, Izette De Forest 
(1954), tells us that Ferenczi’s parents were un- 
usually enlightened in the freedom and com- 
panionship they gave to their children and that the 
life of the family was gay and spontaneous, de- 
voted to the enjoyment of scholarship, music, 
drama, poetry, and languages. The famous psy- 
choanalyst Ernest Jones (1953-1957 )—who knew 
Ferenczi well—tells us, however, that Ferenczi was 
haunted throughout life by a quite inordinate and 
insatiable longing for his father’s love. Ferenczi 
himself said in a letter to Freud that his feelings 
about Freud and other colleagues were bound up 
with his childhood difficulties with his father and 
brothers. 

After graduating from the local Gymnasium, 
Ferenczi studied medicine in Vienna, receiving his 
degree in 1894. He had a year of compulsory serv- 
ice as a physician in the Austro-Hungarian army 
and then interned in various hospitals in Budapest, 
notably St. Rókus Korhaz. There he specialized in 
neurology and neuropathology, also developing his 
skill in hypnotism. In 1900 he opened an office in 
Budapest for the practice of neurology. 

Development of interest in psychoanalysis. When 
Ferenczi first read Freud’s The Interpretation of 
Dreams he was not able to assimilate its teachings. 
It was his rereading of Breuer and Freud's Studies 
on Hysteria that impressed him with the claims of 
psychoanalysis. He wrote to Freud in 1907 and, 
accompanied by Dr. F. Stein of Budapest, who in- 
troduced him, called on Freud in February 1908. 
The impression that he made brought an invitation 
to spend a fortnight in August with the Freud 
family at their summer place. He became a mem- 
ber of the Vienna Psychoanalytic Society, the small 
group that met on Wednesday evenings with Freud. 

Later career. Ferenczi and Freud traveled to- 
gether extensively, and between 1908 and 1933 
they exchanged more than a thousand letters. In 
1909 Ferenczi accompanied Freud on a trip to the 
United States, where Freud lectured at Clark Uni- 
versity. In 1910, at Freud’s suggestion, Ferenczi 
proposed the founding of the International Psycho- 
analytic Association; and in 1913 he founded the 
Hungarian Psychoanalytic Society. In 1916 he un- 
derwent a brief personal analysis with Freud; it 
lasted only three weeks, for two hours a day, since 
Ferenczi had to return to his military duties as a 
medical officer in Hungary. According to Freud, 
the analysis was continued later. In 1918 Ferenczi 
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was elected president of the International Psycho- 
analytic Society; he relinquished the presidency in 
1919. His failure to be invited again to be president 
of the society was a keen disappointment to him 
and may have been a consequence of his advocacy 
of lay analysis. ` 

In 1919, after 18 years of indecision, Ferenczi 
married. The marriage was a very happy one, and 
although there were no children, Ferenczi delighted 
in his relationship with his wife’s daughters by a 
former marriage. 

Ferenczi came to New York City for eight 
months in 1926-1927 to give a course of lectures 
at the New School for Social Research. During his 
visit he gave analytic training to eight or nine 
people, mostly laymen. 

As Ferenczi’s views on psychoanalytic technique 
diverged more and more from Freud’s in later 
years, he gradually withdrew from Freud. Although 
there was never a sharp break, a sense of estrange- 
ment developed between them. 

Personality. Ernest Jones described Ferenczi as 
follows: 

He [Ferenczi] had an altogether delightful personality 
which retained a good deal of the simplicity and a still 
greater amount of the imagination of the child; I have 
never known anyone better able to conjure up, in 
speech and gesture, the point of view of a young child. 
. .. He had a very keen and direct intuitive perception, 
one that went well with the highest possible measure 
of native honesty. . . . He had an exceptionally original 
and creative mind. . . . As is often the way, however, 
with people who throw out masses of ideas like sparks, 
their quality was very uneven, since judgment—objec- 
tive and critical judgment—was the one gift denied to 
Ferenczi.... (1959, pp. 199-200) 

At times, Jones says, Ferenczi could be somewhat 
masterful, and even dictatorial; and toward the end 
of his life, especially during his final illness, he 
“lost most of his old cheerfulness and vitality, be- 
came heavy, depressed, and ungracious, withdrew 
from his friends, and—most serious of all—al- 
lowed his scientific judgment to be gravely de- 
flected” (Jones 1959, p. 200), 

Early contributions to psychoanalysis. Freud al- 
ways gave Ferenczi generous credit for ideas the 
two had worked out in common, and at times it 
is not possible to distinguish their respective shares. 
Among Ferenczi’s important early papers are 
“Stages in the Development of the Sense of Reality” 
(1913); a paper on the relationship of homo- 
sexuality and paranoia (1912a); and a paper de- 
scribing transitory symptom formation during psy- 
choanalytic treatment (1912b). He worked out in 
detail the implications of Freud's explanation of 


hypnosis as depending on the masochistic attitude 
of the person hypnotized; and he discriminated be- 
tween the different forms of homosexuality. 

In the paper on the development of the sense 
of reality—praised by Jones as “the best he 
[Ferenczi] ever wrote” (Jones 1933)—~-Ferenczi 
listed four stages through which the child passes 
in moving from domination by the pleasure prin- 
ciple to acceptance of the reality principle. These 
four periods are the period of unconditional om- 
nipotence (intrauterine), the period of magical- 
hallucinatory omnipotence (earliest infancy), the 
period of omnipotence by the help of magic ges- 
tures, and the period of magic thoughts and magic 
words. He showed that the child at first has all his 
wishes gratified and believes himself omnipotent. 
Later, when the child finds that wishes are no 
longer immediately gratified and has to accept some 
limitations upon his power, he attempts to cir- 
cumvent these frustrations by imagining the grati- 
fication of his wishes, or by making magical ges- 
tures, or by using magical thoughts and words. 
When at last the individual realizes that the en- 
vironment does not conform to his wishes, that 
gratification is entirely conditional upon work, and 
that even work does not always lead to success, 
then the sense of reality may be said to be fully 
developed. 

Theory of genital sexuality. Influenced by Dar- 
win and Lamarck as well as by psychoanalytic 
thought, Ferenczi sought in Thalassa: A Theory 
of Genitality (1924) to explicate the relationship 
of sexuality to the evolution of the race. Ferenczi 
pointed out that pregenital sexual impulses can be 
expressed and gratified by genital activity. For in- 
stance, a man can express urethral or anal tend- 
encies in the way he performs and experiences 
sexual intercourse. Such a mixture of partial sexual 
drives, and indirect expression of them, Ferenczi 
chose to call “amphimixis.” Sexual intercourse can 
also express various fantasies, including the fan- 
tasy of reunion with the mother through return to 
the womb, a fantasy revealed in the course of psy- 
choanalysis of patients. Since the penis goes into 
the birth canal during intercourse and the seminal 
fluid does go into the uterus, sexual intercourse 1S 
well suited to the acting out of this fantasy. 

Assuming a phylogenetic origin for this fantasy, 
Ferenczi speculated that it derives from the marine 
background of the race; mankind’s ancestors were 
fishes, Thus, in wishing to return to the womb, man 
is also expressing the wish to return to the sea, the 
origin of all life. 

In general, psychoanalysts have ignored these 


speculations. They believe that Ferenczi did not 
have a satisfactory way of testing his speculations 
by an appeal to evidence. 

Contributions to technique. Ferenczi is acknowl- 
edged to have been a gifted therapist. He proposed 
a number of innovations in technique; at first 
these centered on the so-called “active” technique; 
later they had to do with a greater participation by 
the therapist in the psychoanalytic interaction, a 
participation that included the granting of more— 
and more direct—gratifications to patients than 
most analysts believed advisable. These later ex- 
periments led Freud to write, “He was persuaded 
that we could accomplish far more with our patients 
if we gave them enough of the love they had 
longed for in childhood” (1933, pp. 297-299). 
Most analysts reject Ferenczi’s belief that granting 
more gratification to patients is therapeutic. 
Ferenczi’s strong interest in the reactions of dis- 
appointment and mistrust that the child suffers in 
his relationship with his parents has led a few of 
his pupils, notably Alice Balint (1949), to inves- 
tigate early parent-child relationships. 

Ferenczi’s early contributions to psychoanalysis 
have been so fully assimilated that their origin is 
often forgotten, although his later writings, which 
were more speculative and deviated from Freudian 
orthodoxy, have been less widely accepted. 


FRANK AULD, JR. 


[For the historical context of Ferenczi’s work, see Psy- 
CHOANALYsIs and the biography of Freup. For dis- 
cussion of the subsequent development of his ideas, 
see SEXUAL BEHAVIOR, articles on SEXUAL DEVIATION: 
PSYCHOLOGICAL ASPECTS and SEXUAL DEVIATION: 
SOCIAL ASPECTS.] 
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FERGUSON, ADAM 


“I hear he [Ferguson] is about to publish . . . a 
natural history of man: exhibiting a view of him 
in the savage state, and in the several successive 
states of pasturage, agriculture and commerce” 
(Reid [1764-1788] 1895, p. 42). Thus wrote 
Thomas Reid on December 20, 1765, to his friend 
Dr. David Skeene. Reid had just succeeded Adam 
Smith in the chair of moral philosophy at Glasgow. 
Nearly a century later, Karl Marx singled out Fer- 
guson, although not with complete accuracy, as the 
first in modern times to develop the theory of the 
division of labor in economy and society. Still later, 
Gumplowicz, distinguished exponent of the conflict 
theory of society, saw in Ferguson’s History of Civil 
Society “the first natural history of society” and in 
Ferguson himself “the first sociologist” (Gumplo- 
wicz 1892, p. 67). William Dunning found in him 
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one who combined “the critical spirit of Hume and 
the historical spirit of Montesquieu” most attrac- 
tively and “studied society and its institutions . . . 
to determine by the light of history whither society 
was moving, not by superhuman wisdom to fix its 
course” (Dunning [1920] 1936, p. 63). 

These brief characterizations may serve to epito- 
mize the thought, and the place in the history of 
social thought, of one of the outstanding “moral 
philosophers” of Scotland of the later eighteenth 
century. 

Adam Ferguson was born in the manse of Lo- 
gierait, Perthshire, in 1723, and was educated for 
the Christian ministry (like so many leaders of 
thought in the Scotland of his day) at St. Andrews 
and Edinburgh. At the age of 21, he was ap- 
pointed deputy-chaplain—and later chaplain—of 
the famous Highland “Black Watch” regiment. In 
1757, Ferguson succeeded Hume as keeper of the 
Advocates’ Library in Edinburgh, and in 1759 he 
was appointed to the chair of natural philosophy 
in the university there; he was transferred five 
years later to the chair of pneumatics and moral 
philosophy. He filled this post with distinction until 
1785, when for reasons of health he resigned it. He 
continued literary activities at a more leisurely pace 
in retirement. 

In 1767, he published his Essay on the History 
of Civil Society, using “history” in the then current 
static—descriptive sense as well as in a “temporaliz- 
ing” sense. This work strongly reflects, by Fer- 
guson’s own admission, the influence of Montes- 
quieu. Two years later, Ferguson published the 
Institutes of Moral Philosophy, an outline of the 
course of lectures that he followed, with variations, 
throughout his teaching career. In 1783, he pub- 
lished The H istory of the Progress and Termination 
of the Roman Republic, a work that reached many 
editions. His two-volume Principles of Moral and 
Political Science, “being chiefly a Retrospect of Lec- 
tures delivered in the College of Edinburgh,” was 
published in 1792. This was in reality only a 
greatly expanded and more finished version of the 
Institutes, with some variations. Most of these 
works appeared almost immediately in translation 
in Germany and France, and some also in Italy and 
Russia. He was also widely known and read in 
America. 

Ferguson was a principal founder and charter 
member of the Royal Society of Edinburgh; on a 
visit to the Continent in 1793, he was made an 
honorary member of the Berlin Academy of Sci- 
ence. He was closely associated, through personal 
friendship, various literary clubs, and correspond- 
ence, with others from Edinburgh, such as Hume, 


Lord Kames, William Robertson, John Home, Adam 
Smith (with whom he had a temporary falling- 
out), and Edward Gibbon, all of whom had high 
regard for Ferguson. Abroad he received the ac- 
claim of men like Baron d'Holbach in France and 
Schiller, Jacobi, Herder, and others in Germany. 
He spent his last years, with failing eyesight, at 
St. Andrews, where he died in 1816, at the age of 
92. He was buried in the old cathedral churchyard, 
and the beautiful epitaph that Sir Walter Scott, 
boyhood playmate of the Ferguson children, wrote 
for him is still to be seen there. 

Whereas Ferguson’s fame at home and abroad 
rested chiefly on his Civil Society, this work is best 
understood in the broader setting of his entire social 
and moral philosophy, which is more adequately 
stated in his Principles. “Moral philosophy” covers 
comparative anthropology, individual and social 
psychology, and “natural religion” and is applied to 
jurisprudence, economics and politics, and ethics. 
The basic concern of moral philosophy is human 
nature itself. “Human nature,” Ferguson once ob- 
served, “is my trade”—meaning, no doubt, the busi- 
ness of his chair. Man is viewed as a rationally 
intelligent, morally evaluating, active, and there- 
fore progressive animal, always and everywhere 
living in society. He is to be studied, therefore, not 
only as an individual but also as a member of the 
community. The community is at once the source 
of his being, the field of his operations, the end of 
most of his actions, and the indispensable means— 
through the invention and use of language as his 
principal instrument of communication—to the 
realization of his essential humanity. 

Key elements in Ferguson’s social theory are a 
clear-cut distinction between “physical law,” which 
he saw as a generalization of behavioral fact on 
every level, and “moral law,” which he saw as a 
generalization of values and norms of conduct; a 
sharp differentiation of the cultural and traditional 
from the biological; a juxtaposition of the princi- 
ples of “union” and of “rivalship” or conflict; the 
division of labor in economy and society; and a 
consistently evolutionistic viewpoint skeptical of 
planned progress—evolution, however, usually m 
an ontogenetic and cultural, rather than a bio- 
logical, application. 

Ferguson’s moral philosophy in particular was 
essentially that of the ancient Stoics: Man is by his 
very nature an active being and a social being; all 
action involves choices, both of ends and of means 
to ends; choices are made between moral values 
and result in either happiness or misery, both for 
the actor and for his fellow men. If norms of con; 
duct and systems of such norms are to be effective, 


they must be rationally based on empirical knowl- 
edge both of human nature and “of the situation 
in which [man] is placed.” “Men of real fortitude, 
integrity and ability are well placed in every scene” 
({1767] 1819, p. 506). “Everyone indeed is answer- 
able only for himself; and, in preserving the in- 
tegrity of one citizen, does what is required of him 
for the happiness of the whole” (1792, vol. 2, 
p. 512). 

Like most of his Scottish contemporaries, Fer- 
guson was averse to “metaphysics,” or abstract 
philosophizing; yet he was not without philosoph- 
ical acumen, a regard for system, a sense of 
“wholism,” and a clear understanding of basic 
methodology. In this sense, he was an exponent 
of the “common-sense” philosophy and one of its 
founders. 

WILLIAM C. LEHMANN 


[For the historical context of Ferguson’s work, see the 
biography of HUME.] 
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FERTILITY 


Demographers distinguish fertility (actual repro- 
duction) from fecundity (the capacity for reproduc- 
tion). The distinction is important, because in all 
societies actual reproduction is less, often much 
less, than the potential maximum. The rather wide 
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variations in actual fertility among societies have 
no known general relation to differences in fecun- 
dity. In specific instances, low fertility may be ex- 
plained by such biological factors as venereal dis- 
ease. However, it is likely that group fertility levels 
vary generally in relation to social factors that 
affect either the social norms about the proper 
number of children or a limited number of means 
of fertility control. These means are intermediate 
between the social organization or social norms, 
on the one hand, and actual fertility, on the other 
hand. 

Davis and Blake (1956) have provided the fol- 
lowing useful classification of these “intermediate 
variables”: 


Factors affecting exposure to intercourse : 
(A) Those governing the formation and dissolution 
of unions in the reproductive period: 

(1) Age of entry into sexual unions. 

(2) Permanent celibacy: proportion of women 
never entering sexual unions. 

(3) Amount of reproductive period spent after 
or between unions. 

(a) When unions are broken by divorce, 
separation, or desertion. 
(b) When unions are broken by death of 
husband. 
(B) Those governing the exposure to intercourse 
within unions: 

(4) Voluntary abstinence. 

(5) Involuntary abstinence (from impotence, 
illness, unavoidable but temporary sepa- 
rations). 

(6) Coital frequency (excluding periods of ab- 
stinence ). 

Factors affecting exposure to conception: 

(7) Fecundity or infecundity, as affected by 
involuntary causes. 

(8) Use or non-use of contraception: 

(a) By mechanical and chemical means. 
(b) By other means. 

(9) Fecundity or infecundity, as affected by 
voluntary causes (sterilization, subinci- 
sion, medical treatment, etc.). 

Factors affecting gestation and successful parturition: 
(10) Foetal mortality from involuntary causes. 
(11) Foetal mortality from voluntary causes. 


Most of the intermediate variables are regulated 
by a complex structure of normative prescriptions 
(Nag 1962; Freedman 1963). For example, how 
many children a couple should have is a problem 
so recurrent and with so many social ramifications 
in each society that it would be a sociological 
anomaly if appropriate social norms did not de- 
velop as a social solution to guide behavior. More- 
over, different combinations of values for these 
variables may produce identical fertility levels, 
whereas societies or groups with very different fer- 
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tility levels may have similar values on some of the 
intermediate variables. Of course, the connection 
between some of these intermediate cultural varia- 
bles and a society's fertility level need not be ap- 
parent to members of the society. 


Measurement of fertility 


Fertility can be defined as the number of births 
oceurring in a population unit during a specified 
time period. The unit may be the individual male 
or female, the male-female couple, the social 
stratum or other groupings of individuals or 
couples, or the whole society. For any of these 
levels, birth rates or frequency distributions may 
be computed separately or adjusted for a large 
variety of social, economic, or demographic char- 
acteristics. In most official Statistics, the time peri- 
od is short, giving a cross-section picture of what 
happens in a particular year or period. An im- 
portant departure from cross-section rates is the 
measurement of fertility over the childbearing 
span of historical cohorts of women [see COHORT 
ANALYSIS]. 

“Period” measures. The most common fertility 
measure is the crude birth rate, or number of 
births per thousand population per annum. It is 
usually computed for whole nations or for major 
demographic strata within nations, such as the 
urban population. More refined measures are im- 
portant for some purposes, but the crude birth rate 
is still often used in evaluating the contribution of 
fertility to population growth in relation to mor- 
tality and migration. Table 1 shows the consider- 
able variation in crude birth rates among major 
world regions. 

Birth rates are called age-specific when they are 
computed for fairly narrow age groups (usually 
one or five years), usually with reference to the 
female population only. Since the crude birth rate 
is a function of both the age-sex distribution of the 
population and of the age-specific fertility rates, 
various methods are used to eliminate the effect of 
varying age-sex distributions in a comparison of 
different populations. This may be done either 
by standardization (in this case, the use of a com- 
mon age-sex distribution to weight the different 
age-specific rates) or by the computation of one of 
the four remaining measures in common use by 
demographers : 


(1) Births per annum per thousand women in the 
childbearing years (the general fertility rate). 


(2) Births per annum per thousand women in speci- 
fied age groups (age-specific birth rates). 


Table 1 — Crude birth rates and gross reproduction rates 
for less and more economically developed 
regions of the world, 1963 


Gross 


Crude birth? reproduction’ 

rate rate 

Less developed regions 41-42* 2.6-2.7* 
Africa 48 3.0 
North 46 2.9 
West 54 3.4 
South and east 45 2.7 

Asia (excluding Soviet Union) 40-41» 2.5-2.6° 
Southwest 45 3.0 
South central 44 2.9 
Southeast 49 2.9 

East 35-37" 2.1-2.3" 
America (excluding North America) 41 2.8 
Middle 45 3.0 
South 40 2.7 
More developed regions 22 1.4 
North America 24 1.8 
Europe 19 1.3 
Northern and western 18 1.3 
Central 18 1.2 
Southern 21 1.3 
Oceania 24 1.8 
Soviet Union 25 1.4 

World total 35-36" 2.25-2.31° 


a. Provisional weighted averages of most recent rates available in 1963 for 
countries within each region. 


b. No absolute figure obtainable because of varying estimates for mainland 
China; figures given here correspond to range of best alternative 
estimates. 

Source: Adapted from ‘Conditions 
and Trends...’ 1965. 


(3) The gross reproduction rate or “total fertility rate” 
(a simple summation of the age-specific birth rates for 
female babies or all babies). 


(4) The net reproduction rate (an adjustment of the 
gross reproduction rate to take into account the effect 
of mortality on reproduction rates). 


These are all “period” measures—that is, meas- 
ures based on the births and deaths of a particular 
year or other time period considerably shorter than 
the reproductive span of the couples in the popu- 
lation. The rates do not take into account the 
earlier reproductive history of the units measured; 
they may therefore be misleading about long-run 
reproductive trends, especially for populations in 
which efficient family-planning methods are widely 
used. Short-run variations in conventional period 
rates may result from variations in the number of 
women marrying at each age and in child-spacing 
patterns, rather than from the number of children 
born over the childbearing span. For example, mar- 
riage rates and the birth of a first child are rather 
closely correlated with the business cycle in highly 
industrialized countries (Kirk & Nortman 1959). 


Fertility measures of historical cohorts. A major 
methodological advance in recent years is the 
measurement of the cumulative fertility of his- 
torical cohorts of women, taking into account the 
family-building sequence of marriage and the spac- 
ing of successive children (France . . . 1953). 
Ideally the data needed for each historical birth 
cohort include the percentage of each cohort dying, 
the percentage of the single marrying, the percent- 
age with one previous birth having another, and 
so on. From annual empirical probabilities of this 
type, it is possible to reconstruct, to date, the cumu- 
lative fertility of a particular cohort or of the whole 
population. But only a few countries are beginning 
to collect these data. 

In cohort analysis, the stages of family building 
are related to specific historical dates, not just to 
abstract age categories. It is therefore possible to 
link variations in family-building patterns for par- 
ticular cohorts to time series for economic or other 
social phenomena. 

Data for fertility studies. The newer, more 
sophisticated measures cannot be used in most 
countries at present, because they lack even the 
elementary data required for the simpler conven- 
tional measures, However, special efforts are be- 
ing made to improve fertility statistics in some of 
the developing countries in order to measure the 
success of programs to reduce the birth rate [see 
POPULATION, article on POPULATION POLICIES]. 

Sample surveys are used increasingly to esti- 
mate fertility and other related phenomena, espe- 
cially where other statistical resources are lacking. 
For example, a series of sample surveys is the 
principal basis for our knowledge of African popu- 
lations. The sample survey is also often used to 
collect information about social factors that may 
affect fertility, since some of the most important 
intermediate variables, such as age at marriage 
or use of contraception, do not appear at all in the 
official series. 


Fertility in preindustrial societies 


Despite the lack of statistical data, it can be 
asserted with some confidence that fertility rates 
have been moderately high to very high in almost 
all preindustrial societies throughout history. With 
the high mortality levels that have prevailed in 
such societies, fertility had to be moderately high 
for the society to survive. 

An increasing volume of data for contemporary 
preindustrial societies indicates, without excep- 
tion, that fertility is at least moderately high (see 
Table 2). With the exception of the Tikopia, the 
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Table 2 — Distribution of less and more economically 
developed countries by level of crude birth 


rate, 1963 
Crude birth rate (per Less developed More developed 
thousand population} region? regions? 
Under 15.0 a 3 
15.0-19.9 1 17 
20.0-24.9 4 10 
25.0-29.9 1 3 
30.0-34.9 3 = 
35.0-39.9 11 = 
40.0-—44.9 21 1 
45.0-49.9 28 1 
50.0-54.9 12 - 
55.0-59.9 5 - 
60.0 and over 2 - 
Total countries* 88 35 


a. Wherever a crude birth rate had not yet been published for 1963, the most 
recent available rate was used. 
b. See Table 1 for geographical composition of these regions. 


e. Excluding countries having fewer than 250,000 inhabitants in 1960, countries 
having no sotisfactory data, and a few countries for which examination 
of available data had not been completed in 1963. 


Source: Adapted from “Conditions and 
Trends . . .” 1965. 


only substantiated instances of relatively low fer- 
tility in preindustrial societies are those subject to 
extreme social disorganization (Lorimer 1954, 
chapter 3). From the point of view of a preindus- 
trial society, of course, high fertility is a functional 
adjustment to high mortality; but from the point 
of view of the reproducing couple, high fertility is 
motivated by the central importance of familial 
and kinship ties in their lives. Motivations for 
larger numbers of children are increased when 
children enhance the ability of the familial unit to 
attain socially valued goals, and when such goals 
are attained through kinship and familial ties 
rather than through other social relationships. 
Fertility and social values. Although a reason- 
ably large number of children may be vitally im- 
portant to parents in a preindustrial society, this 
does not necessarily mean that the normative value 
is for the highest possible fertility. But evidence on 
the existing norm in preindustrial societies is less 
than satisfactory. Ford (1945) has indicated that 
the normative pressures against childlessness and 
against very small families are great and probably 
universal in preindustrial societies. However, he 
has also concluded that mothers are ambivalent 
about childbirth in many circumstances and that 
considerable social pressure may be required to 
ensure adequate reproduction. Davis and Blake 
(1956) have pointed out that since to have some 
children is very important, a society with high and 
variable mortality is likely to have built into its 
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structure very strong pressures for bearing chil- 
dren early in marriage, before the death of one or 
both parents, and also to have additional children 
as a safeguard against the catastrophic loss of the 
essential minimum number. On the other hand, if 
unfavorable conditions develop, this may result in 
“too many” children. Therefore, there may be a 
delicate balance between pressures that tend 
toward higher fertility, to ensure at least a certain 
minimum number of children, and contrary pres- 
sures that encourage abortion or infanticide in 
order to minimize or eliminate what, under difficult 
subsistence conditions, becomes an intolerable sur- 
plus. In any case, although there is general agree- 
ment that fertility is likely to be high in almost all 
preindustrial societies, there is also a body of evi- 
dence about the existence of control practices that 
keep fertility below a potential biological maxi- 
mum, and about cultural and social factors that pro- 
duce variations in fertility through these practices. 

Cultural effects on intermediate variables. On 
the question of control, there is evidence that the 
“natural fertility” of man is probably higher than 
that reported for most preindustrial societies (Lori- 
mer 1954, chapter 1; Henry 1961), but genetic 
differentials in fecundity cannot be ruled out. 
Health conditions reducing fecundity and increas- 
ing fetal mortality may constitute a functional, 
if unintended, adjustment to keep fertility below 
maximum levels. 

Evidence has been assembled on the use of a 
wide variety of control measures—including con- 
traception, abstinence, abortion, and infanticide— 
in many preindustrial societies (Himes 1936; 
Devereux 1955; Nag 1962). Unfortunately, the 
evidence usually demonstrates the existence of cer- 
tain practices in a culture without specifying either 
the extent of use or the effect on fertility. Never- 
theless, there is a basis for the tentative generaliza- 
tion that more or less effective methods of control 
potentially were available in many preindustrial 
societies, probably affected fertility levels in some, 
and, presumably, might have had a much wider 
use and effect were it not for the rewards derived 
from having children in such societies and the risk 
that these rewards would be lost because of unpre- 
dictably high mortality [see FERTILITY CONTROL]. 

There are no empirically validated general ex- 
planations for the variations in fertility from high 
to very high among preindustrial societies, It is 
often possible to point to particular intermediate 
variables as the immediate causes of less than 
maximum fertility, but even in these instances the 
level of the intermediate variables is a topic that 


so far has given rise to explanations that are at 
best speculative. 

Among the most important attempts at general 
explanation are those linking variations in kinship 
structure to variations in fertility (Lorimer 1954, 
chapter 2). A specific example of importance is 
the hypothesis that neolocal nuclear family sys- 
tems and related economic arrangements led to 
late marriage or nonmarriage in preindustrial 
Europe, producing relatively low birth rates (Evers- 
ley 1963). If this is true, it runs counter to the 
most commonly held theories, which associate such 
birth rates with industrialization. It has also been 
suggested that some institutions may function to 
limit fertility, although this is not their deliberate 
purpose. For example, many religious systems pre- 
scribe periods of sexual abstinence, which usually 
(although not always) have the indirect effect of 
limiting fertility (Nag 1962). 

Another unintended restraint on fertility is the 
possible reduction in fecundity resulting from poor 
nutrition and health conditions, whether these 
affect the whole society or only the lowest strata. 
This is one possible explanation for the fact that 
in some Indian studies the poorest rural workers 
have the lowest fertility. Another finding is that the 
period of amenorrhea and temporary sterility fol- 
lowing a pregnancy apparently is lengthened if an 
infant lives and is breast-fed. This suggests that 
variations in infant mortality and in weaning prac- 
tices may also affect fertility (Potter et al. 1966). 

Lack of an adequate contraceptive technology 
has been suggested as an explanation for high fer- 
tility in preindustrial societies (Davis & Blake 
1956, p. 223). An opposing point of view is that 
contraceptive practices are not developed or widely 
disseminated when available, because the struc- 
ture of such a traditional society provides little 
motivation for small rather than large families. 
Coitus interruptus, probably the principal means 
for modern fertility decline in England and France, 
has been practiced for centuries by some couples 
within many preindustrial societies. If sufficient 
motivation for small families had been present, 
such a method might have been adopted more 
widely, in at least a few preindustrial societies. 

These are only illustrations of theories and 
studies indicating that a variety of cultural factors 
in preindustrial societies affect the intermediate 
variables in such a way as to keep fertility below 
its maximum biological potential. Since many of 
the limits on fertility are based on cultural patterns 
without consciously recognized links to family size, 
changes in such practices initially may lead to 


higher fertility before modern family-planning prac- 
tices take their place. For example, this might be 
the result of a further relaxation of traditional 
sanctions against the remarriage of widows in 
India. 


Fertility in industrial countries 

Transitional decline of fertility. The large de- 
cline in fertility in economically developed coun- 
tries in the nineteenth and twentieth centuries is 
unprecedented. Most sociologists and demogra- 
phers would probably agree that there have been 
two basic causes for the general decline. In the 
first place, there has been a major transfer of func- 
tions from the family to other specialized institu- 
tions, so that fewer children are required to achieve 
socially valued goals. Second, a sharp reduction in 
mortality has reduced the number of births neces- 
sary to have any desired number of living children. 

The historical timing of fertility decline in most 
countries, rather long after the beginning of the 
modern process of economic development and 
social change, has been explained in terms of a 
descriptive model of the “demographic transition.” 
This model assumes that fertility and mortality are 
both high in the preindustrial period. The rapid 
population growth associated with modern eco- 
nomic development is then attributed to a decline 
in mortality that occurred while fertility remained 
relatively stable at rather high levels. It is further 
assumed that after a considerable time lag, fer- 
tility begins to fall, reducing the rate of population 
growth as it approaches the level of mortality. The 
usual explanation of this time lag is that whereas 
low mortality is always positively valued, there are 
no low fertility norms to carry over from the pre- 
industrial period; they must be developed gradually 
in a trial-and-error process, under the influence of 
lower mortality and the changing consequences of 
varying numbers of children. 

This simple model of the demographic transition 
is under revision as the result of a number of im- 
portant studies. Some have shown a considerable 
variability in the actual course of the demographic 
transition (Hatt et al. 1955; Ryder 1957). Espe- 
cially significant has been the work of a number 
of economic historians (for instance, Eversley 
1963) probing, in detail, European data for the 
medieval period and for the seventeenth and 
eighteenth centuries. These studies have tried to 
relate changes in economic organization and in 
family structure to changes in age at marriage, 
rate of marriage, and illegitimacy. An important, 
if controversial, hypothesis emerging from this 
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work is that a rise in fertility rather than a fall in 
mortality may be responsible for the population 
growth during the early stages of industrialization. 

Explanations for the decline in fertility that 
eventually occurred throughout the Western world 
sometimes concentrate on the intermediate varia- 
bles and sometimes on the changes in social struc- 
ture, which first produced the small family norms 
and then affected the intermediate variables. 

The role of contraception. Most scholars prob- 
ably would agree that the mass adoption of contra- 
ception is the most important change immediately 
responsible for the decline in fertility in modern 
industrial countries. Notable exceptions are Ire- 
land, where men have tended to marry late or not 
at all, and Japan and the countries of eastern and 
central Europe, where abortion has been widely 
practiced since World War n. 

It has generally been assumed that contracep- 
tion was the most important factor in the develop- 
ment of the small family that gradually became 
the norm for western Europe. But this assumption 
must remain, at best, a plausible hypothesis, since 
there is an almost complete lack of systematic com- 
parative data for many of the other intermediate 
variables affecting fertility. The only study provid- 
ing a series of historical statistics on the use of 
contraception is that by Lewis-Faning for England 
(Great Britain . . . 1949). He demonstrated that 
as the use of contraception increased, fertility de- 
clined, and that contraception was adopted first in 
those social classes whose fertility fell first. The 
practice was found to have spread to other social 
classes in reasonably close correspondence to ob- 
served patterns of differential fertility. There is 
also illustrative evidence about the adoption of con- 
traception—especially coitus interruptus—by sig- 
nificant elements in the French population in the 
late eighteenth and early nineteenth centuries; and 
it so happens that this was the period in which 
fertility in France began to fall (France .. . 1960). 

Since the war several studies have provided sta- 
tistical evidence of the mass use of contraception 
in a number of Western countries and have related 
the practice to fertility levels (e.g., Freedman 1961, 
pp. 116-118). These studies do indeed demon- 
strate that many populations with a history of 
marked secular decline in fertility now practice 
family planning on a massive scale; but another 
finding that they also have in common is that con- 
traceptive practice is not necessarily highly rational 
and effective, even when the average number of 
children per family is limited to two or three. Even 
in the United States, where contraception is virtu- 
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ally universal for the fecund at some time in mar- 
ried life, many couples use ineffective means, 
many do not begin contraception early in marriage 
(although they do not want all their children im- 
mediately), some begin only after having more 
children than they want, and many conceive de- 
spite contraceptive efforts to postpone pregnancies 
(Freedman et al. 1959, chapters 3-6). In Japan 
and a number of east European countries, the fail- 
ure of contraception as practiced to produce the 
small families desired has been followed by mas- 
sive supplementary use.of legal abortion. 

Biological theories. Before World War 11, it was 
not uncommon to attribute the decline in fertility 
to a decrease in biological fecundity. The main 
argument was that development of industrial- 
urban society brought about a decrease in fecun- 
dity. This was held to be a consequence either of 
genetic selection or of the stresses supposedly 
associated with urban life—especially for the high- 
er status groups, whose fertility fell fastest. Al- 
though such explanations may be valid for special 
situations, the prevailing scholarly opinion does 
not assign it great importance in the over-all fer- 
tility decline in Western countries. Kiser and 
Whelpton, in the Indianapolis study, demonstrated 
both that a modern urban population can have a 
high fertility rate during periods when contracep- 
tion is not used and that fecundity differences are 
not associated with socioeconomic status in the 
way that they should be if the biological argument 
about fecundity decline is true (see Whelpton 
1943-1958, vol. 2, pp. 303-416). This argument 
has also been weakened by the difficulty of recon- 
ciling it with the temporal changes in fertility that 
have been observed to follow changes in economic 
conditions, 

Whatever combination of intermediate variables 
was responsible, there is no question that fer- 
tility fell rapidly in all the developed Western 
countries, so that just before World War 1 small 
families averaging closer to two than three children 
were characteristic. In many countries, fertility was 
below the level required for replacement even with 
the very low prevalent mortality, and concern 
about a declining and aging population led to 
national investigations and official pronatalist 
policies. 

Sociological theories. Many explanations were 
advanced for these unprecedented fertility declines. 
The dominant ideas before World War 1, however, 
were all related to the changing functions of family 
and children in an urban-industrial society. To 
summarize the theories: Industrial urbanization 
was associated with a complex division of labor in 


all spheres of life; with the associated high rate of 
social and physical mobility, this inevitably led to 
a growth of secularism and rationalism, the de- 
clining influence of such traditional forces as re- 
ligious faith, and the shattering of traditional 
family and other primary group associations. An 
essential element in this view of urban life was the 
idea that the family would lose its functions to 
other specialized institutions. On the one hand, 
children would cease to be productive assets in a 
familially based economy, and, on the other hand, 
they would be impediments to active participation 
in the larger organizations from which the rewards 
of an urban society would come. The dominant 
view was that as whole populations became in- 
volved in the urban market and society, family 
planning would become universal and the size of 
family planned would continue to decline. 

It is plausible that a shift in functions from the 
family to other institutions is one basic explana- 
tion for the secular decline in fertility in the de- 
veloped countries. But in retrospect there is little 
systematic evidence in prewar studies linking 
specific changes in the functions of the family to 
the decline of fertility at specific times and places. 
There are still no satisfactory answers to some im- 
portant comparative historical questions, for ex- 
ample: Why did fertility begin to decline much 
earlier in France than in England, where urban- 
industrial development was earlier and more in- 
tense? Why has fertility remained higher in the 
Netherlands than elsewhere, despite that country’s 
early involvement in the nexus of international 
trade? 

One line of theory and research has stressed the 
joint role of social mobility and rising standards of 
living in motivating couples to restrict family size. 
This is one explanation offered for the early de- 
cline of fertility in France (Blacker 1957). Banks 
(1954) has linked the onset of the fertility decline 
in England with a change in economic conditions 
that made restriction of family size necessary for 
the maintenance and advance of living standards 
in the rising middle class. More recent statistical 
studies produce conflicting results, with some 1n- 
dicating no correlation between mobility and fer- 
tility in the United States (Westoff et al. 1961) 
and others finding a relationship in France (Bresard 
1950), England (Berent 1952), and Brazil (Hutch- 
inson 1961). Even in those cases in which a rela- 
tionship is found, it is possible that the distinctive 
fertility of the mobile population is simply an aver- 
aging of the fertility levels of their positions of 
origin and destination, and not a distinctive conse- 
quence of social mobility itself. 


Postwar fertility increase in the West. A de- 
crease in childlessness is only one of the important 
postwar demographic trends that direct attention 
to the need for a re-evaluation of the role of family 
and children in society. In most of the developed 
countries in which fertility reached very low levels 
before the war, there has been an increase in the 
proportion of people marrying, a decrease in the 
age at marriage, and an increase or stabilization 
in the average family size of those who marry. 
There is, however, little evidence of a return to 
large families since the war; the shift is from hav- 
ing no children or one child to having from two to 
four, In a number of countries, an increase in the 
number of marriages, rather than in the number 
of children per marriage, accounts for a significant 
part of the postwar rise in fertility. 

Since the war, urban sociologists have given in- 
creasing recognition to the persistence and even 
the resurgence of primary groups, including the 
family, as the means by which stable individual 
personality organization is maintained in imper- 
sonal specialized societies. Part of this revaluation 
of urban society involves more attention to the per- 
sistent influence of traditional ideologies in re- 
ligious and other associations with ends that are 
not primarily economic or political. Such associa- 
tions had been doomed to extinction by prewar 
sociologists, who were fond of stressing the growth 
of secular rationalism. By contrast, a community 
study by Lenski (1961) shows the persistence of 
Catholic ideology in many areas of life, including 
fertility, even after the consideration of the influ- 
ence of rising education, socioeconomic status, and 
urbanization. These results are confirmed, with 
more demographic detail for fertility, in the Prince- 
ton study (Westoff et al. 1961). 

Some sociologists and demographers have sug- 
gested that the leveling off in the long-run secular 
decline in fertility may mean that family size has 
reached a level appropriate to its functions under 
urban-industrial conditions. From this viewpoint, 
considerable experimentation may be required be- 
fore a stable social norm is developed governing 
the size of such an important unit as the family. 
The very low fertility of the 1930s is seen as an 
experimental “overshooting” of an “equilibrium 
solution,” with a readjustment following when the 
dysfunctional ‘consequences were widely felt. The 
increasing convergence of family size in many 
countries to a small range of one to four children 
may be additional evidence that a family-size pat- 
tern appropriate to urban—industrial societies is 
developing (Universities—National Bureau Com- 
mittee for Economic Research 1960, pp. 36-76). 
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Differential fertility 


Differential fertility refers to variation in fer- 
tility among significant subgroups in a population. 
In principle, almost any classification of the popu- 
lation may be a basis for measuring fertility dif- 
ferences. Scholarly attention, however, has cen- 
tered on groups of people whose differing positions 
in the society give them different resources, styles 
of life, and power. Such groups frequently are dis- 
tinctive in social norms and culture traits affecting 
fertility. 

General principles. Classifications used in stud- 
ies of differential fertility include those based on 
education, occupation, income, wealth, landhold- 
ing, caste, social class, labor-force status of the 
wife, religion, ethnic—racial groupings, regional di- 
visions, bureaucratic positions, intelligence, and 
size of place of residence. Occupation and educa- 
tion have been the most frequently used bases for 
classification, both because they are readily avail- 
able in censuses and in official vital statistics and 
because they are significant indicators of status 
and style of life in most modern societies. 

Social groups or strata that differ in their fer- 
tility differ also on one or more of the intermediate 
variables affecting fertility. Differentials may result 
from deliberate controls, such as contraception, or 
from the unintended consequences of other varia- 
tions. For example, in some places, the rate of 
spontaneous abortions is higher in lower-status 
than in higher-status groups, and this has an un- 
intended effect on the status differentials. 

For a number of contemporary societies, there 
is evidence that the low-fertility subgroups have 
used deliberate family-limitation practices—mainly 
contraception and abortion—more extensively and 
effectively than other groups. Substantiating data 
are available, for example, for the United States, 
England, Japan, Hungary, Czechoslovakia, Puerto 
Rico, Turkey, India, and Taiwan. 

It is more difficult to establish unequivocally 
whether the higher fertility subgroups have larger 
families because they want them or because they 
do not have the knowledge or means to restrict 
family size. Low motivation toward a small-family 
goal may reduce interest in learning and practicing 
effective family-planning methods. In the United 
States, there is evidence that the practice of contra- 
ception for spacing births becomes more effective 
as the number of children born to a couple ap- 
proaches the desired total. This suggests that in- 
effective family planning may reflect a desire for 
more children rather than inability to use the proper 
means. On the other hand, the important Indianap- 
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olis study by Kiser and Whelpton in 1940 found 
that low-status and high-status groups expressed 
similar attitudes about desired family size but that 
the low-status groups had more children because 
they practiced contraception less effectively (Whelp- 
ton 1943-1958). 

At least in societies in which socially significant 
groups of people deliberately practice family limita- 
tion, differential fertility levels will be affected by 
distinctive subgroup attitudes about how many 
children a family should have. Presumably, fer- 
tility will be higher and family limitation practices 
less prevalent or effective in subgroups in which 
larger numbers of children are rewarding in vari- 
ous ways. The relatively high fertility of the agri- 
cultural sector of many populations, for example, 
can be linked to the utility of children, whether as 
laborers in the family enterprise, as an alternative 
to urban social-security programs to meet the risks 
of illness, disability, and old-age, or as the members 
of essential primary social groups. Similar func- 
tions have been ascribed to larger numbers of 
children among low-status urban groups during the 
period of transition to a modern economy. 

Fertility differentials often do not correspond to 
the differentials in the rate of reproduction—a 
measure that also takes mortality into account. For 
example, in countries as different as India and 
England, the mortality rate is considerably higher 
in lower-status than in higher-status groups. There- 
fore, by the time a given cohort of women has 
reached the end of the childbearing period, differ- 
ences between social strata in the number of living 
children per mother often vary considerably from 
differences in the average number of children ever 
born to each mother, 

Fertility differentials by status measures. Such 
Stratification measures as education, occupation, or 
income have been studied most extensively in rela- 
tion to fertility, partly because data are available 
in standard demographic sources, but also because 
stratification is important in social and economic 
theory (United Nations . . . 1953, chapter 5). So- 
cially patterned differences in life style and eco- 
nomic activity presumably may influence either the 
social norms about family size or the variables 
immediately determining fertility, such as fetal 
mortality and effective access to contraception. But 
there is no single, universally valid relation between 
status measures and fertility, because such particu- 
lar indications of status as education or land- 
ownership vary in their significance in different 
societies and because the value attached to large 
families is greater in some societies than in others. 
Indeed, the resources that go with high status are 


likely to be used to achieve large families in some 
societies and small families in others. 

For preindustrial societies, there is evidence in 
some studies of a positive correlation between fer- 
tility and status, especially among agricultural pop- 
ulations (Stys 1957). The nature of the evidence 
varies. Fertility has been found to increase with 
size of landholding; agricultural tenants or workers 
have been found to have lower fertility than land- 
holders; persons with low-status occupations have 
been reported to have lower-than-average fertility, 
The evidence for a positive correlation is reported 
in regional studies for England, Germany, Poland, 
China, the United States, India, and the Philippines 
for various periods between the seventeenth and 
twentieth centuries. 

A plausible explanation of positive correlations 
between fertility and status in preindustrial soci- 
eties is that dependence on relatives and children 
for a variety of social needs increases the social 
valuation of large families, so that those with 
higher status use their power to increase family 
size (for example, by early marriage). It is also 
plausible that in some instances poor health, poor 
nutrition, the absence of males seeking work, or 
high maternal mortality among fertile lower-status 
mothers may explain the lower fertility for lower- 
Status groups. The thesis that higher fertility in- 
variably goes with higher status in all preindustrial 
societies is certainly not supported by the evidence. 
For example, in some contemporary regional studies 
of peasant populations in India, there is no cor- 
relation between status measures and fertility 
(Chandrasekaran 1963). 

A negative correlation between occupational 
status and fertility in Western urban populations 
appears with the earliest pertinent census data in 
the nineteenth century (United Nations 1953). In 
the period of transition to a mature industrial- 
urban society, negative correlations between broad 
status measures and fertility are found wherever 
systematic data are available. Although the evi 
dence is mainly for Western countries, similar 
negative relationships have been reported for the 
advanced changing urban sectors in such diverse 
places as India, Taiwan, Chile, Puerto Rico, and 
Lebanon. Plausible explanations for these negative 
correlations center on the idea that high-status 
groups live first in the developing urban-industrial 
sectors in which they enjoy a number of advan- 
tages. These would include superior access to m- 
formation about the means for fertility control, 
later marriage because of higher educational stand- 
ards, and lower mortality rates (making fewer 
births necessary for any desired number of chil- 


dren). Such families, it is argued, would find the 
economic value of children relatively small in the 
course of acquiring higher living standards for 
themselves and their children. They would also be 
likely to participate in extrafamilial activities, 
which could occupy time that might otherwise have 
been given to additional children. Whatever the 
explanation, the existence of the negative relation 
between status and fertility is well documented for 
many places. 

The lower fertility of high-status groups in the 
period of transition to advanced industrialization 
may be the result of both the smaller proportion 
marrying at important child-bearing ages and the 
use of contraception to limit family size within 
marriage. Historical data for status differentials in 
the use of contraception corresponding to differen- 
tials in fertility are available only for England, as 
already mentioned (Great Britain .. . 1949). How- 
ever, similar differentials in use of contraception 
have appeared later in other countries, as data be- 
come available. More recently, there is evidence 
that the differential use of legal abortion is pro- 
ducing fertility status differentials in Japan and in 
some countries of eastern Europe. 

At least in the United States, the higher fertility 
of low-status groups is partly a consequence of 
their rural origin rather than of their status in the 
urban community (Goldberg 1959). There is a 
high proportion of farm-reared persons in low- 
status groups, whether these are defined in terms 
of occupation, income, or education. When only 
couples living in cities for at least two generations 
are considered, either the status differentials be- 
come slightly positive or the negative correlation is 
reduced. This is consistent with the thesis that the 
negative correlation is part of the transition to a 
mature urban society rather than an intrinsic char- 
acteristic of such a society. 

The negative relation of status to fertility is most 
pronounced in data for the period before World 
War wu. However, even at that time there were 
many exceptions within broad occupational groups. 
For example, low fertility is reported among service 
workers, such as barbers and waiters, having fre- 
quent contact with higher-status groups. For the 
period between the world wars, the highest income 
groups sometimes had higher fertility rates than 
the next income group, especially when compari- 
sons are made for persons of high educational or 
occupational status. 

There was a sharp contraction in fertility dif- 
ferentials in many countries in the “baby boom” 
following World War u. Fertility generally in- 
creased most in those social strata which had the 
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lowest fertility before the war. However, even in 
the United States, where this phenomenon was 
very pronounced, there continued to be significant 
negative correlations between various status meas- 
ures and fertility (Grabill et al. 1958). 

Postwar studies, mainly in the United States, 
indicate that there is now very little relationship 
between measures of social status and the number 
of children desired and expected. It is not yet 
known whether the expectations and desires of 
each stratum will be realized in action, Apart from 
possible changes in what is desired, the present 
evidence is that lower-status groups still are less 
effective in planning their fertility to attain desired 
family size. 

Some scholars believe that in a mature urban- 
industrial society fertility eventually will be posi- 
tively correlated with status—and especially with 
income (Universities—National Bureau . . . 1960, 
pp. 209-240). A basic assumption is that effective 
use of contraception, together with very low mor- 
tality rates, will diffuse throughout the society, so 
that the fertility of each stratum will directly re- 
flect the value of varying numbers of children in 
that stratum. Children, it is argued, having lost 
their differential utility as laborers or as social 
security resources, will be valued by all social 
classes for direct satisfactions, such as those 
yielded by durable consumer goods. The higher- 
income groups will then be able to afford and will 
choose to have more children than low-income 
groups. There is some support for this argument 
in data showing that there is a small positive cor- 
relation of income and fertility among couples who 
plan family size effectively and that this correla- 
tion is highest for couples with an urban back- 
ground (Whelpton 1943-1958, vol. 3, pp. 360- 
415). 

This projection of a positive correlation in the 
mature urban society makes assumptions, still un- 
verified, about the elasticity of demand for children. 
One could just as well argue that the higher educa- 
tional status generally found with higher income 
may lead to a broadening of interests, which will 
compete for attention with the demands of larger 
families. The theory also assumes that in the mod- 
ern welfare state even the lowest status groups can 
receive from nonfamilial institutions assistance for- 
merly received from children in meeting the risks 
of illness, disability, unemployment, old age, and 
loneliness. 

Differentials by city size and farm residence. In 
industrialized countries, fertility is almost always 
highest in the farm sector and tends to decrease as 
the size of the community increases, The explana- 
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tion generally offered is twofold: it is said that 
contraception diffused from larger to smaller cities, 
and from there to the farms, and that the func- 
tional advantages of larger families are greatest, 
and their costs least, in smaller communities and 
on farms. 

With the postwar rise in birth rates, fertility dif- 
ferentials by city size probably decreased in many 
countries, since fertility increased most in the 
Status groups concentrated in large cities. Never- 
theless, the traditional differentials persist in most 
Western countries for which there are adequate 
data. It is not yet clear whether the remaining in- 
verse relation of fertility to city size in developed 
countries is a permanent feature of their structure, 
or a transitional phenomenon. 

In the United States, at any rate, urban-rural 
fertility differentials have existed from a very early 
time. Declines in fertility in the rural and urban 
sectors followed parallel courses until about 1940, 
but since then urban fertility rates have risen more 
than the rural rates, thus reducing the differential. 
In a number of preindustrial countries (India, for 
example) urban-rural fertility differentials either 
do not exist or are minimal (Robinson 1961). In 
such societies, cities are often administrative and 
political centers in which familial organization does 
not differ greatly from that in rural areas; the city 
lacks the specialized institutions that take over 
many family functions in industrial societies, 

Religious differentials. Most research on reli- 
gious fertility differentials deals with the specific 
influence of the Roman Catholic church in limiting 
the practice of family planning ( except by periodic 
abstinence) and in supporting norms for larger 
families. The doctrines of the church do not re- 
quire large families, but they do restrict the means 
that may be used to achieve smaller families. The 
fact that empirical findings differ with the country 
studied suggests that the influence of the church 
depends on other variables in the society. In the 
United States, close attachment to the church is 
associated with higher fertility and with lesser use 
of the most effective fertility-control practices, even 
among higher-status groups. Lenski (1961) ex- 
plains this result by an emphasis in Catholicism 
on familial rather than economic mobility values. 
But studies in Puerto Rico and other Latin Amer- 
ican areas find no relation between attachment to 
the church and fertility. Indeed, international com- 
parisons of fertility do not show any significant 
correlation between fertility and the percentage of 
Catholics in the population, when the level of gen- 
eral social development is taken into account. 


Most of the other major world religions do not 
have specific doctrines about family limitation, 
However, religious practices frequently have an in- 
direct effect on fertility. For example, in both India 
and China major religious traditions support a 
strong preference for at least one and preferably 
two sons, thus increasing the pressures for addi- 
tional children. A pervasive fatalism in a religious 
ideology may have a greater effect than a specific 
injunction against contraception in impeding ra- 
tional planning of fertility. 

Other differentials. In industrial societies, wives 
who work away from home are found to have fewer 
children than those who stay at home. Some women 
work because they cannot have children, but the 
lower fertility of working women persists even 
when only fecund women are considered. Where 
the wife’s work is closely tied to the home, lower 
fertility may not result. In Japan and Puerto Rico, 
women who worked at home or in small family 
enterprises were found to have higher fertility than 
other working wives. 

Fertility differentials between ethnic, racial, or 
regional populations are extensively studied, be- 
cause of their practical economic or political sig- 
nificance. But the meaning of the differentials 
depends on their specific historical and cultural 
context. Sometimes it is possible to explain such 
differentials in terms of status or rural-urban dif- 
ferences; otherwise, no successful analytic general- 
ization of these studies has been made. 

Consequences of differential fertility. Much of 
the early interest in differential fertility was eu- 
genic. It was feared that the genetic quality of the 
population would deteriorate if low-status groups 
had higher-than-average fertility. This assumes that 
lower-status groups are more poorly endowed ge- 
netically—for example, with respect to intelligence 
—but this has not been scientifically established. 
Certain types of mental abnormality have a known 
genetic basis, but they are not sufficiently numer- 
ous to affect broad status differentials. It is likely 
that high or low fecundity itself may have a genetic 
basis, but this also is not known to have any spe 
cific connection to broad socially significant strata. 
Whether there are important genetic differences 
between social strata is still an open question. 
Probably, there is greater consensus that differen- 
tial fertility may produce a social problem, regard- 
less of the genetic origin of the problem, if fertility 
is high in groups with limited personal or social 
resources for child-rearing. There is some evidence 
that intellectual development is greater for children 
reared in small families; but the apparent contrac- 


tion of fertility differentials in the postwar period 
in industrial countries has reduced the earlier con- 
cern about the social and biological consequences 
of differential fertility. 

Differential fertility may affect the rate and 
character of social mobility. Varying rates of repro- 
duction of different social strata affect the relative 
number of opportunities available in each stratum, 
depending on the extent of social inheritance of 
position. More generally, the rate of recruitment of 
new members is one important determinant of the 
continuity, structure, and power of any subgroup 
in the population. Recruitment from other groups 
by adoption or invasion is an alternative to repro- 
duction, but differential fertility is often the prin- 
cipal determinant of the relative growth of popu- 
lation subgroups from generation to generation. 
Where success of groups contesting for political 
power depends on their size, a lively concern about 
comparative reproduction rates often is evident. 


RONALD FREEDMAN 


[Directly related are the entries GENETICS, article on 
DEMOGRAPHY AND POPULATION GENETICS; POPULA- 
TION. Other relevant material may be found in Mi- 
GRATION; MORTALITY; NuPTIALITY.] 
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“Fertility control,” as the term is used in this 
article, refers to patterns of human behavior that 
have as their primary objective the prevention of 
unwanted pregnancies and births. Individuals and 
couples adopt these patterns in accordance with 
their cultural values, reinforced by formal or in- 
formal social pressures. 

The methods of fertility control are traditionally 
grouped into four categories: abstinence, contra- 
ception, sterilization, and induced abortion. The 
boundaries between these categories, however, are 
not clearly delineated and will become even less so 
in the future. The rhythm method, for example, is 
a method of contraception, but it also requires ab- 
stinence during a portion of the menstrual cycle. 
Oral contraception interferes with reproduction for 
periods of time and is sometimes referred to as 
temporary sterilization, while surgical sterilization 
has been called permanent contraception. 

The term ‘birth control,” coined by Margaret 
Sanger in 1914 (Lader 1955), is generally used 


as a synonym for contraception. From the begin- 
ning, the term was intended to exclude abortion, 
and, in general, sterilization and abstinence have 
also been excluded. A number of terms substituted 
for birth control, such as family limitation, child 
spacing, family planning, and planned parenthood, 
appear, for the most part, to be intended to pro- 
mote social acceptance rather than to improve 
communication. 

History. Under some circumstances pregnancy 
and/or childbearing have been considered undesir- 
able or have actually been proscribed in human so- 
cieties since the dawn of history. Anthropologists 
have noted the practice of abortion among many 
preliterate peoples throughout the world. It is also 
probable that abstinence and coitus interruptus 
became known as methods of fertility control at 
the same time as the connection of coitus with 
pregnancy. It would appear, however, that fertility 
control was practiced only sporadically throughout 
most of man’s long history. Since levels of mor- 
tality were generally high in premodern societies, 
only those societies survived where the level of 
natality was also high. 

The practice of fertility control on a scale suffi- 
cient to influence the trend of the birth rate began 
in France toward the end of the eighteenth century 
(France . . . 1960) and in other countries during 
the nineteenth century (Himes 1936). Generally, 
it spread from the well-to-do and educated classes 
to the underprivileged, and from the cities to the 
countryside. In some countries, however, farm 
owners were among the first to restrict the size of 
their families in order to avoid property division. 

At present, fertility control is practiced in one 
form or another and with varying degrees of suc- 
cess by most couples in the industrialized coun- 
tries of the world and by social minorities, mainly 
urban, in nonindustrialized countries. However, 
even in societies where fertility control is not 
widely practiced, many recent surveys have shown 
that the majority of couples approve the idea of 
family limitation (see, for example, Berelson 
1966). The preferred size of family, as indicated 
by the respondents, tends to be substantially 
smaller than the average number of children per 
family in the community. 


Methods of contraception 


Contraception, according to the usage of most 
writers on the subject, includes all nonpermanent 
measures to prevent coitus from resulting in con- 
ception. In older demographic literature, it has 
been customary to distinguish between “appliance 
methods,” that is, mechanical devices and spermi- 


cides, and “nonappliance methods,” such as with- 
drawal or the rhythm method. More recently, the 
major distinction is between the “traditional 
methods” and the “modern methods,” the latter 
referring to oral and intrauterine contraceptives. 
Still another classification is made by the Roman 
Catholic church, which distinguishes between “nat- 
ural birth control,” that is, the rhythm method (see 
below), and “artificial birth control,” comprising 
all other methods. 

Folk methods. The principal folk methods of 
contraception are coitus interruptus and the 
douche. Coitus interruptus or withdrawal of the 
male prior to ejaculation is the oldest contraceptive 
procedure known to man. It appears in the Old 
Testament (Genesis 38) and has been noted by 
anthropologists in many parts of the world. In 
western and northern Europe, where relatively late 
marriage has long coexisted with close and fre- 
quent social contacts between unmarried adults 
and with a strong condemnation of pregnancy out 
of wedlock, withdrawal appears to have been the 
method by which premarital pregnancies were 
averted. At a later time, the practice was trans- 
ferred into married life. There can be no doubt 
that coitus interruptus was the principal method 
by which the historical decline of the birth rate 
in the West was achieved. Statistical studies sug- 
gest a level of effectiveness similar to that of 
vaginal methods (see Table 3). 

Postcoital douches with plain water, vinegar, 
and various products advertised under the name 
of “feminine hygiene,” long used for purposes of 
family limitation, have lost popularity in recent 
decades, Since sperm has been found in the mucus 
of the cervical canal within 90 seconds after ejacu- 
lation, medical opinion considers the contraceptive 
effectiveness of douching as quite unsatisfactory. 

Vaginal methods. Vaginal contraceptive meth- 
ods are designed to prevent the entry of sperm 
into the uterus by a mechanical barrier and/or 
to kill the sperm by chemical action. The condom 
or sheath, a cover for the penis during inter- 
course, made its first appearance in England in 
the eighteenth century. Since the latter part of the 
nineteenth century, the early “skin” condoms, made 
from the intestines of sheep and other animals, 
have gradually been replaced by the cheaper and 
more convenient rubber sheaths. A large propor- 
tion of the condoms sold in the United States prior 
to 1938 were inferior in quality. Since supervision 
over condoms has been assumed by the Food and 
Drug Administration, their quality has improved 
greatly. 

The condom offers protection not only against 
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unwanted pregnancy but also against venereal 
disease. It can be used without special instruction 
and elaborate preparation in any situation where 
coitus is possible. 

The diaphragm, known in the United Kingdom 
as the Dutch Cap, was invented by a German phy- 
sician, Wilhelm P. J. Mensinga, prior to 1882. 
Before the advent of oral contraceptives, it was 
the contraceptive device most often recommended 
by physicians in private practice and in birth con- 
trol clinics throughout the United States and 
Europe. The diaphragm, inserted into the vagina 
and covering the cervical os, prevents the entry of 
sperm into the uterus. To fit a diaphragm, a pelvic 
examination by a physician or other trained health 
worker is necessary. Because of this requirement, 
the method is not suitable for general use in coun- 
tries where medical personnel and medical facili- 
ties are scarce. 

The contraceptive effectiveness of the condom 
and diaphragm is high and appears to be approxi- 
mately equal for the two methods. Occasional 
failures occur, for example, if a condom tears or 
a diaphragm is displaced during coitus. A rate of 
2 to 3 pregnancies per 100 women per year would 
seem to be a high estimate for consistent users. If 
motivation is weak, much higher pregnancy rates 
must be expected. 

Chemical contraceptives consist of a spermicidal 
compound and a vehicle for its introduction and 
distribution within the vagina. Various creams and 
jellies and, more recently, foams with a highly 
spermicidal action, have been developed by the 
pharmaceutical industry. Insertion into the vagina 
is accomplished, prior to intercourse, by means of 
an applicator. The effectiveness of these methods 
seems to be lower than that of the diaphragm or 
the condom. Because vaginal tablets can be manu- 
factured easily and cheaply and because no pelvic 
examination, fitting, or apparatus is needed, they 
have been considered eminently suitable for pro- 
grams of population control in the economically 
underdeveloped regions of the world. However, the 
results of clinical and field trials have not been 
encouraging, and it does not appear likely that this 
method will prove sufficiently acceptable and effec- 
tive to be of value in reducing the rate of popula- 
tion growth. 

Rhythm method. The rhythm or safe period 
method of contraception, also known as periodic 
continence, is based on the fact, recognized inde- 
pendently in the early 1920s by Ogino in Japan 
and Knaus in Austria, that conception is possible 
during only a small fraction of each menstrual 
cycle. The fertile days can be determined either 
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by the application of a simple formula to the men- 
strual history of the woman or by the observation 
of changes in basal body temperature during the 
menstrual cycle, or by a combination of both. 

Rhythm is at present (that is, in 1966) the only 
method of fertility control, other than complete 
abstinence, sanctioned by the Roman Catholic 
church. Its contraceptive effectiveness has been the 
subject of much controversy. The theoretical effec- 
tiveness of the method, correctly taught and under- 
stood, and consistently practiced according to the 
Ogino formula, is roughly comparable to that of 
vaginal methods. The use-effectiveness, on the 
other hand, which may be modified by errors of 
instruction or comprehension and by the couple’s 
willingness to take chances, has been consistently 
less than that of vaginal methods in comparable 
situations. 

Oral methods. The contraceptive “pill,” long 
thought of as an attractive solution to the problem 
of fertility control, became a reality in the late 
1950s. Since 1956 several oral contraceptives have 
been extensively tested in many countries. No per- 
manent or serious side effects were noted in any 
of the clinical trials, but laboratory studies of vari- 
ous organ systems have in some instances revealed 
deviations from normal values. The significance 
of these changes cannot yet be evaluated. Women 
who discontinue the medication conceive promptly. 
Oral contraceptives have established themselves 
not only as virtually 100 per cent effective, if taken 
according to prescription, but also as highly ac- 
ceptable to most users, including many women in 
the lower socioeconomic strata who had been un- 
successful with other contraceptive methods. 

Intrauterine methods. Intrauterine contracep- 
tive devices (IUDs) are small, variously shaped 
objects (rings, spirals, loops, etc.) which are in- 
serted into the uterus by a physician. The pro- 
cedure takes only a few minutes and requires no 
anesthesia. The devices are made of chemically 
inert materials, such as polyethylene or stainless 
steel, and may remain in the uterus for an indefi- 
nite period. The mechanism of action of the IUDs 
is not fully understood. The best available evidence 
Suggests that the presence of a foreign body in the 
uterus reduces the time required for the movement 
of the ovum through the Fallopian tube from three 
to four days to one day or less and that the fer- 
tilized ovum, therefore, reaches the uterus at a time 
when neither it nor the endometrium is ready for 
implantation, There is some evidence, however, 
that the ovum may not be fertilized at all. 

Since 1960 several types of IUDs have been care- 
fully studied in the United States and elsewhere 


(Tietze 1966; International Conference . . . 1964). 
In some cases the uterus expels the device, usually 
during a menstrual period and occasionally with- 
out the woman being aware of it, or the device 
must be removed because of side effects such as 
bleeding and/or pain. No evidence has been pro- 
duced that the IUDs are carcinogenic, nor does 
their use interfere with fertility after they have 
been removed. During use, 2 to 3 accidental preg- 
nancies per 100 women per year must be expected 
even with the more successful types of IUDs. 
These failures may occur after an unnoticed ex- 
pulsion or, more frequently, with the device in situ. 
No case of fetal malformation attributable to an 
IUD has been reported. 

The sociological importance of the IUDs lies in 
the fact that they offer the only fully reversible 
method of birth control now available which re- 
quires a single decision rather than sustained mo- 
tivation on the part of the users. The insidious 
dissipation of user interest, which has played so 
frustrating a role in all other methods of contra- 
ception, is of little importance with the IUD. Re- 
gardless of apathy, fatigue, or sexual excitement, 
the contraceptive device is in place and no further 
action is required of either the man or the woman. 


Extent of contraceptive practice 


The first nationwide investigation of the extent 
of contraceptive practice in the United States was 
the Growth of American Families (GAF) Study, 
conducted in 1955 (Freedman et al. 1959). A sec- 
ond survey along similar lines was carried out in 
1960 (Whelpton et al. 1965). According to these 
surveys, the proportions of white couples with 
wives 18-39 years of age who had taken up the 
practice of birth control prior to the interview were 


Table 1 — Per cent of white couples ever using contra- 
ception: United States, 1955 and 1960 


1955 1960 
Religion of wife: 
Catholic 57 70 
Protestant 75 84 
Jewish 86 95 
Education of wife: 
Grade school 48 66 
High school, 1-3 65 78 
High school, 4 74 83 
College 84 88 
Income of husband: 
Under $3,000 58 70 
$3,000-$3,999 69 77 
$4,000-$5,999 74 80 
$6,000 or more 79 85 
Entire sample 70 81 


5. 
Source: Adapted from Whelpton et al. 196: 


70 per cent in 1955 and 81 per cent in 1960. 
Among couples who had never tried to prevent 
conception, many were found to be sterile or sub- 
fecund. The extent and type of contraception 
varied with religious affiliation and with such in- 
dications of socioeconomic status as wife's educa- 
tion and husband's income (see Table 1). 

Among nonwhite couples, included for the first 
time in the 1960 GAF Study, only 59 per cent re- 
ported use of contraception, a substantially lower 
proportion than among the lowest educational and 
income groups of white couples. 

The great majority of couples using birth con- 
trol relied on one or more of five methods (see 
Table 2). It should be noted that the 1960 survey 
was completed before oral contraceptives came into 
general use. 

Close correlations were found in the GAF sur- 
veys between choice of contraceptive method and 
level of education, as well as other indicators of 
socioeconomic status. Use of the rhythm method 
by Catholics was strongly associated with advanced 
education. A similar association was found for the 
diaphragm among Protestants and among those 
Catholics who had experience with “artificial birth 
control.” Conversely, use of the douche and es- 
pecially of withdrawal were inversely associated 
with educational level, while the condom was a 
popular method at all socioeconomic levels. The 
findings of the GAF Study have been confirmed, to 
a large extent, by the Family Growth in Metro- 
politan America (FGMA) Study (Westoff et al. 
1961). According to a nationwide survey taken 
in late 1965 (Ryder & Westoff 1966), about 


Table 2 — Per cent distribution of users by last method 
used: United States, 1955 and 1960 


RELIGION OF WIFE 


Last method used Entire sample Protestant Catholic Jewish 
1955 
Condom 29 31 16 56 
Diaphragm 25 29 12 37 
Rhythm 24 14 56 2 
Douche 9 LA] 6 - 
Withdrawal 8 8 8 3 
All other 5 7 2 2 
100 100 100 100 
1960 
Condom 28 30 17 49 
Diaphragm 22 26 7 35 
Rhythm 23 14 55 5 
Douche 9 1 5 2 
Withdrawal 8 8 10 2 
All other 10 11 6 7 
100 100 100 100 


Source: Adapted from GAF Study, 1955 and 
1960 (special unpublished tabula- 
tions, privately communicated). 
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Table 3 — Per cent of married women, under 45 years 
of age, ever using and currently using oral 
contraception: United States, 1965 


Ever using Currently using 

Under 20 42 28 

20-24 46 30 

25-29 37 22 

30-34 24 13 

35-39 15 8 

40-44 9 5 

Education: 

Grade school 12 7 

High school, 1-3 26 15 

High school, 4 26 16 

College, 1-3 32 19 

College, 4 37 22 

Race and religion: 

White 27 16 
Catholic 21 1 
Non-Catholic 29 18 

Negro 19 12 
Entire sample 26 15 


Source: Adapted from Ryder & Westoff 1966. 


6.4 million married women under 45 years of age 
(26 per cent) had used oral contraception, while 
3.8 million were relying on the “pill” at the time of 
the interview. Use was found inversely associated 
with age and directly with education (see Table 3). 

In the United Kingdom the first major investi- 
gation of the contraceptive habits of the population 
was conducted in 1946-1947 on behalf of the 
Royal Commission on Population. In 1959-1960 
new data were obtained from the Population In- 
vestigation Committee’s marriage survey (Rown- 
tree & Pierce 1961). In terms of religious affiliation 
and socioeconomic status, the patterns of contra- 
ceptive practice were found to be similar to those 
reported in the United States. In regard to methods, 
there are important differences. While the condom 
is the most widely used method in both countries, 
British couples tend to rely far more often on 
coitus interruptus and far less often on the dia- 
phragm than American couples. Catholic couples 
in the United Kingdom showed less preference for 
the rhythm method than Catholic couples in the 
United States. 

In Japan ten surveys on birth control were 
conducted between 1950 and 1965 (Muramatsu 
1966). During this period the proportion of couples 
who reported current use of contraception in- 
creased from 19 per cent to 52 per cent. The 
proportion of couples practicing contraception was 
higher in the six major cities of Japan than in 
the other urban areas and was lowest in the rural 
districts. The familiar socioeconomic gradient was 
also present. The condom, the most widely used 
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method of contraception in Japan, was reported 
by a majority of users in each survey since 1952. 
Next in popularity was the rhythm method, an 
example of a prophet (Ogino) honored in his own 
country. 

The extent of contraceptive practice and the 
methods used have also been studied in a number 
of other countries throughout the world. Generally 
speaking, the practice of contraception is inversely 
correlated with the level of the birth rate and rarely 
reaches 10 per cent of all couples in the rural areas 
of the developing countries. The proportion tends 
to be higher in towns and among the better edu- 
cated. It has been suggested that the more wide- 
spread practice of contraception in industrialized 
societies is connected with the decline, in these 
Societies, of the extended family and the corre- 
spondingly greater opportunities afforded to young, 
unmarried adults for training in contraceptive 
practices (Ryder 1959). 


Effectiveness 


Study of the effectiveness of contraception re- 
quires a distinction between “theoretical effective- 
ness” and “use-effectiveness.” Theoretical effective- 
ness is the effectiveness of a method under ideal 
conditions, that is, used consistently and according 
to instructions, without any omissions or errors of 
technique. Use-effectiveness refers to the contra- 
ceptive practice of a particular population and re- 
flects such variables as the socioeconomic and cul- 
tural characteristics and the degree of motivation 
of the couples concerned. 

Theoretical effectiveness is not accessible to 
measurement and can only be inferred from the 
performance of the most successful group of users. 
Use-effectiveness is measured in terms of failure 
rates per 100 woman-years of use, according to the 
formula first proposed by Raymond Pearl (1932), 
or preferably by means of cumulative failure rates, 
according to the life table procedure described by 
R. G. Potter (1963), 

Comprehensive information on the effectiveness 
of contraception, as practiced by young urban 
couples in the United States during the 1950s, was 
obtained during the course of the FGMA Study 
(Westoff et al. 1961, pp. 83-101). During the 
28,607 months of exposure to the use of contra- 
ception reported by the couples included in the 
Survey, 534 accidental pregnancies occurred, cor- 
responding to a failure rate of 22.4 per 100 years 
of exposure. Failure rates for individual methods 
ranged from about 14 per 100 years of exposure 
for the condom and the diaphragm to about 40 for 
the rhythm method and the postcoital douche 
(ibid., Table B-2). These rates reflect use-effective- 


ness—not theoretical effectiveness. Some of the 
couples admitted that they were “taking chances” 
in their contraceptive practice. Without such omis- 
sions, both admitted and unadmitted, it must be 
presumed that the failure rate for each method 
would have been much lower. 

This conclusion is strongly supported by the 
results of reinterviews with the same couples about 
three years later. It was found that the couples 
who had used contraceptive measures since mar- 
riage and already had the number of children they 
desired experienced a failure rate of only 3.7 per 
100 years of exposure for all contraceptive methods 
combined and a rate of 2.6 if they used the con- 
dom, the diaphragm, or withdrawal (Westoff et al. 
1963, pp. 38-44). Couples who intended to have 
additional children and who, therefore, wanted to 
postpone pregnancy rather than prevent it alto- 
gether had much higher failure rates, similar to 
those experienced prior to the first interview. 


Sterilization 


Surgical sterilization was originally used to pro- 
tect women whose life or health was threatened 
by pregnancy. Dr. James Blundell of London is 
credited with having first proposed the procedure 
in 1823. Effective techniques were developed in 
the latter part of the nineteenth century when 
aseptic surgery and anesthesia became available. 
At about the same time, sterilizing operations 
began to be widely used on males, mainly in con- 
nection with operations on the prostate. Growing 
confidence in the efficacy and safety of surgical 
sterilization led to its use for eugenic purposes, 
that is, to prevent persons suffering from heredi- 
tary disabilities, especially mental deficiency, var- 
ious psychoses, and idiopathic epilepsy, from hav- 
ing offspring. 

In the United States, 32 states adopted legisla- 
tion between 1907 and 1937 regulating the prac- 
tice of eugenic sterilization. Some of these laws 
have been declared unconstitutional, but the ma- 
jority are still in force. However, few states have 
used their authority extensively, and the total num- 
ber of persons sterilized under these eugenic laws 
from their inception up to the end of 1965 was 
65,000. During the early 1960s the number of 
eugenic sterilizations averaged about 500 per year. 

Outside the United States sterilization for eugenic 
reasons is legally regulated in a few countries, 
notably in Scandinavia and in Japan. In Germany 
eugenic sterilization, involving a minimum of 
200,000 persons, was practiced under the Eugenic 
Law of July 14, 1933, until 1945, This estimate 
does not include inmates of concentration camps 
who were sterilized without legal authorization. 


In recent years discussion has centered on the 
legality and/or propriety of voluntary sterilization 
as a method of family limitation and on the use 
of sterilization in countries where high birth rates 
and rapid population growth threaten to produce 
serious economic and social difficulties (Blacker 
1962). 

Permanent sterilization in women is ordinarily 
accomplished by salpingectomy, that is, by the cut- 
ting, ligation (tying), and partial removal of the 
Fallopian tubes. In the male, the sterilizing opera- 
tion (vasectomy) consists of the cutting, ligation, 
and removal of a portion of the spermatic duct. 
Since sterilization does not involve removal of the 
sex glands, it does not produce loss or impairment 
of sexual desire or of capacity for sexual response. 
Restoration of fertility by a second operation is 
possible but not certain. This fact restricts steriliza- 
tion to persons permanently ineligible for parent- 
hood and to mature couples who have all the chil- 
dren they want or are likely to want in the future. 
According to a number of studies, the great major- 
ity of sterilized patients are satisfied with the result, 
feel relieved from the nagging fear of pregnancy, 
and have no complaints. Regrets and more serious 
psychological side effects have, however, been oc- 
casionally noted in sterilized persons of both sexes, 
who were poorly chosen and/or not suitably pre- 
pared for the operation. Such undesirable side ef- 
fects are most likely to occur if the marriage is 
unhappy or on the brink of breaking up, or when 
the operation is accepted reluctantly, under pres- 
sure from a spouse or other persons. 

The number of voluntary sterilizations in the 
United States during the late 1950s is estimated at 
110,000 annually, including 65,000 operations on 
women and 45,000 vasectomies. Later estimates 
are not available. Sterilization of women has been 
popular in Puerto Rico since the 1940s and has also 
gained wide acceptance in Japan (Koya 1961). In 
India the government encourages voluntary sterili- 
zation as a method of population control. By mid- 
1965 about 900,000 operations had been reported, 
two-thirds on men and one-third on women. South 
Korea, too, has included vasectomy in its national 
family planning program, and 49,000 vasectomies 
were performed from 1962 to 1964. 


Abortion 


Induced abortion, that is, termination of un- 
wanted pregnancy by destruction of the fetus, has 
been performed throughout man’s history. Some 
cultures prescribe abortion under specific circum- 
stances, others tolerate it, still others condemn it. 
In the United States most state laws stipulate pre- 
vention of the death of the pregnant woman as 
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the sole ground on which pregnancy may be inter- 
rupted. In practice, many (but by no means all) 
reputable hospitals permit therapeutic abortion in 
cases where a serious threat to health is to be 
averted and for certain eugenic reasons, as when 
a pregnant woman catches a disease that is likely 
to cause malformations in the fetus. In New York 
City the number of therapeutic abortions equaled 
about one-fifth of one per cent of the number of 
live births in 1960-1962 (Gold et al. 1965). In a 
few countries, such as Denmark and Sweden, the 
interpretation of medical necessity for abortion is 
far more liberal, since it takes into account less 
serious threats to the woman's physical or mental 
health, as well as her economic situation. In 1963 
legal abortions amounted to about 5 per cent of 
the number of live births in Denmark, and to about 
3 per cent in Sweden (Tietze 1966). 

In still other countries, abortion is legally avail- 
able either at the request of the pregnant woman 
or on broadly interpreted social indications. In the 
Soviet Union abortion was legalized after World 
War I. In 1936 this policy was reversed, and for 
the next two decades abortion was permitted on 
medical grounds only. In 1955 another reversal 
took place, and abortion on request was once more 
legalized. The adoption of similar legislation in 
most countries of eastern Europe has been followed 
by very rapid increase in legal abortions. In Hun- 
gary legal abortions have outnumbered live births 
since 1959 (Mehlan 1966). In Japan the Eugenic 
Protection Law of 1948 authorized interruption of 
pregnancy for economic as well as medical reasons. 
The subsequent interpretation of this law has been 
tantamount to making abortion available uncondi- 
tionally. 

In several countries where the practice of abor- 
tion is restricted by law, estimates of its incidence 
have been obtained by the survey method (see, for 
example, Armijo & Monreal 1965; Hong 1966). 
This approach has not yet been used successfully 
in the United States. In 1959 a committee appointed 
by the Conference on Abortion at Arden House, 
New York, concluded that a “plausible estimate of 
the frequency of induced abortion . . . could be as 
low as 200,000 and as high as 1,200,000 per 
year...” (Planned Parenthood . . . 1958, p. 180). 
No new data which would permit a more precise 
evalution are available. 


Methods of the future 

Continuing research in the field of human and 
animal reproduction promises new methods of fer- 
tility control, which may become available over the 
coming decade. Among the following procedures, 
some are now undergoing clinical evaluation, 
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others are being studied in selected human volun- 
teers, and still others are in various stages of de- 
velopment with laboratory animals: long-acting 
ovulation suppressants administered by injection; 
compounds suppressing spermatogenesis, for use 
as oral or injectable contraceptives in males; 
immunization of females against sperm; immuni- 
zation of males against sperm, resulting in suppres- 
sion of spermatogenesis; blocking of the spermatic 
duct by injections of plastic material which can be 
easily removed; compounds acting on the fertilized 
ovum prior to implantation; and compounds with 
destructive action on the young embryo. 


CHRISTOPHER TIETZE 


[See also FERTILITY; POPULATION, articles on POPULA- 
TION GROWTH and POPULATION POLICIES. ] 
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FEUD 


Feud has been defined by Lasswell as “. . . rela- 
tions of mutual animosity among intimate groups 
in which a resort to violence is anticipated on both 
sides” (1931, p. 220). This definition includes two 
important concepts—“violence” and “intimate [or 
related] groups”—that require amplification. 

Nature of violence. Concerning the concept of 
violence, the question arises, Is any type of violence 
between intimate groups a feud, or are there spe- 
cific characteristics that denote the violence of a 
feud? The concept is commonly refined in terms 
of intensity and duration. Thus, there appears to 
be general agreement that the violence typical of 
a feud may range from beating, which leaves only 
slight injuries, to killing several members of the 
opposite group. A feud involves prolonged and 
intermittent hostilities. As a logical consequence, 
a single fight or a single killing cannot be defined 
as a feud. Long intervals of relative peace some- 
times elapse between the fights and slayings 
(Lowie [1920] 1947, p. 414). Bohannan says that 
“feud occurs when the principle of self-help gets 
out of hand” (1963, p. 290), implying that if an 
injury is redressed through violence and the self- 
redress is final and more or less accepted by the 
other party, such violence does not merit the term 
“feud.” Lasswell states that feuds often continue 
so long after they begin that the precipitating epi- 
sodes are even forgotten (1931, p. 221). Evans- 
Pritchard (1940, p. 293) agrees with the criterion 
of prolonged violence. However, he points out that 
a feud cannot go on indefinitely; otherwise the 
relationship of the fighting groups (among the 
Nuer, their membership in the same tribe) would 
be severed, and further hostilities, not occurring 
between related groups, could no longer by implica- 
tion be called feud (1940, pp. 279, 283). 

We may conclude, then, that feud involves pro- 
longed, often intermittent violence which must end 
at some point short of the obliteration of the second 
criterion of feud—the intimate relationship of the 
feuding groups. Of course, concluding a feud does 
not necessarily mean that mutual hostility is trans- 
formed into indifference or friendship. A new 
feuding cycle most likely erupts between the old 
combatants any time that a new crime or injury 
is committed by an individual against a member 
of the other side. 

The chain of violent acts that marks the feud 
is initiated in order “to secure revenge, reprisal, 
or glory for a particular individual or family within 
the group” (Wright 1942). Another common char- 
acteristic of the violence that may be classified as 
feud is the claim that the actual acts of hostility 
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are regulated by customs shared by the two fight- 
ing groups (Radcliffe-Brown 1952, p. 215). In 
other words, the hostilities of the two groups are 
patterned upon, and subject to, rules which both 
sides observe. Furthermore, the initial act of vio- 
lence is regarded as injury to the whole group to 
which the victim belongs (family, clan, or village), 
and the members consequently stand under an 
obligation to avenge the injustice (Radcliffe-Brown 
1952, p. 215; 1940, p. xx; Nadel 1947, p. 151). 
Paraphrasing Durkheim, Radcliffe-Brown calls their 
duty an expression of “collective solidarity.” How- 
ever, this principle works also with regard to the 
opposite party, where it produces a group liability, 
with the effect that any member of the offender's 
group may be slain for the crime of his relative, 
his friend, or a coresident (Radcliffe-Brown 1952, 
p. 215; Nadel 1947, p. 151). 

According to Radcliffe-Brown (1952, p. 215; 
1940, p. xx), another aspect of the violence of 
feuds is that it is justified by “public sentiment.” 
Unfortunately, he fails to specify whether this pub- 
lic sentiment pertains to the group of avengers, to 
both of the groups in conflict, or to the society 
at large. If we recall his statement that the acts 
of violence are regulated by custom, we may con- 
clude that this public sentiment pertains to the 
larger society of which the two fighting groups are 
constituent segments. Not all acts of violence jus- 
tify development of such a sentiment: in order 
that a violent revenge be considered a justifiable 
act, its magnitude should be valued as an equiva- 
lent to the injury suffered (Radcliffe-Brown 1952, 
p. 215, 1940, p. xx; Nadel 1947, p. 151). Who, 
however, determines the equivalent? Implied in the 
writings of Radcliffe-Brown and Nadel is the no- 
tion that the criteria for equivalence are sufficiently 
objective, i.e., part of a general custom known to 
all, so that it is often unnecessary for a specified 
authority to deliver an opinion on the balance be- 
tween the injury and revenge. Among the Nuba, 
for example, “equivalence” is so specific that not 
only must a man be killed for a man and a woman 
for a woman, but the age of the person killed in 
revenge should approximate that of the original 
victim. For death in excess of requirement a com- 
pensation must be offered in the form of a person 
who is adopted into the offended clan (Nadel 1947, 
pp. 151-152). In most other primitive societies 
such compensation is usually tendered in terms of 
a payment known in the literature as’ “blood 
money.” Bearing in mind Bohannan’s important 
point that feud occurs when “self-help gets out of 
hand,” Radcliffe-Brown’s and Nadel’s principle of 
equivalence of revenge applies to feud only when 
such equivalence is not achieved by the first re- 
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taliation of the injured party. There have to be 
more than two acts of violence to justify the ap- 
plication of the term “feud.” 

The characteristics of violence that form one of 
the two major criteria of feud may be summed up 
as follows: (1) the violence of a feud ranges in 
intensity from injury to killing; (2) it is initiated 
on behalf of a particular individual or family that 
is a member of the more inclusive “injured group”; 
and (3) it is of long duration, involving at least 
three instances of violence—injury, revenge, and 
counterrevenge. Hostile acts consisting of an in- 
jury and of an equivalent revenge that is accepted 
as final by both parties do not merit the term feud 
and should more properly be called self-redress, 
The nature of self-redress is, in most cases, basi- 
cally different from the prolonged violence called 
feud. 

Nature of feuding groups. The second major 
criterion of feud, which requires the committed 
violence to occur between two “intimate [or related] 
groups,” is far more important and complex (Lass- 
well 1931). It is almost generally agreed that the 
two groups fighting each other must be related in 
order to qualify such hostilities as a feud. However, 
various authors differ in their explanation of the 
nature of this relationship. An imprecise position 
is assumed, for example, by Wright, who states 
that “most primitive groups observe different war 
practices toward a related group with which friend- 
ship normally exists and toward a wholly alien 
group. Hostilities of the first type, although group 
sanctioned, are usually of the nature of a feud to 
Secure revenge, reprisal, or glory for a particular 
individual or family within the group” (1942). 
Others who have studied feuds—for example, Mali- 
nowski ([1941] 1964, p. 261 )—go further, claiming 
that relationship between the two groups obtains 
from the fact that they both belong to “the same 
larger cultural unit.” Similarly, those who use the 
phrase “members of the same society” do not 
necessarily identify the political unit involved. For 
example, Max Gluckman (1940, p. 41) speaks of 
a type of intertribal feud within a larger nation, 
while Hobhouse and his associates restrict the 
relationship of feuding groups to membership in 
the same tribe: “Feuds would thus also be the 
appropriate name for reprisals exercised by one 
branch of a community upon another, e.g., as be- 
tween two clans or two local groups within a tribe” 
(Hobhouse et al. [1915] 1930, p. 228). Besides this 
type of an “internal self-redress” called feud, these 
authors recognize another type of redress which 
exists between two segments of otherwise unre- 
lated communities and is labeled “external self- 


redress.” A similar limitation of the application of 
the concept of feud to the fighting done by group- 
ings that belong to the same tribe is upheld quite 
explicitly by Evans-Pritchard. He contrasts these 
hostilities with the intertribally organized violence 
which he calls war: “Thus, if a man of one tribe 
kills a man of another tribe, retribution can only 
take the form of intertribal warfare” (1940, p. 
278). “Between segments of the same tribe, op- 
position is expressed by the institution of the feud” 
(1940, p. 283). 

Some authors theorize that marriage ties con- 
Stitute the link between the two feuding groups. 
Accordingly, hostilities in a society with exogamous 
subgroups such as clans, lineages, or local com- 
munities are all regarded as feuds and not as wars 
(Schneider 1964, p. 282; also implied by Colson 
1962, p. 120). 

The relationship between the hostile groups is 
far less nebulous and more related to the feuding 
itself than the marriage-tie hypothesis would im- 
ply. Evans-Pritchard has the following to say about 
those Nuer who engage in intratribal fighting: 
“Then either the contradiction of feuds is felt and 
they are settled, the unity of the tribe being main- 
tained thereby, or they remain so long unsettled 
that people give up all hope and intention of ever 
concluding them and finally cease to feel that they 
ought to be concluded, so that the tribe tends to 
split and two new tribes come into being” (1940, 
p. 279). This statement suggests that the nature 
of the relationship of two feuding groups lies in 
the fact that the more inclusive group of which the 
two are members is, at least to some degree, polit- 
ically organized. The political organization may 
mean that there exists a formally designated au- 
thority with jurisdiction over both feuding groups; 
in other cases it may consist of only the most 
informal arbiters or go-betweens, who customarily 
settle internal political problems. Gluckman ([1956] 
1959, p. 20) went so far as to claim that feuding, 
by creating a necessity for the existence of such 
go-betweens, tends to unite the members of the 
larger grouping. There are good examples of formal 
political authority terminating feuds between sub- 
groups by the use of force, in the event that the 
customary exchange has failed. The present author 
would hesitate to designate as a feud fighting which 
occurs between politically unrelated groupings and 
would criticize Wagner (1940, p. 223) and 
Schneider (1964, p. 279) for holding that an overi 
all political organization is not necessary to qualify 
a condition of strife as a feud. Wherever feuds do 
occur, there exists a politically influential authority 
(a formal chief, an informal headman, a council 


of important men, or an individual of very limited 
power, such as a go-between), who is usually too 
weak or disinterested in controlling his constit- 
uents, with the result that prolonged fighting is not 
prevented. The weaker the political control, the 
longer the feuds last and the harder it is to con- 
clude them (see especially Evans-Pritchard 1940, 
pp. 278-279; and Gluckman [1956] 1959, p. 19). 

Feud and war. The criterion of an over-all 
political organization that relates the two feuding 
groups may also be used to mark the boundary 
between feud and war. Whereas feuds can occur 
only within a politically organized whole, war oc- 
curs beyond such an organization and always in- 
volves two groups that are politically unrelated 
(Nadel 1947, p. 301). One should not, however, 
go to the extreme and claim that it would be 
“advisable to include all external, group-sanctioned 
violence against other human beings in the con- 
ception of primitive war” (Schneider 1964, p. 276). 
There seem to be two kinds of hostility external 
to politically organized groupings: hostility which 
is “exercised by a part of a community only upon 
members of another community” and which should 
properly be called “external self-redress,” or retalia- 
tion; and hostility which means “an operation con- 
ducted in the name of the community as a whole,” 
and which thus deserves to be regarded as war 
(Hobhouse et al. [1915] 1930, p. 228). 

These structural criteria that distinguish feud, 
external redress, and war appear to be more politi- 
cally relevant and suitable than the following, 
which are of only circumstantial nature. For exam- 
ple, the claim that only publicly initiated hostilities 
should be called war, while the privately initiated 
ones should be labeled feuds, constitutes a most 
impractical criterion. However, Schneider's criti- 
cism of anthropologists for misusing the term 
“war” is erroneous; it is based on his contention 
that hostilities within a society that is segmented 
into exogamous groups (such as clans, lineages, 
or communities) cannot be called war (Schneider 
1964, pp. 279, 282). He disregards the fact that 
exogamy is a social-structural feature and war is 
a political phenomenon and that, therefore, presence 
or absence of exogamy is irrelevant to this problem. 
To make the term “war” cross-culturally applicable 
and meaningful, one should define it in- terms of 
political-structural features. Data on the Kapauku 
Papuans of New Guinea bear out the fallacy of 
such an analysis (Pospisil 1958). These Papuans 
live in localized exogamous lineages. Several line- 
ages, belonging to different sibs, unite for defense 
purposes into a political confederacy. Beyond this 
unit no political organization exists. Interconfeder- 
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ational strife, which is true war, is marked by 
organization, by participation of whole confedera- 
cies as units in the fighting, and by ferocity involv- 
ing slaughter, arson, rape, etc. On the contrary, 
fighting within a confederacy is done, as a rule, 
with sticks, not with bows and arrows, and does 
not result in death. No raping of women, burning 
of houses, killing of pigs, or destruction of gardens 
accompany these internal hostilities. To consider 
both types of fighting as feud would be to obscure 
rather than clarify the ethnological reality. Bearing 
the political-structural criteria of feud and war in 
mind, one can readily see how erroneous are the 
claims that war is uncommon or nonexistent in 
“the lowest stages” of social development (Hob- 
house et al. [1915] 1930, p. 228) and that it re- 
quires for its existence a “certain development of 
social organization.” Indeed, it is the feud that 
requires a complex social development—a large 
social entity with an over-all political organization 
that is segmented into subgroups. 

Feud and law. The fact that feuds are fought 
between subgroups of a more inclusive grouping 
possessing an over-all network of political relations 
has led to the conclusion that feud is a primitive 
juridical mechanism and that it is an expression, 
or manifestation, of primitive law. Accordingly, 
Malinowski writes, “Fighting, collective and organ- 
ized, is a juridical mechanism for the adjustment 
of differences between constituent groups of the 
same larger cultural unit” ([1941] 1961, p. 261). 
Spencer contends that after a north Alaskan Es- 
kimo was killed, “. . . his own kin became em- 
broiled and the legal mechanism of the feud was 
put into motion” (1959, p. 161). The notion that 
feuding is a manifestation of primitive law led 
Lasswell to the conclusion that there must be two 
types of feuds: “While the blood vengeance feud 
was itself the expression of primitive law, the mod- 
ern feud is at least formally illegal and character- 
istically fills the interstices left in the functioning 
of the prevailing system of legal organization” 
(1931, p. 220). Many anthropologists have dis- 
agreed with the idea that feud is the expression of 
primitive law and have proposed or implied a uni- 
tary definition of feud. Hoebel (1949, p. 3) con- 
siders that feud lies outside the sphere of law. He 
bases this distinction on the fact that the counter- 
killings do not stop and that there is nothing one 
may regard as a mutually recognized coercive 
sanction against the killer and his group. Similarly, 
Bohannan (1963, p. 290) calls feud “a faulty 
jural mechanism” because it does not lead to a 
final settlement—to peace and rectitude. Radcliffe- 
Brown, on the other hand, refuses to regard feud as 
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law, not because of its functional aspect, but be- 
cause it lacks “the exercise of recognized authority 
in settling disputes” (1940, p. xx). 

Pospisil (1956) defines law by means of four 
criteria, none of which is inherent in the phenom- 
ena of feuds: (1) law is manifested in a decision 
made by a political authority; (2) it contains a 
definition of the relation between the two parties 
to the dispute (obligatio); (3) it has a regularity 
of application (intention of universal application ); 
(4) it is provided with a sanction. Law, which is 
characterized by these four criteria, is present in 
all societies—indeed, in every functioning group or 
subgroup of people—that he has investigated. In 
the primitive societies in which feud is endemic, 
law exists in those subgroups which have developed 
leadership, thus coexisting with feuds without in- 
corporating them into the jural mechanism. Where- 
as law presupposes decisions passed by an author- 
ity holding jurisdiction over both litigants and a 
regular respect for, and compliance with, these 
decisions by the parties to the dispute, a feud 
represents an intergroup fight in which all the 
participants ignore and even defy the jurisdiction 
of the over-all political authority, which is usually 
weak or uninterested. Consequently, whereas law 
is a means of intragroup settlement of disputes, a 
feud—because of the feuding parties’ defiance of 
the superordinated political structure—is basically 
an intergroup phenomenon, although it occurs 
within a more inclusive, politically organized unit. 
The difference between law and feud is again of 
political-structural nature, as are the differences 
between feud, war, and external redress. That law 
stands in opposition to feud and that the latter is 
actually the antithesis of the former rather than 
its manifestation, is well documented in those so- 
cieties in which feuds are stopped by legal deci- 
sions of the over-all authority, who either has 
enough power to enforce his will or possesses the 
skill to persuade the quarreling parties to accept 
his solutions (see especially Nadel 1947, p. 154; 
Evans-Pritchard 1940, pp. 278-279). 

Finally, it should be pointed out that not every 
case of “internal self-redress” constitutes a feud. 
When a counterkilling is accepted by the over-all 
political authority as a just punishment of the 
culprit or of his subgroup and the subgroup of 
the defender is induced or forced to accept such a 
verdict and to refrain from further counterkillings, 
the sanctioned reprisal constitutes a case of legal 
self-redress that is true law (Hoebel 1949, p. 3; 
Pospisil 1958, p. 256; Bohannan 1963, p. 290). 
Hoebel, for example, writes, “If the kin group of the 
original killer customarily accepts the action of the 


avengers as just, and stays its hands from further 
counterkilling, then we have legal law” (1949, 
p: 3). 


In conclusion, we may summarize the salient 
features of feud and set it off from the related con- 
cepts of law, war, external self-redress, and legal 
self-redress. The essence of feud has been found 
to be a series (at least three instances) of acts of 
violence, usually involving killings, committed by 
members of two groups related to each other by 
superimposed political-structural features (often 
involving the existence of an over-all political 
authority) and acting on the basis of group soli- 
darity (a common duty to avenge and a common 
liability). This definition sets feud apart from war 
and external self-redress because the last two terms 
refer to acts of violence committed by members of 
politically unrelated groups: in war both combat 
groups act as units in the organized fighting; in 
external self-redress members of two subgroups 
only, each belonging to a different, politically unre- 
lated group, participate in the hostilities, This defi- 
nition also distinguishes feud from law. Feud is 
an internal affair, conducted by members of the 
subgroups of an over-all political organization who 
ignore or even defy its political authority. In its 
conduct, then, feud refers to intergroup phenom- 
ena. Law, on the contrary, is an intragroup affair 
in the full sense of the term: a decision of the 
authority who holds jurisdiction over both parties 
to the dispute is passed, and both disputing parties 
are induced or forced to comply with its provisions. 


LEOPOLD POSPISIL 


[See also War. Other relevant material may be found 
in JUDICIAL PROCESS, article on COMPARATIVE AS- 
PECTS; LAw, article on LAW AND LEGAL INSTITU- 
TIONS; POLITICAL ANTHROPOLOGY, article on POLITI- 
CAL ORGANIZATION. | 
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FEUDALISM 


Feudalism conventionally denotes the type of so- 
ciety and the political system originating in western 
and central Europe and dominant there during the 
greater part of the Middle Ages. However, the term 
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is also applied to other societies and systems of 
government with similar characteristics, in antiq- 
uity and in modern times; in the Marxist usage it 
refers to a type of society and economy charac- 
terized by serfdom, generally succeeding the eco- 
nomic systems based on slavery and preceding 
capitalism. 

The word from the Germanic fehu-od (from 
which is derived the English and French fief )— 
that is, “property in cattle” and, later, “tenure” or 
“property in land”—stresses the importance, in the 
system, of land tenure and the rights and privileges 
attached to it. Since the seventeenth century, the 
complex of tenurial and personal relationships and 
economic, social, and political dependencies that 
centered on the fief have increasingly been re- 
garded as a scaffold of social stratification and 
political organization. This view, often reflecting 
actual political and social problems in eighteenth- 
century England and France, created the notion of 
a period dominated by “feudal laws” (Montesquieu) 
that were comprehensive enough to denote a regime 
and to dominate and rule a society. The later 
meaning of the word, although basically rooted in 
eighteenth-century usage, came to denote, through 
abuse of language, such social realities as the po- 
litical predominance of a landholding aristocracy 
and the exploitation of the small and weak by the 
powerful. It also came to denote any political sys- 
tem in which the power of the state was weakened 
or paralyzed by the privileges of the few and made 
inefficient by the fractioning of political power, or 
by the opposition of powerful political or economic 
aristocratic factions. 

Historical scholarship since the nineteenth cen- 
tury has brought to light more and more of the 
variety of economic, social, and political forms to 
be found in feudal societies at any one time, as well 
as the changes inevitable in any social and politi- 
cal framework lasting over five hundred years. 
Nevertheless, some major features do recur, and 
a certain rhythm of evolution seems to have been 
common to rather large areas as they reacted to 
similar economic, social, and political changes. 
Hence, it is possible to speak about feudal institu- 
tions without implying that all aspects of economic, 
social, and political life predominant in the greater 
part of the European Middle Ages were always 
present. Such institutions can also be found in 
other societies; sometimes they evolve from similar 
conditions, but often they are isolated phenomena 
in different frameworks or without the interrela- 
tions deemed essential in the European system. (In 
these cases the term “feudal tendencies” might be 


a better description. ) 


394 FEUDALISM 


Despite the great variety of definitions of feudal- 
ism, some minimal common characteristics of a 
fully developed feudal system would be accepted 
by most scholars. These include: (1) lord—vassal 
relationships; (2) a personalized government that 
is most effective on the local level and has rela- 
tively little separation of political functions; (3) a 
system of landholding consisting of the granting 
of fiefs in return for service and assurance of fu- 
ture services; (4) the existence of private armies 
and a code of honor in which military obligations 
are stressed; and (5) seignioral and manorial 
rights of the lord over the peasant (see Coulborn 
1956; Hall 1962). 

Perhaps the fullest definition of feudalism in the 
political sphere was given by Weber ( [1922] 1957, 
Pp. 375-376), who considered feudalism one type 
of “patriarchal authority.” According to Weber: 
(1) The authority of the chief is reduced to the like- 
lihood that the vassals will voluntarily remain faith- 
ful to their oaths of fealty. (2) The political corpo- 
rate group is completely replaced by a system of 
relations of purely personal loyalty between the 
lord and his vassals and between these, in turn, and 
their own subvassals (subinfeudation). (3) Only 
in the case of a “felony” does the lord have a 
right to deprive his vassal of his fief. (4) There is 
a hierarchy of social rank, corresponding to the 
hierarchy of fiefs, but it is not a hierarchy of au- 
thority in the bureaucratic sense. (5) The elements 
in the population who do not hold fiefs with some 
political authority are “subjects”—that is, patri- 
monial dependents. (6) Powers over the individual 
budgetary unit (domains, slaves, and serfs), the 
fiscal rights of the political group to the receipt of 
taxes and contributions, and powers of jurisdiction 
and compulsion to military service are all objects 
of feudal grants. 

In the social sector an important element of 
feudalism is the bearing of arms as a class-defining 
profession. Here feudalism is distinguished by a 
relative closing of the social status system in which 
(for the groups dependent primarily on the land) 
the distribution of goods and services is closely in- 
tegrated with the hierarchy of social statuses. 
Within the economic sector feudal government and 
society appear uniformly to rest upon a landed, or 
locally self-sufficient, economic base as distin- 
guished from a pastoral, commercial, or industrial 
one. The merchant community, although it may 
play a significant role in the economy, is essentially 
outside the feudal nexus. The appearance of certain 
technological features of government and economy, 
notably centralized communications and means of 
large-scale political organization, serve to under- 
mine the feudal institutions (Hall 1962). 


Whatever the variations within the economic, 
social, or political sphere, perhaps the most im- 
portant problem in the analysis of feudal societies 
or systems is the extent to which in any given place 
we can find these feudal characteristics developing 
or coexisting in all the major institutional spheres. 
The classical age of feudalism is usually dated from 
the eleventh to the thirteenth centuries and located 
in northern France. Other societies in different his- 
torical periods, whether European or non-European, 
are compared to this northern French society to 
determine the extent to which feudal institutions 
and tendencies developed within them. 


Feudalism in western Europe 


The specific features of feudalism were the out- 
come of the encounter of two types of society, the 
Romanized and the Germanic. Their fusion into a 
new society, the Romano—Germanic, was accom- 
panied by a merging and reshaping of their respec- 
tive institutions. Neither the German nor the 
Roman traditions were homogeneous, and through- 
out central and western Europe they differed ac- 
cording to the Strength of the local (often pre- 
Roman, Celtic) institutions and the effectiveness 
of Romanization, on the one hand, and the dis- 
tance of the new Germanic societies from their 
earlier, preinvasion habitats, on the other. 

At the time of their encounter, both societies 
were in a state of transition. The late Roman or 
Romanized West was passing through the profound 
crisis of a disintegrating empire, a weakening of 
central power, and a dislocation of the bureaucratic 
state machinery; the economic breakdown was seen 
in the diminishing importance of cities as centers 
of administration and of specialized economic ac- 
tivities, in the process of devaluation, and in the 
slowing down of the money economy. State and 
society were groping for new norms of existence. 
Public authority was delegated to the great land- 
owners, who already exercised some authority over 
their immediate dependents; economic life was 
shifting from city to countryside and was concen- 
trated on the larger estates, which tried to achieve 
autarchy in supplying their needs; insecurity was 
creating private warrior bands; freed slaves were 
being absorbed into the peasantry, who lost their 
status as free men to become the dependent semi- 
servile “colonate.” The Germanic tribes (Sippen), 
through migration and settlement, had loosened or 
lost their tribal ties. There remained the cohesion 
of families and of the newer and weaker village 
communities, which in time came to represent 
territorial units rather than strong kinship rela- 
tions. The transition from tribal to state organiza- 
tion continued in the fifth and sixth centuries, but 


the lack of a competent administration combined 
with an extremely low level of literacy and re- 
stricted money circulation helped to weaken the 
traditional units; nowhere was a state structure 
able to take over and to fulfill its public duties. 

The early medieval state, like that of the Frank- 
ish Merovingians (end of the fifth to beginning of 
the eighth century), presents, consequently, a jux- 
taposition of divergent elements of state and society 
(hardly ever integrated into a coherent whole). 
From this point of view, the features associated 
with feudalism are the direct outcome of a society 
striving for patterns of organization and cohesion 
in a period of declining state power and the disrup- 
tion of traditional kinship security groups. 

The most striking feature of the developing sys- 
tem is the new stratification of society. Roman so- 
cial hierarchy was far more polarized than that of 
the Germanic tribes. The latter, although not egali- 
tarian, as some nineteenth-century historians 
claimed, was basically a society of free men with 
a charismatic and hereditary chieftainship. The 
new administrative and military needs had already 
singled out the royal Merovingian entourage of 
warriors and officials and had sanctioned their 
standing by a higher Wergeld. At the beginning of 
the eighth century, however, the permanent need 
for professional, highly trained military men 
(mounted warriors ) brought about a radical change 
in society. The former peasant—warrior lost his 
military value. Private bands of warriors, a phe- 
nomenon that had its antecedents as much in the 
imperial bodyguard and in the private armies of 
the Roman senatorial class as in the ancient Ger- 
manic followers (Gefolgschaft) of the chieftain, 
sprang up around the king and local magnates. 

Vassalage. The nexus between the chieftain 
and his free followers was taken over by the insti- 
tution of vassalage (although the word itself points 
to a more humble origin, as “vassal” derives from 
the Celtic gwas, meaning “youngster” or “servant” ). 
Beginning in the early Carolingian period (eighth 
century), the new institution was integrated into 
the framework of state and society until it became 
official, recognized and sanctioned in public law 
and put to the service of the state. With the tre- 
mendous expansion of the empire of Charles the 
Great and for two centuries thereafter, vassalage 
as a type of social cohesion became the normal way 
of assuring not only military service but also pub- 
lic authority, Although the ancient oath of fealty 
of subjects to the ruler remained, it was felt that 
it did not sufficiently assure either loyalty or po- 
litical allegiance, Consequently, an oath of vassal- 
age, more binding and directly linked with the 
ruler, was demanded from appointed officials. The 
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heads of military and administrative circumscrip- 
tions—dukes, marquis, and counts—became vas- 
sals of the king. This new type of relation, which 
abandoned the charismatic character of the ear- 
lier period, was based mainly on the notions of 
fealty and absolute loyalty, strengthened by the 
religious element inherent in the oath itself, and 
it bound the contracting parties in a contractual 
relation. 

The principles of vassalic relations, first applied 
at the highest state level, spread rapidly to the 
lower rungs of the social ladder, Magnates and 
royal officials assured their own standing and the 
performance of the services of their office by con- 
tracting vassals, and the same process continued 
downward to the simple warrior and local admin- 
istrative officer. Thus, a pyramidal structure of 
bonds and dependencies arose, a scaffold of state 
structure and state machinery, the apex of which, 
ideally, was the king. 

Economic and social relations. The economic 
premises of the new social order were rooted in 
early medieval economy and grew out of the same 
social changes that made vassalic relations pos- 
sible. The weakening of the Sippe not only created 
insecurity but also changed the economic bases of 
existence. The village community, far weaker than 
the Sippe organization, could not offer adequate 
security, and social cohesion took the new form of 
individuals seeking the protection of the powerful 
man in their vicinity, drawing both on the patron— 
client pattern of the Roman tradition and on the 
Germanic notion of Grundherr, the rich and strong 
proprietor, whose influence transcended the bound- 
aries of his property and his direct dependents. 
Such proprietors included ecclesiastical institutions 
as well as secular lords. The peasants—and often 
whole villages—commended themselves into the 
protection of the powerful, relinquishing their 
property and receiving it back as a “precarium” 
(from preco, “to beg for”), a possession (later, 
hereditary tenure) burdened by certain economic 
obligations. Conversely, they received the protec- 
tion of the establishment or the lay lord. This pro- 
tection against outside (fiscal, administrative, mili- 
tary, or juridical) pressures not only made the 
peasant economically dependent but also initiated 
the process through which he lost his standing as 
free man and citizen. His dealings with state au- 
thority were henceforth channeled through his 
overlord. In this sense, the king, who combined 
competences of state sovereignty (often theoreti- 
cal in the ninth and tenth centuries) and vassalic 
suzerainty, lost his subjects, whom he could reach 
only through the mediation of their overlords. 

The material basis of the vassalic contract was 
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the fief. This was usually an agricultural territory 
(but there existed also money fiefs) granted by 
the lord to the vassal at the “homage” (from komo, 
“man”) ceremony when the vassal swore to serve 
the lord as his “man.” At the highest level of the 
feudal echelon the fief was usually a seigniory— 
that is, an economic and political entity invested 
with public powers of administration, taxation, and 
jurisdiction. A seigniory might comprise anything 
from a single village to a large complex of villages. 
It was the degree of public authority and the degree 
of immunity from the interference of an overlord 
which differentiated it from a simple fief and fixed 
its place in the hierarchy of fiefs in the kingdom. 
The seigniory comprised, as a rule, a large terri- 
tory where the exercise of public rights was shared, 
in different degrees, by the lord and the men who 
became his vassals (“subvassals” of the overlord ) 
through enfeoffment and homage. Public power 
became an object of inheritance, since it accom- 
panied the inheritance of the fiefs and seigniories. 

At the bottom of the feudal ladder was the simple 
knight who owed to the overlord his own service 
and was supported by a fief just large enough to 
assure him a living in keeping with the standards 
of his class. Such a fief could coincide with a vil- 
lage or part of it, and its economic organization was 
usually described as a manorial economy. The lord 
of the manor also had noneconomic rights over 
the tenants on his manor, the most characteristic 
being the rights of jurisdiction deriving from land 
tenure. 

The movement of commendation, common to 
all strata of society, brought about a complete 
transformation of its social stratification and co- 
hesion and, finally, of the concepts of the state and 
its authority. Thousands of links of dependence ran 
from the apex to the lowest echelons of society. 
Their scope, meaning, and aim changed from step 
to step. Whereas in higher echelons commendation 
created a professional caste of warriors soon to 
become the nobility, in the lower echelons it cre- 
ated a class of people serving the lords in different 
capacities. As long as the service was basically 
military, the link of commendation created vassal- 
age, which had come to be regarded as the only 
condition fitting a free man. Lower down, commen- 
dation created serfdom of varying degrees, but al- 
ways connoting economic dependence, social deg- 
radation, and exclusion from the community of 
free men and subjects. 

The hierarchic principle of cohesion and depend- 
ence was sustained economically by the legal hier- 
archy of land and by the fixed relation of men to 
land. Only where feudalization did not penetrate 
the depth of society were there free communities, 


direct subjects of royalty, and allodial (entirely in- 
dependent) property. Ireland and Scotland pre- 
served clannish cohesion; Frisia preserved inde- 
pendent communities; in Saxony and parts of Spain 
there were free men; and German nobility kept 
allodial property late into the twelfth century. In 
all other territories all land except the royal do- 
main had the legal status of tenure or dependent 
possession. 

The main economic feature of the fief was the 
holder’s privilege not to work the land himself but 
to receive income in specie, money, and work from 
the peasant population. The peasants themselves 
held their land as servile tenures astricted as to 
payments and services, which varied widely ac- 
cording to the type of servile tenure. But it is a 
striking feature of the system that the obligations 
of the peasant were those deriving from his own 
legal status and that of the land he held. The theo- 
retical symmetry between the status of a man and 
that of his holding was soon destroyed by marriage 
and inheritance. A serf might, for example, be the 
tenant of a “free mansus” (mansus, “a unit of fam- 
ily holding”), his duties deriving from his status 
as serf and the obligations inherent in the free 
mansus. 

Stabilization of the system. Around 1100 the 
major features of feudalism began to stabilize and 
integrate into a coherent politico—economic system. 
Yet, complete integration was never achieved. 
Rights of possession, economic privileges, and pub- 
lic authority often remained undefined, conse- 
quently competing and overlapping. Starting in the 
second half of the twelfth century, political theo- 
reticians with legal training tried to describe the 
institutions of government and society as forming 
a logical whole. One of the stabilizing factors was 
the general rule linking vassalage with fiefs and 
their regular, hereditary transmission. Occurring 
on all levels of the feudal hierarchy, it assured a 
solid scaffold of social structure. Not only were the 
simple knight, his immediate overlord, and every 
lord up to the apex of the feudal hierarchy hence- 
forth concerned with fiefs and seigniories, as pure 
vassalage links would have postulated, but the fam- 
ily as a whole became a major factor in the feudal 
mechanism. On the upper level of the hierarchy, 
that of the great tenants-in-chief of the crown with 
quasi-state authority, it was the dynasty that 
counted. Below them, the traditional vassals of the 
dynasty were often regarded not only as members 
of the household (maisnie) but as a part of the 
noble lineage (lignage). The relations between 
lords and vassals were often conceived in terms 
of family relations, and the competences of the 
lord were not unlike the Germanic mundeburdum 


or the Roman patria potestas. The custom of send- 
ing the vassals’ children to be raised at the court 
of the overlord strengthened this type of relation, 
as did the meetings of the vassals at the lord's court 
in times of festivity, which were held as much for 
business reasons as for socializing. 

Rise of the nobility. In the twelfth century a 
two-hundred-year-old process of class formation 
came to an end, producing a class of nobility. The 
old warrior class of the eighth century was by then 
a class pursuing the profession of arms, which as- 
sured it a privileged place in society and a major 
share in political power; moreover, it was a class 
which could transmit its economic, social, and po- 
litical standing to its descendants, becoming, con- 
sequently, a hereditary nobility. Despite the marked 
differences within the class itself, differences based 
primarily on the extent of political power and the 
control of economic resources, all fief holders re- 
garded themselves, and were regarded by others, as 
the highest class in society. 

The most characteristic feature of the military 
nobility was its new warrior ideal—the knight. 
“Knighthood” was a designation of rank and dig- 
nity; it was, by implication, the expression of the 
new ethos—chivalrousness. Fusing ancient Ger- 
manic ideals of the “heroic age” with newer con- 
cepts of ecclesiastical origin, chivalry (from che- 
valier, “a mounted warrior”) expressed the worldly 
ideals of the fighting class and the new ethical 
teachings of the church. Fighting should not be an 
end in itself but should serve social and religious 
ideals in a basically other-world-oriented society. 
Biblical virtues—the protection of women, the 
weak, and the poor and the defense of religion— 
were the aims that enabled the church to sanction 
war and bloodshed. The ideal of the “Christian 
knight” (miles Christianus) which represented the 
ethos of the warrior caste, imprinted its character 
on the period. Its early, extreme theoretical formu- 
lation was by Bernard of Clairvaux, who regarded 
the knight as a permanent candidate for martyr- 
dom, and its early institutionalization was in the 
military orders created at the time of the Crusades 
in the Holy Land and the Christian reconquest 
of Spain. 

The ideals of monasticism and warriorship 
merged into the ideal of the Christian knight par 
excellence. Chivalry became institutionalized, 
adopting a military—ecclesiastical initiation rite 
(“dubbing”) and elaborating a code of behavior and 
a set of virtues fitting a member of the class. 
Henceforth, membership in the nobility depended 
not only on origin but on the formal act of “knight- 
ing.” The chivalrous virtues and rules of behavior 
and the image the class had of itself were perpetu- 


FEUDALISM 397 


ated by upbringing and education, The noble child 
passed a period of graded apprenticeship, living 
with a noble family (very often the vassal’s over- 
lord) before dubbing, which could be given only by 
someone who was himself a knight. The introduc- 
tion of chivalric rites and what became in the later 
part of the thirteenth century a formal code of chiv- 
alrous behavior made the noble class more exclu- 
sive, thus affecting social mobility. The code 
became, especially after the fourteenth century, 
extremely formalized and served to exclude non- 
members who acquired economic position in non- 
noble pursuits (commerce and banking) and who, 
by buying fiefs, tried to penetrate the ranks of no- 
bility. It also excluded knights who engaged in 
commercial pursuits. 

While the nobility was guarding its ranks against 
outsiders, its own internal differentiation proceeded 
swiftly. The baronial class, in many cases, split 
into magnates, “greater barons,” or grandes; be- 
neath them “smaller barons,” or hidalgos; and be- 
low them simple knights, Although social mobility 
existed, it tended to be rather limited. Marriages 
and dowries were usually contracted in a closed 
class market, and marriage with a lower-born noble 
was regarded with disdain. Local variations always 
existed—for example, social mobility was greater 
in England than on the Continent, and German 
ministeriales (sometimes serfs but in any case not 
nobles ) in royal military service were ennobled and 
could exercise the highest state functions, even at 
the end of the twelfth century (although Germany 
at this time was not yet entirely feudalized). The 
features and ideals of the nobility that are de- 
scribed above survived long after the class lost its 
political standing and parts of its economic posi- 
tion or even economic privileges. 

Growth of political units. As the links of co- 
hesion strengthened, the administrative framework, 
grouping fiefs and seigniories into larger political 
units, became clearer. Generally speaking, there 
were two main lines of development. One was the 
creation of strong local principalities (Anjou, Nor- 
mandy, Flanders), which at the turn of the eleventh 
century succeeded in dominating the different 
seigniories in their territories, recapturing some of 
the public authority (control of castles and mints 
—in some places a monopoly of the princely dy- 
nasty), and often developing princely bureaucratic 
administrations. This process built up the strong 
centralized provinces, which during the next hun- 
dred years were taken over by the Capetians and 
became the foundations of the kingdom of France. 

The second line was followed by Germany. In 
twelfth-century Germany, less feudalized than 
France, public authority was often still in the 
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hands of local princely dynasties with allodial pos- 
sessions, who exercised their competences not as 
the king’s vassals but, theoretically, as his officials. 
Their power was strengthened at the beginning of 
the century when the “quarrel of investiture” weak- 
ened the standing of royalty. To create stronger 
cohesion and forge links of dependence, the crown 
tried to bring the highest nobility into direct vas- 
salic dependence, in the process resigning to it pub- 
lic authority in the principalities. The principalities, 
by forging vassalic links with the local nobility, 
were supposed to become well-ordered administra- 
tive units directed by the crown. The principalities 
achieved, indeed, strong governments, but the 
crown never succeeded in bringing them into a 
rigid state framework. Germany, especially after 
the interregnum at the end of the Hohenstaufen 
dynasty (middle of thirteenth century), was made 
up of principalities and their rulers (Länder and 
Landesherren) within a loose framework of the em- 
pire. Legislation forced the emperor to enfeoff noble 
escheats, which could otherwise have enlarged the 
royal domain and thus strengthened his position 
at the expense of the princely class. Finally, the 
principle of election of the emperor by the imperial 
electors (Kurfiirsten) assured their dominance. 
Consequently, Germany never reached any degree 
of state unity. On the contrary, the principalities 
became independent, strongly organized states, with 
princely power based on authority delegated by the 
emperor and on vassalic links obligatory within 
their territories, In England, after the Norman con- 
quest, sovereignty and suzerainty assured a pre- 
ponderant power to the crown. Feudal particular- 
istic tendencies, brought to light in the middle of 
the twelfth century by rival claims to the throne, 
were quickly checked, leaving royalty in full pos- 
session of its powers. In Italy the development fol- 
lowed the lines of Germany, but the place of the 
principalities was taken by the emerging cities, the 
“communes,” which created territorial units virtu- 
ally independent of the central power. 

The decline of feudalism. The decline of feu- 
dalism was a general phenomenon of European 
history that owed as much to the economic trans- 
formations of the twelfth and thirteenth centuries 
as it did to features inherent in the feudal system 
itself, The economic transformations were the result 
of the twelfth-century “urban revolution.” The re- 
vival of money economy, the renewal of city life 
with its more complex division of labor, the rise of 
the new social stratum of burgesses—all proclaimed 
new needs and new possibilities. They enabled the 
state to perform and enlarge its functions without 
constant recourse to feudal services. The new mar- 


ket situation enabled the peasants to accumulate 
money from the sale of surplus production and 
initiated the commutation of manorial services into 
money payments. The final result was the disrup- 
tion of the manorial economy and a profound 
change in the standing of the nobility. 

Insecurity decreased in the far better policed 
states of the central Middle Ages, and the rural 
population did not depend for its survival or de- 
fense on the local magnate. The political power he 
wielded could be, and was, more efficiently used 
by state officials. Inherited political power conse- 
quently lost its practical and moral justification. 

The change in the position of the feudal lord is 
even more marked when compared with the all- 
important lord—vassal relations of the earlier pe- 
riod. As already mentioned, the inheritance of fiefs 
greatly contributed to the solidity of the system. At 
the same time, it brought with it a notable change 
in the feudo-vassalic establishment. As heredity 
was the rule and the renewal of the vassalic oath 
usually only a formality, the economic element in 
the relationship overshadowed the personal and inti- 
mate elements. Previously undefined and unlimited 
duties of service were replaced by fixed and meas- 
ured obligations. Thus, the military service was 
fixed for 40 days yearly; other aids and services 
were measured in stereotyped proportions accord- 
ing to the size of the fief. The fact that from the 
end of the tenth century a vassal could hold fiefs 
from different lords created a problem of multiple, 
often opposed, loyalties. 

The weakening of the ties of dependence in the 
upper strata of society and the process of dissolu- 
tion on the manorial level brought about a com- 
plete transformation in patterns of social cohesion 
and state organization. Different strata of society 
became crystallized in the pattern of “estates.” The 
estate grouped people of the same social class, who 
had a similar economic standing and enjoyed the 
same privileged position in the state in relation to 
the crown and to other estates. Unlike the former 
feudal links of cohesion, which were vertical, the 
new links binding man to man were horizontal. 
Men joining others of their own class sought ge 
ance and confirmation of their privileged position 
more than security and protection. A man’s stand- 
ing was no longer described in terms of dependence 
on a feudal overlord, but in terms of his belonging 
to a given “estate.” The hierarchic pattern con- 
tinued to exist but as a hierarchy of strata of 50- 
ciety rather than a hierarchy of individuals. More- 
over there were no formal links of dependence 
between the different estates. In a sense, all were 
in direct relation to the crown, and all claimed @ 


share in political power, whether on the national 
or the local level. 


Feudalism in other areas 


Japan. Outside western Europe, the greatest 
convergence of feudal characteristics in the various 
institutional spheres probably occurred in Japan, 
where it developed at the end of the twelfth century 
and persisted in its “pure” form until the Tokugawa 
regime. Here we may follow Hall's analysis (1962). 

The origin of feudalism in Japan seems to have 
coincided with the establishment of the Kamakura 
shogunate by Minamoto Yoritomo in 1192. Al- 
though vassalage and enfeoffment may have ex- 
isted even before the twelfth century, only a small 
portion of Japanese society was organized around 
these practices by 1192. In Japan during the Kama- 
kura period (1185-1333) the legal government 
was still centered on the emperor. It operated 
through the traditional civil administration (greatly 
weakened ) and an expanding system of semipublic 
domains (shden). Independent of these adminis- 
trative and fiscal relationships, there were numer- 
ous more informal hierarchies based upon clan ties 
and military allegiances. Military hierarchies tended 
to form around the local magnates. It was primar- 
ily through the development of hierarchies of such 
allegiances as they came to center upon the office 
of shogun, or military dictator (or on certain 
other high military posts), that feudal institutions 
crystallized. 

Yoritomo’s importance to the development of 
feudalism in Japan lay in regularizing and extend- 
ing the practice of pledges of military allegiance 
combined with protection of landholdings. Yori- 
tomo’s authority to appoint shugo, or “constables,” 
and jitd, or “stewards,” and to interfere in the shden 
system was based on his assertion of supreme mili- 
tary command in a time of national crisis. Through 
such appointments and through the increase of 
legal powers, the feudal nexus in government and 
society steadily encroached upon the imperial- 
shden complex, giving rise to a new type of insti- 
tutional nexus. 

At the apex of the state structure military author- 
ity gradually overshadowed civil authority, and dur- 
ing the thirteenth century the balance between civil 
and military power shifted steadily in the direction 
of the latter. Similarly, at the provincial level, mil- 
itary interests gained over civilian as the shugo in- 
creasingly took on the stature of military governors. 
Locally, the shugo were able to build up their eco- 
nomic support largely through the plural holding 
of jit rights to numerous shõen. They used their 
superior status in the shogunal hierarchy to assert 
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their influence among local bushi, or members of 
the military class. Before long the shugo had ab- 
sorbed many civil administrative powers at the 
same time that they achieved personal leadership 
of province-wide military bands, which they organ- 
ized increasingly on a lord—vassal basis. Below the 
shugo the step-by-step expansion of the jitd’s land 
rights among the bushi also served to extend the 
feudal element in Japanese society. 

As local bushi became rydshu, or landed pro- 
prietors, they began to divide these lands among 
family members or retainers, extending the prac- 
tice of combining grants of land with ties of mili- 
tary loyalty. The new military bonds forged between 
shugo and proprietary jitd or between jitō and vas- 
sal families became the basis of this ever-widening 
feudal system of social and political organization. 

The warfare that embroiled most of Japan dur- 
ing the middle of the fourteenth century hastened 
feudal trends in all parts of the country. Under the 
Ashikaga shogunate (1338-1573) the imperial 
center lost all of its effective power. The sho- 
gunate, now located at the very seat of the imperial 
court in Kyoto, absorbed most of the powers and 
functions of the civil government, although even 
now the emperor continued to play a crucial role 
as the ritual symbol of sovereignty and the source 
of the shogun’s delegated authority. 

In the provinces the key figures were the shugo, 
who by the end of the fourteenth century had de- 
veloped into true regional overlords, having ac- 
quired the combined powers of the former civil and 
military governors. They held title, under the 
shogun, to territories the size of entire provinces, 
serving as the ultimate authority in both civil and 
military affairs. 

By 1500, however, most of the jurisdictional ter- 
ritories of the shugo had been broken into frag- 
ments and a wave of new magnates of local origin 
had inherited the pieces. The shugo had disappeared 
and with them not only a generation of bushi 
leaders but also the last remnants of imperial law 
and civil land management based on the shden. 

The end of this relatively “pure” type of feudal- 
ism came in Japan with the more centralized Toku- 
gawa regime (1603-1867). Although based on the 
feudal structures and to some extent perpetuating 
them, this regime, through its policy of centraliza- 
tion, in fact froze the feudal institutions, depriving 
them of vitality and autonomy. 

Japanese feudalism differed from the European 
pattern in several important respects: (1) the con- 
tinuous importance of the imperial center in spite 
of its loss of political function; (2) the weak- 
ness, perhaps even total absence, of contractual 
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elements in the relations between lords and vas- 
sals; (3) the full, personal, familistic expression 
of these relations; and (4) the lack of any repre- 
sentative institutions. Nevertheless, like the Euro- 
pean pattern, it is a major example of feudalism, 
since it clearly demonstrated a relatively high de- 
gree of convergence of feudal characteristics in the 
different institutional spheres. 

Russia. In other societies the extent of such 
convergence was smaller. The regime of the feudal 
(patrimonial) principality in medieval Russia was 
accompanied by a certain immunity from political 
authority, conferred by private possession of land. 
The connection became firmly established because 
of the importance of military functions in local 
politics in pre-Muscovite central Russia and, later, 
its national importance in Muscovy. Whenever pos- 
session of land was hereditary, the authority con- 
nected with it was also hereditary. This was the 
normal pattern in pre-Muscovite times, and it again 
became general in the seventeenth century, the 
nonhereditary pomest’e (“benefice” or “military 
holding”) being merely a historical interlude, even 
if a rather long one. In pre-Muscovite Russia the 
essential sociopolitical relation was not between 
lord and vassal but between the votchinnik (“patri- 
monial lord”) and the population of his votchina 
(“landed possession” or “patrimony”), which came 
close to that of ruler and subject. There was no 
link between the prince’s service and possession of 
land, and although there was hereditary landhold- 
ing, the prince’s service was not hereditary, and 
subjects were free to leave their principalities. Yet, 
even though the pomest’e was not hereditary, there 
was a connection between military function and 
possession of land. It was based not on a feudal 
contract involving mutual fealty between a suzerain 
and a vassal, but rather on the absolute sovereignty 
of the tsar, who, requiring service from any of his 
Subjects, granted a pomest’e in return for such 
service (Szeftel 1956), 

Three distinct types of sociopolitical structure 
are relevant to Russian feudalism: the votchina re- 
gime, the pomest’e regime, and Western feudalism. 

The votchina regime was characterized by the 
growth of the manorial power of the lord of the 
estate over the population laboring on it or merely 
settled in its vicinity, Such power could be enforced 
by immunity privileges. The votchina estates were 
owned by political rulers ( princes), by private per- 
sons, or by the church. Although it represented, to 
a certain extent, the social aspects of feudal tend- 
encies the votchina system did not contain a coun- 
terpart to the political aspects. There was no formal 


political connection between the vassal’s service 
and the control of the land. 

The pomest’e regime tended to make the control 
of the land depend on service rendered to the State 
by the landholder. There was no dispersion of po- 
litical power in this regime as it grew up in the 
Muscovite state of the sixteenth and seventeenth 
centuries. The power was concentrated in the person 
of the supreme ruler, the tsar. 

In the standard type of feudalism (Western feu- 
dalism) some characteristics of both the votchina 
and the pomest’e regimes are combined. However, 
for this type to develop, certain traits which were 
lacking in either or both of these regimes are es- 
sential. Like the votchina regime, the standard type 
of feudalism presupposes the expansion of the 
manor and the growth of the manorial rights of 
the lord. On the other hand, like the pomeste 
regime, feudalism of the standard type is charac- 
terized by the conditionality of rights on the land. 
The control of the land by the lower-class landlord 
depends on the service he renders to the seignior. 

The important point of difference between the 
pomesťe regime and feudalism of the standard type 
is that while in the former political power is con- 
centrated in the hands of the supreme ruler, in 
the latter the political authority is usually dispersed. 
Thus, no lord—vassal relationship of the western 
European type could develop in pre-Muscovite 
Russia, no code of chivalry was based on it, and 
there could be no consistent heredity of functions. 

The key to understanding the differences be- 
tween the Russian and Western developments ig 
the great migratory and resettlement movement in 
medieval central Russia. This mobility of the rural 
population was fundamentally caused by the rapid 
exhaustion of soil that was not too fertile to begin 
with and by extensive primitive agriculture. Al- 
though the movement produced some feudal traits 
in Russian life, it was also the source of instability 
in social relationships. In Russia the shifting local 
population did not provide the “free servant” with 
many bases to rely on, and there was no other pro- 
tection for his liberty than the temporary charac- 
ter of his service and the right of free departure. 

Byzantium. The constellation of feudal char- 
acteristics in the Byzantine Empire was rather dif- 
ferent from that found in Russia, centering primar- 
ily on the system of the pronoia (“providence, 
“foresight,” “care”). To give lands to a person in 
pronoia is to give lands into his care. In practice 
it meant that estates were given for administration 
to high officers of the state or army, to monasteries, 
and to private persons, as a reward for services. 


The grants differed from simple donations in that 
the pronoia land was bound to the recipient, the 
pronoiarios; that he received it for a definite period 
only, usually for life; that he could not sell the 
pronoia estate; and that it was not hereditary. 

The system developed under the eleventh-century 
Byzantine rulers who tried to reduce the power of 
the military class and to increase that of the civil 
bureaucracy by demilitarizing the administration. 
This policy clearly reflected the decay of the former 
organization of the military—peasant colonies 
(themes), The military commander (strategos), 
who usually served as governor of a province, was 
replaced by the practor, who had been the supreme 
justice on the staff of the strategos. The practor, 
of course, was a civilian, and thus, the primacy of 
the military command in the themes gave way to 
the primacy of a civilian administration based upon 
the new aristocracy of scholars and civilians in the 
capital. 

But the preponderance of the civilian aristocracy 
in the capital did not lead to a strengthening of 
the central power in the rural districts. Generals 
and great landowners outweighed the civilians. 
The emperors of the Ducas dynasty had already 
been compelled to give great privileges both to their 
civilian adherents and to their military or landown- 
ing adversaries; with the accession of Alexius 
Comnenus, 1081-1118, the military aristocracy 
took over the state. It was under the Ducas that 
the pronoia system was first developed and that 
Byzantium approached quasi feudalization. The 
new class of the pronoia owners became liable for 
military service, replacing the former class of peas- 
ant soldiers of the decaying system. The owner of 
a pronoia estate, when summoned, had to appear 
with a certain number of horsemen, according to 
the size of the pronoia. 

Since within the pronoia the formerly free peas- 
ants became more or less serfs, they came under 
the jurisdiction of the pronoiarios, although this ju- 
risdiction was restricted. The central government, 
thus, gave up many of its prerogatives including 
that of direct taxation, and the pronoiarios became 
small rulers, whose estates appeared as little king- 
doms within the empire. The crown became more 
and more dependent on them, which contributed 
to the weakening of the central government and 
to the decline and disintegration of the empire 
(Ostrogorski 1940; Kantorowicz 1956). 

In sum, Byzantine feudalism was characterized 
by the relative predominance of economically in- 
dependent small estates combined with a growing 
political decentralization—without, however, the 
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concomitant development of an over-all system of 
vassalage, a feudal—chivalrous military class, or 
special feudal political institutions. 

Parallel cases. The Byzantine type of feudalism 
is found in many other societies, especially in pe- 
riods of the decline of great empires—to some ex- 
tent at the end of the Roman Empire, in the later 
Sassanid period in Iran, and in the aftermath of 
ASoka's kingdom in India. In many cases insti- 
tutions of this type of feudalism developed when 
officials abused their rights to collect taxes and 
turned their offices into hereditary fiefs. In other 
cases the political traits of feudalism (usually 
many politically self-sufficient patrimonial units 
having some interrelations and an orientation 
toward one budding center) were more highly de- 
veloped than feudal economic characteristics. Such 
cases can be found in China under the Shang and, 
even more clearly, under the Chou; in ancient 
Mesopotamia under the Kassites, in Mittani, in the 
Tran of the Parthian regime, in the igtd institution 
of medieval Islam, and possibly in ancient Egypt. 

In none of these cases, however, was there a 
fully developed system of vassal-lord relations or 
a full-fledged social organization of a military— 
political class. At most, only rudiments of each 
existed. 


Emergence and demise of feudal systems 


In spite of all the differences in their origins and 
features, the feudal systems of the various societies 
analyzed above—and many more could be included 
—manifest some common characteristics. Perhaps 
most important is that they played a major role 
in the development of “high” cultures or civiliza- 
tions. Feudal systems can be found, even if in vary- 
ing degrees, in almost all of the great civilizations 
of the past, where they were central in keeping 
and developing great traditions under circum- 
stances often inimical to their maintenance. 

The importance of this characteristic can best 
be seen by examining the varying conditions under 
which feudal institutions develop. One such set of 
conditions is the partial dismemberment of rela- 
tively comprehensive, widespread sociopolitical sys- 
tems (Hintze 1929; Coulborn 1956). The reasons 
for such dismemberment may vary greatly: the 
clash of cultures, the invasions of nomads, or the 
development of internal contradictions that cause 
the imperial system to lose its effectiveness and its 
essential resources. However, the dismemberment 
is not by itself crucial to the development of feu- 
dalism; rather, it is the combination of the dismem- 
berment and the persistence or development of the 
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ideals of a “great empire” and of orientations toward 
broader societal frameworks among some of the 
elite groups (such as the church or the new mili- 
tary class) who gain control over the governmental 
and economic functions and the contradictions be- 
tween the idea of an empire and the lack of mate- 
rial and administrative positions to administer one. 
In some cases, such as that of Chinese feudalism, 
these orientations were developed by active groups 
that were unable to establish any viable broader 
system but, nevertheless, developed some vision of 
such a system. [See EMPIRES. ] 

Within most feudal systems, ideological orienta- 
tions to such broader frameworks were of great im- 
portance, even if they were only partially institu- 
tionalized. Any feudal system is, thus, always 
characterized by some inherent imbalances in its 
structure, as it contains more and less differentiated 
centripetal and centrifugal structures and orienta- 
tions. However, the exact location of such institu- 
tional imbalances in any feudal system—whether 
in the economic, political, or cultural sphere— 
varies greatly. 

The demise of the feudal system is predicated 
on changes in those conditions—technological, po- 
litical, and economic—that increase the effective- 
ness of the wider frameworks and that may enable 
the restoration or the establishment of unitary 
frameworks and of central powers within them. In 
less differentiated societies this can give rise to a 
restoration of patrimonial or imperial systems. In 
more differentiated societies—as in western Eu- 
rope and in Japan—the feudal background made 
the later transition to modernity easier and more 
stable, and in some cases, it might have facilitated 
—after a period of the “estate” system or of absolut- 
ism—the development of a relatively pluralistic 
system. 

JOSHUA PRAWER AND 
SHMUEL N, EISENSTADT 


[See also BUREACRACY; EMPIRES; MANORIAL ECON- 
OMY; VILLAGE. Other relevant material may be 
found in EvoLUTION, article on SocIAL EVOLUTION; 
KINSHIP; STATUS, SOCIAL; and in the biographies of 
BLOCH; BÜCHER; FUSTEL DE COULANGES; GIERKE.] 
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FIDUCIAL INFERENCE 


R. A, Fisher (1930) proposed a statistical meth- 
od for obtaining from observed data a probability 
distribution concerning a parameter value; he 
called the distribution a fiducial probability dis- 
tribution. The theory of confidence intervals, as 
developed by J. Neyman, was initially presented in 
the literature as a clarification and development of 
fiducial probability [see EsTIMATION, article on 
CONFIDENCE INTERVALS AND REGIONS]. Fisher de- 
nied the equivalence and in his subsequent theo- 
retical papers developed and extended fiducial 
probability. 

As an example, consider a sample of independ- 
ent measurements, X,,-*-,Xn, on a physical 
characteristic 4, and suppose that the measurement 
error is normally distributed with mean 0 and 
known variance ø}. Fisher requires that fiducial 
inference be based on the simplest statistic con- 
taining all the information about the parameter, in 
this case, on the sample mean X. [The sample 
mean here is a minimal sufficient statistic; see SUF- 
FICIENCY.] The expression W = X — p, involving 
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the variable X and the characteristic 4, has a known 
distribution: normal with mean 0 and variance 
a*/n. In an application of the method, the value of 
X is obtained and substituted in the expression 
W =X — p; the expression is solved for # in terms 
of W, giving u = X — W; the fiducial distribution of 
# then derives from the known distribution of W: 
# is normal with mean X and variance o2/n. A 95 
per cent fiducial interval is X + 1.960,/\/n. 

Fisher claimed that a fiducial probability state- 
ment has the same meaning as an ordinary prob- 
ability statement. In the example, suppose that the 
95 per cent fiducial interval as calculated in a 
specific application is 163.9 + 0.8. The fiducial 
statement is that there is 95 per cent probability 
that the unknown value of 4 lies in this interval. 
The interval 163.9 + 0.8 is also a 95 per cent con- 
fidence interval, but as a confidence interval its 
interpretation is different [see ESTIMATION, article 
On CONFIDENCE INTERVALS AND REGIONS]. Con- 
fidence methods and fiducial methods do not, how- 
ever, always lead to the same numerical results. 

The proponents of the confidence method claim 
that in this context probability statements con- 
cerning u cannot be made; the value of p» is some- 
thing that exists: either it is in the interval or it 
is not, and we don’t know which. 

The proponents of the fiducial method reply that 
probabilities concerning realized values are com- 
monplace: in the play of card games, for example, 
a player may observe his own hand and perhaps 
other cards (say, those already played) and make 
a probability statement concerning the distribution 
of cards in the concealed hands. 

The rejoinder is that ų did not arise from a ran- 
dom process such as the card shuffling and dealing. 
The relevance of this rejoinder is perhaps the key 
element to criticisms of the fiducial method. 

In more complex problems the fiducial method 
may give a result different than the confidence 
method, In one prominent problem mentioned be- 
low, the Behrens—Fisher problem, the fiducial 
method gives an answer where confidence methods 
have not yet produced an entirely satisfactory 
result. 

In his original paper on the fiducial method, 
Fisher (1930) considered a Statistic, T, obtained 
by the maximum likelihood method for estimating 
a parameter, 0. Let F(T, 0) be the cumulative dis- 
tribution function for the statistic T. The proba- 
bility density function for T is obtained by differ- 
entiating with respect to T: f(T, 0) = OF(T, 0)/ðT. 
Correspondingly, the fiducial density function for 0, 
given an observed value for T, is obtained by differ- 
entiating with respect to 0: 9(0, T) = OF(T, 0)/00. 


Fisher illustrated the method with the correla- 
tion coefficient r of a sample from a bivariate 
normal distribution with population correlation 
coefficient p. 

For more complex problems, Fisher proposed the 
use of a pivotal quantity, W = h(T, 6), a function 
of the statistic T and the parameter 8 that has a 
fixed known distribution regardless of the value of 
9. For the first example, W = X—p is a pivotal 
quantity. In an application, the observed value of 
the statistic T is substituted in the expression 
W = h(T, 0); the parameter @ is expressed in terms 
of W; and the fiducial distribution of @ is obtained 
from the known distribution of W. 

Fisher’s original method for obtaining a fiducial 
distribution is a special case of the pivotal method. 
As a function of a continuous statistic, T, the cumu- 
lative distribution function W = F(T, @) has a uni- 
form distribution on the interval (0, 1); this rela- 
tionship is called that of the probability integral 
transformation. The fiducial density of 6 for fixed 
T is obtained from the uniform distribution of W, 
in the same way as the density of T for fixed 0 is 
obtained by differentiation. 

As a second example, consider a random sample, 
X,, +++, X,, from a normal distribution with mean 
# and variance o°, both unknown, and suppose 
that interest centers on the parameter u. The quan- 
tity t= Vn(X — u)/sx, using the sample mean X 
and sample standard deviation sy, has a known 
distribution, the t-distribution on n — 1 degrees of 
freedom. In an application, the values of X and s, 
are substituted and the parameter is solved for: 
pw = X — tsx/ Vn. This equation gives a fiducial dis- 
tribution for u that is of t-distribution form (n — 1 
degrees of freedom), located at X and scaled by the 
factor sx/ Vn. 

The Behrens-Fisher problem is an extension of 
this example. Consider a first random sample, 
X,,+++,X,, from a normal distribution with mean 
#, and variance g? , and a second independent te 
dom sample, Y,, +-+, Ym, from a normal distribu- 
tion with mean p, and variance o}. The Beh- 
rens—Fisher problem concerns inference about a 
parameter difference, p, — ,.. The fiducial metho 
gives a distribution described by X — t,5x/V™ for 
Hx and a distribution described by Y — tssy/ vm i 
Hy. (Here t, and t. are independent t variables wit 
n — 1, m — 1 degrees of freedom. ) The fiducial dis- 
tribution for py— puy is that of the difference, 
X-Y- (tiSx/ Vn — t.sy/Vm); some gener 
points are given in Fisher and Yates ([1938] 1949, 
p. 44). pA 

For many problems involving normal and chi 


square distributions, the fiducial distribution has 
the form of a Bayesian posterior distribution as 
based on a prior distribution with uniformity char- 
acteristics [see BAYESIAN INFERENCE]. 

Fisher (1956) considers a wide range of statis- 
tical problems and derives the corresponding fidu- 
cial distributions. 

A central criticism of the fiducial method has 
been concerned with whether fiducial probabilities 
are in fact probabilities in an acceptable sense. 
Some recent analysis, mentioned later, has clarified 
this question. 

Other criticism seems to fall under three head- 
ings. First, fiducial probabilities in some examples 
may not add or integrate to a total of 1. James 
(1954) and Stein (1959) produce examples that 
can yield fiducial distributions that do not integrate 
to 1. Second, in some examples more than one 
reasonable pivotal quantity may be present; these 
can lead to several inconsistent fiducial distribu- 
tions (see Creasy 1954; Fieller 1954; Mauldon 
1955). In other examples no reasonable pivotal 
quantity may be present. Third, if a fiducial dis- 
tribution from a collection of data is used as a 
prior distribution for a Bayesian analysis on a sec- 
ond collection of data, the resulting distribution 
may be different from the fiducial distribution 
based on the combined collection of data [see Lind- 
ley 1958; see also BAYESIAN INFERENCE]. 

Fraser (1961) uses transformations to investi- 
gate fiducial probability. The transformation ap- 
proach applies to a large proportion of Fisher's 
examples, and it introduces an additional range 
of problems for which fiducial distributions can be 
obtained. 

In a later paper (Fraser 1966) the emphasis in 
the transformation approach is focused on error 
variables. Consider the example involving a sam- 
ple of measurements, X, +*+, Xn, on a physical 
quantity, u. Let e be a variable describing the error 
introduced by the measuring instrument: in the 
example, e is normally distributed with mean 0 
and variance o2. A measurement, X;, can then be 
expressed in the form X; = p + e; . Correspondingly, 
the sample mean takes the form X = p + é, where 
é is normally distributed with mean 0 and variance 
o/n. Now consider an application and suppose 
there is no information concerning p. With no 
information concerning p, there is no information 
concerning ë other than that describing its dis- 
tribution. Probability statements can then be made 
concerning the unknown ë just as the card player 
makes statements concerning the realized but un- 
revealed cards in his opponents’ hands. 

Suppose the normal distribution of ē with vari- 
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ance o?/n gives a 95 per cent probability for ë 
lying in the interval 0.0 + 0.8. Then, with an ob- 
served % = 163.9, the probability statement con- 
cerning @ is equivalent to the statement that u 
is in the interval 163.9 + 0.8 with probability 95 
per cent. 

This analysis involving error variables applies 
to many of Fisher's examples, and it extends to 
other problems. The name structural probability 
has been introduced (Fraser 1966) to distinguish 
it in cases where the method conflicts with the 
fiducial method. None of the criticisms mentioned 
concerning fiducial probability apply to structural 
probability. 

D. A. Sprott (1964) uses a more general class 
of transformations to analyze a wider range of 
fiducial distributions. 

Some alternative methods have been proposed 
for obtaining probability distributions concerning 
parameter values: Dempster (1963; 1966) pro- 
poses direct probabilities, and Verhagen (1966) 
proposes induced probabilities. 

D. A. S. FRASER 


[See also EsTIMATION, article on CONFIDENCE INTER- 
VALS AND REGIONS.] 
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FIELD THEORY 


In psychology, the term “field theory” is used 
primarily to designate the point of view of Kurt 
Lewin and his co-workers. Although the term has 
its origin in physics, where it is employed to 
refer to the conceptualization of electromagnetic 
phenomena in terms of fields of electromagnetic 
forces, field theory in psychology is not an attempt 
to explain psychological events in terms of physical 
processes. Rather, it refers to a “method of analyz- 
ing causal relations and of building scientific con- 
structs” (Lewin 1943a), which Lewin felt could 
be applied as fruitfully in psychology as it had been 
in physics. [See Lew1n.] 

Lewin’s early interests and writings—much in- 
fluenced by the German philosopher Ernst Cassirer 
—were concerned with the nature of theory in 
science. Much of his own research orientation and 
of his great impact on psychology arose from his 
way of thinking about theorizing. His approach 
to theorizing in psychology was characterized by 
several major themes: 

(1) An emphasis on the psychological explana- 
tion of behavior focuses on the purposes which 
underlie behavior and the goals toward or away 
from which behavior is directed; it stresses that 
one has to deal with what exists psychologically, 
what is real for the person being studied. 

(2) An emphasis on the total situation stresses 
that the most fundamental construct is that of 
the psychological “field,” or life space. All psycho- 
logical events are conceived to be a function of 
the life space, which consists of the person and the 


environment viewed as one constellation of inter- 
dependent factors. Lewin made it evident that it 
is meaningless to explain behavior without refer- 
ence to both the person and his environment and 
that psychological events: are not determined by 
the characteristics of the individual—“instincts,” 
“heredity,” “needs,” “habits”—acting independently 
of the situation. 

(3) An emphasis on systematic rather than 
historical causation indicates that psychological 
events have to be explained in terms of the proper- 
ties of the field which exist at the time when the 
events occur. Lewin rejected the notion of “action 
at a distance” and stressed that past events can 
have a position only in the historical causal chains 
whose interweaving creates the present situation. 

(4) The dynamic approach, like that of the 
gestalt school, accepts the view that living systems 
tend to maintain a dynamic equilibrium in relation 
to their environments. Related to this view was 
Lewin’s interest in the processes by which equi- 
librium is restored when it is disturbed and his 
interest in such motivational processes as the 
arousal of need tensions, the setting of goals, goal- 
directed action, and the release of tension. 

Lewin was not only a metatheoretician but also 
a bold and original experimentalist. However, the 
themes underlying Lewin’s approach to theorizing 
have played a central role in his empirical work 
on psychology. These themes are reflected in his 
research preoccupation with the psychological en 
vironment (the person’s perception of the situation 
confronting him), his emphasis on social and 
group influences upon behavior, his stress on the 
contemporaneous rather than the historical de- 
terminants of behavior, and his focus on dynamics 
(motivation, conflict, and change). The range and 
impact of the work of Lewin and his associates 
is indicated by a list of some of the research areas 
in psychology which they opened up for experi- 
mental investigation: dynamic studies of memory, 
resumption of interrupted activities, substitute ac- 
tivity, satiation, level of aspiration, studies of 
different types of group leadership, group decision. 
Many of the terms associated with Lewin are now 
part of the common vocabulary of psychologists 
e.g., “life space,” “valence,” “locomotion,” es 
lapping situation,” “cognitive structure,” “acon 
research.” In the space available here, it will be 
impossible to do more than sketch some of Lewin s 
central theoretical notions. This is done in five 
sections which deal with (1) dynamic concepts; 
(2) structural concepts, (3) socially induce 
change, (4) level of aspiration, and (5) group 
dynamics. 


Dynamic concepts 

After several years of work at the University of 
Berlin on the more traditional problems of per- 
ception and learning, Lewin turned to the study 
of motivation. In 1926 he published the first of a 
series of over twenty brilliant articles by himself 
and his students, “Untersuchungen zur Handlungs- 
und Affektspsychologie” (“Investigations Into the 
Psychology of Behavior and Emotion”), which ap- 
peared in Psychologische Forschung. Here most of 
the concepts which later became so famous first 
appeared. Among them are the concepts “tension,” 
“valence,” “force,” and “locomotion,” which play 
a key role in Lewin’s theorizing about motivation. 

A system in a state of tension is said to exist 
within the individual whenever a psychological 
need or an intention (sometimes referred to as a 
quasi need) exists. Tension is released when the 
need or intention is fulfilled. Tension has the fol- 
lowing conceptual properties: (a) it is a state of 
a region in a given system which tries to change 
itself in such a way that it becomes equal to the 
state of surrounding regions, and (b) it involves 
forces at the boundary of the region in tension. A 
positive valence is conceived as a force field in 
which the forces are all pointing toward a given 
region of the field (the valent region which is the 
center of the force field); all the forces point away 
from a region of negative valence. The construct 
force characterizes the direction and strength of 
the tendency to change at a given point of the life 
space. Change may occur either by actual locomo- 
tion (i.e., change in position) of the person in his 
psychological environment or by a change in the 
Structure of his perceived environment. 

There exists a definite relation between tension 
systems of the person and certain properties of the 
psychological environment. In particular, a tension 
may be related to a positive valence for activity 
regions in the psychological environment which 
are perceived as tension reducing, and a negative 
valence for the region in which the behaving self 
is at present. However, the existence of a region 
of positive valence (a goal region) depends not 
only upon the existence of tension but also upon 
whether there are perceived possibilities for reduc- 
ing the tension. 

When a goal region which is relevant to a sys- 
tem in tension exists in the psychological environ- 
ment, one can assert that there is a force acting 
upon the behaving self to locomote toward the 
goal. A tension for which there is a cognized goal 
leads not only to a tendency to actual locomotion 
toward the goal region but also to thinking about 
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this type of activity. This may be expressed by say- 
ing that the force on the person toward the goal 
exists not only on the level of doing (reality) but 
also on the level of thinking (irreality). 

The Zeigarnik quotient. From the foregoing 
assumptions about systems in tension it is possible 
to make a number of derivations. Thus, it follows 
that the tendency to recall or resume interrupted 
activities should be greater than the tendency to 
recall or resume finished ones. Zeigarnik (1927) 
and many others have conducted experiments in 
which subjects are given a series of tasks to per- 
form and are then prevented from completing half 
of them. Later, the subjects are asked to recall the 
tasks they had performed. The results are pre- 
sented in the form of a quotient, commonly called 
the Zeigarnik quotient (ZQ): 

uncompleted tasks recalled (RU) 

completed tasks recalled (RC) 
Zeigarnik predicted a quotient of greater than 1. 
The obtained quotiens was approximately 1.9, 
clearly supporting Lewin’s assumptions. However, 
since many completed tasks were also recalled, it 
was obvious that additional factors were involved. 
Analyzing the situation of the subject at the mo- 
ment of recall, Zeigarnik concluded that in addi- 
tion to the force on the person to think about, and 
hence to recall, the uncompleted tasks, there was 
also present a force to recall both uncompleted 
and completed tasks exerted upon the person by 
the experimenter’s instructions to “try to recall 
the tasks you worked on earlier.” The Zeigarnik 
quotient could be viewed as a function of the rela- 
tive strengths of the induced force to recall all 
tasks and of the force to recall the uncompleted 
tasks. As the strength of the force induced by the 
experimenter increases in relation to the force 
toward the task goals, the quotient should decrease 
toward 1; as it decreases in relative strength, the 
quotient should increase beyond 1. These addi- 
tional predictions, which follow from an analysis 
of the situation at recall, were borne out in ex- 
periments by Zeigarnik and others. Thus, if the 
strength of motivation associated with the inter- 
rupted task is relatively high, or the strength of 
the experimenter’s pressure to recall is low, or if 
the task is interrupted near its end, the Zeigarnik 
quotient will be high. 

A number of more recent experiments have in- 
dicated that the situation of recall is frequently 
even more complex than indicated above. When 
not finishing a task is interpretable as a personal 
failure (e.g., in an experiment where the tasks 
are presented as measures of a socially esteemed 
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ability) and when recall of failure threatens one’s 
self-esteem, or when the recall of success raises a 
lowered self-esteem, the Zeigarnik quotient tends 
to be less than 1. 

Substitute value. Käte Lissner initiated the 
study of the value which one activity has for re- 
ducing a tension originally connected with another 
activity by a technique involving resumption of 
the interrupted task; some of the other experi- 
menters have employed the technique of recall. 
The substitute value is measured by the amount 
of decrease in resumption or recall of the inter- 
rupted original activity after a substitute activity 
has been completed. The results of the experi- 
ments on substitute value can be summarized as 
follows: (1) Substitute value increases with the 
perceived degree of similarity between the original 
and the substitute activity and with the degree of 
difficulty of the substitute activity (Lissner 1933). 
(2) Substitute value increases with increasing 
temporal contiguity between the original and the 
substitute activity and with the attractiveness of 
the substitute activity (Henle 1942), (3) The sub- 
stitute value of an activity (e.g., thinking, talking, 
or doing) depends upon the nature of the goal of 
the original task. Tasks that are connected with 
the goal of demonstrating something to another 
person (e.g., the experimenter) require an observ- 
able substitute activity (not merely “thinking” 
without social communication ); “realization tasks,” 
in which the building of a material object is the 
goal, require “doing,” not only telling how it can 
be done; for intellectual problems, talking (or tell- 
ing how it can be done) can have a very high 
substitute value (Mahler 1933). (4) “Magic solu- 
tions,” “make-believe solutions,” or solutions which 
observably violate the requirements of the task 
have little substitute value for tasks at the reality 
level. However, if the situation is a make-believe 
or play situation, make-believe substitutions will 
have substitute value ( Sliosberg 1934; Dembo 
1931). (5) A substitute activity which is identical 
with the original activity will have little substitute 
value if it does not serve the same goal. Thus, 
building a clay house for Tony will have little sub- 
stitute value for building a clay house for Nicky. 
If the emphasis is on building a clay house and 
not upon the “for somebody,” then, of course, 
substitution will occur (Adler & Kounin 1939). 
(6) Having someone complete the subject’s inter- 
rupted task tends to have little substitute value, 
particularly when completion of the task is related 
to self-esteem. However, when pairs of individuals 
work cooperatively on a task, the completion of the 
task by one’s partner has considerable substitute 
value (e.g., Lewis & Franklin 1944). 


The research findings with respect to substitute 
value have implications for a wide range of prob- 
lems in psychology—from the relative gratification 
value of individual versus socially shared projec- 
tive or fantasy systems to the development of 
specialized roles within a group. Let us briefly 
illustrate with the very important finding that 
the actions of another person can be a substitute 
for one’s own actions if there is a cooperative 
relationship. The fact of substitutability enables 
individuals who are working cooperatively on a 
common task to subdivide the task and to perform 
specialized activities, since none of the individuals 
in a cooperative situation has to perform all the 
activities by himself. In contrast, the individual 
in a competitive situation is less likely to view the 
actions of others as substitutable for similarly 
intended actions of his own. Thus, when a com- 
petitive situation exists in a group, specialization 
of activities is less likely to develop (Deutsch 
1949a; 1949b). 

Satiation. The concept of tension systems has 
also been fruitfully employed in experimental 
studies of satiation. With regard to most needs, 
one can distinguish a state of deprivation, of sati- 
ation, and of oversatiation. These states correspond 
to a positive, a neutral, and a negative valence of 
the activity regions which are related to a par- 
ticular need or tension system. Karsten (1928) has 
studied the effect of repeating over and over again 
such activities as reading a poem, writing letters, 
drawing, and turning a wheel. The main symp- 
toms of oversatiation appear to be (1) the appear- 
ance of subunits in the activity that lead to the 
disintegration of the total activity and a loss of 
meaning of the activity; (2) increasingly poorer 
quality of work and greater frequency of errors in 
performing the task; (3) an increasing tendency 
to vary the nature of the task, accompanied by a 
tendency for each variation to be quickly satiated; 
(4) a tendency to make the satiated activity 4 
peripheral activity by attempting to concentrate 
on something else while doing the task—this is 
usually not completely successful, and the mind 
wanders; (5) increasing dislike of the activity and 
of similar activities, accompanied by an increased 
valence for different tasks; (6) emotional out- 
bursts; (7) development of “fatigue” and similar 
bodily symptoms which are quickly overcome when 
the individual is shifted to another activity. 

Satiation occurs only if the activity has, psycho- 
logically, the character of marking time or © 
getting nowhere, If the activity can be viewed as 
making progress toward a goal, the usual symp- 
toms of satiation will not appear. Embedding a” 
activity in a different psychological whole, so that 


its meaning is changed, has practically the same 
effect on satiation as shifting to a different activity. 
The rapidity with which satiation occurs depends 
upon factors such as the nature of the activity— 
with increasing size of its units of action and with 
increasing complexity satiation occurs more slowly 
—and the state of the person—the more fluid the 
state of the person, the more quickly he is satiated. 
The rate of satiation and cosatiation of similar 
activities (i.e., the spread of satiation effects from 
one activity to similar activities) decreases with 
age and with lack of intelligence (Kounin 1941). 


Structural concepts 


Lewin attempted to develop a geometry, which 
he termed “hodological space,” to represent a per- 
son’s conception of the means-end structure of 
the environment, of what leads to what. Although 
Lewin’s “hodological space” was never developed 
adequately from a mathematical viewpoint, it 
served to highlight the necessity of considering a 
person’s conception of his environment in analyz- 
ing his behavior possibilities and in characterizing 
the direction of his behavior. For example, an in- 
dividual who walks around a fence to get to a ball 
behind it is, psychologically, walking toward the 
ball as he physically walks away from it. 

Cognitive structure. The view that direction in 
the life space is dependent upon cognitive structure 
has been applied to provide insights into some of 
the psychological properties of situations which are 
cognitively unstructured. 

Most new situations are cognitively unstruc- 
tured, since the individual is unlikely to know 
“what leads to what.” As he strikes out in any di- 
rection, he does not know whether he is going 
toward or away from his goal. His behavior will 
be exploratory, trial-and-error, vacillating, and 
contradictory rather than efficient and economical. 
If reaching the goal has positive significance and 
not reaching it has negative significance for the 
individual concerned, then being in a region that 
has no clear cognitive structure results in psycho- 
logical conflict, since the direction of the forces 
acting upon him is likely to be both toward and 
away from any given region. There will be evi- 
dences of emotionality as well as cautiousness in 
such situations. In addition, the very nature of an 
unstructured situation is that it is unstable; per- 
ception of the situation shifts rapidly and is readily 
influenced by minor cues and by suggestions from 
others, 

Lewin has employed his characterization of cog- 
nitively unstructured situations to give insight 
into the psychological circumstances of the adoles- 
cent (1939-1947) as well as those of minority 
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group members (1935), of people suffering from 
physical handicaps (Barker 1946), of the nou- 
veaux riches, and of other persons crossing the 
margins of social classes. It may be applied to any 
situation in which the consequences of behavior 
are seemingly unpredictable or uncontrollable; in 
which benefits and harms occur in an apparently 
inconsistent, fortuitous, or arbitrary manner; or in 
which one is uncertain about the potential reac- 
tions of others. 

Conflict. Lewin has brilliantly employed his 
structural concepts, in conjunction with his dy- 
namic concepts, to give insight into the nature of 
conflict situations. He distinguishes three funda- 
mental types of conflict: 

(1) The individual stands midway between two 
positive valences of approximately equal strength. 
This type of conflict is unstable. As the individual, 
due to the play of chance factors, moves from the 
point of equilibrium toward one goal region rather 
than the other, the resultant force toward that 
region increases, and, hence, he will continue to 
move toward that region. This follows from the 
assumption that the strength of force toward a 
goal region decreases with increasing distance 
from the goal. 

(2) The second fundamental type of conflict 
occurs when the individual finds himself between 
two approximately equal negative valences. The 
punishment situation is an example. This type of 
conflict is very much influenced by the structure of 
the situation. Let us illustrate three varieties of this 
type of conflict: (a) the individual is between two 
negative valences, but there are no restraints keep- 
ing him in the situation—e.g., a girl who will have 
to marry an unpleasant suitor or become an im- 
poverished spinster if she remains in her village 
(there is nothing to prevent her from leaving her 
village); (b) the individual is between two nega- 
tive valences, but he cannot leave the field—e.g., a 
group member who is faced with the prospect of 
losing social status or of performing an unpleasant 
task (he cannot leave the group); (c) the individ- 
ual is in a region of negative valence and can leave 
it only by going through another region of negative 
valence—e.g., a man is cited for contempt of Con- 
gress for not testifying whether or not some of his 
acquaintances were members of the Communist 
party (to purge himself of contempt he must be- 
come an informer). 

It is evident that the situation depicted in (a) 
will lead the person to go out of the field. Only if 
restraints prevent the individual from leaving the 
field will such a situation result in more than mo- 
mentary conflict. Restraints as in (b) introduce a 
conflict between the driving forces related to the 
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negative valence and the restraining forces related 
to the barrier. There is a tendency for a barrier to 
acquire a negative valence which increases with 
the number of unsucccessful attempts to cross it 
and which, finally, is sufficiently strong to prevent 
the individual from approaching it (Fajans 1933). 
Thus, the conflict between driving and restraining 
force is replaced by a conflict between driving 
forces, as in (c) above. This fact is particularly im- 
portant for social psychology since, in many situa- 
tions of life, the barriers are social. When a person 
turns against the barrier, he is in effect directing 
himself against the will and power of the person 
or social group to whom the erection of the barrier 
is due. 

(3) The third fundamental type of conflict sit- 
uation occurs when the individual is exposed to 
opposing forces deriving from a positive and a neg- 
ative valence. One can distinguish at least three 
different forms of this conflict: (a) a single region 
has both positive and negative valence. (The Freud- 
ian concept of ambivalence is subsumable under 
this variety of conflict.) For example, a person 
wishes to join a social group but fears that being 
a member will be too expensive. (b) A person is 
encircled by (but not actually in) a negative or a 
barrier region and is attracted to a goal which is 
beyond it—e.g., a person who has to go through the 
unpleasant ordeal of leaving his home or his group 
in order to pursue some desired activity. In this 
situation, the region of the person’s present activity 
tends to acquire negative valence as long as the 
region which encircles the person hinders locomo- 
tion toward desired outer goals. Thus, being a mem- 
ber of a minority group or being in a ghetto or in 
a prison often takes on negative valence, apart 
from the inherent characteristics of the situation, 
because one can get to desired goals from the re- 
gion only by passing through an encircling region 
of negative valence. (c) A region of positive valence 
is encircled by or is accessible only through a region 
of negative valence. This type of situation differs 
from (b) in that the region of positive valence 
rather than the region in which the person is to be 
found is encircled by the negative valence, The 
“reward situation,” in which the individual is grant- 
ed a reward only if he performs an unpleasant task, 
is an example of this type of conflict. 

Lewin ([1939-1947] 1963, p. 259), as well as 
Miller (1944), has pointed out that the forces cor- 
responding to a negative valence tend to decrease 
more rapidly as a function of psychological dis- 
tance than do the forces corresponding to a posi- 
tive valence. The amount of decrease depends also 
upon the nature of the region which has a positive 


or negative valence. It is different in the case of a 
dangerous animal which can move about than it 
is in the case of an immovable unpleasant object. 
The difference in the gradients of decrease for 
forces deriving from positive and negative valences 
accounts for the apparent paradox that a strong 
fear or a strong tendency to withdraw may be taken 
as evidence of a strong desire for the goal. Only 
with a very attractive goal will the equilibrium 
point between approach and avoidance tendencies 
be close enough to the negatively valent region to 
produce a strong force away from the goal. On the 
other hand, strengthening the negative valence of 
a region may very well have the effect of weaken- 
ing the forces in conflict, since the equilibrium 
point may be pushed a considerable distance away 
from the valent regions. [See ConFLICT, article on 
PSYCHOLOGICAL ASPECTS, ] 


Socially induced change 


It should be noted that Lewin’s motivational con- 
cepts do not presuppose that motivation is primarily 
induced by physiological deficits; his conceptualiza- 
tion suggests that tension and valences may be 
aroused socially (e.g., by an experimenter’s instruc- 
tions to perform a task); he also indicated that the 
forces acting upon an individual may be “imposed” 
or may directly reflect the individual's own needs. 

The concept of powerfield is relevant in this con- 
nection. It is an inducing field; it can induce 
changes in the life space within its area of influ- 
ence. The distinction between own and induced 
forces has been found to be useful in explaining 
some of the differences in behavior under auto- 
cratic and democratic leadership (Lippitt & White 
1943). Thus, children in a club led by authoritarian 
leaders (who determined policy, dictated activities, 
and were arbitrary and personal in evaluation of 
activities) tended to develop little of their own 
motivation with respect to club activities. Although 
the children worked productively when the leader 
was present (i.e., when his powerfield was psycho- 
logically effective), the lack of personal motivation 
toward group goals clearly evidenced itself in 
(1) change of behavior when the leader left the 
club, (2) absence of motivation when the leader 
arrived late, (3) lack of carefulness in the work, 
(4) lack of initiative in offering spontaneous en 
gestions in regard to club projects, (5) lack of pride 
in the products of club effort. [See LEADERSHIP.| 

The distinction between own and induced pen 
has also been employed to explain why ee 
are usually happier and more productive when t a 
can participate in the decisions which affect ree 
work. Participation in goal setting is more likely 


to create own forces toward the goal and thus re- 
duce the necessity to exert continuous social influ- 
ences toward the same end (Coch & French 1948). 


Level of aspiration 

Perhaps no other area of research that Lewin 
and his students have opened to experimental in- 
vestigation has been the object of so many studies 
as that of level of aspiration. The level of aspiration 
may be defined as the degree of difficulty of the 
goal toward which the person is striving. The con- 
cept of level of aspiration is relevant only when 
there is a perceived range of difficulty in the attain- 
ment of possible goals and a variation in valence 
among the goals along the range of difficulty. 

In discussing the level of aspiration, it may be 
helpful to consider a sequence of events which is 
typical for many of the experimental studies in this 
area: (1) A subject plays a game (or performs a 
task) in which he can obtain a score (e.g., throw- 
ing darts at a target); (2) after playing the game 
and obtaining a given score, he is asked to tell what 
score he will undertake to make the next time he 
plays; (3) he then plays the game again and 
achieves a different score; (4) he reacts to his sec- 
ond performance with feelings of success or failure, 
with a continuing or new level of aspiration, etc. 
In the foregoing sequence, point (4) (reaction to 
achievement) is particularly significant for the dy- 
namics of the level of aspiration. 

In outline form, the theory of the level of aspira- 
tion is rather simple (Lewin et al. 1944). It states 
that the resultant valence of any level of difficulty 
will be equal to the valence of achieving success 
times the subjective probability of success, minus 
the valence of failure times the subjective prob- 
ability of failure. The level of aspiration (i.e., the 
goal an individual will undertake to achieve) will 
be the level of difficulty that has the highest result- 
ant positive valence. The subjective experience of 
success or failure is determined by the relation of 
the individual's performance to his level of aspira- 
tion (providing, of course, that the performance 
is seen to be self-accomplished ) and not simply by 
his absolute accomplishments. 

Experimental work on the level of aspiration has 
brought out the variety of influences which affect 
the positive and negative valences of different levels 
of difficulty. It has indicated that cultural and group 
factors establish scales of reference which help to 
determine the relative attractiveness of different 
points along a difficulty continuum. Some of these 
influences are rather stable and permanent in their 
effects. It has been found, for example, that most 
people in Western cultures, under the pervasive 
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cultural pressures toward “self-improvement,” when 
first exposed to a level of aspiration situation ini- 
tially give a level of aspiration which is above the 
previous performance score, and under most condi- 
tions they tend to keep their level of aspiration 
higher than their previous performance. In addi- 
tion to broad cultural factors, the individual's level 
of aspiration in a task is likely to be very much in- 
fluenced by the standards of the group to which he 
belongs. The nature of the scales of reference set 
up by different group standards may vary. Refer- 
ence scales do not derive solely from membership 
in a definitely structured social group; they also 
may reflect the influence of one’s self-image, of 
other individuals, or of groups that either establish 
certain standards for performance or that serve as 
models for evaluating self-performance. Thus, the 
level of aspiration of a college student with respect 
to an intellectual task will vary, depending upon 
whether he is told that a given score was obtained 
by the average high school student, the average 
college student, or the average graduate student. 

Research has given some insight into the factors 
determining the values on the scale of subjective 
probability. A main factor which determines the 
subjective probability of future success and failure 
is the past experience of the individual in regard to 
his ability to reach certain objectives. If the individ- 
ual has had considerable experience with a given 
activity, he will know pretty well what level he can 
expect to reach, and the gradient of values on the 
subjective probability scale will be steep. However, 
it is not only the average of past performances 
which determines an individual's subjective prob- 
ability scale but also the trend—whether he is im- 
proving, getting worse, or remaining the same. 
Furthermore, there is experimental evidence to in- 
dicate that the last or most recent success or failure 
has a particularly great influence on the individual's 
expectation of his future achievement level. In ad- 
dition, there is evidence that the subjective prob- 
ability scales, as well as the performance of others, 
can influence the subjects own probability scale. 
Personality factors—for example, self-confidence— 
may also influence subjective probability. 

The level of aspiration theory has widespread im- 
plications for many social phenomena. It gives in- 
sight into the reasons for social apathy in the face 
of pressing political and international problems. 
People are not likely to attempt to achieve even 
highly valued objectives when they see no way of 
attaining them. Similarly, it sheds some light upon 
why social revolution tends to occur only after there 
has been a slight improvement in the situation of 
the oppressed groups—the improvement raises 
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their level of aspiration, making goals which were 
once viewed as unattainable now perceived as real- 
istic possibilities. 


Concepts of group dynamics 


Apart from papers dealing with group decision 
and social change (Lewin 1935-1946; 1947a; 
1947b ) Lewin actually wrote very little on the theory 
of group dynamics. However, from the research in- 
vestigations of his colleagues and students at the 
Research Center for Group Dynamics a formidable 
array of concepts has emerged. [See Groups.] 

Let us begin our discussion of the concepts of 
group dynamics by briefly considering the concept 
of group. Lewin ([1935-1946] 1948, p. 48) wrote: 


The essence of a group is not the similarity or dis- 
similarity of its members, but their interdependence. A 
group can be characterized as a “dynamic whole”; this 
means that a change in the state of any subpart 
changes the state of any other subpart. The degree of 
interdependence of the subparts of members of the 
group varies all the way from a loose “mass” to a com- 
pact unit. 


French (1944) pointed out that in addition to in- 
terdependence, membership in a group presupposes 
identification with the group. Deutsch (1949a) 
indicated that the interdependence is promotive or 
cooperative rather than, for example, competitive. 
Thus, a group may be tentatively defined as being 
composed of a set of members who mutually per- 
ceive themselves to be cooperatively or promotively 
interdependent in some respect(s) and to some 
degree. 

Cohesiveness. One of the key concepts, which 
has been the subject of much experimental investi- 
gation, is that of cohesiveness. Intuitively, cohesive- 
ness refers to the forces which bind the parts of a 
group together and which, thus, resist disruptive 
influences. Hence, the study of conditions affecting 
group cohesiveness and of the effects upon group 
functioning of variations in group cohesiveness is 
at the heart of the study of group life. Festinger, 
Schachter, and Back (1950) have defined cohesive- 
ness, in terms of the group member, as “the total 
field of forces which act on members to remain in 
the group.” The nature and Strength of the forces 
acting upon a member to remain in the group may 
vary from member to member, 

Various measures of individuals’ “cohesiveness” 
to their groups have been employed in experimental 
investigations: desire to remain in the group, the 
ratio of “we” remarks to “I” remarks during group 
discussion, ratings of friendliness, evaluations of 
the group and its sociometric choice versus out- 
group sociometric choice, and others. Deutsch 


(19494), in a theoretical paper, provides a ration- 
ale for the use of a wide variety of measures by 
developing the hypotheses that members of more 
cohesive (cooperative) groups, as compared with 
members of less cohesive (competitive ) groups, 
would, under conditions of success: (a) be more 
ready to accept the actions of other group members 
as substitutable for similarly intended actions of 
their own (and therefore would not have to per- 
form them also); (b) be more ready to accept in- 
ductions (i.e., be influenced) by other members; 
and (c) be more likely to cathect positively the 
actions of other group members. From these core 
hypotheses, with the addition of more specific as- 
sumptions, it is possible to derive the influence 
of more or less cohesiveness upon many aspects of 
group functioning. Thus, from the substitutability 
hypothesis, it is possible to predict that more spe- 
cialization of function, more subdivision of activity, 
and more diversity of membership behavior would 
occur in the more cohesive groups. The inductibility 
hypothesis leads to the prediction that members of 
more cohesive groups would be more attentive to 
one another, be more understood by one another, 
be more influenced by one another, and be more 
likely to change and have more internalization of 
group norms than members of less cohesive groups. 
The cathexis hypothesis leads to predictions of 
greater friendliness, greater ratio of in-group socio- 
metric choices, etc:, in the more cohesive groups. 
Data in a variety of experiments (e.g., Deutsch 
1949b; Schachter 1951; Back 1951) support the 
foregoing predictions. [See COHESION, SOCIAL.] 
Communication. In a well-integrated program 
of research, Festinger and his co-workers have de- 
veloped a series of fertile hypotheses (Festinger 
1950) and have conducted some ingenious exper- 
iments on the communication process within 
groups. In brief, these investigators have been con- 
cerned with three sources of pressures to commu: 
nicate within groups; (a) communications arising 
from pressures toward uniformity in a group (Back 
1951; Festinger & Thibaut 1951; Schachter 1951); 
(b) communications arising from forces to loco- 
mote in a social structure (Kelley 1951; Thibaut 
1950); and (c) communications arising from the 
existence of emotional states (Thibaut 1950; Thi- 
baut & Coules 1952), [See INTERACTION. ] 
Pressures toward uniformity. Festinger (1950; 
1954) has indicated two major sources of ela 
toward uniformity in a group: social reality am 
group locomotion. He indicates that when there 
no single objective basis for determining the vali j 
ity of one’s beliefs, one is dependent upon soci 
reality to establish confidence in one’s beliefs. Thus, 


to evaluate the validity of his opinions or to eval- 
uate his abilities or even the appropriateness of his 
emotions (Schachter 1959), he will compare them 
with those of others. Festinger (1954) posits that 
to facilitate accuracy of evaluation of their opinions 
or abilities, people will seek out others with similar 
opinions or abilities for comparison and will avoid 
dissimilarity or attempt to reduce it when it exists. 
Also, lack of agreement among members of a group 
provides an unstable basis for beliefs which depend 
upon social consensus for their support, and hence 
—in line with Heider’s discussion of the tendency 
toward cognitive balance (1958) or Festinger’s 
theory of cognitive dissonance (1957)—forces will 
arise to produce uniformity. [See CONFORMITY.] 

Pressures toward uniformity among members of 
a group may also arise because such uniformity is 
desirable or necessary in order for the group to lo- 
comote toward some goal. Greater uniformity in 
opinion within a group can be achieved in either of 
the following ways: (a) by actions (i.e., communi- 
cations) that are directed at changing the views of 
others or one’s own views or (b) by actions to make 
others incomparable in the sense that they are no 
longer effective as a comparison for one’s opinions 
—e.g., by rejecting or excluding people with deviat- 
ing opinions from the group. 

Experiments have shown that increasing the at- 
traction to the group, and thus increasing the im- 
portance of the group as a comparison object, in- 
creases the amount of influence which is attempted 
and the amount of opinion change which occurs 
when there is opinion discrepancy within a group 
(Back 1951). It has also been demonstrated that 
the more relevant or important the opinion is for 
the functioning of the group, the more pressure there 
will be for uniformity in a group (Schachter 1951; 
Festinger & Thibaut 1951). Also, as may be ex- 
pected from theoretical considerations, there is evi- 
dence that when pressures toward uniformity exist, 
the concern is mainly with those members of the 
group who have opinions most divergent from one’s 
own. Thus, members exert influence mainly on 
those whose opinions are most divergent from their 
own. 

Pressures toward change. The experiments re- 
vealing the tendency to direct communication to- 
ward deviants in the group, and through communi- 
cation to exert social pressure upon them, provide 
support for Lewin’s theory of group decision and 
social change, Lewin (1947b) began his analysis 
of change by pointing out that the status quo in 
Social life is not a static affair but a dynamic proc- 
ess that flows on while still keeping a recognizable 
form. He borrowed the term “quasi-stationary equi- 
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libria” from physics to apply to such ongoing proc- 
esses, which are kept at their present level by fields 
of forces preventing a rise or fall. The field of 
forces in the neighborhood of the level of equili- 
brium presupposes that the forces against ex- 
ceeding the equilibrium level increase with the 
amount of raising and that the forces against 
lowering increase (or remain constant) with the 
lowering. Thus, if we assume that a group stand- 
ard is operating to determine the level of worker 
productivity in a factory, any attempt upon the 
part of the worker to deviate from the standard 
by higher productivity will only result in stronger 
forces being induced upon him by his co-workers 
in order to push him back into line. That is, as 
the Festinger experiments have demonstrated, the 
deviant will be exposed to stronger pressures to- 
ward uniformity the more he deviates. However, 
as Lewin points out, the gradient of forces may 
change at a distance from the equilibrium level, 
so that after an individual has gone a certain dis- 
tance from the equilibrium level, the forces may 
push him away from rather than pull him toward 
the group standard. 

Lewin’s analysis of the status quo as a quasi- 
stationary equilibrium has two major implications. 
First, it points out that change from the status 
quo can be produced either by adding forces in 
the desired direction or by diminishing opposing 
forces. However, the two methods of producing 
change have different consequences: if forces are 
added, the process on the new level will be ac- 
companied by a state of relatively high tension, 
since the strength of forces in opposition will be 
greater; if the opposing forces are decreased, the 
new level will be accompanied by lower tension. 
Second, it highlights the difficulties of attempt- 
ing to change group-rooted individual conduct and 
attitudes through efforts directed at the individual 
and not his group. If one endeavors to change 
the prejudices of an individual without changing 
the prejudice of the group to which he belongs, 
an individual will either be estranged from his 
group or be under pressure from his group to re- 
vert to his initial attitudes. Isolated individuals 
may perhaps change their attitudes because of 
their individual experiences, but the person who 
is deeply enmeshed in the social life of his com- 
munity is unlikely to be able to resist the pressures 
to conform on matters of community importance 
if he wishes to continue in good standing. [See 
ATTITUDES, article on ATTITUDE CHANGE.] 

Considerations such as the foregoing have led 
to a series of experiments in various settings— 
in the school, with neighborhood groups, in in- 
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dustry, in an interracial workshop—on the relative 
efficacy of changing behavior by efforts directed 
at individuals or at a group (Lewin 1935-1946). 
A typical procedure has been to compare the 
results of a lecture or individual instruction in 
changing behavior regarding the use of certain 
foods with the results of a group decision favoring 
the use. Results have clearly indicated that the 
group decision method produces more change. 


The present status of field theory 


In previous sections of this article, work that 
was directly initiated or stimulated by Lewin or 
his immediate associates has been described. The 
large sweep of this work, the brilliant innovations, 
the experimental ingenuity cannot help but be 
impressive. Even though Lewin died in 1947, his 
impact on psychology continues to be felt in the 
extension and application of his ideas by others. 
We turn, briefly, to a discussion of some current 
work in psychology—a theory of achievement mo- 
tivation, balance and dissonance theory, and T- 
group training (laboratory training groups) which 
extends Lewin’s ideas. 

A theory of achievement motivation. Atkinson 
(1957; 1964) has developed a theory of achieve- 
ment motivation that is an extension and elabora- 
tion of ideas advanced in the theory of level of 
aspiration developed by Lewin, Festinger, and 
Sibylle Escalona. Atkinson’s theory attempts to 
account for the determinants of the direction, mag- 
nitudes, and persistence of achievement-motivated 
performance, Achievement motivation is the re- 
sultant of two opposed tendencies: T, — T; (the 
tendency to achieve success minus the tendency 
to avoid failure, In Atkinson’s notation this is rep- 
resented as T.;.). 

The tendency to achieve success is assumed 
to be a multiplicative function of the motive to 
achieve success (M,) which the individual carries 
about with him from situation to situation, the 
subjective probability of success (P,), and the in- 
centive values of success at a particular activity 
(L) Ti=M, xP, x I, Similarly, the tendency to 
avoid failure is assumed to be a multiplicative func- 
tion of the motive to avoid failure (M,;), the sub- 
jective probability of failure (P;), and the negative 
incentive value of failure (I;): T; = My X P; x I,. 

So far, Atkinson’s analysis parallels the level 
of aspiration theory, if one assumes, quite properly, 
that the Lewinian concept of valence is equivalent 
to Atkinson’s motive times incentive: namely, that 
the valence of success = M, x I, and that the va- 
lence of failure = M,, x I,. He, however, introduces 
the additional assumptions that I, = 1—P,=P, 


and that I, = —P,. In other words, his theory details 
more unequivocally the relationship between the 
perceived level of difficulty and the incentive values 
of success and failure. It states, in effect, that the 
value of success increases with the perceived diffi- 
culty of the task, while the displeasure at failure 
increases with the perceived ease of the task. 
(Notice that in Atkinson’s theory P, x I, must 
equal —P, x I; and, hence, that whatever differ- 
ences in strength there are between T, and T; will 
be due solely to the differences between M, and 
Mas.) In addition, he has specified methods of 
measuring M, and Mas; M, is measured by the 
methods developed by McClelland and his asso- 
ciates (1953) to measure need achievement and 
Mu, is measured by the Mandler—Sarason Test of 
Anxiety (1952). 

Atkinson (1964) summarizes a considerable 
amount of research that is consistent with his 
theory. The wide applicability of the theory is indi- 
cated by the range of content to which the theory 
has been applied: motivational effect of ability 
grouping in schools, strength of achievement mo- 
tive and occupational mobility, fear of failure and 
unrealistic vocational aspiration, preferences for 
degrees of risk, and the effects of success and 
failure. [See ACHIEVEMENT MOTIVATION.] = 

Cognitive balance. One of the notions implicit 
in Lewin’s view of the life space as cognitively 
structured, and generally in the gestalt view that 
organization tends to be as “good” as possible, is 
that when a structure of beliefs and attitudes is 
imbalanced or disharmonious, a tendency will arise 
to change one’s beliefs and attitudes until they are 
balanced. Lewin’s view is brought out most clearly 
in his discussion of cognitively unstructured situa- 
tions and in his discussion of the cognitive struc- 
ture of different types of conflict situations. Heider 
(1946; 1958), one of Lewin’s most brilliant asso- 
ciates, has extended this notion in his theory of 
cognitive balance. [See THINKING, article on COGNI- 
TIVE ORGANIZATION AND PROCESSES. ] - 

Heider points out that cognitive stability requires 
a congruence among causal expectations with Te- 
spect to related objects, That is, for a state of com- 
plete cognitive harmony to exist, the various im- 
plications of a person’s expectations or na 
of any aspect of the cognized environment canno 
contradict the implications of his expectations or 
judgments of any other aspect of the cop ea 
environment. Thus, if a person judges X to be © 
potential benefit to his welfare, he cannot at the 
same time judge that Y (which is also judged to 
be of benefit to his welfare) and X are antagonistic 
and still have a stable or balanced cognitive struc- 


ture. (Let X and Y be things, people, the products 
of people, or the characteristics of people.) When 
the cognitive structure is in a state of imbalance 
or is threatened by imbalance, forces will arise to 
produce a tendency toward locomotion that will 
change the psychological environment or produce 
a tendency toward change in the cognition of the 
environment. Under conditions which do not per- 
mit locomotion, the tendency for cognitive change 
is enhanced. 

Heider specifies further that (a) in regard to 
attitudes directed toward the same entity, a bal- 
anced state exists if positive (or negative ) attitudes 
go together; a tendency exists to see a person as 
being positive or negative in all respects; (b) in 
regard to attitudes toward an entity combined with 
belongingness, a balanced state exists if a person 
is united with the entities he likes and if he likes 
the entities he is united with; the converse is true 
for negative attitudes; (c) if two entities are seen 
as parts of a unit, a balanced state will exist if the 
parts are seen to have the same dynamic character 
(positive or negative). If the two entities have 
different dynamic characters, a balanced state can 
exist if they are seen to be segregated (i.e., by 
breaking up the unit). 

Heider’s theory has had wide ramifications in 
social psychology. Cartwright and Harary (1956) 
extended balance theory to cover a greater range 
of phenomena and also removed some of its am- 
biguities by using the mathematical theory of 
linear groups. Newcomb (1953; 1961) has applied 
Heider’s theory to the “balancing” of interpersonal 
relations, particularly in his analysis of communi- 
cative acts and the development of friendships. 
Rosenberg and Abelson (1960) have applied a 
modification of Heider’s theory to the problems of 
attitude change. 

Dissonance theory. Leon Festinger, one of 
Lewin’s most renowned students, has developed a 
theory (1957) which is similar to balance theory in 
its stress on the need to avoid cognitive inconsistency 
(dissonance) but differs from it in emphasizing 
postdecisional processes. Lewin (1939-1947), in a 
discussion of the behavior of a housewife who 
buys food, had suggested that behavior before and 
after decisions about purchasing food differed: 
prior to the decision, the more expensive the food, 
the less likely it was to get to the family table; 
after the decision, the higher its cost, the more 
likely it was to be used. Festinger’s theory gen- 
eralizes the idea that the postdecision situation 
may differ from the predecision situation. It makes 
the original assumption that making a decision 
per se arouses dissonance and pressure to reduce 
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the dissonance. [See THINKING, article on COGNI- 
TIVE ORGANIZATION AND PROCESSES.) 

Postdecision dissonance results, according to 
Festinger, from the fact that the decision in favor 
of the chosen alternative is counter to the beliefs 
which favor the unchosen alternative(s). To sta- 
bilize or freeze the decision after it has been made, 
the individual will attempt to reduce dissonance 
by changing his cognitions so that the relative 
attractiveness of the chosen, as compared with 
the unchosen, alternative is increased, or by de- 
veloping cognitions which permit the alternatives 
to be substitutable for one another, or by revoking 
the decision psychologically. In Festinger’s view 
(1964), the crucial difference from the predecision 
state of dissonance is that the predecisional con- 
flict is “impartial” and “objective”; it does not lead 
to any spreading apart of the attractiveness in 
favor of the to-be-chosen alternative. Festinger 
writes: “Once the decision is made, however, and 
dissonance-reduction processes begin, one should 
be able to observe that the differences in attrac- 
tiveness between the alternatives change, increas- 
ing in favor of the chosen alternative” (1964, 
pp. 8-9). 

A variety of interesting and ingenious experi- 
ments have been stimulated by Festinger’s view 
of the postdecision process. (See Brehm & Cohen 
1962, for a detailed review.) These experiments 
have often involved “nonobvious” predictions which 
appear to defy common sense. Many of these pre- 
dictions derive from the notion that if a deci- 
sion produces insufficient rewards, the person will 
change his beliefs so as to make the decision seem 
more rewarding. Festinger (1961, p. 11) writes: 
“Rats and people come to love the things for which 
they have suffered.” Presumably they do this to 
reduce the dissonance induced by the suffering, 
and their method of dissonance-reduction is to 
enhance the attractiveness of the choice which led 
to their suffering in order to justify it. 

The theory of dissonance has stimulated exten- 
sive research into many aspects of social psychol- 
ogy: attitude and behavioral change, perceptual 
distortion, selective exposure to information, work 
productivity, and so forth. Recently, dissonance 
theory has been the subject of methodological criti- 
cism (Chapanis & Chapanis 1964) and theoretical 
criticism (Deutsch et al. 1962; Rosenberg 1965) 
which pose some difficult but perhaps not unre- 
solvable problems for the theory. 

T-group training. T-group training (laboratory 
training in groups) utilizes the experiences and 
interpersonal relations which occur in a temporary, 
laboratory-created group to help the participants 
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develop insights into group processes, insights into 
the way that they themselves function in groups, 
and skills of participation in groups. The National 
Training Laboratories, which has become one of 
the key institutions concerned with the application 
of behavioral science to social practice, was estab- 
lished in 1947 with the cosponsorship of Lewin’s 
Research Center for Group Dynamics and has been 
very much influenced by Lewin’s ideas. His articles 
“Conduct, Knowledge, and Acceptance of New 
Values” (Lewin & Grabbe 1945) and “Frontiers in 
Group Dynamics” (1947) present the intellectual 
base for the development of the conception of labo- 
ratory training groups. These articles highlight the 
importance of the group in the process of individual 
re-education and change. In recent years, a con- 
siderable literature has been developed in connec- 
tion with the laboratory method of training indi- 
viduals in groups (see Bradford et al. 1964). While 
some of this literature is evangelistic in tone, this, 
too, is consonant with Lewin’s active and continu- 
ing concern that social science be used to make the 
world a better place in which to live. 


Although it cannot be said that Lewin’s specific 
theoretical construct—his structural and dynamic 
concepts—are central in current research in psy- 
chology, his impact on psychology has been a major 
one. His impact is reflected in the general orienta- 
tions which, today, are more and more taken for 
granted. These are that psychological events have 
to be explained in psychological terms; that central 
processes in the life space (e.g., distal perception, 
Cognition, motivation, goal-directed behavior) 
rather than the peripheral processes of sensory in- 
put or of muscular action should be the focus of 
investigation; that psychological events have to be 
studied in their interrelations with one another; that 
the individual has to be studied in his interrela- 
tions with the group to which he belongs; that the 
attempt to bring about change in a process is the 
most fruitful way to investigate it; that important 
social psychological phenomena can be studied ex- 
perimentally; that the scientist should have a so- 
cial conscience; and that a good theory is valuable 
for social action as well as for science, 


Morton DEUTSCH 


[Directly related are the entries GESTALT THEORY; 
PERSONALITY; PHENOMENOLOGY; SOCIAL PSYCHOL. 
OGY; THINKING, article on COGNITIVE ORGANIZATION 
AND PROCESSES; and the biography of Lewin. For 
comparison, other relevant approaches to psycho- 
logical phenomena may be found in PERSONALITY: 
CONTEMPORARY VIEWPOINTS; PsYCHOANALYsIs, Rel- 


evant also are ACHIEVEMENT MOTIVATION; ÅTTI- 
TUDES; COHESION, SOCIAL; CONFLICT; Groups; 
LEADERSHIP; SYSTEMS ANALYSIS, article on PSYCHO- 
LOGICAL SYSTEMS; and the biographies of Bruns- 
WIK; CASSIRER; HULL; TOLMAN.] 
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FIELD WORK 


Field work is the study of people and of their 
culture in their natural habitat. Anthropological 
field work has been characterized by the prolonged 
residence of the investigator, his participation in 
and observation of the society, and his attempt to 
understand the inside view of the native peoples 
and to achieve the holistic view of a social scien- 
tist. A society can be said to provide a ready-made 
laboratory for the social scientist in somewhat the 
same way that the human organism serves the 
biologist. 

Field studies have long been the mark of cultural 
anthropologists and to a somewhat lesser degree 
of sociologists; they are increasingly being done by 
political scientists, social psychologists, and other 
social scientists. A difference exists between anthro- 
pology and sociology in the history, problems, and 
methods of field work. Today, however, the trend 
is toward similarity. 

Anthropologists began to do field studies toward 
the end of the nineteenth century, although Mor- 
gan’s study of the Iroquois appeared somewhat 
earlier. The major emphasis on anthropological field 
work has been primarily British and American, with 
such notable exceptions as the work of Thurnwald 
in Germany and of Lévi-Strauss in France. Among 
the first expeditions were those of Boas in British 
Columbia and of Haddon with his associates in the 
Torres Strait region of the Pacific. Their goal was 
to extend the boundaries of the knowledge of man 
by studying wide variations in his cultures. 

The publication of Malinowski’s Argonauts of 
the Western Pacific in 1922 revealed the great po- 
tentialities of field work. This study of Trobriand 
Islanders, among whom Malinowski had lived for 
almost three years, set new standards for field 
workers which continue to operate. Field work 
came to mean immersion in a tribal society—learn- 
ing, as far as possible, to speak, think, see, feel, and 
act as a member of its culture and, at the same 
time, as a trained anthropologist from a different 
culture. It is significant that this method was forged 
in the study of small, homogeneous tribal societies 
in which it would have been difficult for the inves- 
tigator to have avoided face-to-face relations, In re- 
cent years the range in type and size of societies 
studied by anthropologists has been much ex- 
tended. With this extension, new problems and new 
methods have developed, but certain others have 
remained constant. 


Beginning field work 


The anthropologist chooses the geographical and 
cultural area for his field project, studies the litera- 


ture, and, if the language has been recorded, learns 
as much of it as possible before going into the field. 
Today, he will most likely also be interested in a 
specific problem and in social or culture theory, and 
he will endeavor to be au courant with the liter- 
ature. 

He then has to consider whether he goes to the 
field alone, with his family, or as part of a team. 
In the past the norm was the lone field worker; 
today it is common for the investigator's spouse 
and children to travel into the field, The research 
team—a number of anthropologists or scientists 
representing different disciplines—is a new trend. 
Each method has its advantages and disadvantages. 
Being alone gives a greater intensity to the field 
experience and may provide more intimate data, 
since the field worker is thrown upon the native 
people for company. On the other hand, it has the 
disadvantages of loneliness and of being limited to 
what one person can accomplish. The family has 
an advantage in reducing loneliness, and a mate 
and children may be useful entering wedges into 
the community and in securing data from their 
own sex and age groups. A team obviously extends 
the range of data which can be collected. A disad- 
vantage of a family or team is that relationships 
with native peoples may be more difficult if one 
is quickly accepted and the other (or others) dis- 
liked. 

Whether the field worker goes alone or with his 
family or as a member of a team, approval for the 
project and, if possible, cooperation must be se- 
cured from those who have authority in the society. 
Authority may reside in a group of clan elders, & 
chief, a tribal king, officials of a colonial govern- 
ment or an independent nation, the general man- 
ager of an industrial company, or the leaders of 
any other dominant economic, social, or political 
group. The questions of those in power, whoever 
they be, are usually concerned with whether the 
investigator threatens the status quo and whether 
he can be trusted not to reveal any information 
which they would consider harmful to the individ- 
ual or the community. The traditional position of 
the anthropological field worker is that he is there 
as scholar to learn rather than as an instrument of 
change and that he never betrays an informant. 
(In applied social science, anthropologists, sociol- 
ogists, economists, and others sometimes bo 
study and help to implement social change, but this 
is a quite different type of field work [see, for €x- 
ample, ANTHROPOLOGY, article on APPLIED ANTHRO- 
POLOGY]. ) r 

The nod of approval from those in power is only 
the first step. The field worker must then gain the 
good will of the people he wishes to study. He has 
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to explain his presence to them, as he did to those 
in authority, as simply and as honestly as possible. 
In general there is no substitute for honesty. In 
some societies, both tribal and modern, people may 
be flattered by someone being interested enough to 
travel a long way to find out about their customs 
and to learn their language. The anthropologist 
must from the beginning differentiate himself from 
other aliens of his race or culture, such as mission- 
ary, government official, trader, planter, or work 
supervisor, whom the indigenous people may have 
known. It is extremely important that they have 
opportunities to observe and to know the field 
worker. The nearer his house is to the hub of activ- 
ities, the easier it is for reciprocal observations and 
for easy social relations. The location of the house 
may also symbolically establish an important dif- 
ference between the field worker and other aliens. 

Gradually the field worker's role evolves. It usu- 
ally has many facets and will not be the same in all 
situations or for all field workers. He may begin by 
rendering certain services. In a tribal society he 
often brings material goods ranging from orna- 
ments to useful spades and knives. He may dis- 
pense simple remedies, such as aspirin and anti- 
malarial medicine, to those needing them. In 
modern societies he often accepts invitations to 
speak to meetings of teachers, to church congrega- 
tions, a chamber of commerce, and any other sig- 
nificant group that invites him. He is not only being 
helpful; he is making it easy for people to see and 
to know him. 


The first steps 


During the first month or so the field worker 
proceeds very slowly, making use of all his sensory 
impressions and intuitions. He walks warily and 
attempts to learn as quickly as possible the most 
important forms of native etiquette and taboos. 
When in doubt he falls back on his own sense of 
politeness and sensitivity to the feelings of others. 
He likewise has to cope with his own emotional 
problems, for he often experiences anxieties in a 
strange situation. He may be overwhelmed by the 
difficulties of really getting “inside” an alien culture 
and of learning an unrecorded or other strange 
language. He may wonder whether he should in- 
trude into the privacy of people’s lives by asking 
them questions, Field workers vary in their degree 
of shyness, but most people of any sensitivity ex- 
perience some feelings of this type when they first 
enter a new field situation. 

Sooner or later, when the field worker has es- 
tablished rapport with the people and has learned 
how to handle his own anxieties, he establishes a 
routine of work. It is usually wise to begin with 
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relatively impersonal tasks. If he has not already 
done so, learning the language is among the first 
essentials. Field workers naturally vary in their 
linguistic abilities, and languages differ in degrees 
of complexity. But learning any unrecorded lan- 
guage presents manifest difficulties for anyone. 
Margaret Mead (1939) has made the useful dis- 
tinction between using and learning the native lan- 
guage. The field worker frequently learns to speak 
and understand a language without learning it as 
a linguist would. In parts of Oceania where pidgin 
English is common, a combination of it and the 
native language may be the method of communica- 
tion. In Africa, the field worker may have to know 
the language of the former or present colonial 
power, a lingua franca such as Swahili, and as 
much as possible of a native language. Often he 
works through an interpreter, either all the time or 
on certain problems where his fluency in the appro- 
priate language is not sufficient for understanding. 
The status, sex, and personality of the interpreter 
may present difficulties in certain situations, and 
it may be useful, therefore, to have several inter- 
preters. No easy solution to the language problem 
exists. Ideally, the field worker should be an expert 
in the native language. Usually he simply does the 
best he can within the limitations of time and the 
inherent difficulties of the situation. 

Making a census is another early task which has 
the value of being impersonal and essential. It is 
necessary to locate people in space and to know 
the composition of households. As the field worker 
goes from one household to another, he also gets 
acquainted with the occupants. (In modern large 
societies, a government census is usually available.) 


Participation in the culture 


Participating and observing become an ever 
more important part of the routine. The field 
worker somewhat resembles a natural historian: 
he observes and notes, as far as possible, whatever 
comes within his range, even though he may not 
always know the relevance of all his observations. 
He follows long and devious sequences, such as 
those involved in initiation, marriage, and death 
rituals, which may be six or more months in prep- 
aration, and, at the same time, he observes the 
minutiae of daily life in which they occur. He ac- 
companies the people on their economic tasks— 
hunting, fishing, planting, cooking, and others. He 
listens to them converse and gossip when they are 
at ease and picks up new clues which he later fol- 
lows up. A woman anthropologist alone in the field 
can usually participate more fully than a man 
alone, because a man is generally restricted to 
being with other males. But even where social seg- 
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regation between the sexes is strict, a woman can 
work with men as well as with women. No one 
fears her. 

The intensity of the field worker's participation 
varies from one situation to another and between 
investigators. Among the Nuer, Evans-Pritchard 
was given little choice. The Nuer were “persistent 
and tireless” visitors, in and around his camp all 
the time, and he suffered from a lack of privacy 
(Evans-Pritchard 1940). Some anthropologists par- 
ticipate in ritual dances, feasts, and similar social 
events; others limit themselves to taking notes. The 
field worker’s sense of the social situation and his 
personal desires and limitations dictate how much 
and when he will participate. 

Whatever the degree of the field worker’s partici- 
pation in the whole society, friendships with a few 
people develop, and they help him to find a niche 
in the community. It is these friends who often 
become his best informants. They may include 
servants or other employees, a chief, medicine man, 
and many others. Often, too, an informal adoption 
as a “son,” “daughter,” “sister,” or “brother” helps 
the anthropologist find his place and extends or 
deepens his range of knowledge. In the Company 
of Man (Casagrande 1960) provides profile sketches 
of key informants who were close associates and 
important teachers for each of the contributing 
anthropologists. 

When the field worker has become familiar with 
the social contours and feels more or less accepted, 
he begins to work systematically on such anthropo- 
logical problems as kinship systems, forms of mar- 
riage and residence, economic and political organi- 
zation, witchcraft and magical beliefs and practices, 
or any other aspect of life which is significant in 
the society and interesting to him. He asks ques- 
tions in structured and unstructured interviews 
and notes the measure of agreement or disagree- 
ment between the patterns which emerge from the 
answers and the actual behavior he observes. 
Usually, a discrepancy exists between the ideal and 
the real. Another method, used effectively by Oscar 
Lewis (1961), is the tape recording of long inter- 
views with different members of one family, result- 
ing in a significant humanistic account. Hortense 
Powdermaker (1962) had inter-African conversa- 
tions recorded by an African assistant, which re- 
vealed spontaneous tones of feeling and subtleties 
of African life. 

Unexpected and chance events, such as a quar- 
rel, often reveal facets of a culture which the field 
worker had not known existed or, perhaps, only 
dimly suspected. Then, too, lies may be as informa- 
tive on certain levels as the truth. For instance, in 


a study in Mississippi, one of the major values of 
middle-class white people—the straining for an 
aristocratic family background—was revealed in 
their consistent lying about their backgrounds 
(Powdermaker 1939). Failures and newcomers, 
who reveal the pressures which they resist or try 
to conform to, may be another excellent source of 
data. 

Whomever the anthropologist is interviewing or 
however he is participating, there must be a high 
degree of reciprocal communication between him 
and the people he studies. This need represents an 
important difference between the methods of the 
social and the natural sciences. The chemist and 
physicist do not communicate personally with mole- 
cules or atoms, but communication for the social 
scientist depends to a large degree on his psycho- 
logical involvement. The field worker faces a special 
problem, however, for he must be both detached 
and involved. If this dual role is an inherent part 
of the anthropologist’s personality or self image 
(strengthened by his training), he plays it spon- 
taneously and easily. As Goffman (1961) puts the 
problem, the degree of tension felt by a field worker 
corresponds to the level of congruence between 
spontanous and unspontaneous (playing the rules 
of the game) roles. 

The notion that involvement connotes a lack of 
scientific objectivity is mistaken. It is only when 
an anthropologist is aware of his involvements that 
he can then detach himself and observe. Hidden or 
unconscious involvements (positive or negative) 
are dangerous, for the obvious reason that the in- 
vestigator cannot step outside of them. An obses- 
sive identification with either the underdog or the 
top social levels limits the field worker’s communi- 
cation and his objectivity. In a highly stratified so- 
ciety it is important for him to move easily within 
its different segments. 

In addition to a capacity for open involvements 
and for becoming detached from them, personal 
qualities such as kindness, patience, tact, endur- 
ance, and the ability to “take” both loneliness and 
ambiguity are helpful. Other idiosyncratic charac- 
teristics may be useful in one field situation and 
not in another. 

Despite its reputation, the so-called culture shock 
is not often one of the problems of a trained anthro- 
pologist. First, he arrives with a general knowledge 
of the people and their culture through having im- 
mersed himself in the literature. Second, his pene- 
tration into the culture is so gradual that by the 
time he achieves any understanding of it, he is apt 
to take it for granted. In fact, he has to be on his 
guard against this tendency. Culture shock is more 


likely to be experienced when he returns to his own 
modern urban society after an extended period in 
a tribal one. 

The theoretical orientation of the field worker 
is significant from the beginning. Historical recon- 
struction, functionalism, a structural or cultural 
approach, and psychological anthropology are 
among the major frames of reference. Each influ- 
ences the field worker’s choice of problems, the 
type of data he collects, the kind of clues he picks 
up, some of his techniques, his hypotheses, and his 
interpretations. The collecting of data and theoriz- 
ing are not necessarily separate processes. As the 
field worker participates, observes, and interviews, 
much of the time he is also thinking of hypotheses, 
of comparative data, and of relevant theories; it is 
also true, of course, that problems and hypotheses 
frequently arise out of empirical data in the field. 
Because of the personal nature of field work, it is 
imperative that while in the field the anthropologist 
accurately record empirical data separately from 
hypotheses and interpretations and distinguish the 
impressions from the definitive data. Thus, he 
should tell us the number of informants he used; 
give details of their age, sex, and social status; and 
provide precise descriptions of the methods used in 
securing data. Scientific standards demand as com- 
plete honesty as possible and data that may be neg- 
ative as well as positive in terms of hypotheses. It 
should be possible for a reader to make different 
Interpretations from the data. One of the most rig- 
orously honest presentations is Cora DuBois’s study 
of the Alorese (1944). 


New trends in field work 


The above points are relatively constant in all 
anthropological field work. Among the new trends 
ie Sem ennaa on specific problems rather than 
oe vpn Studies; working in larger and more 
the ie societies; a greater concern with sampling; 
ens of sociological surveys along with the tra- 
ing; et i i ge observation, and interview- 
field Er A frank recognition of the fact that the 
. WOrker is a human instrument and part of the 
Situation studied. 
ie has obvious advantages in working on 
the a problems and in large societies. One of 
deer 4 cross-disciplinary teams in the field was 
physissan ee Clyde Kluckhohn and the 

avajo ence Leighton, who studied the 
was mad a ). An experiment in regional research 
anthropolo by a team of 12 social scientists, all 
eal scients gists except one historian and one politi- 
in Ugand St, who worked on political development 

ganda and Tanganyika under the direction of 
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Audrey Richards (1960), then the director of the 
East African Institute of Social Research. The 
members of this team worked independently. A 
comparative study of child rearing in six cultures 
by six research teams, each consisting of a man 
and a woman worker, under the direction of John 
and Beatrice Whiting (1963), represents still an- 
other kind of team. A more extensive cross-disci- 
plinary approach was used in a study (Rogler & 
Hollingshead 1965) of schizophrenia in Puerto 
Rico conducted by a sociologist, psychiatrists, an 
anthropologist, and social workers. As yet there 
have been relatively few field teams initiated by 
anthropologists which cover a large number of dis- 
ciplines. 

Anthropologists in the past were not unaware 
of the problem of sampling. An adequate sample 
might be almost impossible in historical reconstruc- 
tion. But in small functioning communities, the 
whole population of the village usually appeared 
in the field worker’s genealogical charts, and it was 
relatively easy for him to establish the frequency 
of forms of behavior without any elaborate statisti- 
cal methods. Today, working in larger societies, the 
anthropologist often employs assistants to make a 
random survey at the beginning of the field study 
to establish patterns, or toward the end to sub- 
stantiate or negate qualitative aspects of the re- 
search. But getting a truly random sample may pre- 
sent difficulties if the investigator and his assistants 
have no access (or much less) to one group. 

Another new problem arises in working in those 
complex contemporary societies which have dual 
power structures representing different ideologies. 
To work with both is decidedly not easy. There is 
also the problem of centricity: most anthropologists 
find it more difficult to overcome ideological cen- 
tricity than ethnocentricity. Training enables an 
anthropologist to study cannibalism, witchcraft, 
and other tribal customs with relative objectivity. 
It is less easy to be objective on issues which threat- 
en strong political or other social commitments. 
On the other hand, a person who has no commit- 
ments would not be able to understand those of 


others. 


Field work in sociology 

As they work on problems in complex modern 
societies, anthropologists come closer to sociology. 
Field work has long been the practice of sociology, 
although its history and methodology have differed 
from those of anthropology. In contrast with the 
early anthropologists, who traveled far for compar- 
ative materials, the first field workers in sociology 
were primarily interested in the reform of their own 
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society, in which the industrial revolution and the 
rise of an urban proletariat had brought many ur- 
gent social problems. In England, Beatrice Potter 
(later Mrs. Sidney Webb) and Charles Booth and, 
in France, Le Play did field studies among the poor 
through involvement with them, direct observation, 
and interviewing. 

American sociologists followed in the European 
tradition of being critics of modern society. Initial 
empirical research included the Hull-House Maps 
and Papers (1895) and W. E. B. DuBois’s The 
Philadelphia Negro (1899). Field work among the 
urban poor was developed at the University of 
Chicago by such men as Louis Wirth, Robert E. 
Park, and their many able students. They studied 
conditions among ethnic and racial minorities and 
such groups as criminals, hoboes, and prostitutes. 
These people were foreign, perhaps with an exotic 
quality, to many of the middle-class American so- 
ciologists. They explored at home, while the anthro- 
pologists of the same period were studying exotic 
peoples in distant lands. 

The traditional distinction in subject matter is 
weakening. Members of both disciplines and of 
other social sciences increasingly study “primitive” 
preliterate and modern literate societies. Problems, 
too, become interchangeable; studies such as those 
of social structure, communities, social change, and 
interpersonal relations (e.g., those involved in so- 
cial mobility, social conditioning, role playing) are 
common to both disciplines. 

A few sociologists, such as Robert and Helen 
Lynd and William Foote Whyte, in their respec- 
tive eminent studies, Middletown (1929) and 
Street Corner Society (1943), have used anthropo- 
logical methods and techniques as models. But in 
general, the trend in sociology has been to emulate 
the natural sciences as far as possible and to limit 
field problems to those on which quantifiable data 
can be secured, primarily through the use of sur- 
veys. In 1959 Nathan Glazer wrote, “The sociologist 
today—whether his field of interest is the commu- 
nity, criminology, marriage and the family, world 
politics, social classes, housing—is a man who asks 
people questions and then Statistically analyzes the 
answers to them. If he does not ask the questions 
himself, he hires some one else to ask them; or, if 
not that, he analyzes the statistics gathered by 
those who have asked questions—census-takers, so- 
cial workers, and others” ([1959] 1964, pp. 43-44), 
The beginning of sociology in England coincided 
with the development of statistics there, and the 
relationship between the two is, thus, no accident. 

Many sociological field studies lack personal par- 
ticipation and detailed observation; formal ques- 
tionnaires can be a barrier to establishing relations 


between the field worker and the people he studies. 
According to John Bennett and Kurt Wolff (1955), 
the typical sociologist attempts to have the detach- 
ment and objectivity of the natural scientist and to 
view the people he studies as objects or, at best, 
“subjects” rather than as fellow human beings with 
whom he enters into some kind of personal rela- 
tions. 

There seems to be a new trend in which anthro- 
pology and sociology change places methodolog- 
ically, at least to some degree. Increasingly anthro- 
pologists try to eliminate or deny the humanism of 
their approach and to select those field problems 
which they think lend themselves to a rigorous 
scientific method. Some anthropologists try to emu- 
late the methods of the natural scientists, but the 
imitation, unfortunately, is often of the mechanis- 
tic notions of causality common to nineteenth-cen- 
tury science. These have now been replaced by 
principles of indeterminacy and by probability 
curves. It was a sociologist, Melville Dalton, who 
pointed out this time lag. He wrote, “Influenced by 
the dogmatism of nineteenth century science, re- 
search methodology in the social and psychological 
sciences is now more cocksure than in the increas- 
ingly humbler physical sciences” (Dalton 1959, 
p. 273). 


The field worker as an individual 


A quite new development, not yet strong enough 
to be called a trend, is the recognition that the field 
worker is himself an inherent part of the situation 
studied and that his personal as well as his scien- 
tific reactions are an important part of the research 
process. Field workers, of course, have long known 
that they are human instruments of research, al- 
though some have tried to be, or to present an im- 
age of, faceless robots. But there has been a reluc- 
tance to recognize the scientific significance of the 
field worker's personality and his all-too-human 
characteristics, Relatively little space has been 
given in publications to his mistakes, to the trial- 
and-error nature of some of his procedures, to the 
role of chance, to the influence of his reactions on 
a strange people and on their culture, and other 
such important personal factors. With a few excep- 
tions, anthropologists have included only a few 
paragraphs or pages on these points in the intro- 
ductions to their monographs or have relegated 
them to private conversations. After a long period 
of emulating natural scientists, some sociologists 
now recognize the peculiarly human characteristics 
of their discipline, In Sociologists at Work (Ham- 
mond 1964) 13 eminent sociologists give candid 
personal accounts of how their field research proj- 
ects evolved, of their frustrations as well as their 


achievements in the field, of what actually hap- 
pened rather than of what should have happened. 
Stranger and Friend: The Way of an Anthropologist 
(Powdermaker 1966) gives probably the frankest 
and most detailed account of one anthropologist’s 
field studies, of mistakes as well as successes, of 
relationships between the field worker and the 
peoples studied, and of how her personality and 
training related to the type of work done, 

The field is a laboratory in which the role of the 
investigator is significant and relevant to the study 
of the people and their culture, and obviously the 
more candid and revealing the field worker is about 
his role, the more scientific is the report and the 
more helpful it is to other investigators. The recog- 
nition of the significance of the personal charac- 
teristics of the field worker to his research confirms 
the point that field work is an art as well as a science. 


HORTENSE POWDERMAKER 


[See also ETHNOGRAPHY; INTERVIEWING; OBSERVA- 
TION. ] 
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The work of John Neville Figgis (1866-1919) 
in political philosophy constitutes the fusing of 
two interests and two careers. Born in Brighton, 
England, and educated at Cambridge University, 
he developed and matured as a student of history 
and political thought under the influence of Mait- 
land. Influenced also by another distinguished 
Cambridge teacher, Mandell Creighton, he recog- 
nized a call to the religious life, which led him 
to the Church of England and eventually to a 
monastic order of that church, the Community of 
the Resurrection. He continued writing and occa- 
sional lecturing on both religious and political— 
historical topics until his death. 

In his historical works on political theory, Figgis 
laid greatest emphasis, in the tradition of Gierke 
and Maitland, on the importance of man’s corporate 
life as the foundation of the state. His Churches 
in the Modern State (1913) is a popular exposi- 
tion of the nature of corporate life and an applica- 
tion of his theory to the question of the nature of 
a church and its relation to the state. It exhibits 


a passionate concern for freedom in the face of 
the growing power and authority of the state. 
From his scholarly studies of the history of po- 
litical thought, especially that of the Middle Ages 
and the early modern period, Figgis concluded that 
there are two ways of understanding the corpora- 
tion. He assumed that a corporation is “a society 
of men bound together for a permanent interest,” 
and he recognized as corporations such bodies as 
the family, the club, the union, the college, and 
the church. Bodies other than the state must be 
understood either as creatures of the state or as 
real persons with lives of their own and an origin 
independent of the state. From Figgis’ point of 
view, the former approach is wrong, and the 
latter is true to the facts of history and social life. 


The monistic theory 


Figgis traced the theory that the corporation is 
a creature of the state to the concept of absolute 
sovereignty, which in turn he traced to the ab- 
stract idea of unity. Modern monism (the theory 
of the unitary or all-embracing state) derives, he 
suggested, from the ancient conception of the city- 
state as an organic unity and from the view de- 
veloped in Roman law of the position and authority 
of the emperor as sovereign. These notions were 
imported into the Roman Catholic church in the 
medieval period and developed (with only a slight 
setback—the conciliar movement in the fifteenth 
century) into the nineteenth-century doctrine of 
the absolute power and infallibility of the pope. 
In the course of this “crescendo of Papal claims,” 
attempts were made by some popes—for example, 
Boniface vi, 1294-1303, and John xxu, 1316- 
1334—to claim secular power as well as spiritual 
power and authority. With the rise of national 
states after the Renaissance, these attempts failed, 
but a political theory was developed by Machia- 
velli, Bodin, and Hobbes that claimed for the ruler 
in the secular realm the same authority that the 
popes had been able to establish for themselves in 
the church. According to this theory, as Figgis 
described it, the sovereign of the state, whether 
one man, a few men, or all men, is the sole source 
of social authority and social life; no other society 
(corporation) has a life of its own. Each society 
in the state exists, carries on its affairs, and ex- 
ercises its authority over its members with the 
permission, tacit or express, of the sovereign. All 
Such societies are merely creatures of the state; 
they have no innate life of their own and no orig- 
inal right or authority. 

This theory of corporations is rejected by Figgis 
on two main grounds: first, that it does not square 


with the facts of history; and second, that the im- 
plications of this doctrine are inimical to the self- 
development and freedom of the individual. The 
evidence to support the first argument was devel- 
oped by Gierke in his monumental Das deutsche 
Genossenschaftsrecht. Gierke had shown that there 
is another way to conceive of corporations and 
even a spirit of corporate life. Figgis accepted 
Gierke’s approach and developed evidence of his 
own from his analysis of the situation in his own 
day of social groups vis-à-vis the state. He was 
especially concerned with the problem of the rela- 
tion of churches to the state in a period of religious 
heterogeneity. Put simply, the question is, What 
authority does a modern state (for example, Eng- 
land in the late nineteenth or early twentieth cen- 
tury) have over the churches that have been 
organized within its borders? (The problem of 
England is especially complicated because it has 
both an established church and unestablished 
churches, which are both national and interna- 
tional.) The state clearly has the right to prevent 
or punish violations of law. But does not a church 
have the right to determine its membership condi- 
tions, its ritual, and even its purpose without state 
interference? The issue had been met in the courts 
where some members of a church sued to pre- 
vent church leadership supported by a majority 
of the membership from amalgamating with an- 
other church (1913, pp. 18-20). The results of 
these cases pointed sometimes to the monistic the- 
ory (when churches were prevented from chang- 
ing) and sometimes to the other theory (when 
churches were left to themselves). Similar cases 
arose and continue to arise in connection with 
trade unions, and the issue has not really been 
Settled, 

The second argument against the monistic the- 
ory, namely, that its implications for individual 
freedom are repugnant, hinges on the demonstra- 
tion that there is a growing trend, independent 
both of the development of one-man rule and of 
democratic rule, toward the destruction of smaller 
j Sanizatioris either by their consolidation into 
* ae Ones or by their absorption into the state. 
oan freedom for the individual must con- 
tn this development because it cuts off from 
a more individuals their sources of social 
Stike T their support against the might of the 
ER oes not matter that the state is one which 
social opua sovereignty. The life of man is a 
NS e, and if ultimately there is no society 

an the state, man becomes a creature of 
a state. Figgis did not live to see the rise of mod- 
totalitarianism in the third and fourth decades 
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of the twentieth century, but he did foresee it. 
Much evidence could be found in the later period 
to confirm his fears. 


The corporation as a real person 


A more adequate theory of corporate life, Figgis 
held, sees any corporation (especially such a vol- 
untary association as a church) as possessing an 
inherent life of its own, generated out of the spirit 
and will of its members. He recognized the affini- 
ties of his approach to that of Rousseau. A cor- 
poration is a living force with powers of self- 
development. It has a life greater than the mere 
sum of the individuals composing it and is not 
merely a matter of contract. Each such body has 
its own individuality, its own ethos. The existence 
of such bodies is a “social fact” that may or may 
not be recognized by the state. Failure to recognize 
it leads to a condition akin to that of individuals 
in slavery. Recognition of the real personality of 
corporations by the state leads to a conception of 
the state as a communitas communitatum. 

Groups that arise in society out of the efforts of 
individuals to realize themselves do not develop 
without genuine authority over their members, and 
this authority manifests itself in a government 
that has legislative, executive, and judicial powers. 
The authority of the group comes from the con- 
sent of its members. The group carries on activities 
that are public for its members, but private in 
relation to the state. But Figgis realized that this 
distinction between public and private does not 
satisfactorily describe life in modern societies, In- 
stead of seeing society as a collection of individ- 
uals confronting an entity called the “state” and 
being unified by it, he saw society as a complex of 
individuals, groups, and corporations presided over 
by a rather special sort of group called the “state.” 
Figgis was part of that pluralistic movement that 
saw the state as one group in society having dis- 
tinctive functions, but one which was not in all 
ways superior to every other group, either in its 
claims on the individual or in its benefits to him. 
The state is a strong power that is needed to pre- 
vent injustice between groups and to secure to 
each group its rights. It does not create the groups, 
but in pursuing its own goals it has to regulate 
them, limit their activities, and settle conflicts 
among them. However, the resolution of conflicts 
has tended to produce, Figgis believed, unwar- 
ranted interference in the internal life of groups. 
His doctrine, reminiscent of that of Mill, is that the 
state has the right to limit the activities of groups 
only insofar as they impinge on other groups or 
individuals and that the internal life of the group 
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is inviolable. For Figgis, this is the meaning of the 
doctrine of the real personality of corporations, and 
it seems to stand or fall on the adequacy of the 
distinction between the internal (private) and the 
external (public) activities of groups. 


Although Figgis never developed a systematic 
political theory in any detail, he offered hints as 
to the proper point of view of an adequate politi- 
cal theory, and from these hints his readers may 
gain some idea of the sort of theory he would 
defend. The recognition of the real personality of 
groups, the importance of “self-realization” for 
groups, and the characterization of the state as 
the communitas communitatum suggest that his 
theory would derive from Gierke and Maitland on 
the one hand and from the British Hegelians on 
the other. His concern for freedom speaks of the 
influence of Mill and Acton. Yet he never does 
come to grips with the question, What is a state? 
If corporations are real persons, it is self-contra- 
dictory to say that it is a communitas communi- 
tatum, for if the state is taken to be a real person, 
what happens to other groups which are internal 
to it? If, however, the other groups are seriously 
thought to be independent of the state, in what 
sense does it continue to be a communitas? The 
doctrine of the real personality of corporations 
seems to introduce rigidities of its own that inhibit 
the clear observation of the facts of social life. The 
subsequent development of pluralism (political, so- 
cial, and economic) carried further the “discredit- 
ing” of the state and provoked, in turn, a reaction 
in the direction of statism that Figgis would have 
deplored. 

Henry M. Macip 


[For the historical context of Figgis’ work, see PLURAL- 
IsM and the biographies of ACTON; GIERKE; GREEN; 
Martand; MILL. For other relevant material see 
SOVEREIGNTY; STATE; VOLUNTARY ASSOCIATIONS. ] 
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Cinema, which by now should mean both motion 
pictures for theaters and films for television, is 
simultaneously an art and an industry because its 
means of creation and its systems of distribution 
involve expensive technology. This has been true 
since the early nickelodeon days of the public view- 
ing parlor, when five cents illuminated a 50-foot- 
long scene. Today’s distribution of color television 
into living rooms at the rate of $105,000 for a 50- 
second commercial only inflates the problem in 
terms of the costs involved and the relative quality 
of the product. 

What effect this artistic, social, and economic 
phenomenon has on the cultural values of both 
creators and viewers is an international problem 
of the first magnitude, since cinema is a world-wide 
language that can be grasped by the illiterate as 
well as by the educated. Nothing less than the con- 
trol of men’s minds and emotions is at stake. Since 
aesthetics, in this case, has become linked both 
with economics and with political action respon- 
sive to the uses of visual communication, cinema is 
the major cultural force in the second half of the 
twentieth century. 

The relation between those who create the pic- 
ture and those who pay its costs is a conflict be- 
tween the front office and the sound stage as old 
as Michelangelo’s quarrels with the papacy, or as 
complex as Shakespeare’s efforts to gain the good 
graces of his patrons. What is unique to cinema is 
the degree of interdependence between artistry and 
finance. 

The question cannot be dismissed as a matter of 
preference, What is art to one may indeed be an 
industry to another, and vice versa. The insepa- 
rability of the equation makes it difficult to evalu- 
ate. The union of film art and the picture industry 
is a marriage contracted in hell, lived on earth, and 
rewarded in heaven. 

Problems of film production. A poet needs only 
paper and pen to be in business; a novelist has the 
option of adding typewriter and ribbons to these 
tools; a painter can start work once he has brushes, 
paints, and canvas; and a sculptor can fashion al- 
most any durable material. A musician is not 
obliged to own or rent an orchestra to compose 4 


symphony. But a motion picture director requires 
a fully equipped studio to compose a motion pic- 
ture. An impoverished painter could steal hair from 
acat to make a brush, but a film director will find 
that cameras, lenses, lights, developing tanks, and 
editing apparatus are relatively inexpensive com- 
pared with actors’ salaries, set construction, or 
shooting on location. 

Cinema is the only art form solely dependent on 
machinery. The electronic extension of motion pic- 
tures via television is more expensive and involves 
more machinery. Never before in man’s artistic 
history has the artist been so subordinate to the 
means which shape his ends; and, conversely, 
never before has the patron or sponsor or producer 
been so dependent on talent for the utilization of 
his machinery for production, distribution, and ex- 
hibition. A camera collecting dust is not a money- 
making gadget, nor is a script collecting dust a 
motion picture that can be projected. 

A writer-director in cinema is like an orchestral 
conductor; his script, or score, cannot be experi- 
enced without the collaboration of myriad different 
craftsmen employing different instruments. Ideally, 
they should be hired for their specific competence 
in dealing with a special challenge. This is a prob- 
lem of correct casting of craftsmen, but under strict 
union regulations of seniority the writer—director 
is obliged to accept the run-of-the-mill worker, who 
18 not generally inclined to support artistic adven- 
tures, Also, the number of employees allotted to an 
assignment has been prescribed by union rules. 
There are qualitative considerations aside from the 
quantity of workers, assistants, and collaborators. 
A studio may employ specialists as diverse as archi- 
tects (to design sets) and experts in Zen Buddhism 
(to do “research”), 

Since the basic concepts of a motion picture 
must be filtered through so many individuals before 
the picture reaches its final version, and since these 
Concepts are primarily visual, the relation of the 
number of individuals involved to the degree of 
Sia coherence in the picture’s imagery is ob- 
oe ee A cinematic law might be postu- 
acs he greater the number of collaborators in 

e creation of a picture, the less coherent its im- 
gery will be, 

o rodugton problems begin with the “source,” 

ch is either a story or a body of factual material. 
i nre may be a novel, a play, a short story, or 
A ginal treatment”; it may even be a social es- 

E Ana theme, or a report on actualities. 
aid em, or of the source will usually have his ideas 
in Motions translated into script form by other 

ts; the final shooting script may be prepared 
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by yet another batch of writers, including dialogue 
specialists. Such has been the prevailing pattern 
in Hollywood's film and television studios; the same 
is true of Italy, France, Yugoslavia, Hungary, 
Czechoslovakia, Poland, the Soviet Union, Japan, 
Nationalist China, and the Chinese People’s Re- 
public. In England and Sweden the tendency has 
been to use an individual writer, as is the prefer- 
ence in independent production in the United 
States. In fact, the writer-director personality as a 
single force explains the rising quality throughout 
the world of the feature film. Such figures as Ing- 
mar Bergman, Federico Fellini, Michelangelo An- 
tonioni, Jean-Luc Godard, François Truffaut, David 
Lean, Tony Richardson, Stanley Kubrick, and Sid- 
ney Lumet are directors who also serve as their 
own writers, in close cooperation with a scenarist. 

Final decisions over script in terms of plot and 
story values, over casting of stars and other actors, 
and over the editing of the shots were made by 
producers or executive producers during the years 
of the mass production system. Thus a studio like 
Twentieth Century-Fox, Metro-Goldwyn-Mayer, or 
Warner Brothers used to program its merchandise, 
during the heyday of studio monopoly, by planning 
ten westerns, five musical comedies, a dozen social 
dramas (with love interest), half a dozen situation 
comedies, five war dramas, and a flock of Grade B 
thrillers. Depending on how these products sold in 
the market place, the program for the following 
year was adjusted. The producer system reigned. 
In the 1930s and 1940s the producer was like a lord 
commanding his manor, but he was also in compe- 
tition with a neighboring producer under company 
management whose power also rose or fell accord- 
ing to the success of his product. Executive pro- 
ducers were like dukes ruling over a collection of 
lordly producers. Such duchy strongholds were set 
up at M-G-M, for example, under the supreme au- 
thority of the king, Louis B. Mayer. Industry ruled 
over art (Crowther 1957; Ross 1952), 

For the creative talent there was no choice; 
Faust was compelled to sell his soul to the devil. 
Throughout the 1930s novelists and playwrights 
from the East (whose fate is perhaps epitomized 
by that of F. Scott Fitzgerald) accepted front office 
control; there was no other, since Samuel Goldwyn 
ran a one-man M-G-M. 

Rise and fall of the independents. The cele- 
brated case of United States v. Paramount ended 
the reign of the studio chiefs when the Supreme 
Court held that a producer cannot be an exhibitor. 
This bill of divorcement, known as the Anti-Block 
Booking Decision, separated the ownership of 
studios from ownership of movie theaters (Conant 
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1960). The principle behind the Supreme Court’s 
decision was the Sherman Anti-Trust Act of some 
seven decades earlier, a principle never before ap- 
plied to the distribution of an American art form. 
The 1907 Patents Trust case had already broken the 
production monopoly held by the Edison interests, 
so that dozens of new producing companies were 
founded at that time. These were consolidated or 
eliminated throughout the first two decades of the 
century, climaxing in the monolithic studios of the 
1930s and 1940s. Following the Anti-Block Booking 
Decision, independent production boomed, mainly 
through stars and directors forming their own com- 
panies. It would naturally be assumed that the 
liberation of talented creators from the assembly 
line production system would permit more auda- 
cious pictures with fresh plots and deeply delineated 
characterizations. Such hopes have proven halluci- 
natory. Although the economic base shifted in the 
1950s, the cultural superstructure remained solidly 
attached to box office values. Men of talent had 
been conditioned by formula, and upon risking 
their own money and reputations, they played the 
new game in the safe style of the old one. 

In the 1960s no longer does the producer repre- 
sent the banker; he may be the same person. Or 
the actor may be his own director, and the writer 
his own producer. But the Muse Unchained can 
inspire only those who are capable of appreciating 
freedom and adventure. On the whole, the product 
of a company dominated by an actor or director 
cannot be distinguished from the run-of-the-mill 
product of a major manufacturer. The suspicion 
grew that Faust couldn’t deliver any more than the 
devil. To give the devil his due, it must be said that 
the banker began to suspect that the artist (actor, 
director, or writer—producer) was not interested so 
much in creating a new art as in accumulating 
new money. 

Films and audiences in the 1960s. The artistic 
bankruptcy of Hollywood existed before the coming 
of television. There were, of course, a few good 
pictures made during the early 1950s, among which 
High Noon, Shane, On the Waterfront, and Marty 
were outstanding. But bad money makes more bad 
money—all, unfortunately, of an identical green. 
Major manufacturing companies took on serials 
for TV; the Saturday serial of silent days became 
nightly, national, and network. Artistic values have 
continued to suffer, while the major portion of 
many studios’ income has come from the sale or 
rental of previous features to television, the pro- 
duction of serials and features for television, and 
the rental of studio space and facilities to television. 

Hollywood in the mid-1960s continues at an ar- 


tistic low level, with an accompanying hostility 
toward serious pictures. Artistic initiative has 
shifted from Hollywood to abroad. The American 
motion picture audience has been fragmented, 
thanks to television. The fragmentation is quite 
similar to the division in other entertainments, such 
as the theater, books, and magazines. The poten- 
tial market for quality pictures is at an all-time 
high; for instance, there are over four thousand 
film societies at American universities, with an 
attendance of over 2.5 million. Nevertheless, the 
old answers to the problem of quality continue in 
force. The “blockbuster” produced by the major 
studio—one thinks of Ben Hur, Spartacus, Guns of 
Navarone, Cleopatra—has been an old answer. 
Independent production, which has given us such 
pictures as The Bridge on the River Kwai, Anatomy 
of a Murder, and Lawrence of Arabia, is still an 
old answer. 

What is new is that the fragmented audience 
may be as small as that of art houses or film socie- 
ties, or as large as that of national circuits. More- 
over, producers realize that there is less financial 
risk in a good low-budget film than in a good high- 
budget film. Lawrence of Arabia, for example, did 
not make the profit, in proportion to its costs, that 
was made by Tom Jones. In Europe, where govern- 
ments frequently underwrite a portion of produc- 
tion costs, financing is both less complex and more 
easily arranged; dual producerships involving part- 
ners from different European countries assure 
lower costs. Among the major film-producing coun- 
tries, only the United States does not in some meas- 
ure subsidize film production. Also, there is greater 
artistic freedom in Europe, where the film is rec- 
ognized as a director's medium. 

Ingmar Bergman, more than any foreign director, 
has demonstrated that artistic independence can 
collect, however modestly in his case, at box offices. 
Of American producer—directors, Stanley Kramer 
has been the most successful in proving that po- 
litical subjects can also collect; his The Defiant 
Ones, On the Beach, Judgment at Nuremberg, and 
Ship of Fools all dealt with unmistakably political 
themes, and all made money. It is possible that 
successes like these may slowly be encouraging the 
major producers to abandon their reliance on time- 
worn formulas. For instance, a “sleeper” like David 
and Lisa—a small-budget picture with virtually 
unknown actors that dealt with the theme of men- 
tal illness with unusual realism—may have influ- 
enced one of the more progressive major studios 
to distribute Love With a Proper Stranger, in which 
a young girl’s love affair was treated with an ab- 
sence of conventional moralizing that was quite 


new to the American screen. In this way even a 
few examples of artistic boldness may have an 
impact on the entire industry. At any rate, by the 
late 1960s the foreign director, the native producer, 
and the major releaser were all influencing and in 
turn being influenced by a more enterprising pro- 
duction outlook. 

What was hopefully different about this outlook 
was a greater reliance on the individual. Stanley 
Kubrick was able to make Dr. Strangelove, the most 
independently audacious American film since Cit- 
izen Kane, because the distributor who paid for the 
production (Columbia) did not have script-approval 
rights. The result was a uniquely stylized and satiri- 
cal film that could not otherwise have been made 
and that in certain categories achieved box office 
records. Carl Foreman, who enjoyed script freedom 
for The Victors, did a controversial picture that was 
more successful financially than D. W. Griffith’s 
masterpiece Intolerance, which had to wait several 
decades for its proper recognition. It is clear that 
the fragmented movie audience of the 1960s can 
be reached if the proper means are employed. 

Of gods and bankers. If a signature, as John 
Houseman terms it, is now more possible on a film 
than before, then the artistic prospects appear good. 
In Europe, films are advertised through the name 
of the director as well as the stars. This emphasis 
on basic creative talent is beginning to receive rec- 
ognition in America, but only a few directors, such 
as George Stevens, John Huston, and Stanley 
Kramer, are widely known to the public. Stanley 
Kubrick, Sidney Lumet, and John Frankenheimer 
are relatively unfamiliar except to younger, serious 
audiences, The clearer the director's own contribu- 
tion becomes, the more likely he is to be recognized 
as a factor in box office receipts and the greater 
the support he is likely to gain from the bankers 
who finance motion pictures. It is not so crucially 
important that the mass of people can’t tell one 
i from another, except on wheels; the gentle- 
s who own and control the apparatus of manu- 

cture and distribution know who is most likely 
“ipa profits on their loans. Ingmar Bergman 
it een offers from every major American 
it ites : e apparatus. That he was wanted makes 
lin’s fin or younger directors to enter doors. Fel- 

Ti ange success in Europe with La Dolce 
shale a a im the opportunity to be his own boss 
inan years of hocking his soul to Roman 
Periitted T he artistic and financial success of 8% 
aap tn oe to exercise his freedom with wider 
aes i tulietta degli Spiriti, but not, unfortu- 
i oan with similar success. It is doubtful whether 
e creative person is capable of having the 
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judgment to control objectively the writer as well 
as the director in him. Both Wall Street and the 
studios should take note that the single gifted in- 
dividual is sufficient unto himself in any art except 
the film. Indeed, the varied talents of the writer— 
director—producer are inherently at odds with each 
other. Even so catholic a genius as Leonardo da 
Vinci would have found himself hard put to be the 
following (all in his own image): writer, director, 
producer, editor, promoter, manager of a seven- 
ring circus, handholder, advertiser, salesman, and 
psychoanalyst for actors, It is excessive to expect 
so much, even from a cinematic god. 

The writer in him will tell the director how to 
shoot, but the director will prevent the editor in 
him from trimming shots before the saturation 
point. Conversely, the promoter in him will cast 
actors or actresses who have box office appeal but 
are not necessarily suitable for the characters they 
are to portray. When all these talents work well 
together in one body and one mind, we have the 
rarest of exceptions: a work of art that is a finan- 
cial success. Who can deny how different the sec- 
ond half of Lawrence of Arabia might have been 
had David Lean, a truly gifted director, been able 
to maintain an artistic coherence? Or—conversely 
—how much more gratifying Cleopatra could have 
been for the mass audience had an administrative 
intelligence controlled excesses of writing and 
direction. 

Casting of artistry by financial authority is as 
important for total achievement as the casting of 
financial acumen by the creator. Louis B. Mayer 
is as dead as Erich von Stroheim, the director he 
defeated in the cutting room when Greed was mu- 
tilated. History is always on the side of the creator, 
but this is of little consolation during periods of 
creative struggle. Stroheim, who never had the 
opportunity to cast bankers, would soon be a cine- 
matic god if he were alive in today’s market place. 

Nevertheless, living gods are rare in cinema. 
Orson Welles was once such a demon, Federico 
Fellini is today in Italy, Ingmar Bergman in 
Sweden, and Akira Kurosawa in Japan. Perhaps 
the greatest of them all is Luis Buñuel, an enig- 
matic genius who has been making and breaking 
rules for forty years. No American director can be 
said to enjoy this stature. Those who are their own 
producers—John Ford, Elia Kazan, Stanley Ku- 
brick, George Stevens, Billy Wilder, William Wyler, 
Fred Zinnemann—obviously produce the better 
American pictures. Should this trend expand 
through one fragmented audience into others, it 
could parallel the first golden age, the years of 
D. W. Griffith, Thomas H. Ince, Mack Sennett, 
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Charles Chaplin, Douglas Fairbanks—when direc- 
tors did their own casting of bankers. The question 
of for whom a picture is intended is more im- 
portant now than previously, primarily because art 
and industry are wedded as never before. It is a 
different kind of liaison, a marriage made in 
heaven, lived in hell, and rewarded on earth. 

The television monopoly. In the nonfragmented 
audience of television viewers, the film writer— 
director lives in a serfdom unparalleled in the his- 
tory of creativity in America. Since television has 
become the center of every home and the heart of 
the advertising world, the financial prizes are 
tempting beyond imagination. Not even in the dec- 
ades when Hollywood's monetary rewards were 
reckoned as phenomenal has there been so much 
money paid out to so many people for so little 
talent. 

Following the exposure of the rigged quiz shows, 
the TV networks assumed programming control in 
1960. Between that year and 1966 network profits 
rose from about $21 million to more than $80 
million. The prime time between 7:30 P.M. and 
10:30 P.M. is controlled by the three networks, who 
produce the programs on film or tape, either di- 
rectly or through affiliates or associates, and sell 
the time and costs to the sponsor. Thus the net- 
works act as both producers and distributors, in 
violation of the Sherman Anti-Trust Act and in 
direct contravention of the United States v. Para- 
mount Supreme Court ruling. Although the attorney 
general of the United States could go to court and 
win a decree that would separate television pro- 
gram production (90 per cent of which is on film 
or tape and comes out of Hollywood) from tele- 
vision program exhibition, no action is taken. The 
advertisers remain in show business and get more 
powerful every year, though they are, in turn, de- 
pendent on the networks for time slots and pro- 
grams. The competition between advertisers for 
prime time strengthens the monopoly power of the 
networks. 

Hollywood film producers of programs for sale 
to the networks claim, and rightly so, that they 
cannot get their shows on television unless they 
invite a network to participate in (1) a portion of 
the copyright ownership, meaning a share of pres- 
ent and future profits; (2) domestic syndication 
rights, which are the major profit source in book- 
ing films into local television stations; (3) foreign 
syndication rights, which are increasingly profit- 
able and are cushions for future income during a 
producer's dry period or his old age. “Residuals” — 
the profit from replays—are highly lucrative in the 


rebooking of such popular serials as Gunsmoke, 
Bonanza, I Love Lucy, and dozens more. 

The box office standard of judgment for motion 
pictures has been replaced by the television rating 
systems, which estimate, with varying degrees of 
inaccuracy, the alleged millions viewing a given 
program at a given time. Although quality drama 
and public affairs documentaries may have audi- 
ences in the millions—larger than audiences in the 
past who viewed such programs in theaters or class- 
rooms—these films are unceremoniously scuttled 
in favor of more shows with mass appeal. The 
lowest common denominator ensures the adver- 
tiser of the widest market for his sales pitch. Thus 
Playhouse 90 and Matinée, which provided quality 
entertainment and a proving ground for fresh act- 
ing, writing, and directing talent, were replaced by 
soap operas and quiz shows. Protest letters went 
unheeded; economics wrote the ticket. 

The four major sources of sponsorship—the auto- 
mobile, cigarette, drug, and soap industries—are 
in competition among themselves for prime time, 
and so are the companies that make up each of 
these industries. The packager of filmed shows for 
television syndication knows in advance what kind 
of story his writers need write, what sort of flashy, 
attention-arresting direction his directors should 
deliver, and what name stars will attract ratings. 
What this attitude does to the quality of programs 
can be seen nightly. Newton Minow’s “vast waste- 
land”—the term he used as chairman of the Fed- 
eral Communications Commission (FCC) in de- 
scribing their world to the National Association of 
Broadcasters—has become in a few years an ex- 
panding Sahara with an occasional oasis on a 
Sunday afternoon, 

Proposals for federal regulation. The cultural 
future of film in America for the mass audience 
lies in the hands of the FCC, which, however, is 
disinclined to use the powers given it by Congress 
to police the abuses of both stations and networks 
in their violations of frequency and length of com- 
mercials and in their failure to provide the statu- 
tory proportion of public service programming. 
Since a third of the Congress is personally inter- 
ested in commercial returns of local broadcasting 
stations and/or in stock ownership in broadcasting 
companies, there has been no effort to strengthen 
the spine of the FCC. The Johnson administration 
is known as the broadcaster’s ally in Washington. 

Nonetheless, the proposals of the FCC to control 
and foster competition in television program pro- 
duction and procurement of programs should be 
noted for the sake of their historical interest. The 


proposed rules would prohibit network corporations 
from (1) engaging in syndication in the United 
States or distributing independent programs for ex- 
hibition outside the United States; (2) acquiring 
syndication and foreign sales rights in programs 
produced by other persons and licensed directly to 
the network corporations for exhibition; (3) ac- 
quiring rights to share in the profits from syndica- 
tion and foreign sales of such programs. They 
would also require network corporations to divest 
themselves of distribution and profit-sharing rights 
in domestic syndication and overseas sales of which 
they are presently possessed, 

To achieve these ends the proposed FCC rules 
would set a limit on undue concentration and would 
stimulate competition. A network could not offer 
a weekly evening program schedule in which more 
than 50 per cent of the time, or a total of 14 hours 
per week, whichever is greater, is occupied by pro- 
grams produced by the network or in which it has 
acquired the first-run license from an independent 
producer. This stricture is exclusive of newscasts 
and special news programs and sustaining pro- 
grams. The net result of the rule—if ever passed 
and rigorously administered—would be to make 
prime time available every evening for the exhibi- 
tion of some programs in which the network corpo- 
rations have no financial or proprietary interests. 
The market would then be broadened for the inde- 
pendent program producer and competition among 
Such producers would be encouraged. The films 
would then reflect the program judgments not only 
of the network corporations but also of a large 
number of competitive elements who wish to reach 
the American people through television. 

What this ruling would do for the television audi- 
€nce is what the Anti-Block Booking Decision did 
for the motion picture audience, namely, encour- 
age the tastes of the multiple public. This would 
make for a more democratic atmosphere in a plu- 
ralistic society, 

BIE aesthetics of film. The closed market place 
a pees and ideas has limited experiments with 
Soran content while indulging technical maneu- 
oa ook and hold the attention of the majority. 
ies aesthetics, catering to Nielsen ratings 
tia 3 a emphasized excessive movements of 
dinette camera, excessive pacing to shots, and 
perimpos Sa angles or compositions—all su- 
arkeen on a superficial story line and on thin 
irae, sca so On the other hand, in the open 
A e of international film production for 
PIA -i> e aesthetic advances are remarkable, as 

e color utilizations, for example, of An- 
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tonioni in Deserto Rosso and Fellini’s Giulietta degli 
Spiriti. Alain Resnais, Jean-Luc Godard, and Fran- 
çois Truffaut fragment time and space in ways 
which enhance theme, plot, and characters. 

Cinema aesthetics is being furthered through the 
works of such gifted artists rather than through 
the theories of critics, many of which are unrelated 
to contemporary cinema. For example, Kracauer 
(1960) has revealed the realistic nature of photog- 
raphy as a phenomenon affecting credibility; Law- 
son (1964) has emphasized the audio-visual na- 
ture of cinema that makes it a new art form. In 
contemporary cinema it is clear, as never before, 
to what degree movements and light affect mean- 
ing. This approach comes closer to the sui generis 
possibilities of cinema, which set it apart as a lan- 
guage, a craft, and an art. Knowledge of these pos- 
sibilities is helpful in assisting the viewer to analyze 
the factors affecting his reactions and his judg- 
ment. With such a “grammar of cinema” at his 
fingertips, he is armed to withstand the hourly as- 
sault of TV programs and the clever techniques of 
popular film productions. This grammar, rooted in 
movements and light, finds that the frame, with 
its composition and inner action and frame move- 
ment; the shot, with its variety of frame move- 
ments in conjunction with edited movements be- 
tween shots; and the edit, with its various jugglings 
of time through the juxtaposition of shots, comprise 
what might be called the “seven faces of time” 
(Gessner 1965). 


Unless in the second half of the 1960s the fed- 
eral government of the United States moves to pro- 
tect its citizens in their free access to the best in 
cinema, the American people will continue to be 
denied full participation in the most dominant art 
of the twentieth century. In practical terms, such 
protection can be achieved through breaking the 
TV monopolies and by offering government subsi- 
dies for production, as is already the practice in 
Sweden, France, Italy, and the United Kingdom. 
The British Broadcasting Corporation (BBC), sub- 
sidized by government funds and a tax on receiv- 
ing sets, is a particularly good example for study, 
since it is wholly free in making creative judg- 
ments. Some of the more interesting films produced 
anywhere emerge under BBC sponsorship. In the 
United States, only the National Educational Televi- 
sion approximates, on a smaller scale, the BBC ap- 
proach, but not with the BBC’s imaginative audac- 
ity. Sweden taxes every seat in its movie theaters 
to support a cinema training program for talented 
youth and to subsidize experimental productions 


432 FINANCE, LOCAL 


not designed for box office success. These examples, 
it is to be hoped, might influence the eventual crea- 
tion in America of a national film academy which 
would train and produce youthful talent, and of 
an audacious cinema in the tradition of the country 
where the art and industry were born. 


ROBERT GESSNER 


[See also COMMUNICATION, MASS; DRAMA.] 
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FINANCIAL INTERMEDIARIES 


Financial intermediaries issue (indirect) debt 
of their own to buy the (primary) debt of others. 
Their issues attract funds from alternative expend- 
itures by nonfinancial spending units on consump- 
tion, tangible investment, or primary debt. Their 
lending directs the flow of funds to expenditure by 
borrowers on consumption, tangible investment, or 
primary debt. They intermediate between the 
sources of funds that flow to them and the ulti- 
mate users of these funds. 

The intermediaries may be identified by their 
balance sheets, which show a high proportion of 
financial to tangible assets and of indirect debt to 
equity. Their income statements report high ratios 
of income from interest and dividends to total in- 
come and of expense for interest and dividends to 
total expense. 

There are bank (monetary) intermediaries and 
nonbank (nonmonetary) intermediaries. The for- 
mer are commercial banks and the central bank, 
together constituting the monetary system, which 
issues indirect debt, subject to unique regulations, 
in the form of money, The indirect debt which is 
issued by nonbank intermediaries is not used as a 
means of payment. 


Both types of intermediary are to be distin- 
guished from other institutions that transmit funds 
from ultimate lenders to ultimate borrowers. These 
other institutions do not issue their own indirect 
debt in soliciting funds. They include security 
dealers, brokers, and exchanges. 

Types of intermediary. The principal nonbank 
financial intermediaries in the United States are 
the following: 


Depositary intermediaries 
Credit unions 
Mutual savings banks 
Savings and loan associations 
Insurance and pension intermediaries 
Casualty and miscellaneous insurance companies 
Fire and marine insurance companies 
Fraternal insurance organizations 
Government retirement, pension, insurance, and 
social security funds 
Group health insurance 
Private life insurance companies 
Private noninsured pension funds 
Savings bank life insurance departments 
Finance companies 
Factors 
Mortgage companies 
Personal finance companies 
Sales finance companies 
Investment companies 
Closed-end companies 
Face-amount certificate companies 
Industrial loan companies 
Open-end companies 
Agricultural credit organizations 
Federal land banks 
Livestock loan companies 
National farm loan associations 
Production credit associations 
Government lending institutions 
Banks for cooperatives 
Federal Home Loan Banks 
Federal intermediate credit banks 
Federal National Mortgage Association 
Federal Savings and Loan Insurance Corporation 


ie list is not complete; it omits such domestic 
A opinis as the American Express Company, 
i xport-Import Bank, small business invest- 
inter corporations, and pawnbrokers, as well as 
ETES intermediaries in which the United 
etar T articipates, including the International Mon- 
A und and the International Bank for Recon- 
tions F and Development. (Detailed classifica- 
DA intermediaries in the United States may be 
Aae the following sources: “A Quarterly Pre- 
vguation of Flow of Funds . . ” 1959; Goldsmith 
Pp. 2730) Goldsmith & Lipsey 1963, vol. 1, 


Nonbank intermediaries may be classified in 
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many ways. In terms of proprietorship, they may 
be governmental or private, and the private organ- 
izations may be stock, unincorporated, mutual, or 
cooperative. They may be primary, dealing with 
the general public, or secondary, dealing with other 
financial institutions. Grouped according to assets, 
they may be lenders principally on real estate, for 
example, or agricultural products or consumer 
durables. Grouped according to liabilities, they may 
borrow mainly at short term or at long term, and 
their obligations may or may not carry insurance 
benefits. They pay their creditors interest, divi- 
dends, capital gains, or indemnities. They vary in 
regional distribution, rate of growth, cyclical sta- 
bility, degree of concentration, and tax status. Each 
of them reflects an opportunity for private enter- 
prise to profit from transmission of funds between 
lenders and borrowers or a governmental desire to 
supplement private financial arrangements. 

Growth in the financial system of the United 
States has consisted partly in proliferating types 
of intermediary. At the beginning of the nineteenth 
century, commercial banks and private insurance 
companies were available for the deposit and bor- 
rowing of funds. Savings banks and savings and 
loan associations were operating before mid-cen- 
tury, and by 1880 there were mortgage companies. 
The pace and quality of economic growth in the 
first two decades of the twentieth century were 
especially congenial to financial innovation: to this 
period one may credit the postal savings system, 
credit unions, finance companies, investment com- 
panies, and federal agencies for agricultural credit. 
Pension funds and various additional federal lend- 
ing agencies, especially in the area of mortgage 
finance, were generated by the circumstances of 
the 1920s and 1930s. International intermediaries 
have been the notable innovations of the 1940s and 
1950s (Goldsmith 1958, chapter 4; Kuznets 1961, 
pp. 304-305). It is one aspect of relative economic 
maturity that the pace of innovation in financial 
intermediaries has diminished. Existing interme- 
diaries introduce adaptations in service which seem 
to provide a sufficient flow of funds at full employ- 
ment and at an acceptable rate of growth in na- 
tional income. 

Dimensions of intermediation in the U.S. At 
the end of 1963, financial assets held by nonbank 
intermediaries amounted to one-fifth of all finan- 
cial assets in the national balance sheet—to about 
$510,000 million in an aggregate of $2.42 billion. 
In terms of their portfolios, they were three-fifths 
larger than the monetary system. The bulk of 
their assets, approximately $360,000 million, was 
financed by indirect debt to consumer households 
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and nonprofit organizations. This debt amounted 
to one-third of the financial assets in those sectors. 

The distribution of financial assets among the 
types of nonbank intermediary is unequal. Inter- 
mediaries in the insurance and pension categories 
accounted for one-half of all intermediary assets 
at the end of 1963, savings and depositary institu- 
tions for one-third, and the remainder was widely 
dispersed among other categories. 

Nonbank intermediaries do not grow at a uni- 
form rate. The reason is that they sell indirect debt 
in specialized forms to different classes of savers, 
buy primary debt at different terms from different 
classes of investors, encounter dissimilar regula- 
tory restraints and stimuli, and experience diverse 
changes in technical conditions of producing the 
service of intermediation. Each responds uniquely 
to the phases of short and long cycles in real 
economic development; to changes in life expecta- 
tion and age distribution of the population; to 
variations in the price level, in the distribution of 
income between sectors, and in relative growth 
rates for various kinds of real capital. During a 
recent decade, 1948-1958, growth among inter- 
mediaries varied from 521 per cent for credit 
unions to 35 per cent for mutual savings banks 
(Goldsmith et al. 1963, vol. 2, pp. 114-115; see 
also Friend 1963, pp. 666-667; 1964, pp. 16-45). 

Theory of financial intermediation. Nonbank 
financial intermediaries participate in four mar- 
kets, They are net buyers on markets for primary 
securities; on markets for money balances—to be 
held as reserves of liquidity either voluntarily or 
by regulatory rule; and on markets for productive 
factors—labor in particular, but also capital goods 
and land. They are sellers on markets for their 
Own issues of indirect debt—in such forms as 
savings and loan shares, savings bank deposits, 
and pension claims. (See Patinkin 1956.) This 
last market will be discussed first. (We neglect 
here operations by intermediaries on foreign ex- 
change markets and the bearing of intermediation 
on international balances of payments.) 

Markets for nonbank indirect financial assets 
(NIFA). Demand for NIFA, issued by nonbank 
intermediaries as their own indirect debt, comes 
mainly from consumer households. Consumers 
demand NIFA as a component of their personal 
wealth along with such other components as 
money balances, primary securities, business 
equity, housing, and durable goods. NIFA are rela- 
-tively more important in the personal wealth of 
consumers at medium income levels than of con- 
Sumers at extremes of the income distribution. 


Consumers add to their portfolios of NIFA by 
saving, borrowing, and displacing funds from al- 
ternative assets. NIFA qualify as a superior, or 
luxury, good in the consumer budget. 

Economists have experimented with many forms 
of demand function for NIFA (Brown & Friend 
1964, pp. 125-128; Tobin & Brainard 1963). De- 
mand rises with the real rate of interest paid by 
intermediaries relative to rates of return on other 
assets. It rises with permanent disposable income, 
and apparently it responds to intermediaries’ ad- 
vertising. It is depressed by risks involved in 
holding NIFA and increased by insurance against 
these risks. These are typical arguments in demand 
functions for NIFA, but others are relevant for 
particular assets. A stock market boom intensifies 
demand for shares in investment funds. Age dis- 
tribution of consumers is significant to demand 
for insurance. Growth in union membership en- 
hances demand for uninsured pension fund claims. 

Demand for NIFA is confronted by conditions 
of supply—a supply function for individual inter- 
mediaries and for the industry. (Studies of the 
supply function include Hensley 1958; Trade As- 
sociation Monographs for the Commission on 
Money and Credit 1962.) The supply of NIFA 
offered by intermediaries varies inversely with the 
rate of interest on NIFA. At each such rate of in- 
terest, supply tends to rise as rates of interest on 
the primary securities that intermediaries hold rise. 
It is the spread between primary (lending) rates 
and NIFA (borrowing) rates that creates oppor- 
tunity for profit in intermediation. Since, in the 
long run, the primary rate tends to vary with 
the marginal productivity of tangible wealth, the 
supply of intermediation is responsive to oppor- 
tunities for real investment. 

Because intermediation incurs costs for wages 
and depreciation, its supply is depressed by in- 
creases in wage rates and in prices of capital 
goods and is increased by technological advance 
that economizes labor and capital. Increases in 
risk and uncertainty of investment in primary se- 
curities, which typically are at long term, and of 
the much more liquid NIFA debt, which is typi- 
cally at short term, inhibit the supply of NIFA. 
Conditions of supply in intermediation are affected, 
perhaps more profoundly than in any other indus- 
try, by governmental intervention in the form of 
subsidy, special tax terms, and sundry regulatory 
devices. The net effect of governmental interven- 
tion in the United States has probably been to 
stimulate both supply and demand. The justifica- 
tion may be that there are external benefits of 


intermediation that wholly free markets would not 
realize. 

There is a separate industry of firms supplying 
each variety of NIFA. These industries are imper- 
fectly competitive, and some of them satisfy the 
specifications of oligopoly. Governmental restric- 
tions on freedom of entry of new firms are ac- 
countable for some loss of competitive impulse. 
In addition, there is reason to suspect the exist- 
ence of internal economies of scale for the inter- 
mediary firm that are not compatible with free 
competition, The evident imperfections of com- 
petition have led to numerous governmental re- 
straints on the structure of the intermediary in- 
dustry and its market behavior. 

Taken in conjunction, the demand and supply 

functions for nonmonetary intermediation deter- 
mine the volume of NIFA and their market rates 
of interest. These functions determine the spread 
between primary rates and NIFA rates and the 
gross profit to intermediation. The gross profit, 
adjusted principally for labor and capital costs, 
determines net profit, which, in relation to profit 
opportunities elsewhere, regulates the desired flow 
of equity funds into the industry. It is equity funds, 
in turn, which help to provide the factor of safety 
for NIFA that induces consumer households to 
Invest in these assets at yields below primary rates 
of interest, 
? Markets for primary securities, Nonmonetary 
intermediaries are net buyers of primary securities: 
they acquire securities with funds that flow to 
them on the market for NIFA and, in much smaller 
volume, sell primary securities of their own, mainly 
equities, to finance increases in their net worth. 
The latter may be dropped from consideration 
here with the generalization that they increase in 
response to governmental capital requirements, to 
gains in intermediaries’ net earnings, and to gen- 
eral advances in prices of common stocks. 

Historically the development of intermediaries 
og a necessary condition for broad and active 
ag ets in primary securities. Intermediation 
rita an infrastructure of security exchanges, 
Sealing Aa brokers. Reciprocally, efficient market 
tig a or primary securities reduce the operat- 
EE N risks, and uncertainties of intermediaries 
a on { For illustrations see Basch 1964, chap- 

; Nevin 1961, chapter 4.) 
: a Supplying attractive media for savings and 
ie, markets for investors’ issues of pri- 

Rianne nonbank intermediation facilitates 

ss o labor between saving and investment. 

esult is that primary securities accumulate 
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in amounts that are relatively large in comparison 
with national income. 

Aside from stimulating the organization of se- 
curity markets and growth in primary securities, 
intermediaries influence the allocation of savings 
among investment opportunities and, hence, the 
quality of primary securities. Successful inter- 
mediation depends heavily on arbitrage between 
opportunities to finance investment and to acquire 
primary securities. From the standpoint of society, 
such arbitrage promotes allocative efficiency in the 
saving-investment process. On markets for pri- 
mary securities, one manifestation of allocative 
efficiency is uniformity in interest rates and other 
terms of lending for similar securities. Inter- 
mediation works against fragmentation of security 
markets. 

By encouraging saving, and so accelerating 
growth of capital, intermediation tends to reduce 
interest rates on primary securities. To the extent 
that intermediation allocates savings efficiently, it 
results in a capital stock of high productivity and 
so tends to raise interest rates on primary secuti- 
ties. The net effect of intermediation on interest 
rates, in the long run, is probably to raise them. 

Efficient nonbank intermediation increases the 
short-run stability of rates of interest on primary 
securities. Each increase in primary rates first 
widens profit margins in intermediation and then 
induces increases in rates paid on NIFA. Insofar 


_as more attractive yields on NIFA stimulate saving 


and divert the public’s demand from money bal- 
ances, the effect is to intensify the flow of funds 
through nonbank intermediaries toward purchase 
of primary securities. Conversely, short-period 
declines in primary rates may shrink the flow 
of funds toward purchase of primary securities. 
If intermediation does temper short-run fluctua- 
tions in primary rates, the result is a reduction in 
market-risk premia on primary securities. (For an 
alternative view see Minsky 1964.) 

Markets for goods. Nonbank intermediation in- 
fluences aggregate propensities to consume, the 
quality of consumption, and the stability of con- 
sumption, There is a dual effect on total consump- 
tion, since NIFA attract savings, while consumer 
credit, financed through intermediaries, facilitates 
consumption. Intermediation is important for the 
quality of consumption, partly because it changes 
relative costs to consumers of different classes of 
goods, partly also because it permits flexible adjust- 
ments in life-cycle patterns of consumption. Finally, 
intermediation is pertinent to temporal stability in 
consumption to the extent that changes in con- 
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sumer stocks of NIFA and in consumer debt, as 
well as in yields and costs of NIFA and debt, lead 
to acceleration or deferral of consumer spending 
(Enthoven 1957-1964; [U.S.] Board of Governors 
vis LOG 

Influences of intermediation can be traced 
through markets for goods other than consump- 
tion goods. As the savings outlet that provides 
the alternative to financing investment from in- 
ternal sources, through direct issues of primary 
securities to savers, through the monetary system, 
and through government, nonbank intermediation 
participates in selecting investment opportunities 
and in shaping the structure of the capital stock. 
It is involved in allocating savings between private 
and governmental investment, between domestic 
and foreign uses. Furthermore, it is a factor in the 
short-run stability both of total investment and 
of components including housing and business in- 
ventory (Grebler & Maisel 1963). 

Measurements of national income originating in 
nonbank intermediation are ambiguous. It may be 
estimated that saving by these institutions on their 
own account is less than 1.5 per cent of national 
saving in the United States, that their tangible 
investment is about 0.25 per cent of national in- 
vestment, and that they employ about 1.5 per cent 
of the civilian labor force. Their impact on markets 
for goods and factors is mainly indirect, through 
the saving—investment decisions of other sectors. 

Market for money. There has been intensive 
debate regarding the effect of growth and innova- 
tion in NIFA, and of changes in their yields, on 
the demand for and the supply of money. The 
central issue has been the degree of substitutability 
between NIFA and money balances in the asset 
portfolios of consumer households especially, but 
also of business firms. 

If demand for money is negatively elastic to 
yields on NIFA, changes in these yields resulting 
from a monetary policy that raises (lowers) the 
rate of growth in the supply of money may raise 
(lower) growth in demand for money and so off- 
set the intended effects of monetary policy on 
markets for goods, factors, and primary securities. 
Since substitutability of NIFA for money is not 
perfect, interference from NIFA with monetary 
policy may presumably be overcome by sufficiently 
large and timely adjustments in the money supply. 
Alternatively, interference of NIFA with monetary 
policy might be overcome by bringing within the 
orbit of monetary control those nonbank inter- 
mediaries which issue NIFA most closely resem- 
bling money. 

If NIFA are near substitutes for money, tradi- 


tional monetary controls can also be obstructed by 
responses of nonbank intermediaries. There are 
other implications. For example, if commercial 
banks must be restrained more tightly over long 
periods or subjected to more vigorous and volatile 
control for short periods because nonbank inter- 
mediaries weaken monetary discipline, there is 
discrimination against banks. The discrimination 
may be criticized on grounds of equity or because 
it undermines the capitalization and solvency of 
banks. If banks dispose of savings differently than 
do nonbank intermediaries, the discrimination may 
also be criticized for its allocative effects on the 
stock of capital. (The literature concerning effects 
of nonbank intermediation on monetary controls 
is very large. It includes Commission on Money 
and Credit 1961, pp. 78-81; Gurley & Shaw 1960, 
chapter 6; Johnson 1962; Great Britain, Committee 
on the Working of the Monetary System 1959, 
pp. 129-135.) 

Equilibrium of the market for money, at a rela- 
tively stable price level, appears to be an impor- 
tant prerequisite for development of nonbank 
intermediation and of NIFA, even with price- 
escalator provisions. Inflation is a tax not only on 
money balances but also on the real value of 
NIFA as well, and correspondingly diminishes 
real demand for NIFA. Furthermore, inflation 
often concentrates savings in sectors of the com- 
munity that prefer other dispositions of savings 
than NIFA. International data suggest that non- 
monetary intermediation lags where inflation is 
habitual and erratic. 

Integration between nations, in monetary and 
trade institutions and policies, has important im- 
plications for nonbank intermediation, with regard 
both to markets for NIFA and to intermediary port- 
folios of primary securities. For example, one can 
expect innovations in NIFA, such as Euro-curren- 
cies, and international arbitrage by intermediaries, 
such as investment funds, that will tend to unify 
markets for savings. 

Regulation. Nonbank intermediation is typi- 
cally subject to intensive governmental regulation 
and intervention. At the extreme, there is govern- 
ment ownership and management. Short of this 
limit, regulation and intervention include subsidy 
and other stimulants as well as various restraints 
on private enterprise in intermediation. 

There are seven common objectives of regula- 
tion: to increase propensities to save, to guide the 
allocation of savings to investment, to limit short- 
period instability of growth, to strengthen the 
solvency of intermediaries, to improve intermedi- 
aries’ operating efficiency, to correct the distribu- 


tion of income and wealth, and to stabilize the 
balance of payments. (The last will not be dis- 
cussed here. ) 

The basic technique of accelerating savings 
through nonbank intermediaries is to intensify de- 
mand for NIFA, relative to consumption, at given 
levels of national income. Rates of interest may 
be raised on NIFA, innovations introduced, savers’ 
safety guaranteed. For these objectives, competi- 
tion may be intensified among intermediaries, tax 
concessions granted to them, deposits insured, and 
portfolio limits relaxed. 

Regulation of intermediaries is traditionally 
concerned with savings allocation. For the benefit 
of housing or, say, investment in agriculture and 
small business firms, regulation has provided in- 
centives for intermediaries that guide savings to 
desired ends, Programs to overcome regional lags 
in development have included subsidized alloca- 
tions of loanable funds through intermediaries. 
Measures against restraint of competition among 
intermediaries are partly concerned with allocative 
efficiency. 

Nonbank intermediaries have contributed at 
times to short-period turbulence in the saving— 
investment process. Measures to correct their in- 
fluence toward overinvestment include minimal 
requirements for liquidity and net worth. Measures 
to correct their influence toward underinvestment 
include guarantee of their portfolios and provision 
of rediscount facilities. 

4 The solvency of intermediaries has been an ob- 
Jective of high priority in regulation. The costs 
of insolvency that have seemed to justify intensive 
regulation include inequity to savers, misallocation 
of savings, and contagious economic instability. 
Measures to protect solvency include net-worth re- 
quirements, supervision of operations, limits on 
sete selection, and loan insurance. There is 
28 Sige argument that regulatory preoccupation 
ency has imperiled some other objectives, 

such as allocative efficiency. 
Mir ators efficiency is not often a primary ob- 
ei of regulation. Its purpose is to liberate real 
ae =! for other uses, to reduce spreads between 
NIFA rea on primary securities and tates on 
roe ie to strengthen solvency. Economic opera- 
ie toes € essential for the comparatively primi- 
a 7 of intermediation that are appropriate 
It has ne ages of national economic development. 
Shelia ee heavily in the growth of such in- 
one in the United States as credit unions. 
atory taae considerations enter into some regu- 
EPA, niques. The purpose may be merely to 
nequitable incidence on some social classes 
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of gains and losses from intermediation, or it 
may be to change the distribution of income and 
wealth. The first purpose is illustrated by regula- 
tion of insurance companies’ portfolios in the in- 
terest of beneficiaries, or of investment funds to 
limit conspiratorial gains of insiders. The second 
purpose is illustrated by various special credit 
arrangements for agriculture or war veterans or 
regions suffering a slow tempo of development. 

These objectives of regulation have given rise, 
especially in the United States, to a complex ap- 
paratus of federal and state regulatory techniques 
administered by governmental departments, com- 
missions, and institutions. Dissatisfaction with this 
apparatus has inspired intensive re-evaluation by 
both private and governmental bodies. The princi- 
pal occasions in the United States were during the 
decade before World War 1, during the 1930s, and 
in the early 1960s by the Commission on Money 
and Credit. There were other national investiga- 
tions in the course of the 1960s: in Japan the 
Committee on Financial System Research; in the 
United Kingdom, Committee on the Working of the 
Monetary System; in Canada, Royal Commission 
on Banking and Finance. 

These recent studies of regulations concerning 
both monetary and nonbank intermediaries had 
to do, in part, with irrationalities in regulatory 
agencies and techniques. They were also directed 
to goals of regulation and, for the first time, to 
possible benefits of applying a general theory and 
compatible techniques to regulation of various 
classes of intermediaries. There is no precedent 
for their attempts to define optimality in the fi- 
nancial superstructure of capitalism. They did not 
reach common conclusions on either principles or 
techniques of regulation. 

International differences. There is notable di- 
versity of nonbank intermediation between coun- 
tries. First, NIFA per capita vary widely (Kuznets 
1955; Goldsmith 1955). This is explained partly 
by differences in real income and wealth per 
capita, since income and wealth elasticities of 
demand for NIFA are high. It is explained, too, 
by differences in centralization of decisions to save 
and invest: economic centralism is not a fertile 
environment for nonbank intermediation. Again, 
NIFA appear to be high, relative to income, where 
price inflation has been modest and low where in- 
flation has been rapid, chronic, and erratic. Non- 
bank intermediaries are relatively strong where the 
monetary system does not practice broad diversi- 
fication of portfolios and indirect debt, where 
public debt is not finely differentiated, and where 
social security is not financed mainly from current 
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government revenues. The evidence is that non- 
bank intermediation has been retarded by social 
instability and defects of legal structure. There are 
also imponderables of social framework, social mo- 
bility, and personal motivation that affect the pace 
of intermediation. Governmental assistance and 
foreign aid figure prominently in growth of NIFA. 
Finally, development in intermediation varies be- 
tween countries according to the accessibility and 
quality of complementary institutions. All of these 
and other factors relating to international differ- 
ences can be classified according to their effects 
on conditions of demand and supply for NIFA. 

The quality as well as the scale of nonbank in- 
termediation varies internationally. There is varia- 
tion in primary securities acquired by intermedi- 
aries, in NIFA, and in institutional structure. One 
observes rough correspondence of qualitative with 
quantitative development: such intermediaries as 
investment funds appear where real income per 
capita is high; small cooperatives for agricultural 
credit, where income is lower. Still, the quality of 
intermediation is not uniform between countries 
at comparable stages of economic development. 
Differences in composition of wealth and output 
are partly responsible, and the form of political 
organization is pertinent. The distribution of in- 
come by level of income is another factor, as is 
the distribution of population by age and degree 
of urbanization. Within the orbit of capitalism, 
the social choice between alternative methods of 
mobilizing and allocating savings deeply affects 
the environment of nonbank intermediation. One 
can trace in the quality of intermediation’ the 
unique historical experience of each society that 
has given momentum to some intermediaries 
rather than others; the mores and traditions that 
condition savers for or against lotteries, perhaps, 
and for or against concentrations of financial 
power; the mobility of ownership in enterprise; 
and the geographic extent of the economic system. 
From differences in savers’ tastes to differences 
in technological conditions of intermediation and 
patterns of governmental intervention, there are 
numerous factors to explain the international di- 
versity in intermediation. One may infer that op- 
timal development of intermediation will have 
strikingly indigenous characteristics. 


E. S. SHAW 
[See also BANKING; BANKING, CENTRAL; MoNEY.] 
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FINE ARTS 


1. ART AND SOCIETY Francis Haskell 


1. THE RECRUITMENT AND 


SOCIALIZATION OF ARTISTS Mason Griff 


I 
ART AND SOCIETY 


Societies of all kinds and of all periods have 
given birth to what can today be classified as art, 
but there exist no significant propositions as to 
the nature of the relationship between particular 
social systems and the kinds of art that develop 
under them. The problem was first seriously posed 
during the Enlightenment, when philosophers 
arguing in favor of differing political organizations 
suggested that desirable art would necessarily fol- 
low desirable government. But the view of Shaftes- 
bury and others that freedom was a prerequisite 
of great art was hardly borne out either by the 
previous or subsequent history of England, where 
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architecture, sculpture, and painting, although 
reaching a higher peak than they had for some 
centuries, were hardly superior to those engendered 
under despotic regimes in Italy, France, and cen- 
tral Europe. 

In more recent years the problem has been 
raised in more sophisticated terms: quality, it is 
admitted, may be capricious in its incidence, but 
style will necessarily be governed by fundamental 
social laws. E. H. Gombrich has demonstrated the 
fallacies that usually underlie such reasoning [see 
STYLE], but it will in any case be impossible to 
make any authoritative large-scale statements 
about the relationship of art to society until the 
very few studies of particular societies and their 
arts have been greatly increased in number, scope, 
and depth. History remains our only source of 
guidance. 


The artist in history 


Research into the status of artists at different 
periods, while of interest in itself, does not seem 
to throw much light on the nature or quality of 
the art produced, nor does it even give an indi- 
cation of the esteem in which art has been held. 
Moreover, for many societies and epochs the data 
are extremely scanty. 

Greece. In Greece between the sixth and third 
centuries B.C. a flowering of the arts so spectacular 
that it influenced all subsequent history appears 
to have been almost totally neglected by the in- 
numerable writers of genius who lived through 
it. However, outlines of the artists’ lives and a few 
anecdotes of their personalities must have been 
recorded, because later historians such as Pliny 
evidently drew on lost sources of this kind. But 
nowhere in classical Greek literature is there any- 
thing to suggest that individual artists were held 
in serious consideration, and by inference we can 
assume that they were considered as being little 
superior to craftsmen. This, however, certainly 
does not imply that the Greeks were indifferent to 
the masterpieces that were produced: indeed, 
Plutarch points out in a much-quoted phrase (in 
Pericles) that “no generous youth, from seeing the 
Zeus at Olympia or the Hera at Argos, longs to 
be Phidias or Polyclitus, for it does not of necessity 
follow that if the work delights you with grace, 
the one who wrought it is worthy of your esteem.” 

It is noteworthy that in all the discussions that 
took place during the golden age on the nature 
of inspiration, examples were never chosen from 
among sculptors or painters, and it is clear that 
although artists in classical times could be either 
slave or free, the mere association of their work 


with manual labor was enough to prevent their 
being highly esteemed. For this reason, which was 
to recur later in the Middle Ages, an architect 
stood a greater chance of earning renown than 
a painter or sculptor, for, when successful, he 
would be considered more as an entrepreneur 
than a manual laborer. However, as frequently 
occurs in the social history of the arts, sufficient 
skill could on occasion enable its possessor to 
overcome this indifference to the individual artist 
and defy even the most rigid conventions. There 
are, for instance, records indicating that as early 
as the last quarter of the sixth century B.c., “sev- 
eral potters . . . were able to erect impressive 
dedications on the Acropolis” (Cook 1960, p. 272), 
and it is also well known that at much the same 
time we begin to find frequent examples of signed 
work. 

During the Hellenistic period the social status of 
the artist certainly improved, and legends about 
the relations between Alexander the Great and 
Apelles, which are recorded by Pliny, indicate at 
the very least an ideal that was no longer held to 
be absurd—and incidentally set a highly important 
precedent for subsequent royal patronage. We are, 
however, still very largely ignorant not only about 
the status of the artist in antiquity but also about 
the nature of his training and mode of life gen- 
erally, although we do find hints of exhibitions, 
dealers, and other features of the artistic scene 
that later became commonplace. If we are entitled 
to assume, as seems probable, that the artist in 
classical antiquity was considered to be little more 
than a craftsman, it remains the more surprising 
that the development of sculpture and painting 
was so extraordinarily striking, For it is often held 
that artistic change is encouraged by the notion 
of the artist’s individuality, whereas, almost by 
definition, the craftsman is employed exclusively 
to satisfy public demand. 

Rome. For Roman times, too, the evidence is 
scattered, fragmentary, and conflicting. It seems 
clear that to Cato and those who thought like him 
art was something essentially foreign—even de- 
grading—and this view must clearly have been 
reflected in their opinion of the artists themselves. 
On the other hand, archeological research and 
literary references illustrate obviously enough the 
admiration felt for Greek sculpture, and Pliny’s 
Stories of the Greek artists must surely have sug- 
gested to many that the superior ones among them 
could achieve recognition far higher than that 
likely to have been granted to mere craftsmen. If, 
however, we risk the premise that some acknowl- 
edgment of the artist as an individual with a will 


of his own, who does not merely respond to the 
pressures of the market, is at least a likely pre- 
requisite of adventurous art, then the highly de- 
rivative nature of Roman sculpture (and also the 
extremely conservative tendencies of the ancient 
Egyptians) might imply that the position of the 
artist was a very lowly one. But the dangers of 
such a circular argument are too great to make 
it worth pursuing. Certainly the Romans never 
included the visual arts among the artes liberales, 
and this, together with the physical destruction 
and chaos that followed the collapse of the Western 
Empire, proved to be of decisive importance for 
the Middle Ages. 

Middle Ages. The state of our researches into 
the artist’s position in medieval times must rein- 
force our skepticism about any broad-based con- 
clusions to be drawn from antiquity. For here too 
the arts were almost completely ignored by serious 
writers. However, a considerable number of scat- 
tered documents—of the very kind which have not 
survived from ancient Greece or Rome—can still 
be traced, and these show us how complex the 
situation was. Their investigation has recently been 
undertaken by Andrew Martindale (1966). Since 
his researches constitute by far the most valuable 
contribution that has yet been made to the subject, 
they have been heavily drawn upon in the follow- 
ing discussion. 

Enough documentation certainly survives to 
prove what was only conjecture in any discussion 
of earlier periods: until the fifteenth century most 
artists were generally considered superior crafts- 
men, In 1323, for instance, a Paris scholar listed 
among “those craftsmen working with their hands” 
not only “the most ingenious makers of all sorts 
of image, whether contrived in sculpture or in 
painting or in relief” but also “the most cunning 
Constructors of instruments of war” and the 
“makers of bread.” Moreover, from tax returns and 
other sources we can see that artists, like other 
craftsmen, tended to live together in communities 
ìn particular parishes and that they made no dis- 
tinction between work that would now be con- 
sidered mechanical, such as decorating saddles (in 
London the painters’ guild was a branch of the 
saddlers’), and painting devotional images. The 
artist was incorporated into a guild, and his train- 
ing followed the usual practice by which, as a youth 
of about 12, he would enter a master’s shop as 
an apprentice. There he would help out and fa- 
miliarize himself with technical processes, to 
emerge after some years as a journeyman, before 
finally becoming an independent master able to 
engage apprentices of his own. Thus Lorenzo 
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Ghiberti, after winning in 1403 the competition 
for the second set of bronze doors for the baptis- 
tery of San Giovanni in Florence (this was a means 
of attracting artists which was always to remain 
popular with committees), was in a position to 
employ as many as 21 apprentices a few years 
later. The guilds also served other purposes, some 
of which, such as the maintenance of a certain 
status and ritual dignity, have been carried on by 
the academies into our own day. In general, how- 
ever, their concern was essentially with more prac- 
tical matters, above all, to protect both the artist 
and his client from fraud. 

But a too-uniform pattern must not be read into 
the Middle Ages, and many of the striking de- 
velopments that occurred in Renaissance Italy 
were already present in embryo. Then, as later, 
the law of supply and demand operated powerfully 
in favor of change. It is true, as Martindale 
stresses, that a king might bring painters into his 
household for reasons that had little to do with 
their professional ability and treat them, once 
there, with scant respect or reward—he instances 
a “Jack of St. Albans,” who was required to dance 
on a table, and Giotto, whose gift of repartee seems 
to have been at least as highly prized as his art; 
yet it is hard to believe as he does that such 
appointments did nothing to raise the status of 
painting as an occupation (all our experience of 
psychology surely suggests the contrary). Martin- 
dale himself implies that the occasional institu- 
tion of the post of “town painter” (a practice 
adopted, above all, in Venice) may well have been 
inspired by imitation of such royal gestures. Com- 
petition between towns was also of the greatest 
importance. The terms of Giotto’s nomination in 
1334 as city architect of Florence (“architect’— 
though he had made his name as a painter) will 
serve as an example: 


As in the whole world there is to be found none better 
qualified . . . than Master Giotto di Bondone, the 
painter of Florence, he shall therefore be named in his 
native city as Magnus Magister and publicly regarded 
as such, so that he may have occasion to abide here; 
for by his presence many can have the advantage of 
his wisdom and learning, and the city shall gain no 
small honour because of him. 


These terms show what lip service burghers were 
prepared to pay to the arts—and artists—for en- 
hancing their own reputations (and doing down 
a rival city). 

It is true that Italy in general and Florence in 
particular were exceptional in paying such honors, 
but the wording of the appointment reminds us 
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that the rivalry between secular and religious com- 
munities, towns and courts, and one city and 
another could play as significant a part in elevat- 
ing the position of the artist as could the rivalries 
between individual patrons in later years. Cer- 
tainly the artists often thought of themselves as 
much more than mere craftsmen, as can be seen 
from the very many self-laudatory inscriptions that 
they put on their own works: we find examples of 
these as early as 1100. 

Renaissance. The Renaissance did, of course, 
bring great changes in the status of the artist— 
one could instance a fragment of a letter from 
Prince Frederico Gonzaga, who was trying to get 
hold of any example of Michelangelo's 


never-sufficiently-to-be-praised work . . . sculpture or 
painting, as he chooses, we do not mind which it is as 
long as it is by him. . . at least a drawing, if it is well 
done in charcoal. . . . And you must tell him that his 
work will be placed in a most honourable position, and 
that we shall feel eternal gratitude towards him, and 
that we shall never forget such a special favour, and 
will always be ready to do anything we possibly can to 
give him pleasure. (quoted in Luzio 1913, pp. 246- 
247) 


But it must always be remembered that Michel- 
angelo’s position was a wholly exceptional one 
and that many of the customs of the Middle Ages 
were carried on well after his death. However, the 
letter does point up the new role that had been 
assumed ever since the fifteenth century by secular 
patrons in Burgundy, Italy, and elsewhere. 

The early humanists who did so much to raise 
the value of art seem themselves to have been more 
interested in the remains of antiquity than in con- 
temporary painting or sculpture, but by the middle 
of the fifteenth century a number of rulers were 
aware of the possibilities of private patronage as 
distinguished from the traditional construction 
and decoration of public buildings. And toward the 
end of the century Lorenzo de’ Medici, whose role 
as an art lover has been much exaggerated, did 
anticipate the modern practice of advertising 
abroad the supremacy of his country’s artists in 
order to enhance its prestige. Other collectors all 
over Italy, and soon all over Europe, acquired con- 
temporary works of art, which were then evidently 
kept accessible to other artists and connoisseurs, 
From these princely collections sprang most of the 
great national museums, following the example of 
Anna Maria Ludovica, the last of the Medici, who 
in 1743 left the accumulated family treasures to 
the city of Florence, Elsewhere, beheading or exile 
usually proved a more effective means of acquir- 
ing art treasures, 


Nonetheless, appreciation of art, and even ad- 


miration for the artist, could go hand in hand with 
habits formed in earlier ages. Well into the seven- 
teenth century, for example, we come across 
patrons insisting on the use of specified colors in 
the canvases painted for them, and the nature of 
commissions generally shows how little individu- 
ality was allowed to the artist and how much he 
was still treated as a craftsman. It was usual for 
a patron when ordering a picture to pay for the 
stretcher and the priming, and frequently also the 
canvas and some of the more expensive colors, 
He would then indicate the dimensions, insist on 
a specific time limit for its completion, and make 
arrangements about the subject matter. As these 
specifications must have usually taken the form 
of spoken agreements, it is difficult for us to gauge 
how strictly they were enforced; yet the survival 
of preliminary drawings and, later, of painted oil 
sketches can occasionally give us an idea of any 
changes required. Most significant of all in this 
context is the method of payment. The price of 
the picture was decided in advance—according to 
a seventeenth-century Italian author, this could be 
done by finding out how long the work would take 
and estimating what the painter’s daily earnings 
should be “by comparison with the pay of a crafts- 
man engaged in similar work.” A proportion of 
this was paid at once, and further sums were paid 
at different stages in its Progress, ending, on com- 
pletion, with the final settlement, to which would 
be added a bonus. This bonus was the only allow- 
ance made for the vagaries of independence. 
Further indications of the artist’s status (and per- 
haps of the very nature of artistic creativity) can 
be deduced from the fact that in certain cities, 
notably Venice, it was a custom until the end of 
the eighteenth century for artists to work together 
in families. All this should be borne in mind when 
the “individualism” of Renaissance artists is being 
assessed, 

In Italy especially, artists were keenly preoccu- 
pied with their place in society throughout the 
fifteenth century, and innumerable arguments were 
produced to establish their claim to be considered 
wholly superior to craftsmen. Particular attention 
Was concentrated on the “intellectual” as opposed 
to the “manual” aspects of their work, Leonardo, 
whose respect for antiquity led him to the errone- 
ous conclusion that painting had then been one 
of the liberal arts, only to be “driven out” at a 
later date, stressed above all the painter’s need 
to have a thorough knowledge of mathematics. 
It is through instances such as this that we can 
probably see most clearly how a study of social 
pressures on the artist can sometimes help us to 
understand the nature of his art. The Renaissance 


stressed imagination and science at the expense 
of the faithful portrayal of reality, and categories 
of subject matter were tentatively drawn up and 
later codified to put these ideas into effect, with 
“history painting” at the top of the list and “genre” 
at the bottom. 

The social status of artists was largely deter- 
mined by their choice of subject matter until well 
into the nineteenth century. At the same time, the 
guilds, which had once been of such service to the 
artist, were now considered degrading to him 
through their association with other crafts, and 
their dissolution was gradually achieved not so 
much by theoretical arguments as by the practice, 
adopted first by sovereigns and then by private 
citizens, of employing artists in defiance of all 
established traditions. But the process was a slow 
one, and into the seventeenth and even the eight- 
eenth century the guilds continued to play a signif- 
icant, if increasingly marginal, part in the life of 
artistic communities in Italy, France, and else- 
where; nor was the distinction between “arts” and 
“crafts” made final before the French Revolution. 


The rise of the modern artist 


The decay of the guilds has sometimes been held 
responsible not only for the “individualism” and 
enormous increase in the speed of artistic change 
that characterize the Renaissance but also for the 
emergence of the “neurotic” artist so much asso- 
ciated with the early and mid-sixteenth century— 
and indeed with later times. The force of this argu- 
ment would be much increased were it possible 
to conduct any serious research into the state of 
mind of artists in earlier times. What is certain, 
however, is that as the guilds decayed, new asso- 
Ciations arose that took their place and assumed 
some of the same functions (Pevsner 1940). 

Academies. The Accademia dei Virtuosi al Pan- 
theon, founded in Rome in 1543, was concerned 
with the same sort of charitable works that had 
been carried out by medieval guilds. The Accade- 
mia del Disegno, established in Florence in 1563, 
had the grand duke of Tuscany and Michelangelo 
as its first copresidents (the juxtaposition itself is 
revealing). While it was intended primarily to pro- 
mote the newly won status of artists, one of its 
alms was teaching young men to maintain a high 
Standard of art—but significantly, the instruction 
was more in the realm of general education than 
in the technical field, which had been the aim of 
the earlier guilds. 

Dealers. Other features of the present situation 
first began to emerge during the sixteenth and 
Seventeenth centuries. The enormous reputation of 
Italian art attracted the attention of collectors else- 
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where in Europe who, as they were unable to travel 
themselves, either had to summon particular artists 
to their courts or mansions or, when this was im- 
possible, to rely on dealers and international agents. 
Hitherto, such men, who had been operating ever 
since the Middle Ages, had included works of art 
among innumerable other commodities. But now 
they began to assume much greater importance, 
as can be seen by studying the career of Jacopo 
Strada (immortalized in a magnificent portrait by 
Titian), who supplied pictures and antiques to the 
German courts. 

Venice was the most commercialized of Italian 
cities, and it is there that we find in the person 
of Aretino the prototype for so many dubious fig- 
ures familiar since his day—the man who is at 
once artistic adviser to the great (he received a 
gold chain from Frangois 1), dealer, and collector 
(in the pursuit of which activity he tried to black- 
mail Michelangelo). In general, however, dealers 
in contemporary art played a relatively small role 
in Italy and confined their often extortionary at- 
tentions to young painters who had as yet had no 
chance of finding patrons directly or to those who 
suffered during the many financial crises that upset 
what was always a precarious economic situation. 
It was in the north, where both courtly patronage 
and contempt for trade were of much less impor- 
tance than in Italy, that art dealing first assumed 
really significant proportions. In sixteenth-century 
Antwerp and, much more so, in seventeenth-cen- 
tury Amsterdam artists often dealt with middlemen 
who sometimes—as is the modern practice—paid 
them a regular allowance in return for their entire 
output. This breakup in the direct relationship be- 
tween patron and artist and the consequent “ra- 
tionalization” of the market obviously played its 
part in encouraging that specialization of subject 
matter—flowers, genre, portraiture, etc.—for which 
Dutch artists were particularly renowned. 

Exhibitions. It is in the north too that we can 
find the origins of the modern picture exhibition. 
During the later Middle Ages these had been fre- 
quently associated with religious festivals at which 
artists and other craftsmen had shown their wares, 
but such occasions (which continued until a much 
later period) belong more to the history of com- 
merce than to the history of art. By the mid-six- 
teenth century, however, exhibitions were organized 
by the artists’ guilds in Antwerp, and some kinds of 
formal arrangements had come into being. Mean- 
while, another tradition of almost equal importance 
was slowly beginning to develop in Italy, whereby 
churches and their cloisters were decorated with 
pictures on particular saints’ days. At first these 
occasions had no commercial purpose whatsoever 
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and were designed, for instance, by the Virtuosi al 
Pantheon, purely as an act of homage to a patron 
saint. By the early seventeenth century, however, 
artists had seen the possibilities for self-advance- 
ment that were inherent in the custom and were 
anxious to display their pictures on the various op- 
portunities available each year—even though there 
was as yet still no question of selling them. Nor 
were there any catalogues or other regular promo- 
tional procedures. 

The Paris salon. By far the most important step 
in the development of the modern exhibition was 
taken in Paris in 1667, when for the first time, 
what was intended to be an annual salon was 
organized for its members in the Louvre by the 
Académie Royale, the most authoritative of the 
early academies. Exhibitions took place thereafter 
at irregular intervals, although they were well es- 
tablished by the second quarter of the eighteenth 
century, and printed catalogues were sold. For the 
first time, the general public, as opposed to a 
highly restricted circle of the rich and powerful, 
had the opportunity to see current production 
without having to crane their necks in dark churches 
or wheedle their way into the palaces of the aristoc- 
racy. As an inevitable result the artist could appeal 
to a wider circle than ever before, the much-de- 
bated phenomenon of “bourgeois taste” began to 
make itself seriously felt, and criticism sprang into 
being. 

Art criticism. Criticism had hitherto been con- 
fined to historians and theorists, usually arguing 
from basic principles, and to amateur poets, who 
were likely to write laudatory sonnets about works 
they particularly admired. It had been bedeviled 
by the problem of whether a person who was not 
a practicing artist was in a position to make any 
useful comments, and in any case, it had only 
rarely dealt with new works. Yet critics could prove 
to be very influential. Often the critic, who was 
never in any sense a professional, served as ad- 
viser to the more important patrons. Thus G. P. 
Bellori, the most prominent critic of the seven- 
teenth century, whose idealizing theories pro- 
foundly affected his own and all future genera- 
tions, worked for Queen Christina of Sweden and 
for the popes; and Winckelmann, his even more 
important successor in the eighteenth century, was 
employed by Cardinal Albani. 

In any case, such great theorists, historians, and 
scholars belong to a wholly different class from 
the pamphleteers who in 1738 began—much to the 
justified indignation of the artists—to review the 
salons. Their tone was often low and scurrilous, 
but the popularity of their work can be gauged 
from the fact that there were 2 written criticisms 


in 1738, 10 in 1773, and 28 ten years later. While 
most of their articles are now forgotten, they are 
of importance for the historian of art because they 
anticipate later writers who reached the far larger 
and more ignorant publics of the nineteenth and 
twentieth centuries and because they include one 
genius, Diderot. 

Connoisseurs. One other extremely important 
phenomenon occurred during the period under con- 
sideration: the arrival on the scene of the connois- 
seur, the “man of taste,” often holding little politi- 
cal power and of relatively small means but deeply 
concerned with the arts. He was able to exert con- 
siderable influence, particularly on artists whose 
talents were not easily adaptable to official commis- 
sions likely to be offered by great princes or clerics. 
Such a man in seventeenth-century Rome was 
Cassiano dal Pozzo, who employed Poussin for 
many years, or Pierre Crozat (admittedly of infi- 
nitely greater wealth), who nourished the very un- 
usual genius of Watteau. Indeed, many of the most 
beautiful houses, pictures, and statues of the eight- 
eenth century were produced for such individuals, 
who, being far removed from the court or church, 
proved to be extremely rewarding patrons—and 
friends. 

By the end of the eighteenth century many of the 
institutions associated with today’s artistic scene 
were already in being: the private collector, the 
dealer, the exhibition, the critic, and even the mu- 
seum of modern art (in the form of state purchases, 
in France especially, of promising new works). 
Huge fortunes could be earned—Sir Joshua Rey- 
nolds left £100,000 on his death in 1792—and 
huge reputations gained (Reitlinger 1961). 

The ideology of genius. It cannot be denied 
that the artist’s situation has changed radically 
since the late eighteenth century. Once again the 
main cause must be attributed to a new conception 
of the artist’s status. The idea of wayward “genius” 
and “inspiration,” which had been confined by the 
ancients to poets and thinkers, was extended dur- 
ing the Renaissance to artists—at first rather tenta- 
tively and then with growing assurance. The tower- 
ing achievements of Michelangelo and his strange, 
austere character, notable for its terribilta, were, 
more than any theoretical writing, responsible for 
the acceptance of this concept. In the sixteenth 
century Benvenuto Cellini implied that artists were 
entitled to live above the law, and in the seven- 
teenth Salvator Rosa claimed to be unable to paint 
unless “carried away by the transports of enthusi- 
asm.” He refused to accept the traditional advance 
payment for commissioned pictures, claiming he 
could not know when a painting would be finished. 

By the seventeenth century it was a common- 


place that artists were likely to have difficult char- 
acters, at the very least, and that this might even 
be a prerequisite of their talent—this despite the 
strenuous efforts of nearly all the leading artists 
of the day to conform to established patterns of 
social behavior so as to raise their status. 

In the eighteenth century William Blake was 
only one among many who denied that art could be 
taught—an assumption underlying the many new 
academies that had been founded all over Europe 
to improve standards of design—and soon after- 
ward Goya claimed for the artist the right to look 
to his own fantasies rather than to the outer world 
for the subject matter of his art. It is, however, ex- 
ceedingly important to stress that scarcely one, if 
any, of the serious artists of the nineteenth cen- 
tury ever put into practice the vast claims that had 
been made for the rights of genius by their prede- 
cessors and encouraged by the German Sturm und 
Drang. 

Nonetheless, nearly all of them did maintain that 
where artist and public conflicted, it was the artist's 
moral duty to be true to himself rather than to his 
patrons, This was a stand that had hitherto been 
possible only for the very great, whose reputations 
were already assured and whose “difficult” works 
could therefore be taken on trust even when they 
were not fully understood. But this change of atti- 
tude occurred when, because of the increased birth 
rate and the urbanization that had been stimulated 
by the industrial revolution, a much bigger and 
presumably rather different public began to have 
the opportunities to see, and then to buy, modern 
art. Unfortunately, not even the minimum basic 
research into this change of patronage from a 
“cultivated aristocracy” to a “nowveau-riche, igno- 
rant bourgeoisie” has yet been undertaken. The 
general validity of this traditional view will merely 
have to be assumed. 

Art and politics. Two other important new 
developments took place at this time, about which 
we can be much more certain. First, there was the 
great politicization of life that occurred as a result 
of the French Revolution, Artists had occasionally 
played a part in the important issues of their day 
—Tilman Riemenschneider had suffered torture 
for his Support of the Peasants’ Revolt of 1525, 
Lucas Cranach had strongly backed the Reforma- 
Hon, Michelangelo had helped to defend Florence 
against the Medici in 1527, and so on—but never 
before had their styles, as opposed to their occa- 
sional subject matter, been associated with political 
views, In the eighteenth century, it is true, some 
writers—notably Diderot—had tried to identify cer- 
tain artists with certain social and political causes 
(again, as in the case of John Baptiste Greuze, 
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usually on the basis of subject matter), but their 
lack of success had been total. And those who see 
in the classicism of David some sort of bourgeois 
opposition to court circles are merely projecting 
into the past what would never have been sus- 
pected—by him or his patrons—before 1789. None- 
theless, David’s career does mark a watershed. Ac- 
tively engaged in politics himself (and always, 
until his last years in exile, on the winning side, 
whichever it was), he initiated a great stylistic 
revolution that was inevitably confused with politi- 
cal revolution. Thereafter, art and politics became 
inextricably confused, so that in 1825 we can find 
the writer Louis Vitet exclaiming, “Le goût en 
France attend son 14 Juillet” (quoted in Grate 
1959, p. 17), although in fact he was calling for 
an end to Davidian classicism in the name of that 
painterly romanticism whose leading exponent was 
Delacroix, soon to become a convinced reactionary! 

Public galleries. The other important innova- 
tion was the widespread inauguration of public art 
galleries, above all, the Louvre in 1793 and in 1824 
the National Gallery in London. They kept on per- 
manent view collections of old masters, often 
dimmed by yellowing varnish, as a sort of standing 
reproach to any artistic change—this danger had 
been anticipated by Constable, one of the first to 
suffer in this way, and countless innovators could 
have echoed his fear that “the manufacturers are 
made the criterions of perfection, instead of na- 
ture.” This was particularly ironic because most 
public art galleries had been founded not only be- 
cause of the general belief of the Enlightenment 
that art could improve the quality of civilization 
and help to “soften manners,” but above all, be- 
cause it was hoped that galleries, like the acade- 
mies, would benefit living artists: the Louvre was 
at first opened only to artists on seven out of every 
ten days. 


The artist and the bourgeoisie 


A new conception of the artist, of “ignorant” 
public opinion, and of the politicization of all cul- 
ture (so that every new step, previously welcomed, 
was now looked upon as potentially subversive by 
a public that dreaded above all else a new social 
upheaval)—all these factors played a vital and 
often tragic role in the history of nineteenth-cen- 
tury art, especially as the century drew to a close. 
In almost every country in Europe, but particularly 
in France, an “official art,” sponsored by the prin- 
cipal dealers, the academies, and the richer collec- 
tors, confronted those painters, sculptors, and, 
later, architects whom today we look upon as the 
finest artists of their times. Controversy centered 
particularly (but not only) on the “lack of finish” 
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characteristic of nearly all that vital art which was 
trying to break away from the formulas imposed 
by David and his school—formulas which were 
clung to with savage and paradoxical obstinacy by 
those very academies that he had wanted to abol- 
ish. There is ample evidence to show that lack of 
finish was equated with excessive ease and super- 
ficiality, and the now familiar charge, usually made 
in the presence of great and unfamiliar pictures, 
that “my little daughter aged ten can paint better 
than that” is first to be found early in the nine- 
teenth century. More damaging to adventurous 
artists than public scorn was the hostility of their 
conservative colleagues, for no mechanism for 
showing their work had yet replaced the salon or 
academy exhibitions. In France the organization of 
such exhibitions was until 1880 under the direct 
control of the government. 

In the earlier part of the century royal or other 
patrons were sometimes sufficiently powerful to 
ignore the pressures of the academy, and Delacroix 
received important public commissions long before 
he was grudgingly admitted to the Institut de 
France in 1857, after his eighth application. Later 
this dichotomy tended to disappear, and more and 
more artists found themselves excluded from the 
salons (and hence public attention) by restrictive 
juries. Unsuccessful efforts were made to organize 
independent exhibitions, and some artists, such as 
Courbet and Manet, followed an eighteenth-cen- 
tury precedent set by Greuze and others and showed 
their work privately. The Revolution of 1848 
brought a “free” exhibition, Napoleon m1 organized 
the Salon des Refusés in 1863, and a group of 
artists (later to be called the impressionists ) 
showed their works together in 1874 and thereafter 
at irregular intervals until 1886. Above all, the year 
1884 saw the first Salon des Indépendants—a di- 
rect rival at last to the official exhibition—which 
did away with the jury system altogether. 

Meanwhile the salon itself was beginning to 
break down, at first under the sheer weight of num- 
bers (White & White 1965) and then, in 1890, 
because of a split due more to personal quarrels 
than to any conflict of principle. Gradually, inde- 
pendent dealers began to take over the function of 
sponsoring modern art, although how important 
official backing remained can be seen from the fact 
that in 1900, when he was already the most famous 
artist in France and possibly in the world, Auguste 
Rodin said that unless he was successful at the 
Great Exhibition of that year, he would have to 
enter the Institut, “parce que c'est A eux seuls que 
vont les grandes commandes.” 

Dealers as patrons. The dealers who now came 
to the fore were very different from any that had 


been seen hitherto. According to Durand-Ruel, the 
most conspicuous of them, a mature picture dealer 
should be at the same time an enlightened patron 
ready, if necessary, to sacrifice his immediate in- 
terests to his artistic convictions and a man who 
would rather fight against the speculators than as- 
sociate with their interests (Venturi 1939, vol. 1, 
p. 17). He himself is of additional interest because, 
while supporting the impressionists at a time when 
they were looked upon as dangerous revolutionaries 
—the potential instigators of a new Commune—he 
remained a staunch reactionary, bitterly hostile to 
the republican regime; and there is some reason to 
believe that he relished this double assault on the 
values of the triumphant tiers état. The eventual 
rewards that came to Durand-Ruel when the im- 
pressionists achieved popularity may have played 
their part in encouraging other dealers, such as 
Vollard and Kahnweiler, who, like their Dutch pred- 
ecessors in the seventeenth century, tended to mo- 
nopolize the output of their favorite artists but 
whose role in the development of modern paint- 
ing is analogous to that of Renaissance patrons— 
as can be seen from their portraits, like those of 
Pope Leo x or Philip 11 of Spain, painted by the 
greatest artists of their times. 

Along with a new type of dealer went a new 
type of critic, concerned not only with reviewing 
the annual salons but also with interpreting the 
“misunderstood artist” and explaining him to the 
public: witness the writings of Baudelaire on Dela- 
croix, Duranty and Duret on the impressionists, 
Fénéon on the neo-impressionists, and so on. 

The avant-garde. The concept of the avant- 
garde, that is, of an art that will by its very nature 
be in conflict with received opinion, had been aired 
in passing early in the nineteenth century, but only 
toward its end became the doctrine that has af- 
fected all subsequent thinking on the arts. Mani- 
festoes and apologists began to accompany every 
new movement, culminating in those of the Italian 
futurists in 1910, in which for the first time, artists 
and writers went out of their way to court public 
hostility: there could be no better indication of how 
far the situation had changed since 1874, when 
Degas had asked his friend Giuseppe de Nittis to 
exhibit with the impressionists, explaining that 
“puisque vous exposez au Salon, les gens mal 
documentés ne pourront pas dire que nous sommes 
Texposition des refusés” (Nittis 1895, p. 237). 

Yet despite all this, the critics (and the public) 
of the end of the nineteenth century were already 
chastened by the ridicule that now covered those 
who had rejected first the romantics and then the 
realists. Opposition to new developments reached 
a final paroxysm of fury and then, in Western 
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countries at least, tended to disappear following 
World War u, so that it is now not unusual for an 
art that appears to defy the traditional rules to be 
backed by conservative governments; for ever since 
the intensification of artistic rivalries induced by 
the international exhibitions of the nineteenth cen- 
tury, all governments have felt the need to encour- 
age artistic output. Similarly, museum directors, 
warned by the sad experiences of their predecessors 
who lost so many vital opportunities of enriching 
their collections with modern works at accessible 
prices, now pay increasing attention to contempo- 
rary developments. Especially in the United States, 
such directors have often been indirectly assisted 
by a national policy of taxation that assists patrons 
intending to bequeath their collections to public 
museums—a policy often deplored not only on 
grounds of social morality but also because it has 
curtailed the role of the museum as an arbiter of 
taste. 

All these developments, combined with the more 
cautious attitude of critics, have necessarily robbed 
the avant-garde of its original significance to the 
extent that works which are considered the most 
modern of their day are often rapidly acquired by 
the very people whom they were originally designed 
to challenge. Dissatisfaction with “academic art” 
and a recognition of past misunderstandings, as 
well as the actual nature of much twentieth-cen- 
tury painting and sculpture, have also led to an 
obsession with the act of creation itself (sometimes 
at the expense of the created work), so that uni- 
versities and other educational bodies have been 
anxious not only to acquire contemporary art but 
also to observe the artist at work by incorporating 
him within their communities—often with no spe- 
cific duties. The nature of artistic development is 
as yet too-little understood for it to be clear whether 
or not this new relationship with society is a bene- 
ficial one. 

FRANCIS HASKELL 


(See also Creativity, article on SOCIAL ASPECTS; 
STYLE. A guide to other relevant material will be 
found under Art.] 
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Il 
THE RECRUITMENT AND SOCIALIZATION OF ARTISTS 


“Recruitment” is a term used to refer to the un- 
derlying processes for bringing new members into 
a group. Sociologists are especially interested in 
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understanding those social forces that encourage 
individuals to join, or repel them from joining, an 
occupational or professional group and to embark 
on a particular career within that group. 

In many occupations the process of recruitment 
can be readily understood. For example, consider 
the military, from whom the word “recruitment” 
was originally borrowed. We know in clear terms 
the subjective and objective meaning of the pro- 
fessional soldier. We are aware of the symbols, 
such as money, glory, social advancement, travel, 
and adventure, which attract persons to become 
soldiers. We are equally aware of the symbols that 
distract, discourage, and repel the individual from 
this profession—fear, lack of sympathy with the 
cause, discipline, death, separation from loved ones 
and from one’s community. 

By contrast, there is a great vagueness concern- 
ing the recruitment of the artist. It is very difficult, 
first of all, to define the term “artist”; any attempt 
at an essential, or even an operational, definition of 
the artist—an attempt in which many generations 
of aesthetes have failed—is bound to founder (see 
Gabor 1963, p. 148, who makes the same point for 
the definition of “art”). The reasons for this become 
apparent as soon as several crucial questions are 
asked. For example, who is an artist, and why is 
he one? Does it depend on the amount of time 
spent working as an artist? Then how does one 
include individuals like Charles Ives, an American 
composer whose entire working career was spent 
in an insurance company? 

Does working as an artist qualify one to be in- 
cluded in this category? Then we must exclude in- 
dividuals of high artistic potential whose creative 
urge is latent or shows itself in activities other than 
art. Can we rely on contemporary recognition by 
institutions within the art world, such as galleries, 
Museums, or private collectors? But to do this 
would exclude practically all the impressionist 
painters, not to mention Rembrandt at the time 
that he created some of his most important master- 
pieces. In view of these difficulties, we must rely 
on connotative and denotative examples in order 
to define our term “professional artist,” while keep- 
ing in mind the shortcomings of this method, 

When speaking of a professional artist, I think 
of Pablo Picasso, Marc Chagall, Alberto Giacometti, 
Ernest Hemingway, William Faulkner, Anton 
Webern, and Arnold Schönberg, to name a few 
outstanding figures from the first half of the twen- 
tieth century. Such men devote their time and 
psychic energy to creative endeavors or would do 
so if circumstances permitted. In other words, their 
primary work is or would be directed toward ex- 


ploring the “fundamental categories” of human ex- 
istence, the attempt to exalt or denigrate authority, 
“to explore or explain the universe, to understand 
the meaning of events, to enter into contact with 
the sacred or to commit sacrilege, to affirm the 
principles of morality and justice and to deny them, 
to encounter the unknown, . . . to stir the senses 
by the control of and response to words, sounds, 
shapes, and colors” (Shils 1960a, p. 290). 

Control of the arts. In some countries and dur- 
ing some eras there has been fairly tight control of 
the arts, both in the number of recruits and in the 
course of their careers. For example, during the 
guild era in Europe no disciple could change mas- 
ters unless his first teacher agreed to break his 
contract, and all disciples had to remain with the 
master for three years (Tomasini 1953, pp. 135, 
140). Furthermore, there was an attempt to es- 
tablish mastership as a hereditary privilege (Dobb 
1947, pp. 116-117). Willetts writes that during 
the existence of the Sung Academy in China (12th 
century) “its appointments seem to have been sine- 
cures with promotions through four ranks.” The 
French Académie Royale de Peinture et de Sculp- 
ture had a similar elaborate hierarchy. In both 
academies “admission and promotion were by com- 
petitive examination. In both, the candidate was 
required to submit his personal version of a set 
theme, after gaining official approval of his pro- 
posed treatment by means of a preliminary sketch” 
(Willetts 1958, p. 518). 

In twentieth-century Europe and the United 
States the arts are characterized by a relative ab- 
sence of centralized occupational control, as com- 
pared, for example, with the academies mentioned 
above or with the medical profession in the United 
States, with its institutional self-awareness, its 
standards of competence and discipline, and its 
relatively stabilized recruitment. Moreover, in these 
countries the demand for the visual arts—and for 
other arts as well—is rapidly and vastly expanding. 
This combination of an expanding market and an 
absence of tight controls suggests that the world of 
art—fine and commercial—must recruit in a rather 
generous, if not excessive, fashion. Unlike certain 
occupations, the world of art need not underrecruit 
for the purpose of controlling a valuable skill, with 
its accompanying monetary rewards, prestige, and 
honor. 

Much of the rather “open” recruitment in art is 
reflected in various gentle practices and policies of 
many art schools in the United States. Probably 
few applicants are refused admittance. Students 
occasionally drop out of school of their own voli- 
tion but are rarely expelled as from medical or 
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engineering schools. There are few or no crucial 
tests or other hurdles that a student must pass to 
remain in school. Grades seem mainly to yield en- 
couragement or prestige and seem not to be utilized 
to restrict enrollment or to force the repeating of 
courses. Anyone who has the time and money can 
go to any number of commercial and fine arts 
schools. (These data on art schools, as well as 
the following section on the students’ family back- 
ground and public school experience, are based on 
Strauss 1955a; see also Griff 1960; 1964a; 1964b.) 


Early socialization of the artist 


Because recruitment into the art world is neither 
tightly limited nor carefully controlled, we should 
not be led to assume that artists somehow drift into 
art. There exists a whole social paraphernalia for 
getting persons committed to their artistic identi- 
ties; and the fact that the machinery is not usually 
visible to the person himself does not, of course, 
make it any less real. 

The chief mechanism for pumping a flow of 
talent into art today is the public-school system— 
aided by the art schools and often abetted unknow- 
ingly and unwillingly by the student's family. This 
is true, among other reasons, because nowadays 
artists do not appear to come from very high or 
very low social classes. An artistic career is not 
generally initiated by an education at prep school 
or an elite college, nor do persons of low economic 
standing usually become artists by emulating 
models found in their communities (they are far 
more likely to emulate athletes). Parents in various 
Strata introduce their offspring to art, stressing 
humanist, hobbyist, and other values; but not many 
parents consider the visual arts a propitious locale 
on which to fight the battle for class and occupa- 
tional success. As a consequence, few parents di- 
rectly influence their children to enter the field of 
art. Moreover, attendance at galleries and museums, 
with or without parents, plays no discernible part 
in recruitment; it is, in fact, a rare art student who 
reports having any art “in his background.” Nor do 
students entering art school usually know artists of 
any kind except their public-school teachers; nor 
are they especially acquainted with art history or 
with the biographies of famous artists. 

Public-school experiences. The public-school art 
teachers begin to exert their influence quite early 
in the career of the artist, generally in grammar 
school. Impoverished or misguided though their 
teaching may be, they may introduce the youngster 
to the satisfactions and delights of drawing and 
painting. These teachers serve to keep interest in 
art alive throughout the school years by bestowing 
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approval upon the child, singling him out for spe- 
cial honors, placing his work in public view, or as- 
signing him honorific tasks, such as decorating the 
blackboards. Some students mention that as early 
as kindergarten, teachers singled them out for 
praise and isolated them from their classmates so 
that they could concentrate on their art. 

In high school the child who has been recognized 
for his artistic virtuosity continues to take art 
courses and often has the opportunity to major in 
art. Art teachers may begin to suggest that he go 
on to art school—a step that otherwise might not 
occur to some—and may procure information and 
even scholarships for their protégés. The high- 
school milieu affords additional prestige, for the 
child may win a school prize, or even a national 
one, or receive acclaim for decorating a stage set, 
drawing for the school paper, and other such ac- 
tivities. 

Saturday morning art classes. Complementing 
the role of the high school in the recruitment proc- 
ess are the typical Saturday morning classes spon- 
sored by the community art museum. These special 
classes, which are to be found in Europe as well as 
in the United States, play a crucial role in encour- 
aging an art career. For school children whose art 
instruction is limited (especially in the private or 
parochial schools, where art instruction may be 
absent), they provide the first exposure to advanced 
courses and qualified teachers. The Saturday 
classes provide students not only with material 
facilities, such as oils, brushes, and models, but 
also with professional instruction and the encour- 
agement offered by the milieu of a museum. Also, 
for both children and adolescents, this may be their 
first (and perhaps only) opportunity to come into 
contact with peers having similar dispositions 
toward art. 

Another important effect of the Saturday morn- 
ing classes is that they single out the artistic per- 
son: that is, they signify to those in his social 
milieu, as well as to himself, that he has been so- 
cially recognized as having an unusual ability, ab- 
sent in others, which has certain rewards. In addi- 
tion, these classes stimulate the nascent attributes 
of independence and freedom from external re- 
straint that are associated with artists, because they 
separate the individual from his social milieu per- 
haps for the first time in his life. This may accel- 
erate for these children the normal personality 
development from unilateral to autonomous think- 
ing, studied by Piaget (1923). Finally, the experi- 
ence of the Saturday classes may reinforce a sense 
of security for some children; in the childhood and 
adolescent world of comparisons, they can brag 
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about their painting ability, which may, indeed, 
be the only outstanding skill they have. 


Parental attitudes 


The portrait that emerges from a study of the 
childhood and adolescent social experiences of 
the individual who possesses artistic skills reveals 
the congruence of encouragement and reward, of 
aspiration and fulfillment. He learns that his contri- 
bution is socially meaningful and constructive, and 
at the same time he is encouraged to exercise freely 
that which he most basically feels is himself. Ex- 
cept for the normal tensions of childhood and ado- 
lescence all the factors favoring the retention and 
nourishment of an artist’s career are present. 

Discontinuities in this pattern become apparent 
when the person declares publicly that he wishes 
to prepare for a career in art. The parents of the 
would-be artist make two very strenuous objections 
to his desire. The first is that the painter cannot 
hope to support himself solely from the sale of his 
paintings and that this will make it impossible for 
him to attain many of the symbols of success that 
families cherish, The second objection is directed 
at the bohemian Stereotype of the artist, which the 
family wishes to avoid because it violates the pro- 
fessed mores of our culture, (The fact that artists 
often seem to become bohemians is, of course, 
closely related to their financial problems. ) 

When the student’s intention to become an ar- 
tist is discovered, there is a reversal in the attitude 
of his parents toward him. Thus, a crisis is engen- 
dered, which becomes aggravated during the period 
beginning with the announcement of his intentions 
and ending with the termination of his formal edu- 
cation. The entire family makes a determined ef- 
fort to dissuade him, They remove his paintings 
from the walls where they were prominently dis- 
played. Artistic achievements and the large remu- 
nerations for paintings received by artists become 
topics to be avoided. 

It is at this point that the germ of self-estrange- 
ment (commonly referred to as alienation ) is first 
implanted. Going contrary to the wishes of one’s 
parents causes one’s conscience to suffer, It suffers 
because of the strong affective relationship between 
parents and child, because of intensive cultural 
imperatives emanating from the mores and rein- 
forced by religious institutions—particularly the 
dogma concerning filial duties—and because of 
the acute sensitivity and imagination characteris- 
tic of the artist qua artist. While the exertion of the 
individual toward independence is normal during 
this period, this breaking away may have added 
Significance for the career of the artist. Rank, for 


example, in discussing the creation of individuality 
said: 


The gradual freeing of the individual from dependence 
by a self-creative development of Personality replaces 
the one-sided . . . dependence on the mother. . . . The 
person in the third and highest level of development, 
the creative type, such as the artist or the philosopher, 
creates a world for himself which he can accept with- 
out wanting to force it on others; and he accepts him- 
self. Such a person creates his own inner ideals, which 
he affirms as his own commandments. At the same 
time he can live in the world without falling into con- 
tinual conflict with it. (1932; quoted in Mullahy 
1948, pp. 177, 183-184) 


There appears to be an inconsistency in parental 
attitudes toward the child who chooses the artistic 
career. It has been stated that the young artist is 
encouraged and rewarded and that his early suc- 
cess in art is a source of gratification for his par- 
ents as well as for him. However, the parents’ later 
disapproval of his career choice becomes under- 
standable if the difference between school success 
and occupational success is considered. School suc- 
cess is confined largely to the yardstick of the re- 
port card (“My daughter gets all A’s in art”), and 
there is also recognition in the form of prizes, arti- 
cles in the local newspaper, or acknowledgment on 
graduation programs. However, when a child grad- 
uates from high school new standards and criteria 
of judgment are applied, which are associated with 
symbols of financial success and social prestige. 
Art is not an avenue leading to the attainment of 
these symbols, and parents know this. Moreover, 
these reactions reflect some Significant attitudes 
toward the fine artist and toward art in contempo- 
rary culture. 

Cultural variations. Contemporary industrial 
culture stresses conformity, respectability, ration- 
ality, practicality, and security, These are a few of 
the essential values incorporated in the cultural 
complex that Max Weber called “rational bourgeois 
Capitalism.” Art and the life of the artist are antin- 
omies of this. Fine art is nonutilitarian. Moreover, 
the artist tends to be nonconformist and to violate 
many of the behavioral patterns prescribed by so- 
ciety. His choice of career opposes the success 
theme so strongly emphasized and so pervasive in 
contemporary culture. The parents of artists may 
themselves have failed to achieve success, and they, 
with all good intentions of loving parents, want 
their children to avoid the disappointments that 
they have experienced. Also, to parents who value 
success for their children as an affirmation and 
confirmation of their worthiness as parents, there 
is little comfort in the notion that the young artist 
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may achieve recognition posthumously. Yet, this is 
frequently all the young artist can offer as justifi- 
cation for the sacrifices he proposes. 

Parental opposition to a career in art is neither 
recent nor geographically confined; nor, for that 
matter, is parental encouragement. The way in 
which a son is forced to struggle against his parents 
because he suffers from an internal ebullition of 
artistic desire, which they try to suppress, has be- 
come a classic ritual (Fry 1929, p. 6 in the 1958 
edition). Yet, instances of the encouragement of 
artistic careers have also been notable (Haskell 
1963, pp. 20-21; Tomasini 1953). The experience 
of the contemporary composer Gian-Carlo Menotti 
provides an example: 


How well I remember as a child watching the pro- 
found sorrow of all Milan as the funeral of Puccini 
passed through the streets. It was a loss for each of us 
as well as for our country. No wonder that a young 
Italian boy's wanting to be a composer could only be 
a source of satisfaction and pride to his family and 
friends. But the family’s regard for music was only a 
reflection of the general public’s esteem in which the 
composer was held in Italy. (1953, pp. 42-43) 


The explanation for this variation in attitudes is 
Open to debate and awaits further research. Two 
hypotheses, however, immediately come to mind. 
One is suggested by Menotti: 


It is my contention that the average American has 
little or no respect for the creative artist and is apt to 
consider him as an almost useless member of the 
community. The average American father still views 
with dismay the fact that one of his sons may choose 
to become a composer, writer, or painter. He will con- 
sider any such pursuit a sign of “softness,” an un- 
manly and, I venture to say, un-American choice. 

I must add in all frankness that this hostility toward 
the arts is not uncommon in Europe within a certain 
class of society. But it exists only in a very small per- 
centage of the population, mostly among the nouveaux 
riches and the very orthodox members of the aristoc- 
racy who still feel that it is more noble to patronize 
than to create. Moreover, even in this latter small 
moribund class, artistic activity is at least looked upon 
as an essential element of gracious living rather than 
the adornment of uneventful Sunday afternoons. 
(1953, pp. 39-40) 


An alternative hypothesis is suggested by an 
artist turned popular writer: “Art needs a proper 
climate. The average Frenchman is no more artis- 
tic than the average American. . . . But the French 
climate is good for art, because in France an artist 
isn’t expected to earn as much as a stockbroker. 
He is justified in his existence even if he is just 
a little artist, He doesn’t have to be a Picasso. He 
Counts as a necessary human factor although he 
hasn’t reached the very top” (King 1958, p. 8). 
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Preparation for a career in art 


After graduating from high school the neophyte 
artist enrolls in an art school or art institute, where 
he takes a basic course in the fundamentals of 
drawing, painting, and illustration. There he is 
thrown together with a large group of individuals 
having a variety of talents and backgrounds. For 
many, the first year is a joy. They work and learn 
(perhaps for the first time) in a milieu that is per- 
meated with art. The school may be in a museum, 
or at least very near one, and the students are so- 
cially united by their common interest in art. 

Specialization ensues during the subsequent 
years. In the second year, the students are divided 
into fine arts, commercial or applied arts, and art 
education sections. Further specialization takes 
place during the third and fourth years, but the 
most important specialty, in terms of the recruit- 
ment process, is fine arts. 

The fine arts student searches for a teacher com- 
patible to his temperament both as an artist and as 
an individual. Some seek and readily accept tech- 
nical instruction; others expect to get instruction 
only upon request; still others are hostile to any 
instruction, asseverating that it is a violation of 
artistic mores for an instructor even to touch their 
canvases. 

Competition for grades is discouraged because of 
the inherent difficulty in judging art; both talented 
and untalented students, as well as instructors, 
mention this. Usually, the only institutionally 
fostered competition that is meaningful is the 
rivalry for the traveling fellowships, which takes 
place during the senior year. Both students and 
instructors have suggested that some students de- 
liberately alter their natural style during the fellow- 
ship competition in order to cater to the known 
tastes of the judges. However, in general, we can 
say that the art school does not constitute a dis- 
ruptive force, in terms of the unity of the self, as 
other types of schools frequently do. In other words, 
the intensive competition that tempts or forces stu- 
dents to engage in subterfuge or parroting their 
mentors in order to receive passing or superior 
grades is not present in the art school. What 
emerges in the art school is a consistency between 
self and fulfillment. Thus, the self-estrangement 
that for the average person begins with formal 
schooling and continues indefinitely does not, for 
the most part, affect the art student. 

Vocational choices. As the student nears the 
end of his academic training he becomes anxious 
over what he will do when he graduates. Some may 
succumb to the prodding of their parents (or fian- 
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cées or wives) and take a few commercial art 
courses before they graduate. For the same reasons, 
others may take courses in art education or may 
return for a year after they graduate to take courses 
in art education. Still others pass through four 
years of training and delay their decisions until 
they graduate. A very few will win traveling fel- 
lowships lasting for a year or two and thereby post- 
pone their decisions temporarily. A few, both those 
who have completed their fellowships and those 
who did not win one, will find galleries that will 
give them small subsidies. Eventually, however, 
each must find a source of steady income. Some 
will find work in the post office or in stores and will 
paint in the evenings or on weekends. A prolonged 
siege of this work is disheartening, especially when 
the individual realizes that he may have to do this 
for many years—that is, until he is recognized— 
or that he may have to work at whatever he is 
doing for the rest of his life. In the end, most return 
to art in secondary ways—some as art educators 
and many as commercial artists. 

Whichever alternative is chosen, it will be a 
secondary choice and one that implies taking a job 
which the person feels is not his basic calling. It 
is at this point in the recruitment process of the 
artist that the state commonly referred to as aliena- 
tion begins to be seriously experienced. As men- 
tioned above, the first signs of alienation appear in 
connection with the students’ conflict with their 
parents, as they finish high school, but in art school 
this parental opposition is usually overcome or dis- 
regarded. However, there is this important differ- 
ence between finishing formal art school and finish- 
ing high school. While parents will strenuously 
object to their child’s going to art school, they will, 
in most cases, continue to support their child be- 
cause the mores demand it. After graduation, how- 
ever, there are no moral imperatives necessitating 
continued support and certainly none that would 
obligate the parents to support the child indefi- 
nitely. It must also be pointed out that it is at this 
point in the student's life that the philosophies to 
which he has been exposed in his intellectual quest 
as artist—no matter how ingenuous they may ap- 
pear—begin to fructify. 

These circumstances raise a number of impor- 
tant questions concerning the artist. One of the 
most important of these is, What happens to one’s 
artistic identity when one enters various types of 
employment? Griff (1960; 1964b) has sought the 
answer to this question in a study of commercial 
(i.e., advertising) artists, Commercial artists were 
Selected for study because it was believed that these 
people had chosen the extreme alternative among 


the possibilities available to the artist. It was felt 
that the role of the commercial artist differs in so 
many fundamental ways from that of the fine 
artist that a study of it could serve as a paradigm 
for what happens in the case of less extreme voca- 
tional choices. 


Art and advertising 


There are few institutions whose basic values 
are more opposed to one another in all fundamen- 
tal respects than are those of art and advertising— 
their vocabularies, their traditions, their standards 
and rules, beliefs and symbols, criteria for the se- 
lection of subject matter and problems, modes of 
presentation, canons for the assessment of excel- 
lence—are all antithetical. 

The traditions of art seem by their very nature to 
entail a measure of tension with the traditions of 
advertising and its close allies, commerce and in- 
dustry. The very intensity and concentration of 
commitment to these value orientations reflect the 
distance that separates the two. Art, like genuine 
religion, continues to be vitally concerned with the 
sacred or the ultimate ground of thought and ex- 
perience and the aspiration to enter into intimate 
contact with it. Thus, art involves the search for 
truth, for the principles embedded in events and 
actions, or for the establishment of a relationship 
between the self and the essential, whether the re- 
lationship be cognitive, appreciative, or expressive 
(Shils 1960b). 

In contrast, the traditions of advertising empha- 
size the selection of the mundane as subject matter; 
its presentation in terms of immediate percep- 
tion as well as by the use of standardization, classi- 
fication, and clichés; the necessity for certainty of 
Cognition and direction; the deliberate attempts to 
manipulate through the use of visual stimuli based 
upon emulation, fear, and sex; and the assessment 
of excellence in terms of attention value, consump- 
tion, and fidelity (called brand loyalty) to the prod- 
uct of one’s client, 

In addition, there are innumerable technical dif- 
ferences between advertising and the fine arts, 
based upon the problems of graphic reproduction. 
The commercial artist must consider the different 
qualities of papers and printing inks and the effect 
these will have on the final version of his work. He 
must consider that the painting as seen in the orig- 
inal and then on the printed page will be different, 
because the flat surface of the paper eliminates the 
third dimension together with the brush strokes 
and canvas pores. Since it is necessary to make the 
product and accompanying message stand out 
(known technically as highlighting), a sharp linear 
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style is used; a firm principle of advertising art is 
recognition through a silhouette. 

The differences in compositional techniques are 
also readily apparent. For example, in many auto- 
mobile ads there appears to be an illusion of deep 
space, but upon careful analysis it can be seen that 
the eye does not move from the front plane to deep 
space; compare this to a painting by a Dutch mas- 
ter, where the eye moves into the interior from the 
front of the room and then into the distance 
through an open door or window. In the advertise- 
ment the automobile covers the horizon, extends 
along most of the composition, and blocks the eye 
from easy recession into the background. Even the 
space in the background is indeterminate and has 
the function of forcing the nearer objects toward 
the eye, thus forcing the viewer to concentrate on 
the advertised product (Parker 1937, pp. 119-122). 

Role identification and artistic values. Com- 
mercial artists range in type from those who remain 
symbolically attached to the role of the fine artist 
to those who reject this role completely. Between 
these two extremes lies a third group that identifies 
with both roles (see Griff 1960, pp. 219-241, for 
an extended account of the three roles mentioned 
here). 

The traditional role. Some artists work as com- 
mercial artists, but subjectively identify themselves 
as fine artists “temporarily” engaged in the com- 
mercial field. In other words, they identify with 
what they regard as the artist’s traditional role; 
their rationalization for working in the commercial 
field is that the normative standards of contempo- 
rary society preclude any economically feasible al- 
ternative. Underlying this assumption is the belief 
that society will give lip service to, but in reality 
will not support, the artist’s primary identity—that 
of the fine artist as he emerged at the end of the 
nineteenth century. In supporting their premise 
they cite numerous examples, both past and pres- 
ent, of other individuals who have tried, without 
success, to live exclusively from their paintings. In 
addition, there is the actual attempt by many of 
them to live from the sale of their paintings. Find- 
ing this to be an impossibility, they may have 
turned to commercial art. Also, there is the further 
realization that other aspects of the reward system 
OF society are absent (i.e., prestige, praise, or ad- 
miration of others for their dedication to their 
ideals). Instead, in many cases, negative sanctions 
have been applied against them in the form of 
derision and questions concerning the practicality 
of their labors, as well as their “sanity.” 

y The commercial role. Artists who assume the 
commercial role” see art (both fine and commer- 
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cial) as a utilitarian product, and they conceive 
of themselves as instruments for the transforma- 
tion of verbal symbols (dictated by a client) into 
visual ones. Consequently, their ideological orien- 
tation is directed toward pleasing and satisfying 
their clients. Conceiving their roles in this manner 
the commercial-role artists refrain from interject- 
ing or altering in any way the intent of their clients 
unless specifically directed to do so. Thus, they ac- 
cept a norm that is not only upheld by their occu- 
pational group but also sanctioned by the larger 
society: The customer is always right. In carrying 
out this belief, they define their roles as having 
been successfully fulfilled when the requirements 
of their clients have been met as parsimoniously as 
possible. This means the creation of illustrations 
in the quickest and cheapest fashion. 

In contrast to the attitudes held by the tradi- 
tional-role artists, the commercial-role artists reject 
the notion that they are artists working in the com- 
mercial field because of extenuating circumstances. 
On the contrary, they believe that the traditional 
role is a nineteenth-century anachronism and 
therefore should be discarded by the contemporary 
artist. 

The compromise role. The “compromise role” 
is a mixture of both the traditional and the com- 
mercial roles. Like the commercial-role artists, 
those who assume the compromise role believe that 
they are instruments of the clients; however, they 
conceive of themselves as active, rather than as 
passive, agents. In carrying out this conception of 
themselves, they translate the demands of the 
client but at the same time attempt to persuade 
him to accept innovations, specifically the inter- 
jection of fine arts symbols into their illustrations. 
Thus, many feel that they are involved in a cru- 
sade for better art. They believe that by raising the 
standards of their clients’ art, they are at the same 
time raising the level of taste of the public. 

The resemblance between the compromise role 
and the traditional role lies in the fact that both 
are concerned with fine arts. However, the artists 
who choose the compromise role do not consider 
the commercial field onerous or harmful to fine 
arts or believe that their status should be independ- 
ent of the secular realm. They believe, as do the 
commercial-role artists, that their legitimate posi- 
tion in society lies in the commercial field. They 
reflect to a great extent the Bauhaus definition of 
the artist. This definition stresses the importance 
of the artist's working in a society rather than being 
isolated from it. The definition also stresses that 
the artist, while working in a society, should exert 
all his technical, moral, and aesthetic skills to in- 
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fluence the products of mass production (Moholy- 
Nagy 1929). 


The study of the recruitment of the artist is part 
of a broader field called the sociology of art. This 
field is still in its infancy, and much fundamental 
work must be done before its outlines, contribu- 
tions, and limitations will be known and under- 
stood. Griff's study (1964a) of art students in 
Chicago is a step in that direction; further research 
will confirm, invalidate, or suggest revisions of it. 
Meanwhile, what is needed now are similar data 
from the other fine arts and from those areas that 
are emerging as new fine arts, such as photography 
and the film. In this connection, important research 
could be accomplished by studying how an area 
emerges as a fine art. Other important questions 
and areas of needed research are: What are the 
career patterns of successful and unsuccessful 
artists? What are the social conditions responsible 
for the fame of an artist, the capriciousness of 
fame, and immortality; the social conditions hin- 
dering or encouraging creative endeavors: the rela- 
tionship between contemporary patrons and styles 
of art; the social and psychological conditions im- 
peding artists or aiding them to break with tradi- 
tional and acceptable styles of art, thus innovating 
new styles; and the social roles of critics and pa- 
trons? Once research on these questions has been 
undertaken, it will be possible to clarify the more 
significant and elusive questions, such as whether 
the artist is a reflection of his society, or whether 
art is a reflection of the age, 

MASON GRIFF 


[Directly related are the entries CREATIVITY and Pro- 
FESSIONS. Other relevant material may be found 
in ARCHITECTURE; CRAFTS; FASHION; LITERATURE; 
STYLE; and in the biographies of Lukács and RANK.] 
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FIRM, THEORY OF THE 


The theory of the firm is that branch of economic 
theory which deals with the determination of the 
most important economic variables associated with 
the individual business unit, such as price, output, 
and growth. There are no readily defined bound- 
aries for the theory, although it is usually distin- 
guished from the theory of production, which deals 
with the selection of inputs and techniques of pro- 
duction by the firm; from programming, which ex- 
plains how optimal techniques of production may 
be discovered; and from the study of business en- 
terprise and the corporation, which have a more 
Institutional focus [see CORPORATION; PRODUCTION; 
PROGRAMMING]. The theory of the firm is closely 
associated with the concept of the industry; the 
industry is composed, roughly, of those firms pro- 
ducing similar products. The relationship between 
the firm and the industry will be described in some 
detail later in this article. 

The theory of the firm has a considerable his- 
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tory in economic thought, going back at least as 
far as Cournot (1838) and appearing implicitly 
a good deal earlier. The development of consumer 
theory and the basing of economics upon a theory 
of individual choice obviously called for an analo- 
gous development in the treatment of units of pro- 
duction, and after 1900 more and more attention 
was directed to the theory of the firm. 

There are many possible approaches to the theory 
of the firm. These approaches may be categorized 
in various ways, the most fruitful of which seem 
to result from the division between: (a) static and 
dynamic approaches; (b) those theories that de- 
rive their results from the assumption of profit 
maximization and those that do not; and (c) de- 
terministic and probabilistic theories. 

Static and dynamic approaches. By a static 
theory of the firm we mean a theory that describes 
the characteristics of a firm in a position of equi- 
librium. We will deal first with the case in which 
this equilibrium arises when the firm is maximizing 
its profits. 

The first part of the static theory to be well devel- 
oped was the theory of the price and output policy 
of a profit-maximizing monopolist. A monopolist 
is defined as the only supplier of a particular “com- 
modity.” He is therefore confronted by a market 
demand curve giving a total revenue (R) that var- 
ies with output: 


a) R= f(x), 


where x is the quantity produced and sold. (Inven- 
tories are seldom allowed for in the simple theory.) 
Total costs (C), which include normal profits, are 
also affected by output. Normal profits are defined 
as those only just sufficient to keep the entrepre- 
neur and his firm in the given industry: 


(2) C =g(x). 


Profit is maximized when R-— C is maximized, 
when 


f'(x) <0, 
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i.e., output is set at a profit maximizing level when 
marginal cost equals marginal revenue and mar- 
ginal cost is rising faster (or falling less quickly) 
than marginal revenue. 

Perhaps one of the most striking things about 
this solution is that it makes no assertion about the 
level of profits. In fact, it is often assumed that the 
monopolist must make in the normal way profits 
that are in some sense “supernormal.” 

It was the achievement of Pigou (1928) and 
others to see that some of the results obtained 
above are in fact of wider applicability and hold 
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for the static theory of the firm in perfect compe- 
tition as well as in monopoly. The chief differences 
between the two cases arise from the fact that in 
perfect competition price is treated as fixed, so that 
marginal revenue equals price. Thus 

(4) Rop ER p 

where P is the price of the product as seen by the 
firm. It follows that 

(5) E BE 
This is to say that in perfect competition marginal 
cost equals price and that marginal cost is rising 
at the point of equilibrium output. Since in perfect 
competition entry is free and all factors of produc- 
tion are freely available at given prices, it is also 
deduced that average cost is equal to price in equi- 
librium and that average cost is at its lowest at the 
point of equilibrium output. 

The above results are very important in welfare 
economics and form a part of the basis for assert- 
ing the optimality of perfectly competitive systems. 
[See WELFARE EcCoNomics.] But this fact has led 
to much confusion in the theory of the firm. It has 
led some writers to the paradoxical view that the 
theory of the firm is merely a part of welfare eco- 
nomics and that no evidence of the actual behavior 
of firms is relevant to the theory. Others have been 
unwilling to accept any evidence that tends to cast 
doubt on the universality of perfect competition in 
the real world, Both of these pitfalls should be care- 
fully avoided. 

It was pre-eminently Chamberlin’s contribution 
(1933) to the static theory of the firm to notice 
that some degree of monopoly power is consistent 
with free entry and that the equality of average 
costs and prices is consistent with monopolistic 
equilibrium. (This point was noted also by Robin- 
son [1933], but she was, on the whole, more con- 
cerned with considering certain special aspects of 
monopoly theory.) Chamberlin centered the bulk 
of his attention on achieving the marriage of mo- 
nopoly and competition, and he proceeded in an 
elaborate diagrammatic exposition to show how it 
could be accomplished in certain hitherto unex- 
plored situations. 

The Chamberlinian relationship which is best 
known is summarized in Figure 1. Here all firms 
are assumed to have identical cost curves and iden- 
tical demand curves. CC’ is the average cost curve 
of any one of the firms. DD’ is the demand curve 
confronting any one of the firms, after allowance 
is made for exit and entry of firms and for the as- 
sumption that all firms charge identical prices: dd’ 


is the demand curve that would face the firm if 
it could alter its prices without action or reaction 
by its competitors. In equilibrium the number of 
firms must adjust until all firms make only normal 
profits, and each firm still remaining in the indus- 
try believes that it is making maximum profits. 
This will occur at point T in Figure 1 if each firm 
believes its rivals will ignore its price policy. 

Chamberlin’s argument for the universal appli- 
cability of his models seems, in retrospect, some- 
what unconvincing. Even if the economic world is 
in the main some blend of monopolistic and com- 
petitive elements, there is no reason to think that 
it is the very simplified blend that Chamberlin has 
assumed, If the world is more complex than Cham- 
berlin has surmised, it is at least possible that 
other models, and perhaps simpler models, may be 
more useful predictors of economic behavior. The 
merits of any consistent model are not to be de- 
termined so much by argument as by performance. 

An alternative to Chamberlin’s approach has 
emerged from the development of the theory of 
oligopoly. Indeed, some writers seem to claim that 
oligopoly is in fact the norm in nonperfectly com- 
petitive markets. Unfortunately there are many 
competing oligopoly theories; they are not always 
presented in testable form; those that are testable 
in principle would often be difficult to distinguish 
empirically; and little testing has so far been at- 
tempted. 

The dynamic theory of the firm has, on the other 
hand, received only limited treatment. The out- 
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Figure 1 — Equilibrium under monopolistic competition 


standing contribution to the subject is probably 
contained in Hicks (1939). Hicks’s treatment is, 
however, confined to a discussion of intertemporal 
equilibrium. The question of the path of adjustment 
to a new equilibrium position, or the extent to 
which the costs of the firm depend upon the pro- 
duction experience of the firm, has been largely 
ignored until recent years (Clower 1959; Arrow 
1962). 

Profit maximization. Next we turn to the con- 
troversy over the role of profit maximization in the 
theory of the firm. The assumption of profit max- 
imization has proved a very useful starting point 
for investigations in many areas closely connected 
with the theory of the firm, particularly in the 
theory of investment and replacement. But some 
economists have been suspicious of the profit max- 
imization assumption and have preferred to substi- 
tute something else. This substitute has usually 
taken one of two forms: utility maximization or 
behavioral relations. The former is generally used 
in static equilibrium systems, the latter are usually 
not. An example of utility maximization is pro- 
vided by the work of Scitovsky (1943). He shows 
in effect that if entrepreneurs maximize utility 
rather than profit, if utility is influenced by effort, 
and if effort is related to the size of output, then 
the traditional deductions of minimum average 
costs, in equilibrium, of perfect competition will 
be falsified. 

An alternative utility maximization hypothesis 
asserts that utility is affected by the level of ex- 
pected profits and by the variance of these profits. 
If variance decreases with output, the entrepreneur 
might, for example, decide to operate at outputs 
above minimum cost output, even under conditions 
of perfect competition. 

One group of writers, of whom the best known 
is Papandreou (1952), has tended to argue that a 
large number of considerations will enter the utility 
function of the entrepreneur, so that propositions 
derived from profit maximizing assumptions are 
bound to be misleading. This approach is no doubt 
highly plausible, but the advantage of plausibility 
is purchased at the expense of any meaningful hy- 
pothesis. Putting it another way, while we increase 
the immediate acceptability of the theory of the 
firm by increasing the number of variables that the 
firm is said to consider in maximizing utility, we 
achieve this by reducing the number of testable 
Tesults of the theory. 

Some writers have felt that the deduction of the 
behavior of the firm from static equilibrium models 
is beyond the present powers of the economist and 
have instead directed their attention to the formu- 
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lation of behavioral hypotheses about the firm. Such 
hypotheses may or may not be consistent with the 
profit maximization hypothesis. One of the leading 
species of this genus is the so-called “full-cost 
pricing principle.” Roughly, this asserts that prices 
of manufactured products will be set at a level that 
will yield a “normal” profit to a firm operating with 
modern equipment at a high level of capacity utili- 
zation. A great deal of effort has been expended in 
trying to show that such behavior is consistent with 
profit maximization. However, the behavioral hy- 
pothesis itself, which was first suggested by inter- 
views with businessmen, cannot be said to have 
been subjected to rigorous statistical testing. 

Somewhat similar remarks can be made about 
many of the alternative behavior hypotheses that 
have been offered for consideration. It has, for 
example, been suggested that firms aim at maxi- 
mizing their share of the market rather than their 
profits. It has also been suggested that firms “satis- 
fice” —that is, that they only seek to increase profits 
if profits fall below a certain “acceptable” level. In 
both these cases it can be said that discussion has 
been aimed more at finding justifications for these 
hypotheses than at testing their validity by obser- 
vation. A similar difficulty has plagued efforts to 
develop a behavioral theory of the growth of the 
firm. 

The theories discussed above have been con- 
cerned with explaining only the main features 
of the behavior of firms. Recently, however, effort 
has been devoted to the development of models that 
could predict the detailed conduct of business units 
in particular aspects of their work. Thus the de- 
tailed pricing policy of a particular department of 
a department store may be studied and a formula 
found that will enable us to predict this behavior. 
Much of this work has been stimulated by the sug- 
gestions of H. A. Simon (1959), and it remains 
until now largely descriptive in character. Little 
effort has been directed toward rationalizing the 
observed behavior in terms of profit maximization 
or utility maximization. It may indeed be surmised 
that simple explanations of detailed behavior pat- 
terns would be difficult in models excluding uncer- 
tainty and decision costs. 

Probabilistic theories. Another line of develop- 
ment in the theory of the firm involves a twofold 
departure from the neoclassical formulation. First, 
the notion of static equilibrium of the individual 
firm is abandoned. Second, the focus changes from 
deterministic theories of the behavior of a single 
firm to probabilistic rules about the behavior of 
large groups of firms. This approach is also behav- 
ioral in orientation. While there may be a recogni- 
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tion of the desire for profit maximization, its effect 
is in many individual cases swamped by a myriad 
of other forces. As a result, the impact of any 
change in parameters at best changes the prob- 
abilities that govern the conduct of the firm. 

The originator of this approach to the theory of 
the firm appears to have been Alfred Marshall. In 
the second edition of his Principles of Economics 
(1890) Marshall introduced the notion of the “rep- 
resentative firm” to explain the determination of 
long run supply price. The representative firm is 
not any particular firm; rather it is a descriptive 
fiction by means of which Marshall attempted to 
amalgamate a dynamic probabilistic theory of the 
firm with an essentially static and deterministic 
theory of price determination for the industry. 

Marshall’s attempt to harness the static with the 
dynamic approaches was vigorously rejected by 
most of his successors. Foremost among the critics 
was Robbins (1928), who objected correctly but 
irrelevantly that Marshall’s implied theory of the 
firm was inconsistent with static equilibrium as- 
sumptions. 

Marshall’s approach fell into disuse for nearly 
25 years but was re-established by P. K. Newman 
and J. N. Wolfe (1961). They showed that the 
Marshallian doctrine could be interpreted in terms 
of the stationary state of a probabilistic (Markov) 
process. [See MARKOV CHAINS.] It was shown that 
if the probabilistic laws governing the rise and de- 
cline of firms were affected by the price of the prod- 
uct, then a particular size distribution of firms and 
consequently a particular output would emerge 
from each price. It will be seen that this formula- 
tion allows by means of a single process the simul- 
taneous determination of price, output, and the size 
distribution of firms. 

More concretely, we may imagine the probability 
that a firm will change its size in any period as 
being governed by a “transition matrix,” or matrix 
of transition probabilities, A. This takes the follow- 
ing form: 
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where a, for example, is the probability that a 
firm in the first size-class at a particular period will 
be in the second size-class one period later. (The 
figures on the left-hand margin and across the top 


of the matrix are of course the various size- 
classes. ) 


If the matrix A obeys certain so-called Markov- 
ian assumptions, its continued application to a 
large group of firms will ultimately produce a 
definite size distribution that is independent of 
their initial distribution. And if the entries in the 
transition matrix are not fixed but are different 
for each price prevailing, there will be a different 
size distribution of firms for each particular equi- 
librium price. [See MARKOV CHAINS.] 

This sketch of the probabilistic theory of the firm 
is necessarily incomplete. It will, however, be seen 
that this theory has the merit of dealing with the 
firm more nearly in the context in which it appears 
in real life than does a static theory. Nevertheless, 
the theory remains highly abstract, and its practical 
usefulness has yet to be demonstrated. 

It will be apparent that the theory of the firm 
is moving increasingly in the direction of empirical 
science—that is, toward the formulation and test- 
ing of hypotheses about how firms actually behave. 
This is true whether the hypothesis being put for- 
ward is cast in deterministic or probabilistic form. 
It is at the same time less tied to the assumption 
of profit maximization. But profit maximization 
has not lost its usefulness, however frequently it 
may seem to be contradicted as a descriptive hy- 
pothesis. Profit maximization will always be the 
aim of at least some firms during at least some 
Portion of their history. And so we would expect 
that profit maximization would continue to play 
an important role in the normative aspect of the 
theory of the firm. 

The firm and the industry. The notion of the 
industry goes far back in economic thought; it 
probably derived intact from the market place. By 
the time of Marshall’s Principles of Economics 
(1890) it had come to occupy an important place 
in economic theory. The industry was there defined 
as a group of firms producing an identical product. 
This definition is highly ambiguous, but to some 
it seemed to imply a homogeneity of output that 
was most easily related to a state of perfect com- 
petition. Chamberlin’s assumption of identical cost 
and demand curves for all firms enabled him to 
work with “groups” that behaved rather like Mar- 
shallian industries. But the abandonment of per- 
fect competition created difficulties for the concept 
of the industry, once differences in cost and de- 
mand conditions between firms were admitted. In 
the first place, firms might be selling apparently 
identical products at different prices. Moreover, as 
Robinson (1933) pointed out, a shift of demand 
in a perfectly competitive industry would (in the 
absence of external economies at any rate) have 
an unambiguous effect upon price and output, 


while in an imperfectly competitive “industry” it 
might not. The latter seemed a particularly dam- 
aging point, since one of the main motives for the 
development of the theory of the firm seems to 
have been to obtain greater certainty about the 
shape of the industry supply curve. Finally, Triffin 
(1940) argued that general equilibrium under 
conditions of nonperfect competition admitted no 
clear conceptual division between industries. Per- 
haps partly as a result of these arguments there has 
been a tendency for theoretically oriented econo- 
mists to avoid using the concept of the industry 
and to some extent to neglect the study of the 
determinants of supply. 

From a pragmatic point of view these develop- 
ments seem ill judged. Ever since Plato and 
Aristotle, philosophers have been vexed by the 
problems of finding useful categories under which 
the large numbers of slightly differing objects 
we see about us can be grouped. Any such group- 
ing is almost always an abstraction, and the 
important question is not whether it is a truly 
homogeneous grouping but whether it is a useful 
grouping. On the basis of the experience of stu- 
dents of economic institutions, the concept of the 
industry appears a highly useful abstraction for 
many purposes, although not for all. The spread 
of quantitative techniques seems likely to reinforce 
that conclusion, since statistics are usually avail- 
able only on an industry basis. Indeed, from the 
Statistical point of view the problem of defining 
the industry appears to arise from asking the wrong 
questions. The question should be whether some 
characteristics of a group of firms (e.g., the mean 
output of the firms or the mean price charged by 
them for their products) change in a statistically 
Significant way as the result of changes in some 
Specified variable. It would be difficult in an un- 
controlled situation to test for the effects of a very 
large number of variables. The preponderating 
influence of a small number of strategic variables 
ls an assumption that must be satisfied if testing 
is to give useful results, These strategic variables 
may be those predicted by standard theory or they 
may not, but this question is one of fact and not 
of theory. 

The conclusion seems to be that the concept of 
the industry may well remain of great practical use 
whatever theory of the firm is adopted. This was 
one of the essential messages of Marshall’s “repre- 
Sentative firm.” In that construct the “industry” is 
Preserved, although firms are neither in equili- 
brium nor perhaps even operating under conditions 
of perfect competition. In this respect at least, 
Marshallian notions appear to have had a more 
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“modern” empirical flavor than most of the later 
work on the theory of firm and industry. [See 
MARKETS AND INDUSTRIES. ] 

External economies. A good deal of attention 
has been given to the effect of changes in the total 
output of the industry upon the costs of the indi- 
vidual firms in the industry. These effects are 
termed “external economies” when the costs of 
production of the firms in the industry are lowered 
by an increase in the output of the industry. Since 
it is commonly argued that all firms have identical 
minimum costs in the long run equilibrium of 
perfect competition, all firms will in this case be 
affected equally. 

It is usual to distinguish between pecuniary 
external economies, which arise through reduced 
factor prices, and technological external econo- 
mies, which arise through improved organization 
of industry. The latter are most significant from 
the welfare point of view, but much doubt has 
been expressed about the frequency of their occur- 
rence in a perfectly competitive milieu. However, 
for political or other reasons it may be impossible 
to enforce perfect competition, and the external 
economies associated with monopoly may be of 
great practical importance in some lines. 

Scitovsky (1954) has suggested that a further 
source of external economies would arise if differ- 
ent stages of production were expanded (or con- 
tracted) in an uncoordinated way. Such external 
economies are essentially temporary in character. 
However, where monopolistic conditions are pres- 
ent, this type of external economy may lead to the 
permanent neglect of otherwise profitable invest- 
ment opportunities. 

It would appear that the estimation of the actual 
size of external economies in particular industries 
would be extremely useful. The task is made diffi- 
cult, however, by the absence of data, by the un- 
even pace of technical advance, and by the great 
variation among the production processes per- 
formed by firms that are perforce lumped together 
in the same “industries.” [See EXTERNAL ECONO- 
MIES AND DISECONOMIES.] 

J. N. WoLFE 


[See also COMPETITION; MONOPOLY; OLIGOPOLY. Other 
relevant material may be found in DECISION MAK- 
ING, article on ECONOMIC ASPECTS.] 
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FISCAL POLICY 


I. OVERVIEW 
u. WARTIME Fiscau Poticy 
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I 
OVERVIEW 


Fiscal policy is an aspect of public finance, of 
making and financing government expenditures. 
It is distinguished from other aspects of public 


finance in being concerned with decisions about 
certain “over-all” variables—such as total expendi- 
tures, total revenues, and total surplus or deficit— 
in terms of their “over-all” effects—such as their 
effects on national income, total employment, and 
the general level of prices. 

The management of their total revenues and 
expenditures and of the relation between them has 
become one of the principal instruments by which 
governments seek to achieve a high level of eco- 
nomic activity and general price stability. This 
effort encounters many problems—including the 
compatibility of these two objectives with each 
other and with other goals, the uncertainty of the 
size and timing of the necessary actions, and the 
difficulty of making and carrying out decisions in 
a large and political organization. Nevertheless, 
there is widespread confidence that the fiscal in- 
strument is sufficiently powerful, and its use suf- 
ficiently understood, to make a substantial con- 
tribution to successful economic performance. 

The distinction between fiscal policy and the 
other aspects of public finance that deal with par- 
ticular expenditures and taxes and their particular 
consequences is an abstraction from the complexity 
of the real world. In fact, decisions about the “over- 
all” variables are made up of decisions about par- 
ticulars. Also, any decision that has “over-all” ef- 
fects will also have particular effects on particular 
individuals, industries, or sectors of the economy. 

While the boundary between fiscal policy and 
other aspects of public finance is necessarily arbi- 
trary, the concept of fiscal policy is useful for analy- 
sis and for policy making. Every particular expendi- 
ture, tax, and debt issue may have a different 
over-all effect from every other, Yet for certain pur- 
poses it may be both convenient and safe to regard 
some large categories of expenditures, taxes, and 
debt issues, or even their totals, as single variables. 
In fact, it may be impossible to do otherwise, since 
existing knowledge is too crude to permit distinc- 
tion among all possible variables. 

It is necessary to distinguish fiscal policy not 
only from other aspects of public finance but also 
from monetary policy, which also consists of over- 
all measures usually evaluated in terms of over-all 
effects—on national income, total employment, or 
the price level, for example. This distinction can 
be drawn in various ways, depending upon the 
definition given to monetary policy. The distinction 
and connection are clearest if monetary policy is 
defined as policy with respect to the quantity of 
money. Fiscal policy can then be defined as policy 
with respect to total government sources and uses 
of funds and their composition. Certain sets of 
actions are a mixture of both monetary and fiscal 


policy—such as an increase of government ex- 
penditures financed by an increase in government 
borrowing, which in turn causes or is permitted to 
cause an increase in the money supply. Even in the 
mixed cases it is possible to distinguish between the 
monetary and fiscal aspects and to consider what 
effects flow from each. Its relation to monetary 
policy is one of the central problems of fiscal policy; 
this will be discussed below. 

Pre-Keynesian theory. Fiscal policy began to 
assume a leading role in economic thinking, eco- 
nomic controversy, and economic policy only in the 
1930s. But as long as there have been governments 
there has been fiscal policy, and a substantial body 
of doctrine about it existed long before the 1930s. 
Although we shall be concerned here mainly with 
“modern,” i.e., post-1930, fiscal policy, a few words 
should be said about earlier ideas and practice. 
Modern thinking about fiscal policy, after first 
abandoning much of prior doctrine, has since rein- 
corporated a great deal of it. 

Neither fiscal theory nor fiscal practice before 
the 1930s was primarily concerned with maintain- 
ing high employment and stabilizing the rate of 
growth of total national output. Economic thinking 
and policy in general were much less dominated by 
the high employment problem than they later came 
to be and were more focused on problems of long- 
tun growth, efficiency in the use of employed re- 
sources, and equity in the distribution of income. 
It was believed that in the long run the economy 
would tend to produce at a rate determined by 
“teal” factors—the supply of labor and capital and 
the state of technology. Prices and wage rates 
would tend to adjust to changes in total money ex- 
penditures for goods and services so as to leave real 
output unchanged. 

i This view still left the possibility that variations 
a the general level of prices would have unwanted 
distributional effects in the long run and would 
cause unemployment in the short run. This was 
commonly regarded as a monetary problem, which 
could be handled by appropriate management of 
the quantity of money. At the same time, certainly 
by the 1920s, the possible contribution of budget 
Policy, particularly variation of public works ex- 
penditures, to short-run economic stabilization, was 
widely but not universally recognized. However, a 
well-developed theory of how budget policy worked 
was lacking. There was uncertainty and disagree- 
ment about whether budget policy was an inde- 
Pendent instrument of economic stabilization or 
only a particular way of carrying out monetary 
Policy which might nevertheless be helpful in spe- 
cial circumstances. Moreover, the small size of gov- 
ernment budgets relative to national incomes 
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severely limited the possibility of a major contribu- 
tion of budget policy to economic stabilization. 

Before the depression, attention in economic 
thinking about fiscal policy focused on its conse- 
quences for the distribution of the total output 
among uses, rather than on its consequences for 
the level of output in the short run. The main con- 
sequences of fiscal policy were considered to be its 
effects on (a) the division of output between con- 
sumption and investment and (b) the division of 
output between public and private use. Standards 
of good fiscal policy were largely derived from the 
desired objectives for these two divisions of the 
national output. (There was, of course, a vast 
literature on aspects of public finance other than 
“fiscal policy” which was concerned with other 
consequences. ) 

A major theme was the connection among budg- 
et balancing, saving, investment, and economic 
growth. If government expenditures were larger 
than tax receipts, the deficit would have to be 
financed by borrowing. The government's borrow- 
ing would absorb part of the nation’s current pri- 
vate saving, which would therefore not be available 
to finance private investment. Investment would be 
lower than if the budget had been balanced, and 
consequently the rate of economic growth would 
also be lower. 

This argument that total savings will be larger 
if there is a budget surplus than if there is a deficit 
is true only if the taxes used to achieve the surplus 
depress private savings by an amount less than the 
tax collections. Kinds of taxes are conceivable of 
which this would not be true. However, it is not 
necessary to adopt such taxes. Indeed, the same 
argument that calls for balancing the budget and 
running a surplus in order to promote economic 
growth also calls for avoiding kinds of taxation that 
depress private saving. 

The conclusion that government budgets should 
be balanced was also reached by consideration of 
the proper division of the national product between 
private and public uses. The requirement that gov- 
ernment expenditures should be financed by taxa- 
tion was considered necessary to prevent a politi- 
cally dangerous and economically wasteful excess of 
government spending. Ultimately the citizen might 
be thought of as buying services from the govern- 
ment and as obtaining the right amount when he 
acquires the amount whose cost he is willing to pay 
in taxes. 

It is not demonstrable a priori that the total 
amount of government expenditures that would be 
made if they had to be paid for in taxes is the “best” 
amount, or even better than the amount that would 
be made if they could be financed by borrowing. 
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Government expenditures have a real cost in pri- 
vate expenditures that must be forgone, however 
they are financed. The “discipline” argument for 
budget balancing is that citizens will not accurately 
appraise this real cost if they can borrow to pay it. 
Once this possibility of error is admitted, it be- 
comes a question of fact in what direction the errors 
run. If the citizens may underestimate costs which 
are financed by borrowing, so may they over- 
estimate costs financed by taxation and either over- 
estimate or underestimate the benefits of expendi- 
tures. 

As has been noted, consideration of the desirable 
division of the national product between consump- 
tion and investment and between public and pri- 
vate uses led economists before the 1930s to budget 
balancing as a basic principle of fiscal policy. But 
the common popular support for the budget-bal- 
ancing idea was probably not chiefly based on the 
economists’ argument. The idea that government 
budgets should be balanced has a direct and intui- 
tive appeal as a particular application of principles 
of conduct accepted as having moral and utilitarian 
validity in a much broader field. The “folk wisdom” 
basis of popular thinking about budgets probably 
explains the strong resistance of that thinking to 
the revolutionary change in economists’ views of 
the matter that began in the 1930s, 

Post-Keynesian theory. Earlier thinking about 
fiscal policy was challenged in the 1930s at its 
root—namely, the proposition that fiscal policy 
does not affect total national output. Successful 
challenge to this proposition brought the level of 
total output and total employment to the forefront 
of the objectives of fiscal policy. It also radically 
altered thinking about policy to achieve the tra- 
ditional objectives, 

The revolution in thinking about fiscal policy can 
be dated from the publication of John Maynard 
Keynes's General Theory of Employment, Interest 
and Money in 1936. This work had its precursors 
and was subsequently explained, extended, re- 
fined, and in part controverted by others. But the 
General Theory was the turning point. All serious 
argument about fiscal policy since it was published, 
even the argument that completely denied the 
Keynesian conclusions, has been influenced by it. 

The first step in the new approach to fiscal policy 
was to loosen the link between the quantity of 
money and total money expenditures or total money 
income, If the ratio of total money income to the 
supply of money was fixed (or if not fixed, at least 
determined by factors that fiscal policy could not 
influence), fiscal policy that did not affect the sup- 
ply of money would not affect total money income. 

The new theory provided a way for fiscal policy 


to affect total money income, without a change in 
the supply of money. This, of course, implied a 
change in the ratio of money income to money 
supply. For example, suppose the government in- 
creases its expenditures without equally increasing 
taxes and without increasing the supply of money, 
The increase in government expenditures will ini- 
tially increase the incomes of individuals and busi- 
nesses. Since there has been no increase in the 
money supply, the ratio of their money holdings to 
their incomes has declined. Older theory would 
have emphasized reactions to a decline in this ratio 
which took the form of attempts of individuals and 
businesses to build up their money holdings by cut- 
ting their expenditures, which would in turn reduce 
total income. The newer theory emphasized two 
other possibilities. One was that money holdings 
were already “redundant,” so that there would be 
no felt need to increase money holdings as incomes 
increased. The other possibility was that indi- 
viduals and businesses would try to build up their 
money holdings by selling interest-bearing securi- 
ties (rather than by cutting their expenditures). 
This would reduce the prices of such securities, and 
raise the interest rates they yielded, until a point 
was reached at which people would no longer con- 
sider it worthwhile to sell securities at low prices, 
or give up interest, in order to get more money. 

In the early years of the “new fiscal theory” there 
was some disposition to carry the argument even 
further. That is, not only was the effectiveness of 
fiscal policy asserted, but the effectiveness of mone- 
tary policy was denied. If people were willing to 
absorb a change in their money holdings without 
any reaction, or if they reacted in a way that 
affected only interest rates, which did not in turn 
affect either private investment or saving, there 
would be no route by which a change in the money 
supply could affect total money income. 

Controversy between the pre-Keynesian view and 
the extreme post-Keynesian view raged for some 
years in the late 1930s. But in time a consensus 
emerged on an intermediate position. The ratio of 
total money income to the money supply is not 
fixed, but can be affected by changes in interest 
rates, which can be changed by fiscal policy; but 
the ratio is not infinitely variable, and a change in 
the money supply will affect money income, either 
through an effect on interest rates and investment, 
or directly. Under this formulation the national 
money income may be affected by fiscal policy, or 
by monetary policy, or by various combinations of 
the two. Wide disagreement remains, however, 
about the probable magnitudes of the effects of 
each kind of action. R 

The foregoing discussion has run entirely in 


terms of the effect of fiscal action upon total money 
income and money expenditures. It will be recalled 
that earlier thinking held that variations in money 
income and expenditures would not in any case 
affect real output and employment, except tempo- 
rarily, but would affect only prices and wage rates. 
The Keynesian analysis assumed that as long as 
employment was below some level, considered the 
full employment level, wage rates would be stable 
and variations in money income would be directly 
reflected in employment and real output. Some of 
Keynes's early followers tended to argue as if this 
assumption, made for purposes of analysis, were 
also descriptive of the real world. However, there 
has since been a general recognition that the actual 
situation is more complicated. There is no single 
point of “full employment” below which variations 
in money income affect only employment and out- 
put without affecting prices and wage rates and 
above which only prices and wage rates, but not 
employment and output, would be affected. At 
least over a considerable range of employment 
levels, variations in money income will affect both 
prices and output, although the price effect will 
presumably be larger and the output effect smaller, 
the higher the initial level of employment. This 
makes the choice of the “desired” behavior of money 
income difficult, but it still leaves the behavior of 
money income an important objective of fiscal 
policy. 

Sources and uses of funds. Discussion of the 
new fiscal policy initially concentrated on the effects 
of variations in government expenditures, taxes 
being given. Emphasis was placed on the stimula- 
tive effects of “deficit spending,” that is, of govern- 
ment spending not offset by tax revenues. It was 
shown that on certain assumptions the increase in 
the national income resulting from an increase in 
deficit spending would be a multiple of the increase 
in deficit spending. The increase in government 
spending initially causes private incomes to rise by 
an equal amount. Recipients of the additional pri- 
vate income will be induced to spend more, which 
will generate more income for others, who in turn 
will spend more, and so on in a “multiplier” process 
whose limit depends on the proportion of their 
additional income that people spend. 

Tt soon became clear that similar effects could 
be expected from variations in taxes, expenditures 
being given. A reduction in taxes would increase, 
and an increase in taxes would reduce, private 
Spending, which would affect private incomes, and 
thereby affect private spending, and so on in the 
multiplier process. 

_ Tf an increase of government expenditure would 
Increase the national income by some multiple of 
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the increase, and an increase of taxes would reduce 
the national income by some multiple of the tax 
increase, an equal increase of expenditures and 
taxes would affect the national income in one direc- 
tion or the other if the expenditure and tax mul- 
tipliers were unequal. There is no a priori reason 
to think that the multipliers are equal. In the 1940s 
and 1950s there was a great deal of discussion of 
the implications of equal changes of expenditures 
and revenues. Most of it revolved about the “bal- 
anced-budget multiplier theorem,” which showed 
that on certain assumptions the multiplier of an 
equal increase in expenditures and revenues will 
be 1; that is, the total increase in spending from 
an equal increase in government spending and taxes 
will equal the increase in government spending. 

The balanced-budget multiplier theorem demon- 
strated that the budget deficit or surplus could not 
logically be regarded as the sole source and meas- 
ure of the effects of the budget on total spending, 
and opened the way to a more general view of the 
possibilities of fiscal policy. 

The government may be conceived of as obtain- 
ing funds from a variety of sources and using funds 
in a variety of ways. The total sources and total 
uses are equal, if borrowing and the use of existing 
balances are considered sources of funds. Each of 
the sources and uses of funds has some effect on 
total national spending, but the effects of different 
sources and uses need not be equal per dollar. If 
every dollar used has an equal positive effect on 
total spending, and if every dollar obtained, includ- 
ing dollars borrowed, has an equal negative effect, 
then fiscal policy can have no net effect. 

The new fiscal policy first said that the effect of 
borrowing funds to finance a deficit was less per 
dollar than the effect of obtaining funds from taxes 
and also less, and of opposite sign, than the effect 
of spending funds. Therefore, an equal increase of 
government spending and borrowing, or an equal 
decrease of taxes and increase of borrowing, would 
have an effect on total spending. 

The balanced-budget multiplier discussion said 
that the effect of obtaining funds by taxing was 
different, per dollar, and of opposite sign from the 
effect of spending funds. Therefore an equal in- 
crease of taxes and government expenditures would 
have a net effect on total expenditures—an expan- 
sive effect if the expenditure multiplier was larger 
than the tax multiplier. 

Once we have gone this far, it is a logical next 
step to say that different government expenditures 
and different taxes have different effects per dollar. 
In this case a shift from one kind of government 
expenditure to another, or from one kind of tax to 
another, would have a net effect on total expendi- 
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ture. This net effect could be achieved without a 
change in the size of the deficit or surplus or in the 
total size of the budget. 

Formally the possibility exists of obtaining any 
desired effect on total spending through any one of 
a large number of combinations of sources and uses 
of government funds. Any differences in the effect 
per dollar of any two sources or uses of funds 
would constitute a lever that could be used to affect 
total spending. Whether in fact the number of such 
levers is large depends on whether the differences 
in the effects of various sources and uses of funds 
are significant, whether the differences are known 
with some certainty, and whether the various sources 
and uses of funds can in practice be manipu- 
lated to serve some objective with respect to total 
spending. 

Little is reliably known about the differences in 
the effects of various fiscal actions upon total 
spending. There is fairly general agreement that 
an increase in spending or a decrease in taxes has 
a larger expansive effect, per dollar, than the con- 
tractive effect of an increase in government bor- 
rowing, although even this would be disputed with 
respect to some kinds of government borrowing. 
If this is true, then any combination of fiscal 
actions that includes expenditure increases and 
tax decreases, but no expenditure decreases or tax 
increases, will have an expansive effect. But if the 
combination includes tax and expenditure actions 
that separately have effects of different directions, 
the direction of the net effect would be uncertain. 

Modern theory opens up the possibility of a large 
number of combinations of fiscal actions that could 
affect total spending, some but not all of which 
would affect the surplus or deficit. We do not know 
even the direction of the effect of many of these 
combinations, including some that affect the sur- 
plus or deficit. Where there is considerable confi- 
dence about the direction of the effect, the magni- 
tude of the effect is still uncertain, to a degree that 
varies with the particular action. 

Theories concerning implementation. The new 
theory held out the possibility of achieving any 
desired level or rate of change of money national 
income. It said that given any objective for the 
behavior of money national income, and given all 
the factors other than fiscal policy that affect its 
behavior, there is some combination of government 
expenditure programs and tax rates that will achieve 
the objective. However, this Proposition stops far 
short of specifying what the proper fiscal policy is 
at any time. 

The implementation of fiscal policy must con- 
tend with several difficulties: 

(1) It is not certain what behavior of money 


national income should be the objective of fiscal 
policy. The money national income is an interme- 
diate objective, important because of its influence on 
employment, production, prices, economic growth, 
and the balance of payments. What the effect of a 
particular level and rate of change of money na- 
tional income will be on these aspects of the econ- 
omy is always impossible to tell precisely. Even if 
it were known what the effects would be, what 
would be the “best” effects would still have to be 
determined. 

(2) The fiscal policy necessary to achieve any 
given course of money national income will depend 
upon the other autonomous factors that affect the 
national income in the future period in which the 
fiscal decisions will operate. If these other factors 
—which may be summed up as private tendencies 
to invest and consume—are strong, a less expan- 
sive fiscal policy will be required than if they are 
weak. These other factors are variable, and predic- 
tion of them is subject to a wide margin of error. 

(3) The effects of various fiscal actions upon 
the money national income are known only very 
roughly. Presumably, for example, a one-point re- 
duction in the basic rate of the individual income 
tax will make the money national income higher 
than it would otherwise have been. But informed 
estimates of the magnitude and timing of the effects 
will vary widely, and this will also be true of other 
fiscal actions. 

(4) Even if the target for behavior of money 
national income, the autonomous factors affecting 
the national income, and the effects of all fiscal 
actions are precisely known, the proper fiscal policy 
is not uniquely determined. There will almost cer- 
tainly be more than one combination of tax and 
expenditure actions that would yield the target 
national income. A choice will have to be made 
among these combinations, and it will have to be 
made on criteria other than the effect on national 
income, since the combinations are alike in that 
respect, 

(5) Since the behavior of money national in- 
come is affected to some, much-disputed, degree by 
monetary policy, the policy packages among which 
a choice must be made consist not only of various 
combinations of fiscal actions but also of various 
combinations of fiscal and monetary policy. 

(6) For all the foregoing reasons, the range 
and variety of fiscal policies that might reasonably 
be thought at any time to give a good combination 
of effects on employment, production, prices, etc., 
will be large, and it will not be possible objectively 
and certainly to select one policy as best. The 
selection of a policy to follow will be made in the 
political process, by people who are sensitive to 


the political consequences of the selection, This 
may introduce a bias into the selection, causing it 
to depart systematically from what is probably the 
best choice. 

This list of difficulties does not constitute an 
argument against fiscal policy or even against using 
fiscal policy to achieve desired effects on employ- 
ment, production, growth, prices, and the balance 
of payments. There will be a fiscal policy as long 
as there is government. This fiscal policy will have 
effects, and it is obviously desirable that it should 
have good rather than bad effects. But the difficul- 
ties listed here do suggest the problems that must 
be overcome to assure the choice of the fiscal policy 
that will have the best effects, or is most likely to 
have the best effects. 

General strategies of implementation. Two 
main lines of thinking about the strategy of im- 
plementing fiscal policy have emerged in the post- 
war period, although each has variants and the 
two lines tend to meet when each is elaborated 
with regard to the conditions of the real world. One 
approach is direct and activist, the other indirect 
and passive. 

The direct, activist approach might be called the 
“do your best” approach. It recognizes that the ap- 
propriate target for money national income, the 
future state of the economy upon which fiscal pol- 
icy will operate, and the future effects of various 
fiscal actions cannot be known with certainty. 
Nevertheless, in this view, the responsible authori- 
ties in government must make, and act upon, their 
best estimates of these factors. While there will be 
departures from the ideal result, these departures, 
it is believed, will be smaller than would be yielded 
by any alternative system. 

The indirect, passive approach would call for 
fiscal policy to adhere to some predetermined ob- 
Jective rule or standard which would not require 
forecasting short-run economic fluctuations. An 
effort was made to find such a rule which would, 
first, keep fiscal policy from being an independent 
destabilizing force and, second, insofar as consist- 
ent with the first objective, make fiscal policy a 
Stabilizing force. This effort was more evident in 
the United States than elsewhere, perhaps because 
the American political process did not leave budget 
Policy to “experts,” and this generated more public 
Interest in specifying guides to budget policy. 

The rule commonly suggested set as a standard 
a fixed relation between expenditure programs and 
tax rates such that revenues would exceed expendi- 
tures by some constant amount X (which might be 
Positive, zero, or negative) when the national in- 
come was at some standard level Y. Examples of 
Proposed budget rules may be found in the articles 
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by the Committee for Economic Development and 
by Milton Friedman included in American Econom- 
ic Association, Readings in Fiscal Policy (1955). 
Thus the actual surplus would be constant when 
the actual national income was at the standard. 
To this degree the budget would be neutral. If this 
rule were followed, the actual surplus would be 
below X when the actual national income was below 
the standard Y, and the surplus would be above X 
when the actual national income was above the 
standard Y. The farther the national income was 
below the standard, the less the government would 
subtract from the private income stream in taxes, 
and the more it would add in expenditures, so that 
the larger the government's support for total spend- 
ing and income would be. To this degree the budget 
would be stabilizing; it would resist variations in 
the actual national income relative to the standard. 
The standard level of national income would rise 
through time with the growth of potential national 
output. If actual national income did not rise as 
fast as potential, adherence to the rule would gen- 
erate a rising deficit (or falling surplus), which 
would tend to accelerate the growth of actual 
national income. 

The logic of the rule did not require that the 
standard national income should be equal to the 
potential or “full employment” national income, al- 
though it did require that the standard bear a 
certain constant relation to potential national in- 
come. However, in fact, the standard was usually 
prescribed as the full-employment national income, 
and the rule called for balancing the budget (or 
running some specified constant surplus or deficit) 
at full employment. Thus discussion of the rule 
focused attention on what the surplus or deficit 
would be at full employment, as distinguished from 
the actual surplus or deficit. The “full employment 
surplus” became a widely-used shorthand measure 
of the impact of fiscal policy. Whereas changes in 
the actual surplus or deficit result from changes 
in other economic conditions as well as from 
changes in budget policy, changes in the full employ- 
ment surplus result almost entirely from changes 
in budget policy. Use of the full employment sur- 
plus for analysis or prescription did not necessarily 
imply that the full employment surplus should be 
constant, and the concept came to be used in the 
1960s by many who did not accept the rule of con- 
stant full employment surplus. 

Supporters of the indirect, passive approach rec- 
ognized that in principle an active policy based on 
perfect foresight would yield superior stabilization 
results. However, perfect foresight was not possible, 
and actual decisions, if freed of all conventional 
rules, would not even be governed by the best pos- 
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sible economic forecasts. The political decisions 
might be random and destabilizing, or might be 
biased in an inflationary, expenditure-increasing 
direction. Therefore, a less ambitious and more 
restrained policy would lead to better results. In 
fact, a high degree of stability (meaning steadiness 
of growth) could be expected from a passive fiscal 
policy, especially if combined with a stabilizing 
monetary policy. In some variants this stabilizing 
monetary policy is also considered to be governed 
by a rule, such as a constant rate of growth of the 
money supply; in other variants the monetary pol- 
icy is flexible and discretionary. 

Probably the basic criticism of the indirect ap- 
proach is that the establishment of a budget rule 
requires a forecast of what economic conditions 
will be on the average during the period when the 
rule is in force. A rule that might be highly infla- 
tionary in one set of conditions might be highly 
deflationary in another. Therefore the rule does not 
eliminate the need for forecasting but requires a 
more difficult, because longer-run, kind of forecast 
than a more active and flexible policy requires. 

The choice between these two approaches re- 
flects differences of opinion about the operation of 
the economic and political systems as well as some 
differences in evaluating the consequences of infla- 
tion and rising government expenditures. While 
these differences have not been resolved, experience 
and discussion have tended to narrow the difference 
between the two approaches, Advocates of what is 
called here the direct approach do not want or 
expect a continuous adjustment of fiscal policy to 
actual or forecast economic conditions, but only an 
adjustment at intervals. They would also recognize 
that it may not be desirable to change fiscal policy 
to deal with economic changes that are small or 
quite uncertainly forecast, And they have had, at 
least when in responsible positions, to accept as 
some restraint the popular sentiment favoring bal- 
anced budgets. 

At the same time, at least many supporters of 
a “rule” of fiscal policy would recognize that it may 
be necessary or desirable to change the rule from 
time to time. They would also accept the possible 
necessity at some time to depart temporarily from 
the rule. 

Once these points are reached, the differences 
between the two approaches come down to ques- 
tions about the frequency of changes in fiscal policy 
and the strength of the evidence required to justify 
a change, These are matters of gradation, on which 
a continuous graduation of positions may be taken. 

The fiscal-monetary “mix.” If fiscal policy is 
not the only instrument by which the government 


can influence the money national income, it is not 
a sufficient guide to fiscal policy to say that it must 
be so managed as to bring about the desired na- 
tional income. It can only be said that the available 
instruments should be used in combination to effect 
the desired result, but this leaves open the question 
of the combination in which the various instru- 
ments should be used. The main question of com- 
bining instruments concerns the combination of 
fiscal and monetary policy. 

Even though fiscal and monetary policy may both 
be capable, in general, of affecting the national in- 
come, there may be special circumstances in which 
they are not alternative instruments. For example, 
it has been thought that at the bottom of a deep 
depression, profit prospects may appear so bleak 
that no expansion of the money supply would stim- 
ulate an increase in investment or other private 
spending. In this case monetary policy would not 
be an alternative to fiscal policy for promoting re- 
covery. However, whether this is a realistic pos- 
sibility has been disputed, and there is no sugges- 
tion that such circumstances have emerged in the 
postwar world. 

Fiscal actions and monetary actions may differ 
in the speed with which they can be taken, have 
effect, and be modified or reversed if necessary. 
This may provide the basis for a division of labor 
between fiscal and monetary policy, the more flex- 
ible instrument being used in those circumstances 
in which a quick and possibly reversible effect is 
needed. In much of the earlier postwar writing it 
was assumed that monetary policy was the more 
flexible instrument and therefore should be relied 
upon for short-run adjustments of the impact of 
the combined fiscal-monetary package. However, 
later study suggested that the lag between monetary 
action and its effects might be long and variable. 
This raised the question whether monetary policy 
was in fact the more flexible and manageable ele- 
ment in the monetary-fiscal combination. Thus, 
the principle remains that in the division of labor 
between fiscal and monetary policy, the more flexi- 
ble instrument should be used for the more rapid 
and frequent adjustments, but which is the more 
flexible instrument is uncertain. 

If the desired money national income for any 
period can be achieved by one of several combina- 
tions of fiscal and monetary policy, the choice 
among these combinations requires the invocation 
of some additional objective. Two such objectives 
have seemed particularly relevant: economic growth 
and balance-of-payments equilibrium. 

On certain assumptions, the economy will grow 
more rapidly with a large budget surplus than with 


a small budget surplus or deficit, if in each case 
there is the appropriate monetary policy to attain 
the desired current rate of money national income. 
Unless there are fully offsetting effects, the larger 
surplus will mean larger total savings and larger 
total investment. Some offsets may be expected. 
Getting the larger surplus will require some com- 
bination of higher taxes, which may reduce private 
saving or have other growth-restraining effects, and 
lower public expenditures, some of which might 
have had growth-promoting effects. Also, the more 
expansive monetary policy required to accompany 
the larger surplus may depress private saving. 
Therefore, a larger-surplus policy cannot be said 
to be a faster-growth policy without further specifi- 
cation, but it is probably possible to design a larger- 
surplus policy that will be a faster-growth policy. 
Even this would not, of course, by itself make the 
larger surplus preferable. A larger surplus leads to 
faster growth only at some cost, in the form of 
reduced current consumption and possibly also in 
other forms; whether it is legitimate or wise for 
government to decide to pay that cost is a question 
that would still have to be answered. 

In the late 1950s and early 1960s attention was 
focused on the balance-of-payments implications 
of the combination of fiscal and monetary policy. 
This new interest was largely derived from concern 
with the position of the United States, which was 
running a persistent balance-of-payments deficit 
along with more unemployment and less inflation 
than most of the rest of the world. A country in 
such circumstances seeks both domestic expansion 
and contraction of its external deficit. It was sug- 
gested that these two objectives might be simultane- 
ously approached by a more expansive fiscal policy 
combined with a less expansive monetary policy. 
More generally, it could be shown that if domestic 
income and the balance of payments depend upon 
monetary and fiscal policy and on no other, and if 
these two policies affect no other objectives, the two 
objectives determine the best combination of the 
two policies, While these conditions may never be 
fully met, they may be approximated for short 
periods, 

Definition of the budget. Any fiscal policy 
which uses the budget position—the size of the 
deficit or surplus—as a guide to action requires a 
decision on what is to be included in the budget— 
how it is to be defined. As was indicated earlier, 
government fiscal operations can be described by 
a statement of the sources and uses of government 
funds, the sources and uses being necessarily equal. 
The creation of a budget that can have a surplus 
or a deficit requires the selection of some but not 
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all of these sources and uses of funds for inclusion 
in the budget. Which items should be included de- 
pends on the policy purpose to be served by the 
over-all budget figures. Aside from this purpose, 
there is no “true” budget. 

The appropriate definition of the budget depends 
on whether the main purpose of budget policy is 
(a) to affect the money national income, (b) to 
exercise “discipline” over government spending, by 
requiring that expenditure be matched by taxation, 
or (c) to affect national saving by controlling the 
relation between government revenues and the gov- 
ernment expenditures that are not for investment. 
Different emphases on these three objectives lead 
to different answers to the questions that arise 
about the definition of the budget. These questions 
relate chiefly to four kinds of transactions: 

(1) Trust account transactions in which the 
government accumulates a fund from a particular 
class of individuals, out of which payments are 
made, simultaneously or subsequently, for specified 
purposes. Social insurance funds are a leading 
example. 

(2) Loan transactions in which the government 
at some times lends money to private borrowers or 
to other governments or purchases their debt, and 
at other times receives repayment or sells the 
debt. 

(3) Capital expenditures for the acquisition by 
government of assets (including loans) that yield 
a subsequent return, whether in money or in non- 
monetary benefits. 

(4) Transactions in which there is a consider- 
able lapse of time between the accrual of a liability 
to or from the government and the corresponding 
cash payment. 

Different treatments of these kinds of transac- 
tions are illustrated by the three “budgets,” or state- 
ments of government receipts and payments, com- 
monly used in the United States: 

The administrative budget, which excludes trust 
account transactions, is almost entirely on a cash 
basis, and makes no distinction between loan or 
capital transactions and current transactions; 

The cash-consolidated budget, which includes 
trust account transactions and is entirely on a cash 
basis; 

The government sector of the national income 
accounts, which also includes trust account trans- 
actions, but excludes loan transactions and reflects 
corporate-profits-tax liability accruals rather than 
cash payments. 

None of these three financial accounts used in 
the United States distinguishes between capital and 
current transactions. However, many other govern- 
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ments use capital budgets which do make this 
distinction. 

Since the main object sought in postwar thinking 
about fiscal policy has been a certain effect, pre- 
sumably stabilizing, upon money national income, 
the search for definitions of the budget has focused 
on that which would be most revealing of the effects 
of fiscal policy on money national income. What is 
desired is the most comprehensive definition of the 
budget consistent with the requirement that all 
fiscal policies yielding an equal surplus in the 
budget so defined have, under similar general eco- 
nomic conditions, equal effects on money national 
income. This test can be literally met only if every 
expenditure has the same effect, per dollar, as 
every other expenditure and the same effect, with 
sign reversed, as every receipt. This is to say that 
the test cannot be literally met by any real budget. 
The test does suggest the desirability of excluding 
from the budget transactions that are markedly dif- 
ferent in their effect per dollar from the transac- 
tions included and that vary substantially in magni- 
tude relative to the transactions included. However, 
if important items are excluded in order to make 
the budget more homogeneous and thus to give the 
size of the surplus or deficit a more stable meaning, 
problems arise in establishing criteria and mech- 
anisms for controlling the excluded items. In prac- 
tice, some compromise beween homogeneity and 
comprehensiveness in defining the budget must be 
found. 

It seems clear that if the budget is intended to 
reveal the effects of fiscal transactions on money 
national income, there is no good reason for ex- 
cluding trust account transactions from the budget. 
Many, and probably most, of the receipts and ex- 
penditures made by trust accounts are quite similar 
in their aggregate economic effects to many of the 
receipts and expenditures included in the regular 
budget. A more difficult problem is the treatment of 
loan transactions. If the government makes a loan 
to a private business, a local government, or a for- 
eign government, is this more like a government 
grant or purchase, in which case it should be 
treated as an expenditure, or more like government 
purchase (repayment) of its own securities, in 
which case it should not be treated as an expendi- 
ture? Probably the answer depends upon the terms 
and circumstances of particular kinds of loans. 

If the main objective of budget policy is to 
stabilize, or otherwise influence, total money ex- 
penditures, there is no good reason to distinguish 
between the current and capital expenditures of 
government, Capital spending, at least when it is 
for purchase of real goods and services, is likely to 


have as great a direct and indirect effect on total 
spending in the economy as does current spending. 
The amount of taxation needed for economic sta- 
bilization does not depend on whether the govern- 
ments expenditures are for current or capital 
purposes. 

Emphasis on the “disciplinary” function of a 
budget rule may provide a rationale for excluding 
the trust accounts from the budget. The purpose of 
requiring that the budget be balanced may be to 
assure that government expenditures are not made 
unless taxpayers have demonstrated their willing- 
ness to pay taxes for them. However, this purpose 
does not require the inclusion in the budget of 
transactions that have their own disciplinary safe- 
guards. If the government undertakes a particular 
expenditure program to which specific revenues are 
assigned, with a definite plan and commitment over 
some period, the disciplinary function of budget 
balancing does not require inclusion of these trans- 
actions in the budget. The cogency of this argument 
for excluding a trust fund from the budget depends 
upon whether the trust fund actually conforms to 
this description,- which is sometimes, but not al- 
ways, the case. 

A similar case on “disciplinary” grounds can be 
made for excluding from the budget capital trans- 
actions where the productivity of government capi- 
tal investments can be objectively measured and 
compared with the cost of capital. This would pro- 
vide a test of the desirability of public investments 
that would be superior to the test of the willingness 
of the community to pay taxes for them. Where the 
investment yields a product that is sold in the 
market—as is the case with a government-owned 
railroad—this objective test of productivity may be 
available. But there are many kinds of government 
investment—education being one example—where 
the product is difficult to measure or even to define. 
To remove the requirement that such expenditures 
be financed by taxation may leave no adequate lim- 
its to their expansion. 

One case for use of a capital budget may be 
found in the desire to separate the government 
decision about the total amount of saving in the 
economy from the decision about the total amount 
of government investment. The total saving of the 
economy is equal to private saving plus the excess 
of taxes over current, noninvestment expenditure 
of government. The government, or the community 
through the government, may decide to make this 
total larger or smaller by deciding on a larger or 
smaller excess of taxes over current expenditures. 
(This assumes that private savings do not increase 
or decrease by an amount equal to the change m 


the excess of taxes over current expenditures. ) The 
government may decide to run a large current sur- 
plus in order to generate large total savings and 
thus permit large total investment and a high rate 
of growth. This decision by itself logically implies 
nothing about the proportions in which the invest- 
ments would be private and government. On the 
other hand, the government may decide on a larger 
or smaller amount of public investment. This by 
itself implies nothing about what total saving 
should be. The decision to have a large amount of 
public investment may reflect a judgment that 
many public investments are more productive (or 
otherwise desirable) than private investments and 
should be made instead of them—not in addition 
to them. The existence of large opportunities for 
productive investment may be a reason for wanting 
high savings and a large excess of taxes over cur- 
rent expenditure. But this would be equally true 
whether the opportunities were for public or private 
investment. 

If a capital budget is used, the government 
would decide, by deciding on the size of its current 
surplus. how much it should contribute to total 
saving. It would decide this in terms of the value 
the community places on economic growth and the 
supply of productive investment opportunities, pub- 
lic and private. The government would also decide 
on the desirability of particular public investments 
in terms of their productivity, taking into account 
the cost of capital. Proponents of the capital budget 
maintain that this would result in better decisions 
about both the government's contribution to saving 
and the government’s investment than does a 
budget which lumps together current and capital 
expenditure. 

A different kind of concern with the effect of 
budget policy on total saving may justify the ex- 
clusion of trust accounts from the budget. It may 
be desired to balance the budget so that all private 
Savings will be available to finance private invest- 
ment, either on the grounds that growth will be 
promoted thereby or that “consumers’ sovereignty” 
requires that the amount of private investment 
should be determined by private saving. If individ- 
ual citizens voluntarily make payments into a 
government-managed fund from which they will 
subsequently receive benefits, these savings should, 
on this principle, not be available to finance gov- 
ernment expenditures, as would happen if they 
were lumped into the budget. The force of this 
argument depends upon the degree to which the 
trust account in fact has the character of a private 
voluntary savings institution. 

The idea of the capital budget does not neces- 
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sarily imply that the current budget should be bal- 
anced by taxation; the rule may be a surplus or 
deficit in the current budget. However, much dis- 
cussion of the idea has assumed that the issue was 
between balancing the current budget and balanc- 
ing the total budget. Since net government capital 
expenditures are likely to be positive, balancing 
the current budget will be more expansionary than 
balancing the total budget. This has won some 
support for the capital budget idea from those 
who believe there is need for an expansionary fiscal 
policy but who do not want to violate community 
sentiment in favor of balancing “some” budget. 

Budget flexibility. Fiscal action taken at any 
time has its effects on some future time. The ap- 
propriateness of the action is its appropriateness to 
the future time at which its effects occur. Fiscal 
action therefore requires or implies a forecast. The 
reliability of the forecast will be greater the nearer 
the future period to which it relates. For this reason 
attention has been directed to increasing the flex- 
ibility of fiscal policy, so that it will have its effects 
more quickly and the time for which forecasts have 
to be made will be shortened. Increased flexibility 
has been sought along three main lines, sometimes 
called built-in flexibility, formula flexibility, -and 
discretionary flexibility. 

Built-in flexibility. The stabilizing effect of the 
variation in government tax receipts or liabilities 
and unemployment compensation payments that 
automatically accompanies variation in the national 
income has already been noted. Recognition of the 
value of this automatic response of the budget has 
naturally led to the question whether this response 
could be strengthened. However, a basic difficulty 
arises in considering possible ways to strengthen 
the built-in stabilizing response. The extent to 
which tax receipts vary with a change in the na- 
tional income depends upon the tax rates applied 
to individual and corporate incomes at the margin 
—the proportion of an increase in an individual's 
or corporation’s income that is taken in taxes. If 
these marginal rates are high, the proportion of a 
change in national income that is absorbed in a 
change in tax receipts will be large, and this will 
have a stabilizing effect on the economy. However, 
if these rates are high, taxes will take a large pro- 
portion of the additional income that an individual 
or corporation may earn by productive activity. At 
least beyond some point this will weaken the in- 
centive to productive activity and will adversely 
affect economic efficiency and growth. Thus the 
stabilizing value of high marginal tax rates can be 
obtained only at the cost of other important eco- 
nomic objectives. 
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In general, the marginal rates of tax, and also 
the rates of unemployment compensation payments, 
have been so closely determined by a combination 
of other factors economically and politically more 
important as to leave no room for adjusting them 
to increase “built-in” stabilization. 

To get around this difficulty there have been pro- 
posals intended to increase the automatic respon- 
siveness of tax collections or unemployment com- 
pensation to changes in the over-all state of the 
economy without increasing their responsiveness to 
the condition or activity of any individual. The 
general principle may be illustrated by the follow- 
ing hypothetical scheme: Weekly withholding of 
income tax from each taxpayer will be reduced by 
$2.00 for every percentage point by which the latest 
published seasonally adjusted figure for the rate 
of unemployment exceeds 4 per cent (and annual 
liability will be correspondingly reduced). Such a 
scheme would greatly increase the variability of 
tax collections in response to variations in the rate 
of unemployment without increasing the amount 
of tax taken out of an additional dollar of income 
earned by any individual. 

Formula flexibility. Consideration of proposals 
for formula flexibility has not proceeded beyond an 
elementary and formal Stage. One difficulty is that 
while, once the formula is adopted, reliance on 
long-term forecasting is eliminated, the formula 
itself implies a long-run forecast of the way in 
which fiscal policy should respond to changes in the 
economy. Formula flexibility buys speed of response 
in exchange for discretion in deciding what the 
response should be. This may be a worthwhile 
exchange, but that has not been satisfactorily dem- 
onstrated. Possibly a more serious difficulty is polit- 
ical. Governments may be willing to take active 
fiscal measures when the need is clear and present 
but unwilling to commit themselves in advance to 
the automatic occurrence of such measures, 

Discretionary flexibility. Efforts to increase the 
flexibility of the budget have been largely concen- 
trated on discretionary measures, Preparations are 
sought which, while leaving the government in all 
cases free to act or not, will enable the government 
to decide more quickly and will speed up the effects 
of the decisions taken. The classic case of such 
preparations is the “shelf of public works,” Ideally 
this is a reserve of government construction proj- 
ects for which legislative authorizations and appro- 
priations have been made, plans drawn, and sites 
obtained, so that work can be started quickly when 
the Executive decides to do so. Interest in the shelf 
of public works has diminished, at least in the 


United States, since the end of World War 11. Fed- 
eral nonmilitary public works activities have not 
been large, and the volume of projects that seem 
desirable enough to authorize but not urgent enough 
to begin immediately has been small relative to the 
size of the economy and its potential fluctuations, 
The volume of these that could quickly reach a 
high rate of operation and, if necessary, be stopped 
without great loss is even smaller. Also, it has ap- 
peared that the need for expansive action might 
arise when construction is running at a high rate, 
so that increasing public works would not be 
appropriate. 

The revenue side of the budget seems to hold 
out greater possibility for quick and powerful fiscal 
action. Federal revenues in the United States are 
about 20 per cent of the gross national product. 
Changes in tax rates that are administratively fea- 
sible could increase or decrease the available after- 
tax incomes of individuals or businesses by large 
amounts in a short time. The period required for 
legislative enactment of a tax change may be long, 
however, and there have been a number of pro- 
posals for shortening this period. These involve 
advance legislative authorization to the president 
to make a specified tax cut, possibly subject to 
Congressional veto within a limited period, or an 
agreement by Congress to act upon a presidential 
recommendation within a limited period. In fact, 
even without such arrangements the legislative 
process would allow quick Congressional action 
on a tax change if a strong majority of Congress 
were agreed on the need for it. 


Deliberate and systematic effort to use fiscal 
policy to keep the economy operating close to its 
potential while avoiding inflation and serving other 
objectives became standard practice in most West- 
ern industrial countries after World War 1r. The 
results of that experience were varied and interpre- 
tation of the results even more varied. But the 
experience did not contradict the idea that fiscal 
policy, at least when accompanied by appropriate 
monetary policy, is a powerful instrument for affect- 
ing economic performance in basically free econ- 
omies. This instrument by itself may be unable to 
teconcile inconsistencies where they arise between 
some objectives, such as full employment and 
price stability or domestic stabilization and inter- 
national equilibrium. Even aside from this, knowl- 
edge and institutions are inadequate for manage- 
ment of the fiscal instrument to achieve its ideal 
effectiveness. Yet there is little doubt that the re- 
sults of freeing fiscal policy from older inhibitions 
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have on the whole been beneficial and that further 
opportunities for improved use of the fiscal instru- 
ment, based on experience and study, remain. 


HERBERT STEIN 


[See also DEBT, PUBLIC; MONETARY POLICY; MONEY; 
PUBLIC EXPENDITURES; TAXATION.] 
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i 
WARTIME FISCAL POLICY 


The objectives and methods of peacetime fiscal 
policy are not invalidated by the onset of a major 
war. However, the overriding goal of winning the 
war modifies both the relative emphasis placed on 
the various objectives and the mixture of proce- 
dures adopted to achieve them. 

The experience of the United States in World 
War 11 will serve tv illustrate the points involved. 
The magnitude of that war and its grave threat to 
the nation demanded the greatest feasible effort. 
The public, with minor exceptions, strongly sup- 
ported the war. On balance there could be no re- 
course to the resources of other countries; instead, 
the United States supplied large amounts of maté- 
riel to its allies. However, the war was considered 
a temporary interlude in the national life, as is 
evidenced by the speed and completeness of de- 
mobilization at its conclusion. 

The magnitude of the war required policies de- 
signed to achieve maximum engagement and utili- 
zation of human and material resources: a sub- 
stantial proportion of the resources previously 
employed in producing civilian goods had to be 
shifted quickly to the military effort. Patriotic fer- 
vor was such that people were willing to sustain 
exactions, toil, inconvenience, and hardship not 
acceptable at other times, but only if they believed 
these burdens were being fairly shared by everyone. 
In the absence of both the opportunity and the 
reason to borrow abroad, all borrowing had to come 
from the same public that paid the taxes and bore 
the other burdens of the war, although not neces- 
sarily in the same proportions. Because of the ex- 
pectation that the war would be a temporary con- 
dition, people were willing to postpone the purchase 
of many durable and semidurable goods, thus free- 
ing resources for the war effort. The same expecta- 
tion also strengthened public resistance to efforts 
by particular groups to use the wartime emergency 
as an occasion for permanently altering the social 
and economic structure of the economy. 
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These wartime conditions had several implica- 
tions for fiscal policy. The use of a draft to select 
persons for military service was an obvious neces- 
sity, but its superior equity over “buying” recruits 
also gave it a superficial attractiveness as a method 
of recruiting industrial and other supporting efforts. 
It was not, however, a practical method for eliciting 
the intensive, tedious, continued efforts required in 
war production. Moreover, such use of the draft 
and concomitant military command system would 
have threatened the postwar economy with the 
perpetuation of a totalitarian economic system. 

Accordingly, it was deemed imperative to rely as 
much as possible on governmental expenditures 
and fiscal incentives to stimulate maximum utiliza- 
tion of human and material resources, and on tax- 
ation and borrowing to support the resulting finan- 
cial costs of the war. However, certain direct 
controls were required to reinforce and comple- 
ment the fiscal measures. To have diverted re- 
sources from civilian to military use through fiscal 
measures alone would have been a slow, incom- 
plete, and very costly process; priorities and direct 
allocations were an effective addition, which, al- 
though sometimes drastic, were justified in the 
public mind by the war emergency. Furthermore, 
it was clear that, despite heavy taxation, persons 
with large incomes and accumulated wealth could 
virtually monopolize the purchase of scarce, neces- 
Sary consumer goods, while persons with small in- 
comes suffered intense hardship. Rationing, al- 
though administratively difficult and sometimes 
arbitrary, could assure broad distribution of the 
supply. For reasons of economy and morale price 
and wage controls were needed to prevent spiraling 
inflation, gross inequities, and permanent shifts in 
the economic positions of different persons and 
groups, 

While fiscal measures alone cannot be expected 
to achieve the objectives, or cope with the economic 
problems, of a major war, they should be made to 
do as much as possible, thereby reducing the need 
to employ measures disruptive of the market econ- 
omy. The major objectives of an effective wartime 
fiscal program are fairness of burden distribution, 
stimulation of military and supporting production, 
avoidance of inflation, and salutary consequences 
for the postwar economy. In practice these goals 
must be pursued through political and administra- 
tive processes in which policy and action are deter- 
mined by what people believe, whether or not it is 
in accord with reality. 

Taxation versus borrowing. The major alterna- 
tives for financing governmental expenditures are 
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taxation, borrowing of private savings, and expan- 
sion of the money supply. The last usually takes 
the form of loans from commercial and central 
banks rather than direct money issue. Because of 
its inflationary effects, it is the least desirable alter- 
native. No case has yet been noted of a major war 
financed by taxation alone. During World War n 
the U.S. federal government collected in taxes 45 
per cent of its expenditures, a somewhat higher 
percentage than during World War 1 or the Civil 
War and a somewhat lower percentage than was 
collected by Great Britain and Canada in World 
War I. 

The relative shares of war costs to be paid from 
taxation and from borrowing have been determined 
by various factors. A traditional belief that still 
seems to be influential is that through borrowing, 
a country can shift much of the cost of the war to 
“future generations” in the postwar years. This 
belief has no validity for a country relying on in- 
ternal resources. While a relatively small part of 
the real economic burden of the war can in some 
sense be shifted to postwar years, the amount thus 
postponable cannot be increased by financing the 
war through borrowing instead of taxing. What can 
be changed by this means is the distribution of the 
cost among persons. Wartime borrowing places 
financial burdens during the war on lenders, who 
after the end of the war are repaid out of taxes, 
which in turn are paid by the lenders and non- 
lenders alike. [See DEBT, PUBLIC.] 

The desirability of the redistribution resulting 
from wartime borrowing can be measured by ap- 
plying the criteria listed above. With respect to 
fairness of burden distribution, there are conflict- 
ing considerations. Financing through taxation 
avoids the injustice of levying a double burden on 
members of the armed forces; once on the battle- 
field and again through taxation to reimburse 
wartime lenders. However, to pay the total cost of 
a major war through currently collected taxes 
would require imposing a heavier burden on low- 
income groups than would be necessary if part of 
the cost were met by accumulating debts to be 
serviced after the war out of collections from pro- 
gressive taxes. 

With respect to production incentives, lending to 
the government has a less adverse effect than pay- 
ing taxes does, since the patriotic efforts of workers, 
farmers, and businessmen are strengthened by the 
assurance of a tangible reward in the form of cur- 
Tent assets and postwar spending power. 

Taxation is distinctly superior to wartime bor- 
Towing as a means of minimizing inflation both 
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during and after the war. To the citizen’s mind his 
tax payments are gone forever, while what he loans 
to the government creates for him an asset that 
psychologically encourages his continued expendi- 
ture. To minimize this effect, the government seeks 
through shortages, rationing, and patriotic appeals 
to maximize private savings and to tap them 
through the sale of war bonds and other securities. 
The inadequacy of voluntary savings commonly 
necessitates borrowing from banks as well, with 
consequent expansion of the money supply. If the 
economy moves into war from a low level of out- 
put, as was the case in the United States in 
1940-1941, some growth in the money supply is 
needed to finance expanded production and any 
price increases that may be required to stimulate 
and transfer resource utilization. Also, the chances 
for a prosperous postwar economy are improved by 
sufficient wartime borrowing from the public and 
from banks to provide the reserves of liquid assets 
needed to support demand during the postwar tran- 
sition period. However, such borrowing through- 
out the war greatly exceeded these needs and cre- 
ated an excessive volume of liquid assets. Although 
the resulting inflationary pressures were largely 
suppressed during the war, the liquid assets con- 
tributed importantly to the inflationary spending of 
the postwar period. 

All these considerations may be variously eval- 
uated and balanced in choosing between more taxes 
or more borrowing. The conclusion almost certainly 
will be that more should be paid through taxation 
than will be politically feasible, because the psy- 
chological limits to taxation are undoubtedly much 
below the economic limits set by adverse effects 
on production incentives. The psychological limits 
appear to be determined more by the rapidity and 
extent of rate increases than by the previous level 
of the rates. In the United States the political and 
psychological limits undoubtedly were closely ap- 
proached by 1943, when Treasury Department ef- 
forts to secure substantial tax increases met with 
great political resistance and a minimum of public 
support, 

Use of income taxes. The major sources of ex- 
panded revenue to finance World War 11 were taxes 
on net income, particularly the personal income 
tax but also the corporation income tax and excess 
Profits tax. The characteristics of progressive in- 
come taxation are especially needed in time of war. 
The progressive income tax is highly flexible in that 
rates and exemptions can be readily changed, 
yields at given rates increase more than propor- 
tionately to the rise in national income, and with 
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current collection, rate increases can be quickly 
refiected in government revenues. The income tax 
is the best adapted of all taxes for adjusting the 
tax load in accordance with the social consensus 
on what is equitable. Taxes on net income do not 
drive businesses into bankruptcy and are less likely 
than most other taxes to be shifted forward in 
higher prices with consequent wage and price esca- 
lation. However, despite its merits, the income tax 
can be overused; very high income tax rates may 
impair incentives to produce and unquestionably 
stimulate the search for ways to avoid or evade the 
tax. [See TAXATION. ] 

Use of sales taxes. An exception to the heavy 
use of the income tax by nations fighting World 
War u was in the Soviet Union, where sales tax- 
ation is part of the apparatus of socialist income 
distribution. In other countries the use made of 
sales taxes and other gross transactions taxes dif- 
fered greatly, but the rates generally were not 
greatly increased during the war, perhaps because 
of the regressive incidence of the taxes and their 
tendency to upset wage stabilization programs. In 
the United States the pre-emption of the sales tax 
field by the states was a factor, as was a political 
reluctance to introduce a general sales tax that 
might well remain as a permanent element of the 
postwar system. In Great Britain a heavy “pur- 
chases tax” on sales at the wholesale level was im- 
posed, at rates ranging up to 100 per cent. The ob- 
jective was not only to raise revenue but also to 
discourage the purchase of scarce, less necessary 
goods. A very modest use of this principle was 
made in the United States in the form of moderate 
excise taxes on scarce durable goods, with the un- 
intended sequel that they were treated as purely 
revenue taxes and their repeal was delayed long 
after the shortages had turned to surpluses. 

Special taxation. A major question regarding 
the concept of fairness in war taxation is to what 
extent higher rates or special taxes should be im- 
posed on increases of income realized because of 
the war. The corporate excess profits tax, which 
has become an expected element in wartime tax 
structures, is the principal illustration of taxation 
used for this purpose. It has the twofold objective 
of raising revenue where psychologically and 
economically it will presumably hurt the least 
and of preventing businesses from reaping uncon- 
scionable profits from the common emergency. In 
practice, the excess profits tax has proved to be 
extremely difficult to administer in harmony with 
its objectives; in a complex business society truly 
excessive profits are hard to define with precision 
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and even harder to measure. Yet having in the tax 
system an “excess profits tax,” even a weak or arbi- 
trary one, seems to be essential to promote morale 
and make more acceptable the wartime controls 
that limit the incomes of workers and farmers. 

The principle of special taxation on increases of 
income was not extended to individuals, despite 
considerable support for such action, no doubt be- 
cause both the formulation and administration 
would have been even more difficult than for cor- 
porate business. The tax would also have damaged 
the war effort by impairing the incentives of war 
workers, whose new-found prosperity was indeed 
the target of the proposals. 

Substantial resistance appeared to any tax 
changes that would permanently modify the rela- 
tive economic positions of taxpaying groups and 
might therefore be considered reforms. Perhaps the 
clearest illustration is the estate tax, which was in- 
creased only moderately. The reasoning of the Con- 
gress seemed to be that if higher wartime rates 
continued into the postwar years, the result would 
be a major shift in the distribution of wealth and 
income; while strictly as wartime levies the rate 
increases would arbitrarily fall only on the estates 
of persons who died during the war. 

New methods of raising revenue did not prove 
popular with the wartime congresses. An effort by 
the Treasury Department late in the 1942 session 
to secure a progressive tax on personal expendi- 
tures was summarily rejected. The idea of dis- 
tinguishing part of the income tax as a refundable, 
compulsory loan, proposed by J. M. Keynes in 
1940, was strongly supported by some economists 
in the United States administration, and it was 
incorporated to a small degree in the “victory tax” 
on income—a tax designed to hit lower incomes, 
with war workers again the main target. The com- 
pulsory loan feature was in effect repealed before 
collection. The compulsory loan was, however, 
used in Great Britain and, for a short time, in 
Canada. The experience can scarcely be considered 
a success (Heller 1951). Keynes's objective in 
proposing compulsory lending was to raise the 
psychological limits of wartime taxation, but ap- 
parently the taxpayers rarely distinguished between 
the loan part and the tax part of the payment. 

External resources. Countries in a position to 
supplement their domestic resources with resources 
from other countries have used various methods 
to accomplish the transfer. For the most part, 
intergovernmental loans were used during World 
War 1, but the repayment mechanism broke down 
after the war; the United States, which was the 
principal creditor, imposed such heavy protective 
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tariffs as to make repayment in goods virtually im- 
possible. In World War n the use of intergovern- 
mental loans was minimized, and large use was 
made of the Lend-Lease program, which in effect 
was a method of common burden sharing with no 
postwar indebtedness liability. At the opening of 
World War 1 the British nationalized foreign se- 
curities held by citizens and, as the situation 
required, either sold them for foreign exchange 
or used them as collateral for loans. The British 
also incurred large foreign exchange debts in 
Commonwealth countries, some of which were 
not liquidated until long after the war. The Ger- 
mans made substantial drafts on the resources 
of countries that they occupied during the war 
(Lindholm 1947). 

“In-between” periods. Turning from major wars 
to periods marked by minor war or heavy cold war 
defense budgets, we find that some of the charac- 
teristics discussed above are absent. The economic 
and financial requirements during such periods 
impose burdens that large and growing economies 
may absorb with little difficulty—except for poten- 
tially greater inflationary pressures than would be 
present in time of real peace. There is little sense 
of national emergency or outpouring of patriotic 
fervor and willingness to sacrifice. There is gradual 
although reluctant recognition that the problem 
may continue indefinitely rather than be quickly 
ended. It may prove to be significant that thus far 
the experience of the United States with modern 
fiscal policy has not been during periods of sub- 
stantially unimpaired peace but during periods of 
major war, reconversion from war, minor war, 
and the “gray area” between war and peace. 

In a situation falling between peace and war the 
appropriate fiscal policies tend to be similar to 
those of a peacetime period, although of course 
much depends on the size and rate of growth of 
defense and military expenditures. At times, even 
the need for further fiscal stimulus may emerge. 
In an “in-between” period the use of control 
measures to supplement fiscal policy is usually 
resisted, and if they are imposed, their administra- 
tion is less successful in achieving compliance than 
in a time of major war. 

Fiscal measures must be carefully oriented. The 
policy toward borrowing should depend on current 
conditions, not on projections into a doubtful 
“postwar” period. The tax system should be de- 
veloped for the long run, not for a short-run 
emergency. An excess profits tax, for example, 
would be less appropriate in an “in-between” period 
than in time of major war, as well as more diffi- 
cult to design and administer, since there is no 
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valid base period against which to measure the 
excessiveness of profits. 

In short, a period between peace and major 
war is an unusually difficult one in which to op- 
erate fiscal policy. The economy may develop too 
much inflationary pressure for successful contain- 
ment through fiscal and monetary policy alone, 
while the absence of patriotic fervor and sense of 
urgency prevents wartime controls from being 
either acceptable or readily administrable. It is 
a period in which quasi controls, “voluntary” re- 
straints, and other halfway measures, unsatisfac- 
tory as they are for all concerned, may represent 
the only feasible supplement to fiscal and monetary 
policies. 

Roy BLouGcH 
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FISHER, IRVING 


Irving Fisher (1867—1947), American econo- 
mist, had originally intended to become a mathe- 
Matician, He studied at Yale with the eminent 
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mathematician J. Willard Gibbs but was drawn 
toward economics under the influence of William 
Graham Sumner. As Fisher himself reported, he 
was fascinated by Sumner. He was also deeply in- 
fluenced by the astronomer Simon Newcomb, who 
had published a number of economic texts, includ- 
ing the remarkable Principles of Political Economy. 

From 1892 to 1895 Fisher taught mathematics 
at Yale, with an interval in 1893 and 1894, when 
he visited Europe, spending time in Berlin and 
Paris. After 1895 he transferred from the mathe- 
matics to the economics department, remaining at 
Yale until his retirement in 1935. 

A prolific writer, gifted in the most varied sub- 
jects, active as a mathematician, statistician, de- 
mographer, economist, businessman, reformer, and 
teacher, Fisher left behind him some thirty books 
and hundreds of papers and theoretical studies, as 
well as many popular articles dealing with a wide 
spectrum of subjects, ranging from economic 
theory to public health. 

Fisher made a fortune from his invention of a 
visible card index file system, which he marketed 
in 1910. The corporation he founded merged with 
others in 1926 to form Remington Rand, Inc., of 
which he was a director until his death. He was 
also a founder and director of several agencies for 
economic analysis and forecasting, and a director 
of a considerable number of other companies. Part 
of Fisher's fortune was lost in the 1929 stock- 
market crash. 

He was connected with a large number of asso- 
ciations and interested in many public causes. He 
was concerned with the problems of world peace 
and campaigned for six months throughout the 
United States in favor of U.S. participation in the 
League of Nations. He published two books (1923; 
1924) and many articles on the subject. He was 
an ardent proponent of prohibition (see 1926; 
1928a). His objectivity in selecting statistical data 
for these books has been questioned by some of his 
opponents. 

Fisher also devoted a good deal of his time to 
campaigning for improved hygienic, sanitary, and 
eugenic practices, These crusades were an outcome 
of his own severe attack of tuberculosis in 1898, 
which interrupted his work for four years and gave 
him an interest in problems of health. In 1913, 
together with H. A. Ley and ex-President Taft, he 
founded the Life Extension Institute, with a view 
to generating public awareness of the contribution 
that sound living and periodic medical examinations 
can make to good health. He published How to Live 
(see Fisher et al. 1915), which has run into 90 
editions and sold more than 400,000 copies in the 
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United States; it has also been translated into ten 
languages. None of Fisher’s economic writings was 
as successful. 

His deep concern with problems of eugenics 
arose from his belief that civilization could be saved 
only if the trends of physiological decadence of the 
superior elements and excessive reproduction of 
the inferior were reversed. Fisher served as presi- 
dent of the Eugenics Research Association, the 
American Eugenics Society, the Life Extension In- 
stitute, and the Vitality Records Office. 

Like Walras, and like Pareto in the earlier part 
of his life, Fisher was always actively concerned 
with public policy, trying to modify the existing 
situation and to protect the organization of the 
economic system. He saw clearly that economic 
difficulties are very often monetary difficulties, and 
he constantly recommended active intervention by 
the public authorities in monetary affairs. He was 
an apostle of managed currency and of stabiliza- 
tion of the purchasing power of money. In his book 
on stable money he related that between 1912 and 
1934 he prepared no less than 99 speeches, 37 let- 
ters to newspapers, 161 special articles, 9 submis- 
sions to appropriate governmental bodies, 12 cir- 
culars, and 13 books, a total of 331 documents 
intended to disseminate his ideas on monetary 
stabilization. Fisher was founder and president of 
the Stable Money League, a body whose aim was to 


make propaganda in favor of the “compensated” 
dollar. 


Contributions to economics 


The very lucidity of Fisher's thought may have 
led superficial minds to undervalue its true worth. 
Since no effort is necessary to comprehend his 
meaning, there is a tendency to underestimate the 
complexity and, in many instances, the originality 
of his thinking. In contrast to Marx and Keynes, he 
could develop his ideas fully, specify them, and so 
strip them of their obscurities and contradictions 
that the formulas which emerged were extraordi- 
narily plain and clear, Whatever the difficulty of 
the subject, Fisher excelled at distinguishing the 
theoretical from the practical, at using only per- 
fectly defined concepts, at identifying problems, 
treating each in a concise, clear paragraph, and at 
relegating to appendixes elements that were acces- 
sory to the main theme. His essential contribution 
lay, first, in his reduction of the copious accumu- 
lation of inconsistent notions in earlier writings to 
a contradiction-free synthesis that made full use of 
their valid elements and, second, in his lucid pres- 
entation of this synthesis. 

The remarkable characteristic of Fisher’s work 


is that it contains no basic error. Taken as a whole, 
and aside from a few minor errors of detail, it 
offers only valid ideas. His work is characterized by 
the ability to clarify, whether analytically or syn- 
thetically, rather than by the power of creative 
imagination. This is where Fisher's true originality 
is to be found. 

Use of mathematics. Fisher must be considered 
one of those who laid the foundations of modern 
economics, particularly of econometrics. He con- 
tributed more than any other scholar to the intro- 
duction into economics of scientific methodology 
and mathematical thinking, and he played an es- 
sential role in the development of specific concepts 
and theories which lie at the base of today’s eco- 
nomics. 

Even as a student, Fisher perceived the immense 
potential that a scientific approach offered in the 
field of economics, and he became one of the most 
brilliant pioneers in the cause of mathematical eco- 
nomics. He believed that sooner or later every sci- 
ence tends to become mathematical and that the 
social sciences are therefore only more backward 
than astronomy, physics, and mathematics. He was 
convinced that all those who, like Gibbs, system- 
atically relied on mathematics as a working tool 
would find their reward in the form of important 
discoveries. But he was also aware of the need to 
limit the use of mathematics. He was an economist; 
while mathematics was one of his tools, he never 
fell into the trap of mere formalism, characteristic 
of so much of the work done today. His work, like 
that of Pareto, teems with judicious comments on 
the scope and nature of theories, on the power and 
limits of the application of mathematical tech- 
niques, and on scientific method in general. 

In 1929 two young mathematical economists, 
Charles Roos and Ragnar Frisch, proposed to Fisher 
that he join them in founding a society aimed at 
disseminating mathematical thinking in economics. 
He welcomed the suggestion with enthusiasm. The 
international Econometric Society was founded the 
next year, with Fisher as its first president. By 
now this society has several thousand members, 
including many of the most distinguished econo- 
mists. [See ECONoMETRIcS.] This may serve to in- 
dicate how much the atmosphere has changed since 
the young Fisher noted, in 1891, that mathematical 
economics was still almost as much on the de- 
fensive as it was during the 1870s and 1880s, when 
Jevons and Walras were pleading for it so ardently, 
and so vainly. 

Theory of value and prices. In the Mathemat- 
ical Investigations in the Theory of Value and 
Prices ( 1892), a work of his youth, Fisher’s aim 


was to present a general mathematical model of 
the determination of value and prices. He claimed 
to have specified the equations of general economic 
equilibrium for the case of independent goods 
(chapter 4, sec. 10), although the only mathemat- 
ical economist whose work he had consulted was 
Jevons. With commendable honesty he recognized 
the priority of Walras’s Éléments d'économie po- 
litique pure (1874) as far as the equations of the 
general equilibrium are concerned and likewise the 
priority of Edgeworth’s Mathematical Psychics 
(1881) as regards the concept of utility surfaces. 
It appears that, although only a student, Fisher had 
independently developed a theory of general eco- 
nomic equilibrium that was identical to part of 
Walras’s and included the concept of the indiffer- 
ence surface, one of the fundamental bases of 
modern economic theory [see the biography of 
Watras]. 

Given the existence of these earlier formula- 
tions, the truly new elements in the Investigations 
were its clarity of presentation, its illustration of 
the general theory of equilibrium by a mechanical 
model in the case of independent goods, and its 
generalization of equilibrium theory to the case in 
which the utility of each good depends on the 
quantities of other goods consumed. A comparison 
of Fisher’s text with Walras’s will show how much 
more clear and concise Fisher’s exposition is. 
Fisher, however, apparently did not perceive the 
distinction between ordinal and cardinal utility; it 
was Pareto who emphasized the concept of ordinal 
utility and used it systematically. Although the 
Investigations does have a valuable appendix on 
utility and the history of the mathematical method 
in economics, on balance, the assessment made by 
Ragnar Frisch (see Schumpeter 1948), that the 
Investigations is a work of monumental impor- 
tance, seems exaggerated and unjustified. 

Capital and income. Fisher’s study of capital 
and income (1906) was intended to place the fun- 
damental concepts in this field on a rational and 
rigorous basis and to develop the theorems flowing 
from these concepts. According to Fisher, his aim 
was to supply the long-missing link between the 
ideas and habits that govern business management 
and the theories of abstract economics. 

This aim was satisfactorily realized. The Na- 
ture of Capital and Income contains the theoret- 
ical foundations of accounting science, both at the 
enterprise level and for the economy as a whole, 
as well as of actuarial science. Moreover, it pre- 
Sents these fundamentals in the framework of a 
general economic theory. Thus, Fisher prepared a 
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rigorous foundation for the subsequent work on 
national income and wealth. 

With this book Fisher apparently became the first 
economist to develop a theory of capital (including 
human capital) on an actuarial and accounting 
basis; both disciplines are essential parts of any 
economic calculus. His theoretical constructions is- 
sued from a concrete examination of accountancy 
and actuarial operations; this is the only valid ap- 
proach, and many economists (even including 
Keynes, in the Treatise on Money as well as in the 
General Theory) have erred by postulating a priori 
relationships between economic aggregates. 

Also in The Nature of Capital and Income Fisher 
presented a systematization of the two concepts of 
capital and income, both rigorously defined. He 
showed very clearly how these two concepts are 
linked through the rate of interest. Capital consists 
of a stock of goods; income is a flow of services. 
The value of capital is given by the present value 
of the future flow of income from it. The direction 
of the causal relation is not from capital to income 
but from income to capital; it is not from the pres- 
ent to the future but from the future to the present. 
In other words, the value of capital is the discounted 
value of expected income. [See CAPITAL.] 

He saw with unprecedented clarity that the eco- 
nomic present is no more than the capitalization of 
the future and that therefore the economic present 
is only a synthetic projection of the anticipated 
future. Fisher demonstrated convincingly that in 
economics only the future counts, and that past 
costs have no direct relevance to value. In point of 
fact, his research resulted in a rigorous definition 
of the bases on which it is possible to ground a 
valid theory of interest. 

The only notable omission in this work is Fisher’s 
failure to give the mathematical relations between 
capital and income in continuous notation. Such a 
formulation, which could well have found a place 
in an appendix, would have shown even more 
clearly the link between capital and income (see 
Allais 1965a). 

The analysis of capital and income in this book 
is so satisfactory that it is hardly possible to go 
beyond it, and it retains all its relevance today. 
Curiously enough, The Nature of Capital and In- 
come—which is Fisher's masterpiece, in my opinion 
—was not duly appreciated, and Pareto and 
Schumpeter are among the few economists who 
have recognized its value. 

Monetary theory. In The Purchasing Power of 
Money (1911) Fisher completely recast the theory 
of money, giving a full demonstration of the prin- ` 
ciples that determine the purchasing power of 
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money in the formal framework of the equation of 
exchange 


(1) MV + M’V’ = PQ 


and applying these principles to the study of his- 
torical changes in purchasing power. 

It is impossible, without doing grave injustice to 
the author, to analyze or even summarize this book, 
which is powerfully original in its close association 
of theory and econometric analysis with factual 
data. For Fisher, the purchasing power of money 
(or its reciprocal, the general price level) depends 
wholly on five well-defined factors: (1) the stock 
of money in circulation, M; (2) its velocity of cir- 
culation, V; (3) the volume of deposits, M’; (4) 
their velocity of circulation, V’; and (5) the over-all 
volume of transactions. 

In his preface Fisher stated that, fundamentally, 
his ambition was only to renovate and amplify the 
old “quantity theory” of money. He claimed that if 
the previously presented version is modified ap- 
propriately, it must be accepted as a basically cor- 
rect theory. 

Although Fisher is all too widely believed to have 
been an intransigent quantity theorist, he specified 
in many places the limiting conditions on the 
quantity theory's validity. A few quotations should 
dissipate any doubt as to his real position : 


. . the theory is correct in the sense that the level 
of prices varies directly with the quantity of money in 
circulation, provided the velocity of circulation of that 
money and the volume of trade which it is obliged to 
perform are not changed. . . . ({1911] 1920, p. 14) 
The strictly proportional effect on prices of an increase 
in M is only the normal or ultimate effect after transi- 
tion periods are over. The proposition that prices vary 
with money holds true only in comparing two imagi- 
nary periods for each of which prices are stationary 
or are moving alike upward or downward and at the 
Same rate... . (ibid., p. 159) 

Therefore the “quantity theory” will not hold true 
strictly and absolutely during transition periods. . . . 
(ibid., p. 161) 


There is no doubt that the weak point in Fisher’s 
theory is his inability to free himself from the tram- 
mels of the ceteris paribus assumption, which 
Bishop Berkeley had introduced in the eighteenth 
century. In his attempt to show the correctness of 
the quantity theory, Fisher shed fresh light on a 
significant number of questions, but in a sense he 
wanted to prove too much and was led into giving 
insufficient weight to short-term transitional phe- 
nomena. 

Long before The Purchasing Power of Money 
first appeared in 1911, Walras had attempted to 
escape from this blind alley by defining the concept 


of “real wanted cash balances.” The Cambridge 
school, including Pigou and Keynes, did no more 
than take up Walras’s idea, but nobody was able 
to develop an operational formulation of the theory 
of money, and the coefficients that were defined 
could not be rendered determinate. An operational 
formulation has only been developed in the recent 
past, for example, by the present author (Allais 
1965b). The equation of exchange MV = PQ had 
already been used by Newcomb in his Principles of 
Political Economy, and indeed it was to Newcomb 
that Fisher dedicated his book on money. 

From a monetary viewpoint, the main contribu- 
tions of Fisher’s work seem to be a notable clarifi- 
cation of the significance of the formal framework 
of the equation of exchange; the definition of sta- 
tistical methods for estimating the different param- 
eters of this equation, in particular the velocities 
of circulation of notes and coin and of deposit 
money, and for estimating the over-all volume of 
transactions; the estimates themselves; and finally, 
a definitive clarification of the influence of demand 
deposits on prices and of the need to include them 
in the definition of the supply of money. 

Although Fisher was responsible for remarkable 
progress in monetary theory, he was not exempt 
from errors of judgment. Thus, chapters 11 and 12 
are devoted to the verification of the equations of 
exchange. But an equation of this kind, which is 
the definition of the velocity of circulation, cannot 
be verified; all that the statistics can demonstrate 
is the compatibility of the different estimates of the 
velocity of circulation. 

Whatever its limitations, The Purchasing Power 
of Money clarified much that had been confused. 
Fisher did not use the fertile Walrasian concept of 
desired cash balance explicitly, nor did he attain 
the fruitful Keynesian concept of liquidity prefer- 
ence. And it is true that he did not reach a satis- 
factory synthesis of the theory of money with price 
theory (although taking his work as a whole, he 
came nearer to such a synthesis than did Keynes). 
Nevertheless, he produced analyses of monetary 
questions—in particular of the velocity of circula- 
tion and of the equation of exchange—that in 
many respects are definitive. [See Money, articles 
On QUANTITY THEORY and VELOCITY OF CIRCULA- 
TION.] 

Theory of interest. For Fisher, the rate of in- 
terest is governed by the balance between the sup- 
ply of capital, as determined by the psychology of 
Savers, and the demand for capital, as determined 
by the possibilities of, and the outlook for, invest- 
ment. 

As with some of his other works, the major con- 
tribution of The Theory of Interest (1930a) is not 
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so much the presentation of major new ideas as the 
clarity and rigor of exposition of an extremely com- 
plex subject. The fundamental theses of the book 
had already been developed before Fisher by John 
Rae (1834) and, above all, by Béhm-Bawerk 
(1884-1912), Fisher, indeed, underlined his debt 
in the dedication of his book: “To the memory of 
John Rae and of Eugen Böhm-Bawerk, who laid 
the foundations upon which I have endeavored to 
build.” 

At the outset, Fisher made the distinction be- 
tween nominal interest and real interest. The major 
part of the book (chapters 4—18) is devoted to the 
theory of the determination of the real rate of in- 
terest. Then follows a special chapter, 19, on the 
relation of interest to money and prices. Fisher's 
theory of the real rate of interest is a synthesis of 
psychologica! theories, such as the theory of ab- 
stinence, and physical theories, such as the theory 
of productivity. The objectivity with which he ac- 
complished this synthesis enabled him to give due 
weight to the significance of each of the different 
aspects. While the central themes of the book are 
all to be found in greater or lesser measure in 
Béhm-Bawerk’s work, the clarity of Fisher’s presen- 
tation and the rigor of his analysis are incompara- 
bly greater. 

Throughout his analysis, Fisher quite correctly 
distinguished two problems, namely, how the in- 
terest rate is determined and why it is always posi- 
tive. He contended correctly, and in contrast to the 
approach taken by many earlier writers, that the 
determination problem should be the one studied 
first. He stressed, again quite correctly (chapter 
8, secs. 4, 5), that from a psychological or technical 
Viewpoint there is nothing in the nature of men or 
things that should lead one to expect that the rate 
of interest, expressed in terms of whatever good 
is chosen as unit, will be positive rather than neg- 
ative. Long before Keynes, Fisher showed clearly 
that the rate of interest in terms of a given good 
cannot become negative if the good can be stocked 
without significant expense, a condition that is met 
by money (chapter 2, sec. 3, chapter 11, sec. 9). 
In his discussion of the case of the interest on un- 
tedeemable bonds ( chapter 13, sec. 10), he also 
Pointed out the impossibility of a zero or negative 
Interest rate. But since he does not appear to have 
Seen that land rents, like unredeemable bonds, are 
ìn practice a kind of perpetual income, he failed 
to realize that in a social organization based on 
Private land ownership the rate of interest cannot 
be zero or negative (see Allais 1947, vol. 2, 
Pp. 479-499), 

Fisher's theory of interest, like that of Béhm- 
Bawerk, is a capitalistic theory springing from an 
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examination of the nature of capital. In a sense, 
it is the opposite pole to a theory such as that of 
Keynes, which is essentially a monetary theory of 
the rate of interest. (The present author has dis- 
cussed this point: Allais 1947, vol. 1, p. 27.) 
Neither Fisher nor Keynes succeeded in approach- 
ing the indispensable synthesis of the two points of 
view. In discussing the repercussions of the theory 
of interest on monetary problems, Fisher showed 
conclusively that when prices are at peak levels, 
interest rates are high, not because the price level 
is high but because it has risen; and that when 
prices are low, interest levels are low, not because 
the price level is low but because it has declined 
(chapter 19, sec. 10). 

Fisher's Theory of Interest is not entirely free 
of error. For example, in the discussion of the 
optimum date for tree felling (chapter 7, sec. 6), 
Fisher stated that the date is given by the equality 
of the marginal growth rate of the forest and the 
rate of interest. This result is wrong and leads to 
an overestimation of the optimum length of the 
interval between two fellings—for no account is 
taken of the fact that the earlier the trees are 
felled, the earlier will it be possible to repeat the 
cycle. The quantity to be maximized is not the 
present value of a single felling, but the present 
value of all the income from successive fellings. 
Elsewhere, Fisher appears to have considered that 
the reason for the difference between long-term 
and short-term interest rates reflects the nearness 
or distance of the redemption date (chapter 13, 
sec. 11). He failed to observe what Keynes saw 
clearly, namely, that short-term loans have the 
advantage of being relatively more liquid and are 
therefore remunerated at a lower rate. Finally, 
Fisher stated that the productivity of nature is a 
factor tending to support the interest rate (chap- 
ter 8, sec. 6), Counterexamples showing that this 
point of view is untenable can be found (see, 
among others, Allais 1947, vol. 2, p. 721). 

Nevertheless, in an over-all view allowing for 
these minor criticisms, Fisher’s analysis of interest 
represents a successful attack on one of the most 
difficult problems of economic theory, one that 
such men as Walras, Pareto, and Marshall had 
not fully grasped and for whose first deep analysis 
the credit goes to Böhm-Bawerk. 

While it is true that Fisher did not completely 
resolve the issues of the basis of interest and the 
relation between the physical productivity and the 
value productivity of capital, he was responsible 
for a remarkable growth of understanding in this 
field. His analyses, particularly of the propensities 
to save and invest and of the interdependence of 
the rate of interest with other components of the 
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economic system, prepared the way for his fol- 
lowers. [See INTEREST.] 

Monetary policy. Three of Fisher's works are 
devoted to projects for monetary reform: Stabiliz- 
ing the Dollar (1920), Stamp Scrip (1933b), and 
100% Money (1935). It is not unfair to say that 
throughout his life he was absorbed, even obsessed, 
by concern for the maintenance of a stable pur- 
chasing power of money—in other words, the 
avoidance of both inflation and deflation. 

Stabilization of purchasing power. Fisher's first 
book on monetary policy (1920) contains a plan 
for a reform of the gold standard, intended to 
stabilize purchasing power. Its principle, which 
was stated as early as 1911 in The Purchasing 
Power of Money (chapter 13, sec. 5), is very 
simple. If prices in terms of gold rise by one per 
cent, the official price of gold should be lowered 
by one per cent in order to maintain the purchas- 
ing power of the dollar; conversely, if prices in 
terms of gold decline by one per cent, the official 
price of gold should be raised by one per cent. 
In this system, the increase (or decrease) of the 
official price of gold must apply to all currencies 
if the principle of fixed exchange parities is to be 
respected. But a single country can also apply the 
system, as long as it is prepared to accept varia- 
tions in its currency exchange rate in line with 
the price fixed for gold in terms of national cur- 
rency. Stabilizing the Dollar also contains a sys- 
tematic bibliography on the stabilization of the 
purchasing power of money, in which Fisher cited 
many authors (e.g., Rooke 1824; Newcomb 1879: 
Marshall 1887) whose ideas corresponded closely 
to his own. 

Despite Fisher’s assertions to the contrary (ap- 
pendix 2, 1p) the cogency of his propositions is 
based on the quantity principle, which is valid 
only over the long term. Although his plan there- 
fore applies only to long-term price movements, 
it would have been entirely viable and effective 
during the nineteenth century. Certainly, if the 
system had been in operation in that period, the 
long-run increases and declines in the price level, 
which actually occurred and whose drawbacks are 
evident, could have been avoided. It is also certain 
that it would have enabled full internal and ex- 
ternal currency convertibility to be maintained 
after World War 1. One might even claim that the 
introduction of this system would have been a 
necessary condition for the maintenance of con- 
vertibility. Indeed, as Fisher himself stressed, the 
link with gold is not strictly indispensable, and 
under a system of paper money, price stability can 
be underpinned by appropriate limitation of the 


volume of newly issued means of payment. He 
himself stated that the operation of his plan would 
be facilitated rather than hindered by the internal 
demonetization of gold, the sole essential feature 
of his scheme being the full convertibility of paper 
money into gold ingots (chapter 4, sec. 6). These 
ideas still have some potential value. In point of 
fact, they had been applied during the Middle Ages 
to prevent nominal prices from declining, and the 
fixed price of gold, introduced in the nineteenth 
century, was in a way a regression of practice. 

Stamped money plan. The objective of Fisher’s 
stamped money plan of 1932 and 1933 was to 
furnish an efficient method of combating the 
hoarding of money, which has extremely injurious 
effects in a period of depression, the more so as 
the rate of hoarding tends to accelerate. Here 
again, the original idea was not Fisher's own but 
was first suggested by Silvio Gesell in his book 
The Natural Economic Order (1906-1911, part 2) 
as a means of lowering the rate of interest. 

In the system advocated by Gesell, notes in cir- 
culation would retain their value only if validated 
each month by a stamp sold through the post office 
network. The price of the stamp could be fixed in 
the light of circumstances; Gesell proposed a figure 
equivalent to a depreciation of the order of 5 per 
cent per annum. Gesell’s aim was not so much to 
combat depression as to lower the rate of interest 
toward zero in order to suppress all unearned in- 
come in a market economy. 

In Booms and Depressions Fisher took up the 
idea with his customary clarity, pointing out that 
it constitutes a method of discouraging hoarding. 
He wrote: 


Let one hundred of these dollars be given to each citi- 
zen. . . . This “gift” would be to all of us from all of 
us (and so no gift at all). .. . After all the 12 stamp 
spaces have been filled, the dollar could be redeemed 
either by another of the same kind or by an ordinary 
dollar, at the option of the government. If the stamped 
dollar, renewed, runs for nine years (108 months), 
the funds for this redemption will have already been 
provided to the government by the public. . . . This 
strange-appearing plan will not seem so strange if we 
think of it as a loan to the public from the govern- 
ment, to be repaid in monthly installments of one per 
cent. (1932, pp. 227-229) 


Gesell’s suggestion was taken up by Keynes also, 
who pointed out that it would permit the level of 
interest rates to be lowered (1936, chapter 23, 
sec. 6). 

There is no doubt that a scheme of this kind 
is a valid antidepression weapon, provided it is 
extended also to cover bank deposits—which is 
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feasible. Fisher believed that in normal times there 
would be no need to have recourse to this system 
but that if the need for it became evident, it could 
be introduced with beneficial effect. It has been 
objected that anyone wishing to hoard would not 
be prevented from hoarding land, precious stones, 
or precious metals, but from a monetary viewpoint 
this is not a valid objection: what is at issue is 
how to avoid generalized overproduction by render- 
ing it undesirable to hoard money. 

The proposals for monetary reform by Gesell, 
Keynes, and fisher have not been understood, The 
idea of stamped money has been derided as an 
economic absurdity. Yet the only objection that 
may validly be raised is that in normal circum- 
stances, i.e., when monetary policy is implemented 
reasonably, hoarding is moderate and the stamping 
of money is unnecessary. So far as the monetary 
aspect is concerned, the policy of validation of 
money through stamps is recommended by Fisher 
only in case of need. As for Keynes’s notion that 
lower interest rates would increase the real na- 
tional income, it can be shown that losses are 
wholly negligible by comparison with the capitalis- 
tic optimum situation with an interest rate of a 
few per cent (see Allais 1962). 

Today the danger is not deflation but inflation, 
and the current interest in the stamped money 
plan is quite limited, if it exists at all. But from a 
theoretical point of view, this system—as well as 
that of the stable dollar—has many features that 
merit reflection. 

The aim of Stamp Scrip, which was published 
in 1933, a few months after Booms and Depres- 
sions, was to describe the stamped-money experi- 
ments made in the Austrian towns of Schwanen- 
kirchen and Wörgl and in a score of American 
cities in 1932, to assess the merits of these experi- 
ments, and to reply to certain objections that had 
been put forward. In Fisher's own words, the book 
Was prepared “in a few days of fast and furious 
work”; the speed with which it was written had, 
to say the least, unfortunate consequences for its 
quality, 

Hundred per cent reserve. Fisher's aim in 100% 
Money was to show that economic fluctuations 
can be largely eliminated if demand deposits are 
totally backed by a corresponding amount of cash, 
thus depriving the banking system of its right— 
more or less erratically exercised—to create 
money; and to show that a system of this kind 
would actually stop the irresponsible creation 
and destruction of circulating medium by our 
thousands of commercial banks which now act 
like so many private mints” (1935, p. xi). 
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The fact is that the degree of potential insta- 
bility of a banking system becomes greater as the 
coverage ratio departs farther from unity. Credit 
is generally the process in which a banker makes 
a loan of money he does not possess to a client 
who nevertheless considers the money as available. 
The system can continue to operate as long as de- 
positors as a whole have sufficient confidence in 
its stability. But once confidence is shaken, the 
banks are unable to honor demands for with- 
drawals. In its basic conception this system is 
irrational, and its only justifications are its histori- 
cal acceptance and the savings of gold that it 
permitted in the nineteenth century, when the 
price of gold was fixed—the possible consequence 
of this fixity under a gold standard being long-term 
decreases of the price level. Fisher's 100% Money 
sets forth a valid plan to rationalize the monetary 
system. Some of the aims of the plan have been 
implemented since 1934, when the Federal De- 
posit Insurance Corporation was founded: since 
then a bank failure caused by depositors withdraw- 
ing funds from the banking system has become a 
practical impossibility. However, the present system 
has the drawback that the management of the 
over-all money supply is less easy than it would 
have been under a “100 per cent money” system. 

Again, the original idea was not Fisher's, as he 
acknowledged in his preface. He himself gave the 
credit for the suggestion to a group of economists 
from the University of Chicago, including, among 
others, Henry C. Simons, Aaron Director, Frank H. 
Knight, Henry Schultz, Paul H. Douglas, and A. G. 
Hart. In November 1933 this group circulated an 
unsigned 26-page mimeographed paper entitled 
“Banking and Currency Reform” (see Walker 1935; 
Hart 1935). However, the credit for the original 
idea should not go to the Chicago school. Much 
earlier, in 1898, Walras was stressing the unstable 
nature of the system of issue of banknotes and was 
proposing a 100 per cent coverage ratio (1898, 
pp. 348, 365, 374, 375 of the 1936 French edi- 
tion). Ludwig von Mises adopted much the same 
position in 1928 in his book Geldwertstabilisierung 
und Konjunkturpolitik. 

A system of 100 per cent money is perfectly 
feasible providing that banking activity is split into 
two clearly separated branches, deposits and lend- 
ing. Depositors would pay charges to cover the cost 
of managing their accounts, and banks would 
make loans with funds they had borrowed for that 
purpose. Introducing the system would involve 
certain transitional problems, but they could easily 
be overcome, and their resolution would give gov- 
ernments a much greater mastery of monetary 
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policy than could be acquired in any other way. 
The idea of this reform has generally been aban- 
doned today; its only proponents are a few econ- 
omists such as Milton Friedman in the United 
States (1959) and Maurice Allais in France 
(1947). 

Other works on monetary policy. Three other 
books written for the lay public should be men- 
tioned among Fisher's published works on monetary 
reform. These are The Money Illusion (1928b), 
Booms and Depressions (1932), and Inflation? 
(1933a). The first and last of these are, by com- 
parison with his other works on the subject, of 
limited interest. 

Booms and Depressions is of more importance. 
It has three parts, dealing with theories, facts, and 
remedies, respectively. Nine main factors are con- 
sidered: overindebtedness, volume of currency, 
price level, net worth, profit, production, psycho- 
logical factors, currency turnover, and rates of 
interest. Most stress is placed on the factor of over- 
indebtedness. 

When the 1929 slump occurred, Fisher was over 
60 years old; yet he tackled the very difficult theory 
of economic fluctuations. He finally reached the 
conclusion—which contains a large element of 
truth—that business cycles are due on the one 
hand to the existence in the banking system of 
uncovered demand deposits and on the other to 
the opportunities for hoarding offered by the circu- 
lating monetary media. These views led him to 
recommend regulation of the demand for and 
supply of money through steadily depreciating cir- 
culating money and 100 per cent coverage of de- 
mand deposits as a cure for business cycle ills (see 
1933b; 1935), 

Appraisal of monetary policy. Nowhere in his 
writings did Fisher really take up the central prob- 
lem of variations in the demand for money with the 
level of economic activity, and despite the reserva- 
tions he himself made, his approach was too nar- 
rowly oriented toward quantity theory. As Schump- 
eter aptly put it, “the scholar was misled by the 
crusader,” His propaganda in favor of the compen- 
sated dollar resulted in his misinterpretation of the 
price stability that ruled up to 1929 and, corre- 
spondingly, his total unawareness of the gravity of 
the situation. He was one of the most optimistic 
Supporters of the doctrine of the “new economic 
era.” Although he had done quite well from his 
investments in the 1920s, he had to absorb large 
losses in the 1929 crisis. But as soon as he assessed 
its scope properly, he became an indefatigable pro- 
ponent of the various plans for the restoration of 
Prosperity that have been summarized above. To- 


gether with George Frederick Warren, he persuaded 


President Roosevelt to devalue the dollar in order 
to stimulate a rise in the price level in the United 
States. 


Contributions to statistics 


Fisher's work on money and prices, in particular 
his propositions for stabilization of the dollar, led 
him to considerable advances in two branches of 
Statistical science: price indices and distributed 
lags. 

Index numbers. The aim of Fisher's book The 
Making of Index Numbers (1922) is to identify 
the characteristics of the best feasible index of 
prices for use in measuring changes in the purchas- 
ing power of money. This book, in which he tried to 
systematize and rationalize index number theory 
by defining a certain number of criteria, is in fact 
an extension of chapter 10 of his Purchasing Power 
of Money and of the appendix to that book. His 
research into the qualities of a satisfactory price 
index was a by-product of his general analysis of 
the equation of exchange. 

Whereas Fishers approach in The Purchasing 
Power of Money was deductive, in The Making of 
Index Numbers it was inductive and empirical: he 
compared the results of using different formulas 
on the same historical data. He used two principal 
criteria of evaluation, the “time reversal test” and 
the “factor reversal test,” and recommended use of 
the “ideal” index, the geometric mean of the 
Paasche and Laspeyre indices. 

There can be no doubt that Fisher’s study, which 
was the most extensive at that time in the field of 
index numbers, was a fruitful springboard for 
much of the progress made subsequently [see INDEX 
NUMBERS]. 

Distributed lags. Fisher was the first to en- 
visage a systematic dependence of the present on 
the past in economics, and thus he opened up a 
whole new area. The existence of systematic effects 
explains why it has been possible successfully to 
analyze economic and geophysical time series using 
autoregressive equations 


(1) biy, + biyi + vee + biyta + tee Se, 
which when inverted can be written formally as 
(2) Yr = ayer + aye, + Quen foe + Ap€t-p G 


in which y, is the cumulative effect of earlier ac 
tions, er.» , weighted by coefficients a, , which decline 
with distance in time. The accepted English term 
for this formulation, “distributed lags,” was coined 
by Fisher. This term is intended to convey that 
each e, acts with a certain delay, so that lags of 


different length must be taken into account when 
studying the influence of the past. 

In his study “Our Unstable Dollar and the So- 
called Business Cycle” (1925) Fisher proposed a 
formulation of the type 


y(t) = y+ f aC0)pCt — 6) a6, 


in which the weights a@(@) are distributed lognor- 
mally, for study of the interdependence of the level 
of economic activity y(t) and past values p(t — 0) 
of the general level of prices. 

Later, in his Theory of Interest (chapter 19, 
sec. 6), he used weights. a, which declined linearly 
with time, to study the relationship between the 
rate of interest and earlier rates of increase of the 
price level. 

The line of approach initiated by Fisher was 
later to prove particularly fertile in econometric 
thought. [See DISTRIBUTED LAGs.] 


Influence 


Unlike Adam Smith, John Stuart Mill, and Alfred 
Marshall, Fisher wrote no systematic treatise. The 
reason for this is doubtless that Fisher was preoccu- 
pied above all with research and that his manifold 
practical activities took much time. In Schumpeter’s 
well-chosen words, Fisher’s works “are the pillars 
and arches of a temple that was never built. They 
belong to an imposing structure that the architect 
never presented as a tectonic unit” ([1948] 1960, 

Pp. 237). But these foundations are solid. 
` Fisher was not an eminent philosopher, like 
Cournot, and he did not have a universal mind, 
like Pareto; he did not share Walras’s preoccupa- 
tion with social problems or try to study the philos- 
ophy of the social and economic organization of the 
time, as did Keynes and Schumpeter. But the fact 
remains that he made major contributions to the 
fundamental problems of capital, interest, and 
money. 

As Schumpeter observed in his remarkable bio- 
graphical article, Fisher, unlike Marx, Marshall, 
and Keynes, did not found a school. He had many 
pupils, but few disciples. In his crusades he joined 
forces with many other groups and individuals, but 
he remained almost alone in his scientific work. 
Perhaps it was the derision aroused by his crusad- 
Ing activity that led to his isolation; one of his 
critics has written: “His career was marked by 
neglect at its inception and ridicule at its close” 
(Lekachman 1959, p. 293). 

For many years, Fisher’s influence was nonethe- 
less considerable. He was president of the American 
Statistical Association, of the international Econ- 
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ometric Society, of the National Institute of the 
Social Sciences, and of the American Association 
for Labor Legislation. The appearance of a book 
by Fisher was invariably an event, and it would be 
reviewed widely, though the reception might be 
favorable or hostile. In the case of both Stamp 
Scrip and 100% Money, the reaction among 
Fisher's fellow economists and in official circles 
was unfavorable. In fact, Fisher was often consid- 
ered prone to draw too rapid conclusions from ab- 
stract conceptions and therefore inclined to sug- 
gest measures whose chances of success were 
correspondingly uncertain. 

Although there has never been a Fisher school 
in the sense that there has been a Keynesian school, 
Fisher's influence on a great number of young econ- 
omists was nevertheless profound (see Sasuly 
1947). As a disciple of Fisher, the author of the 
present article is among those who have recently 
produced systematizations, generalizations, or ex- 
tensions of Fishers theoretical and econometric 
work on capital, income, interest, and money (see 
Allais 1943; 1947; 1954; 1965a; 1965b). 


Assessment 


The opinion of Fisher's work has not generally 
been as high as its merit warrants, especially in the 
United States, where its value has been grossly 
underestimated. 

To a remarkable extent Fisher combined within 
himself the eminently Anglo-Saxon preoccupation 
with facts and practical action and the essentially 
Latin quality of clarity of conception and exposi- 
tion. He was at one and the same time theorist and 
practitioner, having the characteristics, therefore, 
of a great engineer. 

Fisher had, above all, an extraordinary feeling 
for things concrete, and the whole body of his work 
is permeated by an unceasing search for numerical 
applications. For him no theory was of use unless 
it led to applied work and the quantitative analysis 
of factual data. Fisher always maintained close 
contact with businessmen and tried to familiarize 
himself with their reactions and preoccupations, 
so as to analyze them and compare them with the 
theoretical models of economic science. He was 
deeply interested in practical action. Economic 
science, as he saw it, was not merely pure philo- 
sophical speculation; it should be used, as engineer- 
ing is, to achieve practical ends. 

Fisher's normative approach tended to lower the 
esteem in which he might otherwise have been 
held. This unfortunate fate he shares with Marx, 
whose remarkable sociological accomplishments 
have been partially discredited by his political atti- 
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tude, and with Walras, whose normative proposi- 
tions have been considered by many as unsophisti- 
cated or even downright infantile. 

Men who are accepted as geniuses have been 
known to spout nonsense on certain issues. As 
Pareto wrote: “It is somewhat hard to believe, 
although it is no more than the truth, that the great 
Newton wrote a book which proved that the prophe- 
cies of the Apocalypse had come to pass.” No one 
would conclude from this that Newton’s Mechanics 
is not a first-level achievement of the human spirit. 

Fisher marks a decisive stage in the history of 
economic science. He was the first economist to 
combine profound theory and authoritative obser- 
vation. He contributed powerfully to the construc- 
tion of theoretical mathematical models aimed at 
the explanation of reality, and at the same time, 
whether in working out his assumptions or inter- 
preting his results, he never lost his extraordinary 
preoccupation with reality, which he observed and 
analyzed with a refined sense of the concrete. 

Simultaneously a theorist and a practitioner, 
Fisher combined to the highest degree two sup- 
posedly incompatible characteristics: an esprit de 
géometrie and an esprit de raffinement; these are, 
despite Pascal's opinion, but two sides of one and 
the same medal—intelligence. Because of these 
qualities, Fisher ranks with the greatest contrib- 
utors to economic science. Like physics, economics 
calls for the study of abstract constructions at the 
same time that it requires the observation and 
analysis of facts. When Fisher died in 1947, the 
present author wrote (Allais 1947) that it was be- 
cause of the combination of these two qualities that 
Fisher had to be given a place in the hall of fame 
of modern economics. A year later Schumpeter de- 
scribed him as America’s greatest scientific econo- 
mist. The future will certainly confirm this judg- 
ment. Fisher opened up a new horizon, Others will 
go beyond the point he reached: they have done so 
already, for it is easier to progress when the way 
is posted with signs. 


MAURICE ALLAIS 


[For the historical context of Fisher's work, see the 
biographies of BöHM-BAWERK; NEWCOMB; PARETO; 
RAE; SUMNER; and WALRas.] 
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Ronald Aylmer Fisher (1890-1962) achieved 
world-wide recognition during his lifetime as a 
statistician and geneticist. He continued the work 
begun in England by Karl Pearson at the begin- 
ning of the twentieth century, but he developed it 
in new directions. Others also contributed to the 
tremendous surge in the development of statistical 
techniques and their application in biology; but 
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these two men, by their energetic research and ex- 
ample, in turn held the distinction of dominating 
the statistical scene for a generation. 

Fisher was born in East Finchley, near London. 
Apart from a twin brother who did not live long, he 
was the youngest of seven children. His father, 
George Fisher, was an auctioneer; no particular 
scientific ability is evident in the achievements of 
his relatives, except perhaps those of an uncle who, 
like Ronald Fisher, was a wrangler in the mathe- 
matical tripos at Cambridge. Fisher attended school 
at Stanmore Park and then went to Harrow, where 
he was encouraged in his mathematical interests 
and won a scholarship to Gonville and Caius Col- 
lege, Cambridge. His leanings in mathematics fol- 
lowed the English tradition in natural philosophy, 
and his university student years, from 1909 to 
1913, culminated in his receiving first a distinction 
in optics for his degree papers in 1912 and then a 
studentship in physics during his postgraduate 
year. He had, however, already noticed from study- 
ing Karl Pearson’s Mathematical Contributions to 
the Theory of Evolution that natural philosophy 
need not stop with the physical sciences. 

After leaving Cambridge, Fisher spent a short 
time with the Mercantile and General Investment 
Company. When World War 1 broke out in 1914, 
his very bad myopia prevented him from joining 
the army, and he taught mathematics and physics 
for four years at various English public schools. In 
, 1917 he married Ruth Eileen Guinness, who was to 
bear him eight children. 

Fisher did not really begin his full-time statistical 
and biological career until 1919, when he became 
Statistician at Rothamsted Experimental Station, 
an agricultural research institute in Harpenden, 
Hertfordshire. His earlier years had, however, been 
a valuable gestation period. In a short paper pub- 
lished in 1912 while he was still at Cambridge, he 
had already proposed the method of maximum like- 
lihood for fitting frequency curves. Two more solid 
papers established his permanent reputation for re- 
search. The first was his remarkable paper on the 
sampling distribution of the correlation coefficient, 
published in Karl Pearson’s journal Biometrika in 
1915, in which his geometrical powers of reason- 
ing were first fully displayed. The second, published 
in 1918, examined the correlation between rela- 
tives on the basis of Mendelian inheritance and 
exhibited his ability to resolve crucial problems of 
Statistical genetics. Fisher received from Karl Pear- 
son an offer of a post at University College at the 
same time that he received the Rothamsted offer, 
but he wisely chose Rothamsted, largely because of 
the much greater scope and independence of this 
new statistical post but also, perhaps, because his 


contacts with Pearson had not been particularly 
promising. Pearson was apt to bulldoze his way 
into research problems without worrying unduly 
about territorial rights. Having been previously 
stymied by the correlation distribution problem, he 
took over Fisher’s solution with enthusiasm but 
without the further close consultation that profes- 
sional etiquette would seem to require. Fisher was 
aggrieved by this treatment, and it may well have 
been the start of the long and bitter feud that de- 
veloped over the years. Fisher had reason to criti- 
cize much of Karl Pearson’s work, but the personal 
animosity that developed between them was some- 
thing more than a substantive disagreement. As 
late as 1950, when a selection of Fisher's best sta- 
tistical papers was published, the omission of the 
1915 Biometrika paper was a silent reminder of 
Fisher's feelings. 

The period at Rothamsted, from 1919 to 1933, 
was the most brilliant and productive of Fisher's 
career. The institute, with its teams of biologists 
and congenial research atmosphere, was precisely 
the environment Fisher needed. His own wide 
range of biological interests enabled him to under- 
stand his colleagues’ problems and to discuss their 
Statistical aspects constructively with them. His 
Statistical activities were represented by the pub- 
lication of his best-known book, Statistical Methods 
for Research Workers (1925a), which has been 
published in 13 English editions and also translated 
into several foreign languages. Fisher's varied ac- 
complishments included doing much of his own 
computing and initiating many of his own genetical 
experiments on poultry, snails, and mice, although 
it is for his creative theoretical ability that he will 
be remembered. By 1929 he had been elected a 
fellow of the Royal Society of London. He pub- 
lished his classic on population genetics, The 
Genetical Theory of Natural Selection (1930a), in 
which he did much to reconcile Darwin's theory 
of evolution by natural selection with Mendel’s 
genetical principles, which were unknown to Dar- 
win. In the later chapters of this work he discussed 
the theory, first suggested by Francis Galton, of the 
evolution of a genetic association between infer- 
tility and ability. This could result from marriages 
between those successful because of high innate 
ability and those successful because of social ad- 
vantages due to relatively infertile parents’ having 
concentrated their material resources on one or two 
children. His energetic views included proposals for 
family allowances proportional both to size of 
family and to size of income, which would offset 
the penalty imposed on children by parental 
fertility. 

Fisher's genetic and eugenic interests were soon 


to be reflected in his move to London in 1933, when 
he was appointed Galton professor at University 
College as successor to Karl Pearson. This move, 
however, no doubt fanned the flames of their feud. 
When Pearson retired, the college isolated the 
teaching of statistics in a new department of sta- 
tistics, under his son Egon; the Galton professor 
was left with eugenics and biometry. In spite of 
Egon Pearson’s greater tolerance and appreciation 
of Fishers new statistical techniques, which em- 
phasized precise methods of analysis in small 
samples, Fisher felt frustrated, as he indicated at 
the time in a letter to W. S. Gosset. Moreover, Jerzy 
Neyman, who held a post in the statistics depart- 
ment from 1935 to 1938, incurred Fisher's wrath 
by publishing work that Fisher regarded as un- 
necessary or misguided; their proximity in the same 
building at University College exacerbated Fisher's 
sense of injury. The recurrence of feuds of this 
kind was by now beginning to be as much a mani- 
festation of Fishers own temperament as of his 
antagonists’. His wide interests and strong person- 
ality made him a charming and lively companion 
when he chose to be and a generous colleague to 
those who were in sympathy with his work, as 
many have testified. But his emotions as well as his 
intellect were too bound up in his work for him to 
tolerate criticism, to which he replied in vigorous 
and sometimes quite unfair terms. Apart from such 
lapses from objectivity, Fisher proceeded to con- 
solidate his scientific reputation both by the de- 
velopment of the study of genetics (especially 
human genetics) in his department and by the con- 
tinued publication of statistical works, such as The 
Design of Experiments (1935a), Statistical Tables 
for Biological, Agricultural, and Medical Research 
(with F. Yates, in 1938), and further original 
papers in his departmental journal (the Annals of 
Eugenics ) and elsewhere. 

The third main phase of Fisher’s scientific career 
was his appointment to the Arthur Balfour chair of 
genetics at Cambridge, from 1943 until his retire- 
ment in 1957, During this time he wrote two more 
books—The Theory of Inbreeding, in 1949, and 
Statistical Methods and Scientific Inference, in 
1956—and also edited the collection of his papers 
published in 1950 (see 1920-1945); but most of 
his important work was already under way. Honors 
Continued to accumulate, including three medals 
from the Royal Society (a royal medal in 1938, a 
Darwin medal in 1948, and a Copley medal in 
1955), a knighthood in 1952. Shortly before he re- 
tired, he became master of his Cambridge college. 
He was an honorary member of the American Acad- 
ony of Arts and Sciences, a foreign associate of the 
National Academy of Sciences, a member of the Pon- 
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tifical Academy of Sciences, and a foreign member 
of the Royal Danish Academy of Sciences and Let- 
ters and of the Royal Swedish Academy of Sciences. 

After retirement he visited the division of mathe- 
matical statistics of the Commonwealth Scientific 
and Industrial Research Organisation in Adelaide, 
Australia, where he was a research fellow at the 
time of his death. 


Contributions to statistics and genetics 


Statistics. To turn in somewhat more detail to 
Fisher's original work, a formal listing of his con- 
tributions to statistics item by item might result in 
an underemphasis of the strength of their impact, 
which was due to their simultaneous variety and 
depth. Moreover, a formal listing is unsatisfactory 
since the intimate relation of Fisher's contributions 
to the practical problems arising from his profes- 
sional environment sometimes meant that their 
academic presentation was incomplete or late, or 
both. Work of great value, such as the technique 
of analysis of variance, received inadequate dis- 
cussion in Statistical Methods for Research Work- 
ers because it had hardly reached any degree of 
finality when this book was published in 1925, and 
it was still rather cluttered with ideas of intraclass 
correlation; apparently, Fisher never bothered to 
redraft the discussion for the later editions. 

Excluding Fisher’s mathematical work in genet- 
ics, it is nevertheless convenient to try to list his 
chief contributions to statistical theory under two 
main headings: (1) fundamental work in statis- 


_ tical inference, and (2) statistical methodology and 


technique. The first group would include his im- 
portant work on statistical estimation, mainly rep- 
resented by two papers, one published in the Philo- 
sophical Transactions of the Royal Society (1922), 
and the other published in the Proceedings of the 
Cambridge Philosophical Society (1925b). Before 
writing these papers, Fisher had already been much 
concerned with precise inference in small samples 
for familiar quantities, such as the correlation co- 
efficient, chi-square, etc., and had produced a 
steady flow of papers on their sampling distribu- 
tions, of which his 1915 paper on the correlation 
coefficient is the best known. He was very careful 
to distinguish between an unknown population 
parameter and its sample estimate. When the 
sampling distribution of the estimate was available 
in numerical form, a test of the significance of any 
hypothetical value of the parameter became pos- 
sible. These precise tests of significance—for ex- 
ample, of an apparent correlation r in the sample 
on the “null hypothesis” of no real correlation 
(p = 0)—were particularly valuable at the time be- 
cause of the tendency among biologists and other 
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research workers not to bother with them. Fisher's 
own emphasis on them was, however, rather incon- 
sistent with his subsequent attack on the Neyman- 
Pearson theory of testing hypotheses, especially 
since an unthinking use of these significance tests 
by some workers, as in the failure to recognize that 
a nonsignificant result does not imply the truth of 
the hypothesis tested (e.g., p =0), caused some 
reaction against their use later on. 

It is evident that Fisher had begun to think about 
his general theory of estimation before 1922; apart 
from his advocacy of maximum likelihood in 1912, 
the notion of sufficiency had also arisen in the 
special case of the root mean square. deviation as 
an estimate of the true standard deviation o in the 
case of a normal, or Gaussian, sample. Nevertheless, 
the general theory was first systematically devel- 
oped in the two papers cited, and it included a dis- 
cussion of the concept of consistency and a heu- 
ristic derivation of the asymptotic properties of 
maximum likelihood estimates in large samples 
[see ESTIMATION, article on POINT ESTIMATION]. 
It also included a crystallization of the concept of 
information on a parameter 6 in the formula 


r-#{(3)'} 


where L = log p(S|@) is the logarithm of the prob- 
ability of the sample S when the parameter has true 
value 0. The importance of this work lay in (1) 
examining the actual sampling properties of maxi- 
mum-likelihood estimates, particularly in large 
samples (the method of maximum likelihood goes 
back quite a long way, at least as it is analogous 
to maximum a posteriori probability by Bayes’ 
inverse-probability theorem on the assumption of a 
uniform a priori distribution); and (2) emphasiz- 
ing that a sample provides, in some appropriate 
statistical sense, a definite amount of information 
on a parameter. Fisher’s concept of information, 
preceding Shannon’s, which was introduced in 
quite a different context (see Shannon 1948), was 
especially appropriate for large samples because of 
the possible ordering of normally distributed esti- 
mates in terms of their variances (squared standard 
deviations), [See INFORMATION THEORY.] Fisher 
justified his concept more generally in the use of 
small samples by thinking in terms of many such 
samples; it is curious that he missed the exact 
inequality relating the information function and 
the variance of an unbiased estimate known as the 
Cramér—Rao inequality. In any case, however, the 
arbitrariness of the variance and unbiasedness re- 
mains; and in small samples Fisher introduced the 
general notion of a sufficient statistic, which, by 


rendering conditional distributions of any other 
sample quantities independent of the unknown 
parameter, exhausted the “information” in the 
sample [see SUFFICIENCY]. 

The next remarkable contribution in this gen- 
eral area came with Fisher's brief paper entitled 
“Inverse Probability” (1930b). Fisher had always 
been derisory of the estimates and inferences re- 
sulting from the Bayes inverse-probability approach. 
He felt that a unique and more objective system 
of inferences should be possible in fields where sta- 
tistical probabilities operate and noted that an 
exact sampling distribution involving a sample 
quantity or statistic T and an unknown parameter 
@ leads (under appropriate regularity and monoto- 
nicity properties) to the feasibility of assigning 
what he termed a “fiducial interval,” with a known 
fiducial probability that the parameter @ is con- 
tained in the interval. The interpretation accepted 
at the time, and implied by Fisher’s own wording 
in this paper, was that this interval, which is neces- 
sarily a function of the statistic T, is in conse- 
quence a random interval and that fiducial proba- 
bility is a statistical probability with the usual 
frequency connotation. Referring to the case of a 
true correlation coefficient p, he said, “We know that 
if we take a number of samples of 4, from the 
same or different populations, and for each calcu- 
late the fiducial 5 per cent value for p, then in 
5 per cent of cases, the true value of p will be less 
than the value we found” ([1930b] 1950, pp. 22, 
535). With some restrictions (for example, to suf- 
ficient statistics) it was thus apparently identical 
with the theory of confidence intervals developed 
about the same time by Neyman (1937). [See Es- 
TIMATION, article on CONFIDENCE INTERVALS AND 
REGIONS; FIDUCIAL INFERENCE.] 

On inductive inference questions, Fisher often 
did not make it clear what he was claiming; but it 
should be stressed that regardless of what his in- 
terpretation was or of its relevance to the problem 
at hand, it still was formulated in terms of an 
assumed statistical framework. Nevertheless, its 
formal bypassing of Bayes’ theorem was a masterly 
stroke which received attention outside statistical 
circles (cf. Eddington’s remark, “We can never be 
sure of particular inferences; therefore we should 
aim at a system of inference that will give con- 
clusions of which in the long run not more than a 
stated proportion, say 1/q, will be wrong” [1935a] 
1960, p. 126). 

Later, Fisher attempted to extend fiducial theory 
to more than one parameter. His first paper 
(1935b) discussing this extension took as one ex- 
ample the problem of inferring the difference in 


population means of two samples coming from 
normal populations with different variances. The 
difficulty here (which Fisher may not have real- 
ized at the time, since he himself never examined 
in detail the logical relations of sufficient statistics 
in the case of more than one parameter ) is that the 
effect of the unknown variance ratio cannot be 
segregated in the absence of a “sufficient” quantity 
for it that does not involve unwanted parameters, 
such as the individual population means. In reject- 
ing a criticism along these lines by the present 
author (Bartlett 1936; for further details see, for 
example, Bartlett 1965), Fisher explicitly gave up 
the orthodox frequency interpretation for fiducial 
probability which he appeared to have assumed 
earlier. He and others attempted to formulate a 
theory for several parameters that would be both 
unique and self-consistent, but this has yet to be 
achieved in any generality, and to many this search 
is misguided in that it does not eliminate from 
fiducial theory the arbitrariness that Fisher had so 
strongly criticized in the Bayes approach. 

A rather different and somewhat more technical 
estimation problem that Fisher solved in 1928 is 
the derivation of sample statistics that are unbiased 
estimates of the corresponding population quan- 
tities and of the sampling moments of these sta- 
tistics. The population quantities are the cumu- 
lants or semi-invariants first introduced by Thiele 
(1903), and Fisher’s combinatorial rules for ob- 
taining the appropriate sample statistics and their 
own cumulants constituted a striking example of 
Fisher's intuitive mathematical powers. Another 
paper published in 1934 is worth noting as an 
original and independent contribution to the theory 
of games developed about the same time by von 
Neumann and Morgenstern. [See GAME THEORY, 
article on THEORETICAL ASPECTS.] 

Fisher’s work on the design of experiments is so 
important logically as well as practically that it may 
be regarded as one of his most fundamental con- 
tributions to the science of statistical inference. It 
is, however, convenient to consider it in the second 
general area of statistical methodology and tech- 
nique, in conjunction with analysis of variance. 
Fisher perceived the simultaneous simplicity and 
efficiency of balanced and orthogonal experimental 
designs in agriculture. Replication of the same 
treatment in different plots is essential if any sta- 
tistical assessment of error is to be made, and 
formally equal numbers of plots per treatment are 
desirable. However, simplification in the statistical 
analysis is illusory if the analysis is not valid. 
When observations are collected haphazardly, the 
most sensible assumptions about statistical varia- 


FISHER, R. A. 489 


bility have to be made. In controlled experiments 
there is the opportunity for deliberately introduc- 
ing randomness into the design so that system- 
atic variation can be separated from purely random 
error. This is the first vital point Fisher made; the 
second naturally accompanied it. With the analysis 
geared to the design, all variation not attributable 
to the treatments does not have to inflate the error. 
With equal numbers of plots per treatment, each 
complete replication can be contained in a separate 
block, and only variability among plots in the same 
block is a source of error; variability between dif- 
ferent blocks can be automatically removed in the 
analysis as irrelevant. The third point arose from 
treatment combinations, such as different fertilizer 
ingredients. For example, if nitrogenous fertilizer 
(N) and phosphate (P) are to be tested, the recom- 
mended set of treatment combinations is 


Control (no fertilizer), N, P, NP, 


where NP denotes the treatment consisting of both 
the ingredients N and P (each in the same amount 
as when given alone). This design maintains sim- 
plicity and may improve efficiency, for if phosphate 
has no effect, or even if its effect is purely additive, 
the plots are balanced for nitrogen and doubled in 
number, and similarly for phosphate. Moreover, if 
both ingredients do not act additively, an inter- 
action term can be defined that measures the dif- 
ference in effect of N (or P) in the presence and 
absence of the other ingredient. Such a definition, 
although to some extent arbitrary, completes the 
specification of the treatment effects; and the whole 
technique of factorial experimentation typified by 
the above example is of the utmost importance both 
in principle and in practice. As Fisher put it: 


The modifications possible to any complicated appa- 
ratus, machine, or industrial process must always be 
considered as potentially interacting with one another, 
and must be judged by the probable effects of such 
interactions. If they have to be treated one at a time 
this is not because to do so is an ideal scientific pro- 
cedure, but because to test them simultaneously would 
sometimes be too troublesome, or too costly. In many 
instances . . . this impression is greatly exaggerated. 
(1935a, p. 97) 


A further device that naturally arose in factorial 
designs was that of confounding, by which some 
of the higher-order interaction effects in designs 
with three or more factors are assumed to be un- 
important and are deliberately arranged to coin- 
cide in the analysis with particular block contrasts. 
This enables the number of plots per block to be 
smaller and the accuracy of the remaining treat- 
ment effects thereby to be increased. 
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To a large extent the practical value of these 
experimental methods was not dependent on the 
statistical analysis, but the simplicity and clarity 
of the analysis greatly contributed to the world- 
wide popularity of these designs. This analysis was 
in principle classical least-squares theory, but the 
orthogonality of the design rendered the estimation 
problem trivial, and the concomitant assessment of 
error was systematized by the technique of “analysis 
of variance.” Basically, this technique is a break- 
down of the total sum of squares of the observa- 
tions into relevant additive parts containing any 
systematic terms ascribable to treatments, blocks, 
and so on. Once the technique was established and 
the appropriate tests of significance were available 
(on the assumption of normality and of homo- 
geneity of error variance) from Fisher's derivation 
and tabulation of the “variance-ratio” distribution, 
it could handle more complicated least-squares 
problems, such as more complex and even non- 
orthogonal experimental designs or linear and 
curvilinear regression problems. One useful ex- 
tension was the adjustment of observed experi- 
mental quantities, such as final agricultural yield, 
by some observed quantity measured prior to the 
application of treatments. This technique was re- 
ferred to as analysis of covariance, although this 
last term seems more appropriate for the simul- 
taneous analysis of two or more variables—that is, 
the technique of multivariate analysis. 

Fisher was active in the development of multi- 
variate analysis. Earlier workers had of course 
encountered multivariate problems in various con- 
texts, and Fisher had followed up his geometrical 
distribution of the correlation coefficient with a 
derivation of the distribution of the multiple corre- 
lation coefficient (1928b), again brilliantly using 
his geometrical approach. The problem exercising 
Harold Hotelling in the United States and P. C. 
Mahalanobis in India, as well as Fisher, was the 
efficient use of several correlated variables for dis- 
criminatory and regression problems. Fisher’s name 
is particularly associated with the concept of the 
discriminant function, some function of the varia- 
bles that will efficiently distinguish from the meas- 
urements of these variables for a single individual 
whether he came from one or another of two 
different populations. 

Contributions to genetics. Fisher's work in ge- 
netics was comparable in importance to his purely 
Statistical contributions and equally reflected his 
originality and independence of outlook. In the first 
decade of the twentieth century, Mendelian genet- 
ics was still a new subject, and its quantitative con- 
Sequences were not yet properly appreciated. It was 
in dispute whether they were consistent with Dar- 


win’s theory of evolution by natural selection or 
even with the observed inheritance of metrical 
characters. Fisher took the second and lesser prob- 
lem first, and in his 1918 paper he gave a pene- 
trating theoretical analysis of correlation, breaking 
it down into nongenetic effects, additive gene ac- 
tion, and further complications, such as genic inter- 
action and dominance. He was thus able to demon- 
strate the consistency of Mendelian principles with 
the observed correlations between sibs or between 
parents and offspring. Then, in his book on the 
genetic theory of natural selection, he tackled the 
larger problem. He pointed out that the atomistic 
character of gene segregation (in contrast to Dar- 
win’s hypothetical “blending” theory of heredity) 
is essential for maintaining variability, which in 
turn is the basis of the process of natural selection. 
He clarified the theoretical role of mutations, show- 
ing that mutations provide a reservoir from which 
eventually only favorable ones can survive. He 
emphasized the possibility of the selection of modi- 
fier genes, for example, in rendering the action of 
many mutant genes recessive by modifying the 
heterozygote phenotypically toward the natural wild 
type. The relative importance of this theory of the 
evolution of dominance was queried, for example, 
by Sewall Wright, among others, but this did not 
prevent it from being a relevant thesis that stimu- 
lated further research. The effect of modifier genes 
was also shown to be important in the phenomenon 
of mimicry. 

At the Galton laboratory, Fisher’s work in human 
genetics included linkage studies and the initiation 
of serology research on the human blood groups. 
An exciting moment in the work on serology came 
when G. L. Taylor and R. R. Race studied the 
Rhesus blood groups. Fisher was able to predict, 
from the experimental results to date, the effective 
triple structure of the gene, and hence two more 
anti-sera and one more allele, which were soon suc- 
cessfully traced. Such predictions may be compared 
with Fisher’s earlier theoretical predictions in the 
theory of evolution, for example, on the evolution 
of dominance. They made possible the maintenance 
of a healthy link with experimental and observa- 
tional work, which was often initiated or encour- 
aged by Fisher himself. It was in this spirit that he 
collaborated for several years with E. B. Ford in 
sampling studies of natural populations. 

In retrospect, Fisher’s wholehearted immersion 
in his own research problems, fundamental and 
broad as these were, did cause him to ignore some 
important theoretical trends. His neglect of purely 
mathematical probability, which had been rigor- 
ously formulated by A. N. Kolmogorov (1933), 
seemed to extend to developments in the theory of 


random or stochastic processes, although these 
were very relevant to some of his own problems in 
evolutionary genetics. In England, A. G. McKen- 
drick had published some brilliant papers on sto- 
chastic processes in medicine, and G. U. Yule on 
the analysis of time series, but Fisher never ap- 
peared to appreciate this work; in particular, his 
own papers on the statistical analysis of data 
recorded in time sometimes showed a rather over- 
rigid adherence to classical and unduly narrow 
assumptions. In appraising Fisher’s work, one must 
consider, in addition to these general boundaries 
that demarcate it, his occasional specific errors 
and, more importantly, his temperamental bias in 
controversy. Fisher’s scientific achievements are, 
however, so varied and so penetrating that such 
lapses cannot dim their luster or reduce his ranking 
as one of the great scientists of this century. 


M. S. BARTLETT 


[For the historical context of Fisher’s work, see STATIS- 
Tics, article on THE HISTORY OF STATISTICAL 
METHOD; and the biographies of GALTON; GOSSET; 
PEARSON; YULE. For discussion of the subsequent 
development of Fisher’s ideas, see ESTIMATION; Ex- 
PERIMENTAL DESIGN; FIDUCIAL INFERENCE; Hy- 
POTHESIS TESTING; LINEAR HYPOTHESES, article on 
ANALYSIS OF VARIANCE; MULTIVARIATE ANALYSIS.] 
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Alice Cunningham Fletcher (1838-1923), Amer- 
ican ethnologist, was born in Cuba during a tem- 
porary residence of her American parents on the 
island. She traveled widely in her early years and 
eventually settled in the Boston area, where she 
studied American archeology and ethnology at the 
Peabody Museum at Harvard University. It was 
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out of intense concern for the welfare and rights 
of the American Indian that she began her scien- 
tific studies of them. Although she was eventually 
to gain great and well-merited recognition as a 
scholar, the recommendations in behalf of Ameri- 
can Indians that she made in the name of anthro- 
pological authority suffered from an uncritical 
commitment to benevolent philosophies of the 
nineteenth century. The policy she advocated was 
based on the assumption that it was both inevitable 
and desirable for the Indians to be assimilated into 
white society and for their tribal culture to be 
rapidly destroyed. 

In 1870 she held a post with the Women’s Na- 
tional Indian Association as administrator of funds 
from which small loans were made to Indians so 
that they might buy their own lands and build 
homes. Then, in 1879, she met Thomas Henry 
Tibbles—frontiersman, minister, journalist, and 
ardent worker in the cause of Indian rights—and 
asked that she be allowed to take part in his work 
on the frontier. Tibbles found it difficult to take 
seriously the plans of the small, seemingly delicate 
woman, already 40 years old and a product of city 
life, but Alice Fletcher prevailed. On September 1, 
1881, she arrived in Omaha, Nebraska, to begin 
work on field studies that were to become standard 
references in North American ethnology. But many 
anthropologists familiar with Fletcher’s work are 
not aware of the fact that this field work permitted 
her to initiate the large-scale implementation of 
her well-intentioned but misguided notions and 
thus to affect the course of Indian affairs through- 
out the country up to the present day. 

In Omaha, she was met by Tibbles and Bright 
Eyes La Flesche, an educated Omaha Indian 
woman associated with Tibbles’ work and soon to 
become his wife. Tibbles’ autobiography, Buckskin 
and Blanket Days ( 1957), reflects both his admira- 
tion for Alice Fletcher’s determination and exasper- 
ation at her cheerful obliviousness to what wiser 
minds had heretofore recognized as realities in 
dealing with Indians and Indian agents. He never 
identified her by name but referred to her only as 
“High Flyer,” the name bestowed on her by his 
Omaha helper, Wajapa. After introductions to 
members of the Omaha tribe, the two women, 
Tibbles, and Wajapa set off on a journey to 
the Winnebago reservation—the Ponca settlement 
north of the Niobrara River which was made up of 
removed and demoralized remnants of the tribe 
whose members had returned to their old haunts— 
and finally to the Rosebud Sioux reservation in 
South Dakota. Tibbles had not underestimated the 
hardships, bad weather, and inconveniences that 


could be expected on the trip, but Alice Fletcher 
withstood them all equably. 

She was impressed by Tibbles, Wajapa, and 
Bright Eyes and was grateful to them. Although they 
could converse in the several languages encoun- 
tered on this first journey and could communicate 
with equal ease with her, a lady newly arrived from 
Boston, it apparently never occurred to her that the 
two Indians were highly atypical and that for all 
of Tibbles’ vast knowledge of Indian life, his Indian 
policy derived entirely from the white Christian 
values he shared with her. 

Alice Fletcher had long since decided that what 
was best for the Indians was a Congressional act 
whereby every adult Omaha would be allotted 80 
acres, to remain tax-free and to be held in trust by 
the government for a period of 25 years. After that 
time those who were judged competent would be 
granted fee patents and control over their holdings. 
Reservation lands left over after the distribution of 
allotments were to be sold for the benefit of the 
tribe to finance development of their farms. It is 
doubtful that the Omaha understood the nature of 
Fletcher's solution for their problems; they only 
feared dispossession, as their neighbors the Ponca 
had experienced, and were in no position to ques- 
tion her narrow interpretation of their plea, “Make 
my home secure” (La Flesche 1923). 

Alice Fletcher had influential friends in Wash- 
ington, and the act was quickly pushed through 
legislative procedures and passed August 7, 1882. 
She was appointed special agent to oversee the 
surveying and allotment. In subsequent years she 
performed similar work among the Winnebago and 
Nez Percé, who, along with many other tribes, 
were allotted land under the Dawes Severalty Act 
of 1887, patterned after the Omaha Act. 

In 1910, after an absence of many years, she 
returned to the Omaha for a brief visit. She ob- 
served, “. . . the act has not been altogether evil 
nor has it been wholly good for the people” (Fletch- 
er & La Flesche 1911, p. 640). The Omaha were 
certainly far better off materially and were more 
cheerful than they had been when she first knew 
them, and it is small wonder that she viewed their | 
future with optimism. She saw the tendency to 
lease rather than work the land as nothing more 
than an unfortunate phase of adjustment to land 
ownership, although it was really an indication of 
a major trend. The few well-run farms that she 
thought would be models proved to be short-lived 
exceptions. Finally, those items which she noted as 
“quaint survivals of old customs under a new 
guise” were really instances of adaptations, of cul- 
tural persistence, and of a desperate clinging to 


tribal identity that characterize Omaha life to the 
present day. 

Margaret Mead spent the summer of 1930 with 
the Omaha (Mead 1932, passim). In keeping with 
her desire to conceal the identity of the group, 
which she called the “Antlers,” Mead did not men- 
tion Fletcher by name but paid respectful tribute 
in her statement that the traditional life and his- 
tory had been so completely recorded for the Ant- 
lers that she chose to work with this group because 
she would be able to devote her attention entirely 
to modern conditions. Mead did not disclose that 
the same scholar who had provided such excellent 
ethnographic data was also the “well-intentioned 
lady of missionary leanings” whose benevolent 
efforts resulted in the social and economic chaos 
Mead observed. But Mead thought that philan- 
thropic Americans of that time could not have 
been expected to understand that individual land- 
ownership by Indians would not solve the economic 
problems that the loss of the buffaloes had created. 
In any case, the inadequacy of the Omaha plan 
was demonstrated wherever allotment took place. 
Between the time that fee patents were issued and 
the beginning of the administration of John Collier 
as commissioner of Indian affairs in 1938, two- 
thirds of allotted Indian lands had passed into 
white ownership (Fey & McNickle 1959, p. 78). 
Fletcher's “strong paper” on individual allotments 
was far weaker than the tribal treaty relating to 
undivided lands. 

In matters unrelated to economics her outlook 
was scientifically objective. Tibbles always disap- 
proved strongly of her interest in Indian dancing 
and singing and of her desire to observe and re- 
cord ceremonies: to him such matters were part 
of Indian “savagery” to be rigorously discouraged 
and repressed. Alice Fletcher, however, was im- 
pressed by the artistry, poetic sophistication, and 
scientific significance of these customs in contrast 
to the Indians’ material life, and she made im- 
portant pioneering studies of Indian music and 
ceremony. Her obviously sincere concern for Indian 
welfare and her untiring efforts to record cere- 
monial life accurately won the esteem of her in- 
formants. She collected highly secret data with 
relative ease and maintained generally excellent 
relations with both colleagues and informants by a 
combination of humor, patience, total unselfish- 
ness, and lack of vindictiveness when crossed. Her 
small stature and seeming frailness apparently dis- 
armed people who did not appreciate her strength 
and determination. During one field trip she suf- 
fered a siege of “inflammatory rheumatism,” which 
was to leave her crippled for the rest of her life. 
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As she lay in pain, her Indian friends gathered 
daily to cheer and console her with songs. As soon 
as she was able, and with an amazing display of 
musical aptitude and memory, she carefully tran- 
scribed a large number of the songs (Hough 1923, 
p. 255). 

Fletcher's theoretical interests were influenced 
by nineteenth-century evolutionary concepts but 
went beyond mere collection of facts or broad 
ascription of traits to given stages of development. 
Her investigations of Omaha religion carried her 
into comparative work among other Siouan speak- 
ers of the Plains, and as she suspected the exist- 
ence of yet older layers of cosmology and ritual 
from other sources, she turned to the Caddo as 
the likely carriers of more ancient forms. She had 
observed the Pawnee hako ceremony in the 1880s, 
and as her awareness of problems of diffusion in- 
creased, she returned as an experienced field 
worker to the Pawnee in the 1890s and recorded 
the ceremony (Fletcher 1904). 

Her scientific endeavors won rapid recognition 
in the anthropological profession. From 1883 on- 
ward, her name appears often in the Proceedings 
of the American Association for the Advancement 
of Science (AAAS) in connection with papers deal- 
ing with aspects of Plains culture. In 1896 she was 
vice president of the AAAS. In 1884/1885 she was 
in charge of Indian exhibits at the New Orleans 
industrial exposition. It pained her that Indians 
were represented by crude artifacts, and she wrote 
a letter, which was published and circulated among 
Indian groups, stressing the importance of educa- 
tion if Indian people were to take their rightful 
place in creating technological wonders like those 
of the white man (Fletcher 1885, pp. 1-4). 

The Chicago world’s fair of 1893 included a 
special congress of anthropology, and she took an 
active part in the proceedings along with the major 
anthropologists of her day, such as Frederic Ward 
Putnam, Franz Boas, and Zelia Nuttall. 

After 1890, when she made her home in Wash- 
ington, D.C., Alice Fletcher was a member of 
the Women’s Anthropological Society (WAS) and 
served as its president in 1895. The WAS was 
founded in 1885 by Matilda Stevenson, famous for 
her studies of Pueblo groups, and was disbanded 
in 1899, when the group was received as a whole 
into the heretofore all-male Anthropological Society 
of Washington. Alice Fletcher was elected presi- 
dent of that society in 1903 and of the American 
Folklore Society in 1906. She worked closely with 
Frederic Ward Putnam in his research on the Ser- 
pent Mound in Ohio, and in 1900 they engaged 
in a fund-raising campaign to buy the site as a 
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historic landmark, turning the deed over to the 
Ohio Archeological and Historical Society. 

By the time of her death, at the age of 85, Alice 
Fletcher had become sort of a living legend in an- 
thropology, active to the end in promoting scholarly 
enterprises and good works. 

Nancy OESTREICH LURIE 


[Other relevant material may be found in INDIANS, 
NORTH AMERICAN,] 
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Herbert John Fleure, British geographer and 
anthropologist, has done much to further the view 
that the study of man and his societies should not 
be divorced from the study of their environments 
and that an evolutionary approach to the condition 
of man and his cultures in the various regions of 
the world is essential to the discipline of human 
geography. He vigorously championed this sub- 
ject at a time when it was not generally recognized 
as a university discipline in Britain. 

Fleure grew up in Guernsey. In 1897, when he 
was 20, a scholarship to the University College of 
Wales, Aberystwyth, enabled him to take courses 
in the natural sciences; he specialized in marine 
zoology. He spent the year 1903-1904 studying at 
the University of Zurich, where Rudolf Martin 
stimulated his interest in anthropology. The fol- 
lowing year he returned to Wales and received the 
degree of p.sc. He then began to teach courses in 
zoology, geology, and botany at Aberystwyth and in 
1907 was elected to a newly established lectureship 
in geography. His intellectual development was in- 
fluenced by Darwinism and by the works of such 
German earth scientists as Eduard Suess and Ferdi- 
nand von Richthofen. 

Fleure became professor of zoology in 1910 but 
continued to teach geography, and in 1917 his 
persistent advocacy of this subject was rewarded 
by his appointment to the endowed (Gregynog) 
chair of geography and anthropology. In the same 
year he became honorary secretary of the Geo- 
graphical Association and editor of its journal (now 
Geography), and through these agencies he worked 
strenuously to advance the cause of human geog- 
raphy in education. From 1930 until his retire- 
ment in 1944, Fleure was professor of geography 
in the University of Manchester. He has been presi- 
dent of three sections of the British Association for 
the Advancement of Science, and of many learned 
societies. In 1936 he was elected fellow of the 
Royal Society. 

Fleure quickly established his reputation as an 
anthropologist with an anthropometric survey of 
Wales conducted in 1916 (Fleure & James 1916). 
He first outlined a scheme of world regions, de- 
fined by the quality of life within them, in an 
article published in 1917. Suspicious of the con- 
cept of “natural regions,” he saw, long before pre- 
historic archeology and paleoecology had estab- 
lished the antiquity and extent of man’s alteration 
of his surroundings, that human societies can 
fashion their own environments and that “environ- 


ment” is a term of cultural appraisal. In his Human 
Geography in Western Europe (1918) he first pre- 
sented his conception of that discipline. The first 
of the ten volumes of The Corridors of Time, writ- 
ten in collaboration with Harold Peake, appeared in 
1927, and the last volume appeared in 1956. From 
the first volume (Apes and Men) to the last (Times 
and Places), this series presents Fleure’s mature 
views on the development of human societies in 
the major regions of the world since prehistoric 
times. A balanced judgment on such issues as dif- 
fusion and independent development springs from 
a broad training in the sciences of man. 

Fleure has stressed the significance of culture 
contacts in all periods of human history as leading 
to the questioning of routine, the cross-fertilization 
of ideas, objective thought, and the release of in- 
novative effort. Always guarded in his conclusions, 
Fleure has been chary of general laws in human 
geography, where the likelihood prevails that dif- 
ferent responses will arise in different regions at 
different times. 

In addition to the works mentioned, Fleure has 
contributed extensively to encyclopedias and also 
to journals of geography, anthropology, sociology, 
folklore, and archeology. 

E. ESTYN EVANS 


[See also GEOGRAPHY; REGION.] 


WORKS BY FLEURE 

1916 FLEURE, H. J.; and James, T. C. Geographical Dis- 
tribution of Anthropological Types in Wales. Journal 
of the Royal Anthropological Institute of Great Britain 
and Ireland 46;35-153. 

(1917) 1919 Human Regions. Scottish Geographical 
Magazine 35:94-105, > A revision of the paper “Ré- 
gions humaines” published in May 1917 in Annales 
de géographie. 

1918 Human Geography in Western Europe: A Study in 
Appreciation. London: Williams & Norgate. 

1922 The Peoples of Europe. Oxford Univ. Press. 

1923 The Races of England and Wales: A Survey of Re- 
cent Research. London: Benn. 

(1927) 1929 The Races of Mankind. Garden City, N.Y.: 
Doubleday. 

1927-1956 PraKe, HaroLD; and FLEURE, H. J. The Corri- 
dors of Time. 10 vols. New Haven: Yale Univ. Press. 
> Volume 1: Apes and Men, 1927. Volume 2: Hun- 
ters and Artists, 1927. Volume 3: Peasants and Pot- 
ters, 1927. Volume 4: Priests and Kings, 1927. Vol- 
ume 5: The Steppe and the Sown, 1928. Volume 6: 
The Way of the Sea, 1929. Volume 7: Merchant Ven- 
turers in Bronze, 1931. Volume 8: The Horse and the 
Sword, 1933. Volume 9: The Law and the Prophets, 
1936. Volume 10: Times and Places, 1956. 

1951 A Natural History of Man in Britain: Conceived as 
a Study of Changing Relations Between Men and En- 
vironments, London: Collins. 


FLOURENS, PIERRE 495 


FLOURENS, PIERRE 


Pierre Flourens (1794-1867) was a French 
neurophysiologist who came from Languedoc. 
His intellectual aptitude became apparent to his 
family when he was still very young, and his edu- 
cation was accelerated. By the time he was 15 he 
was a student at the celebrated faculty of medicine 
at Montpellier; he received his medical degree 
before he was 20. 

Flourens was drawn to Paris, where he became 
a protégé of Georges Cuvier, who introduced him 
to the intellectual elite of the city. In 1828 he 
became deputy professor to Cuvier at the Collége 
de France; in 1830, professor at the Musée d’His- 
toire Naturelle; and in 1855, professor at the 
Collége de France. He also became a member of 
the Académie des Sciences in 1828 and was made 
perpetual secretary of that institution in 1833. In 
1840 Flourens was elected to the Académie Fran- 
çaise. 

While still a young experimenter, Pierre Flourens 
devised a rigorous technique for investigating the 
brain. Before him, researchers had plunged a tro- 
car into the brain through a trephined aperture 
that permitted no precise localization and that 
concealed pressures and hemorrhages from the 
investigator. Flourens carefully uncovered the brain 
and its meninges, avoiding damage to the blood 
vessels, and precisely excised the area to be studied. 

In 1822 Flourens began to study the effects of 
extirpation of successive layers of the cerebellum. 
He found that at an intermediate point of extirpa- 
tion a loss of stability and motor skill appears, as 
if the animal, although still able to walk, run, fly, 
or swim, has lost its “balance wheel.” This was the 
original discovery of the function of coordination. 

Flourens also studied the effects of eliminating 
one of the semicircular canals of the ear. The 
result was “an impetuous movement of the head,” 
which shifted violently in the spatial plane of the 
canal in question. This again was an original dis- 
covery, opening up the further study of the semi- 
circular canals as distinct from the cochlear ap- 
paratus. 

Flourens also experimented on the brain stem. 
In the lower portion, in the medulla oblongata, he 
precisely delimitated the small point that is the 
seat of the “vital node,” which if punctured causes 
instantaneous death by suddenly terminating respi- 
ration. In the upper portion a selective extirpation, 
anterior to the corpora quadrigemina, gave the 
first experimental indication of a relay in the cen- 
tral nervous apparatus of sight. 
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Finally, he made experiments on the cerebral 
hemispheres. This work was particularly difficult 
and time consuming. The terrain had been little 
explored by anatomists, and little was known of 
the connections and relays. Flourens’ work was 
divided into two series: in one he performed com- 
plete ablations of the cerebral hemispheres; in the 
other, limited ablations by regions, sometimes by 
progressive removal. 

The series of total ablations produced the follow- 
ing results: loss of intelligence and judgment; loss 
of all voluntary initiative; and relative diminution 
of instinctive activities, with retention of automatic 
motility and locomotion. 

The series of limited ablations produced losses 
of the same kind, greater or lesser according to the 
region and the individual case; but these losses 
were only partial and proved to be capable of atten- 
uation or compensation. 

Above all, Flourens reached the fundamental 
conclusion that the degree of disorder in any par- 
ticular faculty seemed to depend only on the quan- 
tity of cerebral tissue removed, regardless of its 
location. Generalizing from this fact, Flourens be- 
came opposed in principle to all attempts to estab- 
lish “cerebral localizations”; and his own doctrine 
of the “homogeneity” of the brain became the focus 
of a historic dispute. 

There is, to be sure, a case against Flourens, but 
it should be reviewed with detachment. In the first 
place, Flourens, like others at that time, failed to 
recognize the significance of certain paralytic ef- 
fects because he did not experiment on primates. 
Next, it should be noted that the first indisputable 
demonstration of cerebral localization did not occur 
until 1870, after Flourens had died. Last, Flourens’ 
conclusions with regard to the “homogeneity” of 
the brain and the theory that effects are related to 
the magnitude of the ablation were again based 
on experiments limited to lower mammals. In- 
deed, even so great an investigator as Lashley was 
similarly misled to conclude from his famous’ ex- 
periments on rats that there are “mass-action” 
effects. 

Lately, with the refinement of physiological con- 
ceptions, the problem of homogeneity has again 
been raised, and some researchers have concluded 
that in the neocortex the -highest functions of 
nervous integration go beyond “analytical” regional 
localization. These researchers explicitly evoke the 
memory of Flourens. 

AUGUSTE TOURNAY 
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FOLKLORE 


Folklore means folk learning; it comprehends 
all knowledge that is transmitted by word of mouth 
and all crafts and techniques that are learned by 
imitation or example, as well as the products of 
these crafts. Objects which are mass produced and 
knowledge which is acquired through books or 
formal education are a part of culture, which in- 
cludes the total body of learning, but they are not 
folklore. In nonliterate societies folklore is virtu- 
ally identical with culture, but in literate industrial- 
ized societies it is only a fragment of culture. 
Anthropologists and humanists have defined folk- 
lore differently, but their definitions are in funda- 
mental agreement in excluding all learning that is 
transmitted by writing. 

Folklore includes folk art, folk crafts, folk tools, 
folk costume, folk custom, folk belief, folk medi- 
cine, folk recipes, folk music, folk dance, folk 
games, folk gestures, and folk speech, as well as 
those verbal forms of expression which have been 


called folk literature but which are better described 
as verbal art. Verbal art, which includes such forms 
as folktales, legends, myths, proverbs, riddles, and 
poetry, has been the primary concern of folklorists 
from both the humanities and the social sciences 
since the beginnings of folklore as a field of study, 
and it is with this principal segment of folklore 
that this article is concerned. 

European interest in folklore goes back at least 
to the sixteenth century and the age of exploration, 
but the modern study of folklore is generally con- 
sidered to date from the early years of the nine- 
teenth century, when the Grimm brothers began 
collecting German folktales in the field. The term 
folklore was first introduced into English in 1846 
by William John Toms, who urged that accounts 
of “the manners, customs, observances, supersti- 
tions, ballads, proverbs, &c., of the olden time” be 
recorded in Britain for future study and for com- 
parison with the materials which were being re- 
corded in Germany by the Grimm brothers and 
other scholars. 


Forms of verbal art 


A variety of forms or genres of verbal art have 
been distinguished by folklorists, but neither the 
categories nor the terminology has been standard- 
ized. The following categories have proved useful. 

Prose narrative. Myths, legends, and folktales 
are three important kinds of prose narrative or 
“tale,” which is one of the most widespread forms 
of verbal art. The differences between myths, 
legends, and folktales are hotly disputed, but dis- 
tinctions similar to those made here are recognized 
in some nonliterate societies and have long been 
employed by students of European folklore. 

Myths. In the society in which they are told, 
myths are considered to be truthful accounts of 
what happened in the past. They are taught to be 
believed, and they can be cited as authority in 
answer to ignorance, doubt, or disbelief. Myths are 
the embodiment of dogma; they are usually sacred, 
and they are often associated with theology and 
ritual. Their characters are usually not human 
beings, but they often have human attributes. 
Myths account for the origin of the world, of man- 
kind, of death; for characteristics of birds and 
animals; or for features of the landscape. They 
may recount the activities of the deities, their vic- 
tories and defeats, their friendships and enmities, 
their love affairs, and their family relationships. 
They may “explain” details of ceremonial parapher- 
nalia or ritual, or why taboos must be observed. 

Legends. Like myths, legends are regarded as 
true by the narrator and his audience, but they are 
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usually secular rather than sacred. Their principal 
characters are human, and they concern a period 
less remote than that of myths. They tell of migra- 
tions, wars and victories, deeds of past chiefs and 
kings, and succession in ruling dynasties. They 
include local tales of buried treasure, ghosts, 
fairies, and saints. Legends correspond to Sagen in 
German and traditions populaires in French. 

Folktales. Prose narratives that are regarded 
as fiction are called folktales, They usually recount 
the adventures of animals or humans, but ogres 
and even deities may appear in them. A variety of 
subtypes can be distinguished, including drolls or 
noodles, trickster tales, tall tales, dilemma tales, 
formulistic tales, and moral tales or fables. Folk- 
tales are known as Märchen in German and as 
contes populaires in French. They have been known 
as fairy tales in English, but this is inappropriate 
both because fairies seldom appear in folktales and 
because narratives about fairies are usually re- 
garded as true. Some folklorists use the term 
Märchen in English while employing “folktale” to 
include all three categories, but this is unnecessary 
when “prose narrative” better serves this purpose. 

The distinction here between truth and fiction 
refers only to the beliefs of those who tell and hear 
these tales, and not to our beliefs, to historical or 
scientific fact, or to any ultimate judgment of truth 
and falsehood. In diffusing from one society to 
another, a myth or legend may be accepted without 
being believed, thus becoming a folktale in the 
borrowing society. Occasionally the reverse may 
also happen. In a period of rapid cultural change 
an entire belief system and its mythology may be 
discredited. Even in cultural isolation there may 
be some skeptics who do not accept the traditional 
system of belief. Nevertheless, it is important to 
know what the majority in a society believes to be 
true at any given point of time, for people act upon 
what they believe to be true. 

Aphorisms. Proverbs, maxims, and similar 
terse, sententious sayings can be grouped together. 
Again usage varies, but in this article proverbs are 
distinguished from “proverbial phrases” or meta- 
phorical comparisons and from maxims or mottoes 
like “Honesty is the best policy,” which can be 
applied only in the literal sense. Proverbs have a 
deeper meaning, one which can be understood only 
through the analysis of the social situations to 
which they are appropriate. While prose narratives 
are world-wide in their distribution, proverbs are 
primarily an Old World genre, important through- 
out Europe, Asia, and Africa. Proverbs have been 
reported from the Americas and from Oceania, but 
not in large numbers and not with sufficient docu- 
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mentation to determine whether they are in fact 
proverbs or some other form of aphorism. 

Riddles. Riddles differ from proverbs in that 
they require an answer, but they may also be con- 
cisely stated. African riddles are usually phrased 
as short declarative sentences, rather than as ques- 
tions, so that they resemble proverbs in form and 
so that initially it may be difficult for an outsider 
to recognize the implicit question to which he is 
expected to provide the correct answer. Riddles 
vary considerably in their form of statement; and 
many European riddles take the form of rhyming 
poems, as in the case of Humpty Dumpty. Riddles 
again are primarily an Old World genre, although 
some have been recorded in other parts of the 
world. 

Poetry, tongue-twisters, and verbal formulas. 
Poetry is widespread, at least in the form of song 
texts, but has received far less attention than the 
forms of verbal art mentioned thus far. Tongue- 
twisters have been recorded in various parts of the 
world but have been little studied. Incantations, 
invocations, passwords, greetings, and other verbal 
formulas have also been neglected by folklorists, 
although they appear in linguistic studies and in 
ethnographic descriptions of ritual and etiquette. 
Like tongue-twisters and some song texts, verbal 
formulas are often obscure in meaning and both 
difficult and almost pointless to translate. Compre- 
hension is often less important than correct recita- 
tion, and verbatim accuracy may be essential to 
their religious, magical, or social effectiveness. In 
view of this it is not surprising that these forms 
have received less attention than prose narratives, 
proverbs, and riddles, in all of which both com- 
prehension and communication are involved. 

These forms of verbal art are not watertight 
compartments, as the case of rhyming riddles sug- 
gests. Dilemma tales, which are widespread in 
Africa, leave the solution up to the audience. Often 
classified with riddles, they are clearly a borderline 
form; they seem usually to call for argumentation 
rather than a correct answer. Trickster tales may 
be either myths or folktales; and ballads are nar- 
ratives in songs. Some tales incorporate songs in 
the development of the plot, and others end by 
quoting a proverb to summarize a moral. The social 
significance of verbal art is stressed in the follow- 
ing section, but its aesthetic attributes are also im- 
portant. Their study provides a common meeting 
ground for the humanities and the social sciences 
that is rarely if ever equaled in other data on 
human behavior. Language imposes limitations on 
the artist in folklore as the medium of expression 
does in the graphic and plastic arts, music, or the 
dance. In the case of verbal art the medium of 


expression is the spoken word, and verbal art is 
subject to the limitations of the phonetics, gram- 
mar, and vocabulary of the language. For this 
reason linguists are best qualified to consider the 
question of style, which can be defined as what the 
artist is able to achieve within the limitations of his 
techniques and media. Linguists have, in fact, con- 
tributed several important discussions of this sub- 
ject. More often they have recorded prose narratives 
as a convenient means of collecting texts for lin- 
guistic analysis, with the result that many of the 
most accurately recorded and carefully transcribed 
collections of verbal art are to be found in lin- 
guistic studies. 


Some functions of verbal art 


Amusement is an obvious function of folktales, 
riddles, and tongue-twisters, but verbal art has 
other, more important functions. 

Education. In nonliterate societies verbal art 
plays a major and often explicitly recognized role 
in education. It is important to learn myths and 
legends because they contain information that is 
believed to be true. Proverbs are often character- 
ized as the distilled wisdom of past generations. 
Learning the values of a culture as they are ex- 
pressed in proverbs is similarly considered impor- 
tant -in its own right; without some command of 
proverbs, individuals in many African societies 
cannot effectively fulfill their roles as adults. 
Whereas African proverbs are considered to be the 
province of adults, riddles are for children. Riddles 
teach the characteristics of plants, animals, and 
other things in nature, as well as some features of 
technology, material culture, and social structure. 
Even folktales that are regarded as fictitious are 
recognized by Africans as important in the educa- 
tion of children, because so many of them are 
moral tales. The importance of verbal art in edu- 
cation has been noted in nonliterate societies 
throughout the world. Verbal art provides a 
medium for the transmission of knowledge, values, 
and attitudes from one generation to another and 
thus contributes to the continuity of culture. 

Social control. Verbal art helps to maintain 
conformity to cultural values and accepted pat- 
terns of behavior. It is widely used to express social 
approval and disapproval, to apply social pressure, 
and to exercise social control. Proverbs, songs of 
ridicule, and even riddles and folktales may be 
used to criticize those who deviate from the ac- 
cepted norms and social conventions. On the other 
hand, proverbs, praise names, and praise songs 
give recognition and reward to those who conform. 
African proverbs are especially important in this 
respect. When action is contemplated that may 


lead to social friction, open hostilities, or direct 
punishment by society, proverbs can be used to 
express warning, defiance, or derision of a rival 
or enemy. They may also express advice, counsel, 
or warning to a friend. 

Because of the high regard in which they are 
held, and because they are considered especially 
appropriate to adult life, African proverbs are 
highly effective instruments of social control. Be- 
cause they express the morals and ethics of a so- 
ciety, they are convenient standards for appraising 
behavior in terms of the approved norms. And be- 
cause they are pungently, sententiously, and wit- 
tily stated, they are ideally suited for commenting 
on the behavior of others. 

Social authority. Verbal art serves to validate 
social institutions and religious rituals. Malinowski 
showed how, in the Trobriand Islands of the Pa- 
cific, myth provides a warrant and a charter for 
magic, ceremony, ritual, and social structure. 
Myths can be cited as authority on questions of 
religious belief and ritual procedure, and to justify 
rights to land, fishing grounds, social position, 
tribute, or political authority. This important func- 
tion can be seen in many societies, but it is not 
confined to myth. When dissatisfaction with or 
skepticism of an accepted pattern is expressed, or 
when doubts about it arise, there is usually a myth 
or legend to validate it; however, a moral folktale 
or a proverb may serve the same purpose. 

Sociopsychological function. Verbal art pro- 
vides a psychological release from the restrictions 
imposed on the individual by society. Ever since 
Greek myths were recorded in writing, it has been 
recognized that characters in myths do things that 
are regarded as shocking, sinful, and even criminal 
in daily life. Myths and folktales provide an oppor- 
tunity for people to talk about kinds of behavior 
that society prohibits them from indulging in, and 
about kinds of success that they can scarcely hope 
to achieve themselves. 

This can be illustrated in our own society by 
“dirty jokes” and, formerly, mother-in-law jokes. 
In the case of the latter, their popularity has de- 
clined as fewer young couples live with their in- 
laws and the family is increasingly fragmented. 
In American Indian societies that practice mother- 
in-law avoidance, the trickster violates his own 
mother-in-law. Tales of polygynous marriages fea- 
ture far more prominently among the monogamous 
Pueblo groups than in societies where polygyny is 
accepted. In West Africa the Ashanti and Daho- 
means recognize the value of being able to criti- 
cize and laugh at authority and other matters, both 
secular and sacred, in a way they cannot do 
normally, through folktales and songs. Even rid- 
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dies and tongue-twisters, although considered the 
province of children, sometimes involve sexual ref- 
erences that are considered off-color by adults. 

The familiar theme of rags to riches, the wide- 
spread tale of the magic flight or of the seven- 
league boots, and tales of resurrection after death 
are also meaningful in terms of psychological re- 
lease. Viewed in this light, verbal art reveals man’s 
attempt to escape in fantasy from the restrictions 
of his geographic environment, from his own bio- 
logical limitations as a member of the human 
species, and from the repressions imposed upon 
him by society, whether these result from social 
and economic inequalities, from sexual taboos, or 
from the Ashanti taboo against laughing at a 
person afflicted with yaws. 

Cultural continuity. When the functions of 
education, control, authority, and release are 
viewed together, it can be seen that verbal art has 
the broader function of maintaining the continuity 
and stability of culture. It is used to inculcate cus- 
toms and ethical standards in the young, to reward 
the adult with praise when he conforms or to pun- 
ish him with criticism or ridicule when he deviates, 
to provide him with rationalizations when he ques- 
tions the institutions and conventions of his society, 
and at the same time to provide him with a com- 
pensatory escape from the hardships and injustices 
of everyday life. It operates to ensure cultural con- 
tinuity from generation to generation through its 
role in education, by emphasizing conformity to 
the accepted cultural norms, and through the vali- 
dation of social and religious institutions. By pro- 
viding a psychological escape from the institutions 
and norms which it sanctions and enforces and by 
providing discontented individuals an opportunity 
to talk about forbidden forms of behavior, rather 
than practicing them, verbal art preserves the es- 
tablished customs and institutions from direct 
attack and change. 

Political uses of verbal arts. Despite the fact 
that verbal art serves to continue and stabilize cul- 
ture, it has also been used for the purposes of 
political propaganda and social change. During the 
emergence of the independent nations of Africa, 
myths, legends, and song texts have been used to 
promote ethnic unity, regionalism, nationalism, 
anticolonialism, and pan-Africanism. 

The Yoruba creation myth provided the basis for 
the Society of the Children of Oduduwa (Egbe 
Omo Oduduwa), established for the purpose of 
uniting the people of the Western Region of Ni- 
geria, and Bakongo mythology has been used for 
political purposes by Joseph Kasavubu’s Abako 
party in the Congo. In Katanga a common myth 
has been cited by politicians to show why the 
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Lunda, Luena, and Chokwe should unite in sup- 
port of Tshombe and his Conakat party, or, con- 
versely, why the Luena and Chokwe should join 
in opposition to the Lunda. African praise songs 
have been adapted to honor new political leaders; 
songs of allusion have been used to criticize colo- 
nial officials and rival politicians; and the hymn 
“God Bless Africa,” first sung publicly in 1899, has 
been adopted as a closing anthem by the African 
National Congress and as the unofficial national 
anthem of several African countries, and it was 
sung at the 1958 Conference of Independent Afri- 
can States in Accra, Ghana. 

In the Soviet Union in 1936, the Communist 
party “discovered” the effectiveness of folklore as 
a political weapon. Scholars reversed earlier theo- 
ries that regarded folklore as a product of the 
upper classes filtering down to the lower classes, 
and claimed that the working people were the cre- 
ators of folklore. 

Folktales and songs have been used to advance 
the theme of the class struggle in the Soviet 
Union, in China, in Cuba, and elsewhere. The 
Nazis used Teutonic mythology to promote their 
idea of a master race. In earlier times Krylov used 
fables to needle the Russian aristocracy, and the 
first Chinese edition of Aesop's Fables was sup- 
pressed by officials who recognized their satire and 
suspected that they had been invented locally. 
Further study of the political uses of verbal art, 
which have only recently been recognized by folk- 
lorists, will undoubtedly reveal other instances of 
this kind. 

The recording of verbal art is a well-recognized 
field technique in linguistic and anthropological 
research, and it can be helpful in studying political 
attitudes. Verbal art provides useful materials for 
school curricula and important data for the study 
of law, values, psychology, and history in’ non- 
literate societies. Writing and industrialization 
have undermined its social significance far more 
in urban United States than in most literate socie- 
ties, but they have never destroyed verbal art or 
the other segments of folklore. 

WILLIAM BASCOM 


[Directly related are the entries COMMUNICATION; 
Drama; History, article on ETHNOHISTORY; Music, 
article on ETHNOMUSICOLOGY. Other relevant mate- 
rial may be found in CRAFTS; HISTORIOGRAPHY; 
MYTH AND SYMBOL; PRIMITIVE ART.] 
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FOLLETT, MARY PARKER 


Mary Parker Follett (1868-1933) was a social 
philosopher who attempted to apply the principles 
she deduced to practical problems in business, poli- 
tics, and social work. She was born a member of 
an old Quincy, Massachusetts, family and was 
educated at Thayer Academy, Radcliffe College, 


and the Sorbonne. While still a student at Radcliffe 
she spent the year 1890-1891 at Newnham Col- 
lege, Cambridge, England, where she formed many 
lifelong friendships. Her intellectual contribution 
to the social sciences is contained in two books, 
The New State (1918) and Creative Experience 
(1924), and a posthumous collection of papers, 
Dynamic Administration, published in 1942 (see 
1926-1930). She died in London. 

Among the earliest to see the importance of the 
group in understanding an individual and his 
thoughts, she rejected schemes which postulate a 
dualism between the individual and society, as well 
as most other forms of causal interaction between 
these two entities, in favor of the notion of inte- 
gration. 

The New State is a plea for a political order 
based on an interlocking hierarchy of membership 
groups, beginning at the neighborhood level. The 
unit of the community is the socialized individual, 
that is to say, the individual who, through associa- 
tional membership, has become a fully participat- 
ing member of society. This is the new individu- 
alism of the new state. In the ideal democracy, 
therefore, integration of the individual personality 
and the society is so complete that no conflict, 
either psychological or social, is conceivable. 
“Democracy does not register various opinions; it 
is an attempt to create unity” (1918, p. 209). 

In Creative Experience Follett discussed the pre- 
vention and elimination of the failure of integra- 
tion, that is, of social conflict. She was particularly 
interested in problems of social change and con- 
flict in small-scale social systems, such as factories, 
wage boards, regulating commissions, community 
centers, and neighborhoods. 

The concept of circular response, developed in 
Creative Experience and in later essays and lec- 
tures, is her principal contribution to the analysis 
of failures of integration. Circular response rests 
upon the theory that the unit of social analysis is 
the pattern of relations between actors, conceived 
as a single situation produced by a union of their 
interests (1924, p. 188). No response by an actor 
is wholly predictable for he must continually mod- 
ify his behavior to adjust to the expected responses 
of others, who constitute his environment. This 
modulation of both the activity and the sentiments 
of the actors in an environment constitutes circular 
response. As Follett put it: “The most fundamental 
thought about all this is that reaction is always 
reaction to a relating. . . . I never react to you but 
to you-plus-me; or to be more accurate, it is I-plus- 
you reacting to You-plus-me” (1924, p. 62). Some- 
what similar theories have been used by Foote and 
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Cottrell (1955), but ‘their chosen unit of analysis 
is the episode or event.rather than the single’social 
act. % 

Thus, the only good solution to social conflict is 
not compromise, not conquest, but integration. 
Integration in this context means the creation of a 
novel solution that penalizes no-one and that be- 
comes the only sure base for progress toward an 
ideal democracy. If integration is to.be achieved, 
various forms of coordination must be introduced 
as fundamental principles of organization: (1) di- 
rect contact between the responsible people who 
have to carry out policies, rather than hierarchical 
control; (2) early contact between these responsi- 
ble people, so that policy may be created by them, 
rather than later meetings that can only try to 
resolve differences between policies already evolved 
by isolated groups; (3) the reciprocal relating of 
all factors in a situation, that is, equal attention to 
all the variables in the social system. 

Coordination in these various forms is a continu- 
ing process, since in any complex social environ- 
ment there exist many points of creativity, and 
established policies can never be executed as de- 
signed but must constantly be reformed in conso- 
nance with basic goals. 

The sources of Follett’s ideas are to be found in 
the thinking of her own time: she borrowed much 
from Cooley’s concept of the primary group, which 
became her ideal social setting, and from G. H. 
Mead’s analysis of the “I” and the “me.” Her work 
also reflects the psychological approach to politics 
of such English political philosophers as Graham 
Wallas as well as the emphasis on purpose or will 
or thought in contemporary idealist political think- 
ing. Follett did not appreciate the role of institu- 
tional structures, bureaucracy, or force. She firmly 
rejected Durkheim's proposition that social facts 
may be conceived of as “things,” and her approach 
to the concept of the state was unsophisticated. She 
never mentioned the existence of legitimate power 
or the prevalence of legitimized and idealized 
peace that has its source in bloody conquest. 

Her early papers were in political analysis, but 
she later found that she could more conveniently 
study the operation of political influences in indus- 
trial systems. Her positions in the Department of 
Vocational Guidance of the Boston school system 
and on the Massachusetts Minimum Wage Board 
gave her access to basic data with which to test 
her generalizations. She never presented her data 
in the form of tables, preferring to communicate 
by generalization and illustrative story. 

Professional sociologists have tended to ignore 
Follett’s writings, although they contain many in- 
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teresting and sensible statements on the subjects 
of primary group analysis and the creative process 
in society. Nor are there many references to her 
work in writings on industrial relations or political 
science. Curiously, the pluralistic concept of the 
state is occasionally attributed to her by political 
scientists (for example, de Grazia 1952), although 
she had been highly critical of that concept in The 
New State: her emphasis on integrative behavior 
is not compatible with the concept of the state as 
an amalgam of discrete political groups. 


JoHN MoGEY 


[See also ConFuict, article on POLITICAL ASPECTS; 
GENERAL WILL; POLITICAL PROCESS; REPRESENTA- 
TION, article on THEORY; and the biographies of 
CooLEY; MEAD; WALLAs.] 
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1, WORLD PROBLEMS 
I. CONSUMPTION PATTERNS 


Nevin S. Scrimshaw 
Yehudi A. Cohen 


I 
WORLD PROBLEMS 


Low per capita food production and high rates 
of population growth in underdeveloped areas 
cause food shortages in many less developed coun- 
tries, particularly in tropical and semitropical 
regions. Even though 60 to 80 per cent of the 
people in these countries are engaged in farming, 
their productivity is so low that it does not meet 
the needs of the population. By contrast, in some 
industrialized countries less than 8 per cent of the 


population is engaged in an agricultural industry 
that produces vast surpluses. Although these sur- 
pluses help to meet the needs of many other parts 
of the world, malnutrition is widespread and per- 
sistent in the underdeveloped areas and is re- 
sponsible for much of the high mortality in these 
areas, whether by itself or in combination with 
infections of various types. 

Factors limiting adequate food production are 
primarily social and economic rather than physical. 
The lack of knowledge and the illiteracy of the 
rural population complicate attempts to increase 
food production as well as to control population 
in underdeveloped countries. Long-standing cus- 
toms, limited agricultural training activities, and 
inadequate storage and distribution facilities help 
to perpetuate low agricultural production in these 
areas. Lack of the tools of scientific agriculture 
and scarcity of money or credit for their purchase 
are major additional factors. Moreover, this in- 
adequacy of the food supply is part of a vicious 
circle that keeps productivity low: malnourished 
populations are more vulnerable to disease and less 
capable of sustained work than are well-nourished 
populations. 

Continued increases in food production can be 
anticipated in most of the less developed countries 
although in many areas they will not be large 
enough to maintain adequate per capita food sup- 
plies. The additional food necessary to give at 
least a subsistence ration to most persons is likely 
to continue to come from the food surpluses of the 
industrialized countries and to be augmented by 
the exploitation of new protein sources, 

The over-all world food supply is failing to keep 
pace with population growth but is not yet limit- 
ing the world’s explosive population increase (Food 
and Agriculture Organization . . . 1966). 

Food and history. The replacement of exclu- 
sive dependence upon hunting, fishing, and gather- 
ing by the beginnings of agriculture was the first 
great step in human development. The rate and 
scope of social evolution have depended to a major 
extent on the development of more effective means 
of obtaining food. In primitive cultures, man’s 
struggle for food consumes most of his time, 
thought, and energy. The creation of a surplus of 
food over and above what is needed to live leads 
to successive refinements in the subdivision of 
labor, which in turn make possible social and 
technical advances and thus the production of 
even greater quantities of food. 

For millennia the food supply was a major 
factor limiting the growth of human populations 
and determining their density in any particular 


area. It is only in the last century that improve- 
ments in agricultural production have become 
sufficiently widespread to remove food as the 
limiting factor in most population growth. In 
combination with improved control and preven- 
tion of infectious and other diseases, an exponen- 
tial increase in numbers of people has been the 
result. 

Pressure for good agricultural land for the pro- 
duction of food has been a major factor in the 
turbulent warfare of the historical record. Crop 
failures have resulted in population losses through 
death and emigration, which have impoverished 
and impeded the social and economic development 
of populations and even whole countries. For ex- 
ample, Ireland has never recovered from the high 
losses from starvation and emigration as a direct 
consequence of the potato famine of the 1840s; 
the political history of the United States and Canada 
has perhaps been equally changed by hundreds of 
thousands of Irish immigrants. Another example 
of the importance of food problems is found in the 
major political and economic consequences of the 
failure of agricultural production in the communist 
countries, 


Contemporary problems 


Paradoxically, the principal nutrition problem 
of the industrialized countries today is one of over- 
eating, with a consequent increase in obesity, car- 
diovascular disease, diabetes, and hypertension. 
[See OBEsITY.] 

In contrast, nutritional difficulties in underde- 
veloped countries are similar to those experienced 
in the industrialized countries 50 to 100 years ago. 
Children under five years of age and, to a lesser 
extent, pregnant and nursing mothers are most 
affected, 

Common deficiencies. The most common se- 
vere nutritional deficiencies are those of protein 
and vitamin A. The best indicator of the prevalence 
of protein deficiency (“kwashiorkor”) is the mor- 
tality rate for children one to five years of age, 
which is commonly 20 to 50 times higher in un- 
derdeveloped countries than in areas like western 
Europe, the United States, Australia, New Zealand, 
and Japan. Infant mortality rates (deaths of chil- 
dren under one year of age) are two to four times 
higher in most less developed countries than in 
those that are industrialized. Early weaning with 
improper substitutes and inadequate supplementa- 
tion of breast milk are important factors in high 
mortality rates for infants who survive the danger- 
ous first months of life. 

Vitamin A deficiency is also most common 
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among preschool children; it causes severe eye 
lesions which often result in blindness. In Indo- 
nesia and other countries of southeast Asia, deaths 
from secondary infection are particularly common 
in children with vitamin A deficiency. This defi- 
ciency could be readily prevented by the green and 
yellow vegetables that are, or could be, widely 
available in most countries. 

Another common problem—marasmus, or par- 
tial starvation—is seen most often among infants 
who have been prematurely weaned and fed watery 
gruels that are deficient in both calories and pro- 
tein. A form of acute thiamine deficiency (infantile 
beriberi ) is a cause of death among nursing infants 
in some southeast Asian countries, where a pol- 
ished rice diet results in thiamine (vitamin B,) 
deficiency in mothers. Beriberi also still occurs 
among many adults in these areas. Pellagra, caused 
by inadequate niacin and tryptophan intake, is 
seen in those populations of Africa that subsist on 
maize as the principal staple, but it is seldom seen 
in the maize-eating populations of Latin America, 
where additional niacin is provided by both beans 
and coffee. 

Women experiencing repeated cycles of preg- 
nancy and lactation are likely to develop iron de- 
ficiency anemia, loss of bone calcium, and reduced 
lean body mass. Malnutrition in other adults is 
less common, except in times of famine, although 
individuals unable to obtain work, or too old or 
sick to work, may be seriously undernourished 
because they cannot afford to buy adequate food. 
Alcoholism is a common cause of malnutrition in 
both underdeveloped and industrialized countries 
because money is spent on alcohol rather than on 
proper food. 

The cost of malnutrition. The cost of malnu- 
trition to less developed areas is exceedingly high; 
it includes the waste of resources in rearing infants 
who die before they can become useful citizens 
and the reduced working capacity of malnourished 
adults. From a quarter to a third or more of the 
children die before they reach school age, largely 
from infections that would not be fatal to a well- 
nourished child or from clinical malnutrition pre- 
cipitated by a prior episode of acute infectious 
disease (Scrimshaw 1966). Nearly all children 
among the less privileged populations of under- 
developed countries show retarded growth and de- 
velopment at the time they reach school age; and 
although they are rarely seriously malnourished 
during school years, they do not make up for the 
deficit acquired during preschool years. Recent 
evidence suggests that the retardation in physical 
development in infancy is paralleled by impaired 
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mental development, which is probably permanent. 
This means that the future development of a coun- 
try is compromised by serious malnutrition in 
young children. Moreover, attempts to provide 
adequate medical care are complicated and made 
more costly by both the greater amount and the 
longer duration of illness of both malnourished 
children and adults. 

An unfortunate aspect of malnutrition in adults 
is that the workers so adapt themselves to malnu- 
trition by reduced vitality and activity that they 
scarcely realize they are underfed. The final result 
is lethargy, lack of drive, and loss of initiative. For 
both industrial and farm workers it leads to ab- 
senteeism because of sickness and to higher acci- 
dent rates because of early fatigue. In general, the 
countries of the world with the lowest per capita 
consumption of food are also those where the effi- 
ciency of the workers is lowest. 


Reasons for malnutrition 


Although for the world as a whole the long-term 
trend of per capita food production has been up- 
ward, there has been a decline in per capita food 
production in communist Asia and in India, as well 
as in many parts of Latin America and southeast 
Asia (Food and Agriculture Organization of the 
United Nations 1963). If the high population 
growth rates in the less developed countries in- 
crease as projected, still more of these countries 
will soon be unable to maintain present levels of 
per capita food production. It is important to 
recognize that increased food needs will continue 
to arise not only from a growing population but 
also from greater per capita income. As people in 
the less developed countries improve their financial 
status, much of the increase is spent for food. In 
some countries where food production has not kept 
pace, this situation has led to a substantial rise in 
imports of food, and in others the effect of growth 
in income has been balanced by rising food prices. 
Either result has serious economic and political 
consequences. 

Using food shipments from regions of high per 
capita productivity to meet part of the food needs 
of the increased numbers of people in the less de- 
veloped countries is not a satisfactory solution, 
because it involves considerable problems of trans- 
porting and financing, which are further compli- 
cated by political issues. Ideally, adequate food pro- 
duction should be achieved in each area, unless 
foreign exchange is available for food purchases 
from cash crop or industrial exports. 

Physical waste. Unfortunately, the largest loss 
of food to insects and rodents occurs in those 


countries with the greatest food shortages. For 
example, in some of the grain storage bins of India, 
the loss to rodents alone is over 30 per cent. Con- 
servative official estimates of total such preventable 
food losses in India indicate that they are greater 
than the present large annual food deficit and con- 
stitute at least 25 per cent of the total grain pro- 
duction; if such losses were prevented, India would 
have food for export. 

In tropical regions of the food-short countries, 
mold causes additional spoilage. In the case of pea- 
nuts, an important source of protein in Africa and 
India, one type of mold (Aspergillus flavus) pro- 
duces a substance called “aflatoxin,” which even 
in trace amounts is highly carcinogenic for experi- 
mental animals. This mold also grows freely on 
cereal grains, sweet potatoes, and legumes, and 
even slightly moldy food is likely to be unfit for 
human consumption. Lack of refrigeration also 
results in spoilage of fruits and vegetables, which 
are a glut on the market when in season and un- 
available or prohibitively expensive the rest of the 
time. 

The great majority of the malnourished popula- 
tions of the world depends on either a cereal staple 
such as rice, wheat, or corn, or starchy roots such 
as manioc or potatoes. The former are more nutri- 
tious, but the latter produce more calories per acre, 
require less labor, and are somewhat easier to store 
without serious loss. 

Cultural waste. It is a mistake to conclude that 
the malnutrition of less developed areas is solely 
due to inadequate food production, lack of storage 
facilities, or even lack of purchasing power. Ig- 
norance of the nutritional needs of young children 
and lack of understanding of the relationship be- 
tween food and health are of equal or greater im- 
portance. The failure to give any food supplement 
to breast-fed infants before 9 to 12 months of age 
in many countries; the use of rice-water, barley- 
water, cornstarch or even sugar-water as a wean- 
ing food; and the variety of taboos surrounding 
the giving of eggs to young children are examples 
of common practices which lead to malnutrition 
and death of children under five years of age, even 
among families whose older children and adults 
do not suffer from malnutrition, In underdeveloped 
countries, one often sees a critically malnourished 
child with a woman who is wearing new clothes 
and jewelry and who is totally unaware of the true 
reason her child is dying. 

The various erroneous beliefs surrounding the 
proper feeding of children with infections are often 
the most important factors in the chain of multiple 
causation leading to acute nutritional disease. For 


example, in both south India and Central America 
children with measles are given water in which a 
small amount of cereal and various local herbs 
has been cooked—a diet grossly deficient in pro- 
tein and calories. Such beliefs also cause the family 
to spend money for charms, ceremonies, and use- 
less proprietary medicines. 

In underdeveloped areas the seemingly irrational 
beliefs regarding food may be quite elaborate. Par- 
ticularly common is the practice of classifying 
foods and prohibiting certain combinations. In 
Peru, for instance, foods are “hot, cold, heavy, or 
light.” A complex of beliefs indicates when and to 
whom a given class of foods can be fed. Certain 
classes of food are considered appropriate or in- 
appropriate, without relation to their nutritive 
value, at times such as pregnancy or during one 
or another illness. Few societies recognize the true 
relationship of foods to health. 


Improving production and conservation 


The yield increases of the industrialized coun- 
tries are due to chemicals, mechanization, good 
seeds and animal breeds, and their effective use. 
Chemicals and machinery could achieve miracles 
in the less developed areas, but they require greater 
capital investment and know-how than are pres- 
ently available. Selective breeding of plants and 
animals could also be extremely effective: types 
of both have been developed which are much more 
productive and sometimes also more resistant to 
disease. For example, virus-resisting potatoes, rust- 
resistant wheat, hybrid corn and, most recently, 
high lysine corn and gossypol-free cottonseed are 
part of the “miracle” of food production in the in- 
dustrialized countries, along with greatly improved 
breeds of poultry, cattle, sheep, and swine. 

Some of these superior varieties can be intro- 
duced directly into less developed countries, but 
most require additional research for their adapta- 
tion to different environmental conditions. Agricul- 
tural research and its application through agricul- 
tural extension have been a major factor in high 
yields in the developed countries. Production of 
most improved varieties requires considerable 
knowledge of scientific agriculture as well as much 
initial capital investment, and the education of 
government officials is also an important factor. 
Poorly informed administrative and policy officials 
often block the efforts of extension workers and 
farmers by ill-considered taxes or import controls, 
unwise emphasis on cash crops, neglect of agri- 
cultural research, indifference or hostility to pro- 
ducer cooperatives, and failure to provide for rural 
credit. 
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Limiting physical waste. Solution of the food 
problems of less developed areas will require ex- 
tensive improvements in the conservation and 
handling of food as well as in its production. Of 
first importance is reducing preventable food losses 
through the use of insecticides and rodenticides, 
construction of rat-proof and dry storage facilities, 
fumigation of grain, and improved distribution by 
using protective packaging materials and more 
rapid transportation. Controlled atmosphere and 
cold storage facilities are useful for some seasonal 
and highly perishable crops, and refrigeration is 
essential for the proper conservation of meat, poul- 
try, and fish. 

Smoking and salting are simple and effective 
means of preserving meat and fish for adult con- 
sumption, but in general they do not provide food 
that is suitable as major protein sources for infants 
and young children. Sun-drying is widely used for 
many foods, but this has serious limitations in a 
hot, moist climate, especially when proper storage 
conditions are unavailable. Using artificial heat is 
more costly and involves the same problems of 
storage and packaging. Thermal processing or 
canning represents a big economic step because 
it requires sealable containers as well as higher 
temperatures. In industrialized countries spray- 
drying has been extensively employed for milk and 
a variety of other products that are fluid before 
processing. Obviously, the handling of frozen foods 
in tropical underdeveloped countries in the early 
stages of industrial development poses problems 
that preclude the widespread application of freez- 
ing as a method of food storage. Freeze-drying 
seems too expensive to apply to food staples, but 
the shelf life of certain types of food may be ex- 
tended by low doses of ionizing radiation. Chemical 
preservatives and other additives that have been 
important in the efficient utilization of the food 
supply in modern nations are also needed in under- 
developed countries. Examples are antioxidants to 
slow down the process by which fats become rancid 
and propionates to inhibit molds. 

New food sources. Nutritional requirements 
can be satisfied by an adequate source of calories 
and sufficient quantities of each of the essential 
amino acids, fatty acids, vitamins, and minerals 
in any utilizable form, whether natural or syn- 
thetic. There are many compelling reasons why 
most nutrients will continue to be supplied largely 
from plants and animals for a long time to come, 
but new and exotic sources of food are being 
explored. 

Oilseed meals, such as cottonseed flour, can 
serve as valuable new sources of proteins provided 
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that they are carefully processed. Furthermore, 
much nutritional benefit can be derived from the 
enrichment of certain foods. For example, the 
quality of most protein-containing foods of vege- 
table origin can be improved by combining them 
with foods of complementary essential amino acid 
content or by adding the missing amino acids in 
synthetic form. Other examples include the en- 
richment of salt with potassium iodide for the 
prevention of goiter, a measure which has proved 
as practical and effective in Guatemala and Co- 
lombia as in the United States and Switzerland; 
the prevention of beriberi and pellagra by the addi- 
tion of B-complex vitamins to cereal products; the 
enrichment of margarine and skim milk with 
vitamins A and D so that they are nutritionally 
acceptable substitutes for butter and whole milk, 
respectively, and the addition of the amino acid 
lysine to wheat and wheat flour. 

In a number of underdeveloped areas where 
milk cannot be depended upon to furnish needed 
protein, vegetable mixtures that were developed to 
serve this purpose are proving to be commercially 
successful and are bringing about important nutri- 
tional improvements. Furthermore, the world catch 
of marine fish has increased enormously in recent 
years, and processes have been developed for the 
production of a low-cost fish protein concentrate. 
In the meantime, fresh, canned, and smoked fish 
are compensating for the protein inadequacy of 
the diet in Japan and a number of other countries. 
Attempts to use algae, green leaves, and even grass 
as sources of protein have met with little success 
because of poor palatability and excessive costs, 
but this could be changed by technological develop- 
ments. Other possible protein sources are food 
yeast, which can be grown with several inexpensive 
carbohydrates, such as molasses and sulfite liquor 
of the paper industry. Other microorganisms can 
be produced by using petroleum by-products or 
natural gas as their source of energy. 

Synthetic foods. Palatable synthetic foods sim- 
ulating bacon, hamburger, ham, chicken, fish, and 
scallops have already been developed from soy 
protein isolate. Protein fibers are spun like nylon 
and reconstituted into textured foods that can be 
flavored in any desired manner. At present these 
are as expensive as the foods they simulate, but 
as the number of mouths to feed increases and 
costs of the process are brought down they are 
likely to prove useful. The nutritional needs of 
the body can already be reduced to chemically 
known substances that can be synthesized or ex- 
tracted from natural products. To meet the de- 
mands of an enormously increased world popu- 


lation, the eventual use of wholly synthetic foods 
seems likely. Ironically, most of the efforts to 
achieve this are being stimulated by the nutritional 
requirements of man in space rather than by ter- 
restrial food problems. 

Social science research. Much of the malnutri- 
tion in the world today, particularly among young 
children, can be prevented by changes in feeding 
practices. Increased production and purchasing 
power will not alone automatically eliminate mal- 
nutrition; too often the additional money is spent 
on soft drinks, alcohol, or other consumer items 
that will not improve inadequate diets. Nutritional 
education must be a major part of efforts to elimi- 
nate malnutrition. 

There is much that the social scientist can con- 
tribute to solving world food problems. Persons re- 
sponsible for nutrition and health education need 
to understand the reasons for food practices and 
beliefs, yet these have too often been neglected by 
social scientists even when studying a culture in 
detail. Even less research has been done on the 
most appropriate and effective means of inculcat- 
ing food habits in various cultures (See Mead 1943; 
1964). 

An exception was the introduction in Guatemala 
of the low-cost protein-rich vegetable mixture, In- 
caparina, which benefited from preliminary field 
studies by anthropologists. The success of this 
product is attributable in large part to its introduc- 
tion as a variant on the traditional atole rather 
than as a new food product. It will not always be 
possible to use this technique, for new foods often 
will have no local counterpart. The guidance of 
social scientists who have made advance studies 
will be particularly needed for the introduction 
of products that have never before been used for 
human feeding, such as fish protein concentrate 
or bacterial protein. 

There is also need to study changes in food 
habits or attempts to impose new foods, for their 
impact on other aspects of culture. Gifts of un- 
familiar surplus foods may simply be unused or 
they can upset patterns of life that are oriented 
about indigenous staples. Authoritarian methods 
of forcing nutritional standards on a child or a 
community may endanger cooperation in other im- 
portant matters. 

A major obstacle to introducing agricultural 
techniques is the lack of local technological knowl- 
edge and competence. Agricultural extension 
agents and teachers often attempt to induce change 
blindly without understanding the existing beliefs 
and traditions that they are trying to alter. Simi- 
larly, although technology is making rapid progress 


in devising practical techniques for family plan- 
ning, motivating people to make use of them re- 
mains a problem. In order to make such programs 
more effective, there is an urgent need for studies 
of the ways in which the desired changes can be 
achieved. 


The agricultural revolution 


The world is now in the midst of a revolution 
in food production, conservation, and distribution 
that is as far-reaching in its effects on societies as 
the original development of agriculture; it is com- 
parable in significance to the industrial revolution, 
with which it is closely allied. Farm production in 
the industrialized countries has risen spectacularly 
within a few decades, simultaneously with a sharp 
reduction in the numbers of persons engaged in 
farming and the actual amount of land under cul- 
tivation. This has meant food for greatly increased 
numbers of people. Equally important, the workers 
no longer needed in agriculture are freed for the 
production of consumer goods and services to an 
extent beyond any previously seen in history. It is 
this freedom which has made possible modern eco- 
nomic and social development. 

With the need for farm labor decreasing, the 
cities of the industrialized countries have received 
the increase in population. Nevertheless, food pro- 
duction in these countries is rising despite the di- 
minishing farm population, thus providing enough 
food for the growing urban markets. In many of 
the less developed countries, population is outgrow- 
ing available land, and people from the rural areas 
are migrating to the cities in ever larger numbers. 
However, in most such countries farming remains 
largely at subsistence levels, and transportation 
and storage facilities are inadequate to meet the 
demands of the cities for food distribution. 

An efficient agriculture thus becomes a prime 
requisite of economic and social development. Sub- 
sistence agriculture adds nothing to the national 
economy; and until enough farm families con- 
tribute to meeting the food needs of urban families, 
the manpower and capital required for industrial- 
ization will not be available. Once food production 
rises sufficiently to provide a firm basis for eco- 
nomic and social development, the process tends 
to accelerate. Research results accumulate and are 
applied; capital is amassed and invested; machines 
are constructed and put to work. As the process 
continues, more and more people are available, 
educated, and trained for the complex tasks of 
a technically advanced society. Unless the health 
and intelligence of the people are such that they 
can make good use of modern knowledge and tech- 
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nology, no amount of investment in material things 
can ensure satisfactory social and economic de- 
velopment. A good food supply is needed to prevent 
excessive morbidity and mortality from disease as 
well as the poor learning and working capacity that 
result from malnutrition. At present, the growing 
per capita shortage of food in many countries will 
continue to thwart the ambitions and make a 
mockery of the goals of economic planners. 


NEVIN S. SCRIMSHAW 


{Other relevant material may be found in AGRICUL- 
TURE; MORTALITY; POPULATION, article on POPULA- 
TION GROWTH; TECHNICAL ASSISTANCE; and in the 
biography of SAUER.] 
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i 
CONSUMPTION PATTERNS 


The consumption of food, like other biologically 
supportive activities, is an aspect of cultural be- 
havior, In no society are people permitted to eat 
everything, everywhere, with everyone, and in all 
situations. Instead, the consumption of food is 
governed by rules and usages which cut across each 
other at different levels of symbolization. These 
symbolizations define the social contexts and group- 
ings within which food—or a particular kind of 
food—is consumed, and prohibit or taboo the con- 
sumption of particular foods. An important di- 
mension of the social contexts regulating the con- 
sumption of food is the principle that patterns of 
consumption and distribution must be examined as 
one. For example, there are some societies in which 
it is believed that a person will die if he eats food 
which he has grown himself, while in others it is 
felt that a person may eat only food which he him- 
self has grown or acquired by purchase or other 
exchange. Between these two extremes are several 
other patterns. 

In all societies the distribution and consumption 
of food is an expression of a variety of social rela- 
tionships: those of social proximity and distance, 
religious-ritual fraternity and status, political super- 
ordination and subordination, bonds within and 
between families, and the like. The definition of 
food, its distribution, and its consumption always 
take place with reference to individuals as occu- 
pants of statuses and categories within institution- 
alized groupings. 

In other words, food is used symbolically to rep- 
resent only some social forms and personal feelings 
in a society, and these are usually among the im- 
portant forms and personal feelings in the group’s 
life. Thus, by noting the specific and bounded social 
contexts—clan, village, in-law relationships, friend- 
ship, and the like—within which food is employed 
symbolically, it is often possible to infer which are 
the important groupings and relationships within 
the society. For example, a taboo on eating the 
totemic animals associated with one’s clan is indic- 
ative of the significance of clans in the organiza- 


tion of the society's institutions; it is not the taboo 
per se which suggests the importance of clan rela- 
tionships but the fact that some rule governing the 
consumption of food—in this case, forbidding it— 
is associated with the clan. Why clans usually pro- 
hibit their members from eating the animals associ- 
ated with these groups is a separate question, and 
one about which there is still considerable uncer- 
tainty. 

Among some Melanesian people the rule that 
a man must give part of the harvest of a crop 
to his sister, while his wife receives a similar pres- 
tation from her brother, provides a clue to the im- 
portance of certain matrilineal ties; in societies that 
have a caste system the rule that the members of 
different castes may not eat together indicates the 
importance of formal distance between castes as 
well as the caste organization itself. Correlatively, 
when food ceases to be employed as a vehicle for 
the expression of social sentiments within a group- 
ing, e.g., in a clan, or when proscriptions concern- 
ing the consumption of food are attacked, it can be 
assumed that significant changes are taking place 
in the socioeconomic structure of that society. 

The rules governing the distribution of food 
within a society reflect and reinforce prevailing 
ethical and moral orientations in that society. For 
example, when the government of the United States 
willingly distributes food supplies to poor people in 
other societies but not within American society 
itself, it appears that its dominant values im- 
plicitly tend to define poverty as an indication of 
moral failure, if not as sinful. Hence, the assump- 
tion often appears to be that if there were gratui- 
tous distributions of food to poor Americans, such 
action would be construed as reward or even ap- 
proval of such moral failure. But since it is char- 
acteristic that the criteria pertaining to the distribu- 
tion of food within a social system differ entirely 
from those relating to other groupings or societies, 
American society can make prestations of food to 
people in other societies while refusing (or being 
unable) to do so within its own social boundaries. 

Almost every society defines a few foods as ac- 
ceptable for consumption under some circumstances 
but wholly unacceptable in others. For example, 
foods which are associated with amusement and 
relaxation are usually considered inappropriate to 
ritual or ceremonial occasions. In pluralistic and 
stratified societies, most foods which are grown 
indigenously are eaten by people in all groups; 
however, in almost all such societies, there are a 
few foods and drinks which are not universal or 
which will be consumed by members of different 
groups in different contexts or situations. Thus, for 


example, the same alcoholic beverages in a plural- 
istic society will be drunk under different condi- 
tions and in entirely different places by members 
of different groups. Such definitions both symbolize 
and reinforce the consciousness of separateness 
and distance existing between bounded groups in 
pluralistic and stratified societies. 

For reasons which are not yet completely clear, 
the major transitional crises of the life cycle—the 
rites de passage—are marked in almost all societies 
by ritual or ceremonial distribution and consump- 
tion of food. One possible hypothesis to explain 
these nearly universal customs is that each of the 
three transitional crises (birth, marriage, death) 
initiates a significant alteration in socioeconomic 
relationships and reciprocities and that these are 
symbolically noted in displays, distributions, pres- 
tations, token exchanges, and consumption of food. 
An individuals birth automatically establishes 
reciprocal rights and duties between him and others 
to whom he is related in a series of interlocking 
social networks; marriage in all societies is a tran- 
sitional ceremony which establishes bonds and rec- 
iprocities between the kin groups of the marrying 
couple; an individual’s death terminates, and re- 
quires readjustments in, reciprocal economic rela- 
tionships. 

In addition to these ritual celebrations through 
distribution and consumption of food, as well as 
other forms of wealth (e.g., bride wealth), a great 
many societies celebrate historical or ceremonial 
events according to calendrical systems. These re- 
current and fixed celebrations are usually governed 
by ritual consumptions of food, as in the American 
Thanksgiving feast or in the custom of many Amer- 
icans to celebrate their Independence Day with 
family picnics. Calendrically regulated religious 
events are similarly celebrated. 

One of the fundamental principles governing the 
organization of social relationships in all societies 
is the division of labor by sex; most societies also 
stipulate divisions of labor according to the addi- 
tional criteria of age, status or ranking, group mem- 
bership, and the like. Although the division of labor 
is most apparent in the production of food, it also 
plays a major role in the consumption of food. 
There is probably considerable, even systematic, 
overlap between rules governing the division of 
productive labor and those governing food con- 
sumption, of which kinship is undoubtedly of prime 
importance; thus far, little work has been done on 
this problem. A possible model for such an analysis 
would be a horticultural society in which the wife 
does most of the productive work, raising food 
which her husband then gives to his sister. 
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The distribution and consumption of food within 
the nuclear family also symbolize institutionalized 
relationships. For example, among the Tallensi of 
west Africa there are many expressions of the sep- 
arateness of father and son and of the father’s 
absolute authority. One of these symbolizations is 
the rule that a son may not look into his father’s 
granary during the father’s lifetime. After the 
father’s death, his eldest surviving son is conducted 
ritually by his father’s brothers to peer into the de- 
ceased’s granary. This son thus acquires control 
over the distribution of the food stored in the gran- 
ary, and this prerogative, among others, symbolizes 
and supports his control over the patrilocal ex- 
tended family. 

To cite another illustration, where it is the as- 
signed task of the woman of a household to distrib- 
ute the food within the nuclear family, e.g., by serv- 
ing meals to members of the household, the manner 
in which she distributes the food may symbolize 
relationships to her husband and children at differ- 
ent stages of the family cycle (and to other mem- 
bers of the household, if present). Thus, status re- 
lationships within the family are sharply delineated 
by such rules as whether a woman eats with her 
husband or separately, when and whether a,man 
eats with his children, etc. Among the Manus of 
the Admiralty Islands, near New Guinea, the es- 
trangement between husband and wife during the 
first few years of marriage, and the competitive 
relationship in which each stands in relation to the 
kin of the other, is symbolized in part by the shame 
they feel in eating together. The most competi- 
tive relationships among the Manus are between 
brothers-in-law and between sisters-in-law: these 
people may not eat together. 

The special significance of food in the symboli- 
zation of kinship relationships generally can be 
seen in the fact that behavior with respect to food 
in the nuclear family often varies with the degree 
of segregation or division of role relationships; the 
latter is in turn dependent on the degree to which 
the nuclear family is connected to a wider nexus of 
kinship networks. Thus, one of the symbolizations 
of the greatly reduced segregation of role relation- 
ships between husband and wife in modern com- 
plex societies—attendant on the weakening of 
wider kinship matrices—has been the tendency to 
minimize and blur the division of labor with respect 
to the distribution of food within the family. In the 
lower social strata of Western societies, for ex- 
ample, men do not cook or otherwise participate 
in the distribution of food within the family; in 
these strata there is usually a strong connection 
between the nuclear family and wider kinship 
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groupings and a marked degree of segregation in 
role relationships between husband and wife. In 
higher social strata in some Western societies, on 
the other hand, there are situations in which men 
do cook and participate in the distribution of food 
within the nuclear family; these situations tend to 
arise as segregation in role relationships between 
spouses becomes weakened as part of the attenua- 
tion of wider kinship relationships. 

Most societies provide for separateness between 
nuclear families. One of the symbolizations of this 
separateness is in the confinement of the prepara- 
tion and consumption of food to the social bound- 
aries of the nuclear family. In most polygynous 
organizations, each wife has her own hearth or 
kitchen; she and her children eat separately, the 
husband either eating with each nuclear family 
serially or eating with his senior wife. While such 
arrangements are usually interpreted by anthropol- 
ogists as being designed to maintain peace among 
co-wives, these patterns of consumption must also 
be understood as representations of the social-struc- 
tural differentiations which most societies make 
between nuclear families. Among some Mexican 
Tarascan groups who possess a stem-family organ- 
ization, in which a son continues to live with his 
father after marrying, the son’s wife and her 
mother-in-law share a common kitchen, and the 
two nuclear families eat jointly. When dissension 
occurs, or in anticipation of the son’s plans to 
found an independent household, his wife will 
sometimes establish her own kitchen, and the two 
families will eat separately. Among the Cheyenne 
Indians of North America, where men were forbid- 
den to speak directly to their mothers-in-law, the 
men were required to provide all the meat by hunt- 
ing; this was cooked in the mother-in-law’s lodge, 
but then her daughters carried the food to their 
respective tepees for eating. The hunting and gath- 
ering Tiwi of Northern Australia are one people who 
do not separate the component nuclear families of 
polygynous households into consumption units. In- 
stead, each polygynous household cooks and eats as 
a unit. This is probably because each household 
head has many more wives, especially elderly ones, 
than he has children; hence, there are few com- 
ponent nuclear families in each household. 

There are four patterns which govern the distri- 
bution of food, and hence the consumption of food; 
three of these are patterns of sharing. Although 
they have a tendency to shade off into one another, 
they are easily identifiable and clearly illustrate the 
basic principle that it is a characteristic of social 
systems to symbolize social relationships by means 


of different patterns of distributing and consuming 
food. These four patterns are (1) recurrent ex- 
change and sharing of food; (2) mutual assistance 
and sharing in times of need; (3) narrowed and 
reluctant sharing; and (4) nonsharing. 

(1) Recurrent exchange and sharing. There are 
several variations on this pattern of sharing; all, 
however, are associated with and symbolize a com- 
bination of factors which tend to give rise to max- 
imal solidarity within the community: social affili- 
ation based on consanguineal kinship; physical 
proximity between households; the prescription or 
careful control of residence by, among other fac- 
tors, rules of residence; the conduct of interper- 
sonal communication in stable and consistently 
functioning primary or face-to-face groupings. In 
brief, the combination of highly integrated kin 
groups, physical proximity, and sedentary life ap- 
pear to yield strong feelings of social proximity. 
Recurrent exchange and sharing of food is one of 
the symbolic representations of this solidary state. 

Among most peoples characterized by this kind 
of social organization, there is a constant flow of 
gifts of food from household to household within 
the solidary grouping. Among other societies, whose 
cultures are suffused with religious ceremonial 
(e.g., the Hopi and Tallensi), the recurrent distri- 
bution of food, too, takes place in a religious cere- 
monial context. Among still others (e.g., the Kur- 
tatchi and the Papago), there are unelaborated but 
repeated donations, prestations, and exchanges of 
small token amounts of food among kinsmen. 

Regardless of the variations on the theme, these 
recurrent and repetitive exchanges of food almost 
always take place within the sociological boundaries 
of the solidary community. Also, such recurrent 
exchanges of food are in addition to ceremonialized 
or ritualized feasts. 

(2) Mutual assistance and sharing in times of 
need. The primary difference between type (1) 
and the social organization with which mutual as- 
sistance in times of need is associated is that in the 
latter the ties and alignments of kinship are not 
solidified into corporate kin groups. The bonds of 
kinship, qualitatively speaking, are equally strong 
in both social organizations, but in the present in- 
stance these ties are not elevated to the level of 
exclusive and solidary groupings. 

A degree of social and emotional solidarity arises 
in such a community from the strong and marked 
tendency to affiliate primarily with kinsmen, real 
or fictive. Usually the members of such a commu- 
nity are mostly kinsmen, and sometimes entirely 
kinsmen. An element of fissility, and a correspond- 


ing degree of social distance, is introduced into 
such a social organization by a variety of factors. 
First, physical mobility and change in community 
membership are usually permitted, thus resulting 
in the simultaneous presence of kinsmen and non- 
kinsmen within the group. Where nonkinsmen join 
the community, they are sometimes addressed and 
treated as kinsmen; preference for consanguineal 
ties is the mainspring of such relationships. But of 
equal, if not greater, significance is the fact that 
kinsmen have the right to sever relationships with 
their group and either establish or join an entirely 
independent one. While extreme personalization of 
relationships is the rule in this social organization, 
social cohesiveness and proximity are proportion- 
ately weakened by movement away from kinsmen 
and by the intrusion of nonkinsmen into the com- 
munity. Among many peoples (e.g., the Andama- 
nese, Arunta, South African Bushmen, Plains In- 
dian tribes) the exigencies of the food-producing 
environment, rather than tensions and quarrels, 
appear to have necessitated such adjustments on 
the social-structural level. In either event, the con- 
sequences, in terms of fission and social distance, 
are similar. 

It is possible to observe a process of compromise 
in this social organization between forces which 
give rise to social proximity and those resulting in 
social distance. Any of these, when taken alone, is 
productive of either solidarity or fissility. For ex- 
ample, clanship among the Navajo or the peasants 
of Shantung province in pre-Republican China 
would, when taken separately, produce a high de- 
gree of social solidarity. Obversely, the isolation of 
the family—geographically, and hence emotionally, 
among the Navajo and almost entirely emotionally 
among Shantung peasants—gave rise to the emo- 
tional inbreeding of the family vis-a-vis the commu- 
nity. The simultaneity of factors making both for 
solidarity and for fissility produces a compromise 
between the two and places such peoples at this 
point along the continuum of social cohesiveness 
and solidarity rather than at the point of maximal 
or minimal solidarity. Mutual assistance in times of 
need is characteristic of such societies. 

In nomadic hunting and gathering societies hav- 
ing this type of social organization, mutual assist- 
ance and sharing in times of need often take the 
form of the immediate dispensation of meat to al- 
most all the households of the community as soon 
as an animal is captured or slain, especially to 
those who do not have meat. In sedentary societies, 
mutual assistance in times of need takes the form 
of aid rendered to kinsmen, real or fictive, when 
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economic help is objectively required. There is, of 
course, mutual assistance in time of need in socie- 
ties which practice recurrent exchange and shar- 
ing; however, as the present instance indicates, 
there are many societies in which there is mutual 
assistance without the elaboration of recurrent ex- 
change. 

This is not meant to imply that there are no 
feasts and exchanges whatever in this social or- 
ganization; these are present, but they occur only 
on specific occasions, e.g., religious events and the 
life crises marked by rites de passage. Nor is the 
idea or concept of recurrent prestation unknown in 
such societies, but such exchange is almost always 
confined to gifts of goods, not food. 

(3) Narrowed and reluctant sharing. By this is 
meant that whatever assistance is rendered to peo- 
ple is given very reluctantly and grudgingly. This 
pattern of food distribution is ordinarily associated 
with fragmented social systems in which the iso- 
lated nuclear family unit is the significant unit of 
association; socially, geographically, and emotion- 
ally isolated from all other family units, it consti- 
tutes neither a society nor a community. Nor do 
the several family units constitute a society, struc- 
turally or functionally, when they come together. 

The primary characteristic of this social system 
is the presence of relatively great geographic dis- 
tances between households and families and, con- 
comitantly, clearly demarcated social boundaries. 
Physical space or distance in this social organiza- 
tion serves as a boundary-maintaining force only 
indirectly and incidentally; this is an important 
consideration, because even when the family units 
in such a social system unite for temporary amal- 
gamations, they continue to maintain their great 
social distance from each other. Although the ties 
of kinship in this social organization may be highly 
formalized, there are rarely any functioning kin 
groups outside the family, and kinship ties become 
increasingly diffuse as the physical distance be- 
tween kinsmen increases. 

One of the material symbolizations of this social 
atomism or fission is the pattern of narrowed and 
reluctant sharing, as among the Kaska and Ojibwa 
Indians. Even when individuals in these societies 
are willing to extend aid, they are willing to do so 
only to a very limited number of people. The cul- 
tures of such peoples generally contain the ideal 
prescription of generosity and mutual assistance 
in times of need, but this ideal is rarely, if ever, 
achieved; in actual behavior, generosity and assist- 
ance are extremely restricted—qualitatively and in 
the number of persons toward whom such behavior 
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is manifest. Another way of stating this is that be- 
havior with respect to food in such societies ap- 
proaches parsimoniousness but never quite achieves 
it, reflecting the atomization of social relationships 
beyond the nuclear family. 

(4) Nonsharing. This is a general category; it 
has thus far not been possible to determine differ- 
ent types of nonsharing behavior with respect to 
food, as has been done in connection with pat- 
terns of sharing, because there are too few societies 
in the category of nonsharing from among which 
discrete types of nonsharing could be elicited. 

Here “nonsharing” means the absence in a cul- 
ture of enforceable prescriptions to share food with 
others, no matter how great their need; where non- 
sharing is found as the dominant pattern, the indi- 
viduated accumulation of wealth is an end in itself, 
rather than a means to cooperative or competitive 
generosity. That is, people in many societies, as in 
the first two categories described above, amass food 
or other forms of wealth in order to be able to dis- 
tribute it in socially prescribed forms of generosity, 
and there is often competition to be the most gen- 
erous. 

In societies characterized by nonsharing, there 
are rarely, if ever, any closed groups within the 
community in which membership, feeling of be- 
longing, and reciprocity are fixed and inalienable. 
Few, if any, institutionally significant statuses are 
ascribed; authority, group cohesiveness, interfamil- 
ial support, and mutual assistance are almost al- 
ways absent. Similarly, allegiances are expedient 
and frequently contractual. While considerations 
of kinship appear to play occasional roles in inter- 
personal associations, such considerations are de- 
cidedly secondary; instead, kinsmen and nonkins- 
men alike enter into the economically competitive 
struggle, and personal profit and success emerge 
as predominant values. Power resides among those 
who are monetarily powerful rather than among 
those persons whose social dominance derives from 
ascribed high status or who have succeeded in other 
spheres of competitive activity. 

Societies characterized by this system superfi- 
cially resemble societies in which narrowed and 
reluctant sharing is found, in that the effective 
socioeconomic unit appears to be the independent 
nuclear family, The resemblance, however, is super- 
ficial. In societies in which nonsharing is the dom- 
inant pattern, the social isolation of the family does 
not derive from physical distance between family 
units. Instead, the principal isolating factor appears 
to be the search for individually accumulated 
wealth. Examples of societies characterized by non- 
sharing of food are the Alorese, the Marquesans, 


highland peasants in Jamaica, the Yakut of Siberia, 
and the Yurok Indians. 

The comparative ethnographic data reveal with 
marked clarity that no matter how highly elabo- 
rated a pattern of recurrent exchange and sharing 
might be, to say nothing of other patterns, in no 
society is an individual expected to give up every- 
thing he possesses. In no society is true “selfless- 
ness” an expected or imposed value to which the 
individual must adhere. For example, among the 
Arapesh of New Guinea, who have elaborated re- 
current exchange to an extreme and among whom 
a man does not eat food which he himself has 
grown, personal ownership and rights in property 
are clearly delineated with definite notions of 
“mine” and “yours.” The Kwoma, too, who believe 
that a man would die if he ate his own harvest, 
also specify individual ownership of the yield of 
the land, each person’s crop being put in a separate 
bin in a storehouse. The Copper Eskimo prescribe 
that even if only one seal is caught, it must be dis- 
tributed equally throughout the community. How- 
ever, it has been observed among them that during 
the winter, when each woman cooks indoors, she 
can hide the choicer portions of meat for her hus- 
band and herself until after visitors have left; in 
the summer, on the other hand, when most of the 
cooking is done out-of-doors and everyone can see 
what is in the pot, no concealment is possible. It 
can be suggested that the provision in all cultures 
for personal ownership and retention—no matter 
how great the pressures to share—is due, at least 
in part, to the need for material representations of 
individuation and personal separateness, i.e., self- 
awareness. As Hallowell noted in his paper “The 
Self and Its Behavioral Environment,” “It seems 
necessary to assume self-awareness as one of the 
prerequisite psychological conditions for the func- 
tioning of any human social order, no matter what 
linguistic and culture patterns prevail” ([1937— 
1954] 1955, p. 75). 

At the same time that the social forces which 
make for extreme degrees of social proximity max- 
imize predispositions to share food, factors making 
for social distance within a maximally solidary 
community will mitigate the predisposition to 
share. That is, it can be assumed that every society 
contains some forces, however few and small, 
which help to make for social distance. In a society 
in which extreme social proximity helps to produce 
a pattern of recurrent food sharing, those factors 
making for social distance in the same society will 
help to produce a proportionate reluctance to share. 

For example, high social status, such as chief- 
tainship, is determined in a variety of ways. It can 


be postulated that social distance is greater be- 
tween persons of high and low ascribed statuses 
than between persons of high and low achieved 
statuses. Assuming some degree of reluctance in 
most people to share food with others, even in so- 
cieties in which people appear strongly predisposed 
to share, it follows logically that reluctance to share 
would covary with ascribed high status even in 
societies in which there is strong social cohesive- 
ness and solidarity. In other words, it can be as- 
sumed that persons with ascribed high status in 
solidary communities will employ their social dis- 
tance from the “masses” or “commoners” as a 
mechanism which enables them to amass and 
retain more for themselves; this is not necessarily 
conscious on their part. For example, among the 
Manuans of Samoa, who have an elaborate hier- 
archy of chieftainship, a high income is one of the 
few feudal privileges. Although chiefly wealth is 
kept in circulation by the requirement that chiefs 
be generous to commoners, the chiefs nevertheless 
generally have more food than anyone else. Among 
the Dahomeans, the economic organization was 
usually characterized by a surplus which eventually 
brought about the concentration of wealth in the 
hands of the hereditary leisure classes. 


The evidence is conclusive that patterns in the 
consumption of food are almost always governed 
by cultural symbols and that the ways in which 
food is distributed and consumed reflect a society's 
dominant modes of social relationships and group- 
ings, especially those pertaining to kinship ties. 

YEHUDI A. COHEN 


[See also DRINKING AND ALCOHOLISM; EXCHANGE AND 
DISPLAY; and the biography of Mauss.] 
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1, POLITICAL ASPECTS 
ul. Economic ASPECTS 


I 
POLITICAL ASPECTS 


Pecuniary grants and other material donations 
by one government to another are among the re- 
current features of interstate relations. The two 
basic types of grants are tribute and subsidy. Trib- 
ute is typically a donation from a state that is 
somehow more vulnerable to a less vulnerable state 
in exchange for protection or immunity; subsidy 
originates typically with a state that is at least 
financially stronger, and the donor expects in re- 
turn a performance that serves his security strategy. 

Contemporary foreign aid may comprise ele- 
ments of tribute, when it secures immunity from 
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verbal hostility and feelings of guilt, for instance. 
But its direct line of descent is from subsidies. The 
elementary political economy of subsidies is un- 
changing: the subsidy is an inducement for the 
recipient to behave in a way that would supplement 
the donor’s exertions or substitute for them, and 
do it at a cost that is less than the cost of alterna- 
tive courses of action, including war and greater 
national exertions in war. The basic assumption is 
clearest in aid for military effort. It holds that there 
is a definable unit of fighting power or defense 
power that can be had more cheaply from outside 
the donor’s military establishment. The assumption 
is vindicated whenever domestic armaments have 
reached one or more of the limits set by internal 
supply of material resources, conditions of social 
stability within a political order, and countervailing 
responses by third states. The same basic principle 
of diminishing returns of strictly internal efforts 
can be applied to economic welfare and other non- 
military components of national security and inter- 
national stability, and serves as the rationale of 
foreign aid. 


Objectives 

The ultimate objective of the donor is to convert 
superior economic power into a pattern of political 
alignments that would improve his regional or 
global position. The recipient expects foreign aid 
to generate new resources of power, welfare, and 
prestige. His principal aim may be to consolidate 
the regime’s authority with respect to opposition 
forces, to keep abreast of growing popular needs or 
of growth in other states, or to equip himself for 
effective or even offensive action with regard to 
other countries. Calculation of political advantage 
to be derived from aid is at its most pragmatic 
when diplomacy of maneuver and alignment pre- 
dominates in a relatively flexible and relaxed multi- 
polar situation; it may be temporarily subordinated 
to ideological determinants, or to the compulsions 
of force or the general interest when the global or 
regional balance of power is exposed to a hege- 
monial drive and polarized in consequence. Sub- 
sidies then become an emergency expedient to 
confirm lesser states in their readiness to resist the 
onslaught or to convert to resistance those states 
reluctant to subordinate their immediate concerns 
to the maintenance of the existing international 
system. 

The expanded foreign aid activities following 
World War 1 conformed, on the whole, to the 
fundamental principles of the statecraft of subsi- 
dies. Foreign aid was part of a policy of coalition 
building by the United States, the preponderant 


yet defensive power in the system; it became part 
of a coalition-frustrating policy in the hands of the 
Soviet Union in a later phase of the cold war. The 
peculiar character of the contest enlarged the 
meaning of the coalition to be built or frustrated 
and drew into the compass of foreign aid not only 
active, military allies; economic and military aid 
were used to attract the uncommitted, floating 
allegiance of “independents” as well as to consoli- 
date established commitments. The need to win the 
apparently decisive uncommitted states made both 
contestants edge away from damaging antecedents 
and associations. The more conservative United 
States repudiated the paternalistic and colonial ele- 
ments in its past and among its allies; the revolu- 
tionary Soviet Union disclaimed its militant tradition 
as an export commodity and disowned its overly 
expansionist Chinese ally whenever expedient for 
reassuring alarmed neutrals. Yet the need to retain 
the allegiance of the inner core of committed parti- 
sans has imposed a limit on profitable opportunism 
also in the international contest; it is a dubious 
electoral and diplomatic strategy to woo untested 
new friends in such a way as to alienate old 
comrades-in-arms. 

Difficulties in implementing the objectives. 
Three special circumstances have warped the im- 
plementation of the political economy of subsidies 
beyond the unavoidable measure of distortion in- 
troduced by hegemonial conflict. 

One was an overemphasis on the role of eco- 
nomic changes and processes in assuring political 
stability and on the role of domestic politics in 
unstabilizing foreign policy activities. In this re- 
spect, the effect of liberal predispositions was com- 
parable with the affirmations of Marxist ideology 
and was reinforced by the need of meeting the 
Soviet practice of internal political subversion and 
manipulation of economic malcontents. Growing 
demands for external assistance consequently ac- 
quired great authority, but the recipients all too 
often lacked the will and ability to use this assist- 
ance responsibly and effectively. 

Another deforming influence has been the asso- 
ciation of foreign aid with decolonization; it added 
psychological and propagandistic biases to those 
imparted by a deterministic ideology and an oppor- 
tunistic political strategy. Foreign aid came to be 
widely regarded as a duty for the advanced states 
and as a reparation for the uncritically asserted 
grievances of recipients. 

The consequent moral advantage of recipient 
countries was enhanced by a political advantage 
owing to the third distorting circumstance, the 
sharp increase in military—technological inhibitions 


on warlike settlements of conflicts between major 
powers. A demonstrable capacity to wage a major 
nuclear war has remained a fundamental condition 
of the efficacy of all political action by the prin- 
cipal powers, including dispensation of foreign aid; 
at the same time, however, the economic power 
and activities of the donor states have been largely 
deprived of their essential support in usable con- 
ventional military force. Thermonuclear stalemate 
and severance of economic power from the ultimate 
military sanction behind political control have com- 
bined to increase the ability of the more ambitious 
among the lesser states to act with impunity. Com- 
petitive economic growth has become the prime 
functional substitute for war as a test of superiority 
for the two competing superpowers and politico- 
economic systems. Consequently, foreign aid activi- 
ties, the principal device for extending the economic 
competition to third and lesser states, were tem- 
porarily overstrained. 

The distorting ideological, psychological, and 
military circumstances produced a disparity be- 
tween power and control or influence that was 
unfavorable to the donors. This disparity at first 
intensified the donors’ competition over the Afro— 
Asian nations; only progressively and uncertainly 
has the disparity begun to foster a tacit under- 
standing between the superpowers, regarding the 
rules and limits of foreign aid competition, that 
may eventually match comparable understandings 
in the domain of the arms race. 

American and Soviet approaches. Undeniable 
differences of style and spirit have separated the 
so-called capitalist and the so-called socialist major 
donors. But both have been confounded by com- 
parable uncertainties regarding the objectives of 
foreign aid and the means or strategies for attain- 
ing the objectives. The immediate, short-term ob- 
jective of both the United States and the Soviet 
Union has been negative: to prevent forces favor- 
able to the competitor from becoming established 
or consolidated to the detriment of one’s allies and 
sympathizers in embattled lesser countries. The 
long-term, positive objective has been to promote a 
structure of world order that would reflect the in- 
ternal organization, or the domestic ideal, of the 
superpowers: the supremacy of one class or elite 
in one case, pluralism of competing forces within 
the limits of consensus on essentials in the other. 
The uneasy coexistence of short-term preventive 
objectives and long-term constructive objectives 
compelled both major donors repeatedly to decide 
between often uncongenial, immediate beneficiaries 
of foreign aid and ultimate, actually favored bene- 
ficiaries. In practice, this often meant that the 
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Soviet Union aided or appeared to aid bourgeois- 
nationalists against local communists and that the 
United States helped dictatorial oligarchs, at least 
temporarily, against local liberals or democrats. 

The ambiguity of objectives has been reflected 
in uncertainties over strategies. The unresolved 
issue has been whether to choose a strategy keyed 
to individual countries or a strategy keyed to the 
international system as a whole. A one-country 
strategy of foreign aid as implemented by the 
United States would promote the capacity for politi- 
cal independence of individual recipients, on the 
assumption that the exercise of such independence 
would be consistent with American interests. As 
implemented by the Soviet Union, this strategy 
would aim at subverting the identity of a noncom- 
munist regime, on the assumption that formal 
political independence no less than formal political 
democracy is only a disguise for rule by elements 
hostile to the Soviet Union. In both cases, the stress 
is on the peculiar needs, conditions, and opportuni- 
ties in the target country, while the repercussions 
of aid to that country in comparable or related 
countries are downgraded. 

The one-system strategy would reverse the priori- 
ties, assigning high priority to the foreign policy 
behavior of recipients and its international reper- 
cussions. An American one-system strategy of 
longer range would promote behavior conducive to 
an elementary world order and, secondarily, to the 
progressive breakup of the Sino-Soviet bloc. The 
corresponding strategy of the Soviet Union would 
promote the progressive erosion of Western posi- 
tions in the less developed countries as a group, 
producing an irretrievable breakdown of the West- 
ern system rather than piecemeal subversion in one 
country after another. 

No clear choice between the two principal kinds 
of strategy has been made by either superpower, 
in part because of the untoward relationships be- 
tween domestic and external political behavior of 
many recipient regimes. The United States has had 
to cope with the problem of regimes that are inter- 
nationally pro-Western but internally antireform; 
the Soviet Union has had the problem of countries 
that are internationally neutralist and outwardly 
anti-West but domestically anticommunist. On the 
whole, doctrinaire supporters of foreign aid as a 
panacea in the West and of communism and local 
Communist parties as paramount concerns in the 
Sino-Soviet bloc have tended to favor the one- 
country approach, concentrating on immediate and 
locally promising conditions. Those primarily con- 
cerned with foreign policy have stressed the imme- 
diate needs of a coherent foreign aid policy and the 
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long-range requirements of a propitiously evolving 
international system, in view of the unlimited 
stakes of the global conflict and the limited efficacy 
of foreign aid in any one country, if administered 
in separation from over-all political and military 
strategies. 


Types of aid 

However used, foreign aid has differed from its 
antecedents in the modern European state system. 
Chief among them are the more or less overt dynas- 
tic subsidies for military manpower, chiefly in the 
seventeenth and eighteenth centuries, and the more 
or less officially sponsored but privately subscribed 
loans in the nineteenth and early twentieth cen- 
turies. Throughout, the principles and practices of 
“foreign aid” were adjusted to the type of control 
over individuals, whether authoritarian or constitu- 
tional, and to the prevailing scope of governmental 
functions, whether they were mercantilist, laissez- 
faire, or welfare-statist. They were also adapted to 
the predominant ways of managing international 
political alignments and military conflicts—as rel- 
atively temporary or hopefully indefinite commit- 
ments, implemented by levies of manpower or by 
advance building of infrastructures such as rail- 
roads and air bases. 

However, political and technological changes 
have, on the whole, fostered a growing temporal 
and qualitative gap between grant and anticipated 
performance. The difference between economic 
grant and expected political performance was al- 
ways there; but even in the field of military aid 
and security the typical performance of the recip- 
ient came to differ radically from the donor’s when- 
ever concession of facilities replaced supplementary 
manpower as the principal return for the aid given. 
The balance between specific and immediate acts 
and slowly evolving general conditions, as the pur- 
poses of foreign aid, has tended to shift in favor 
of the conditions expected to generate policies con- 
genial to the donor even after the termination of 
the aid relationship. 

The growing gap between grant and performance 
has become increasingly hard to span for two inter- 
acting sets of reasons. One has had to do with the 
instability of governing elites and their frequent 
replacement by competing elites. Revolutionary 
break and repudiation of indebtedness, anticipated 
in the lot of French loans to the tsarist regime in 
Russia, has become the rule, albeit with exceptions. 
The other set of reasons has had to do with the 
diffusion of aid relationships as an apparently 
stable and permanent feature of international rela- 
tions. Aid from a richer or well-established state 


has come to constitute a well-nigh automatic coun- 
terpart to diplomatic recognition and practically 
its most valuable by-product. However, this recog- 
nition in kind has not been governed by norms 
comparable with those of recognition in law, re- 
garding effective internal political control, respon- 
sible international conduct, and nonidentification 
of material aid with moral approval. 

Acquisitive and creative aid. The distance be- 
tween grant and anticipated performance, whatever 
may be its long-range effect on foreign aid, helps 
distinguish between two types of aid in the post- 
World War 1 era through the maze of administra- 
tive and propagandistic nomenclature. One kind of 
aid may be described as acquisitive; it is closest to 
traditional subsidies in that it is extended in the 
anticipation of a relatively specific, tangible, or im- 
mediate advantage for the donor. The acquisition 
may range from a facility such as a military base 
or a base for political operations, through an alli- 
ance commitment (or repudiation of alliance com- 
mitment to the donor's adversary), to the control 
over large segments of a country that would other- 
wise be economically and politically nonviable, 
Alternatively, the donor can give aid in anticipation 
of a long-range economic and political development 
in a favored direction. Such aid may be called cre- 
ative or developmental. Acquisitive aid generally 
has large military components for internal or ex- 
ternal security; creative aid typically is economic, 
even though it may release indigenous resources 
for military outlays. Although a few aid relation- 
ships may be wholly acquisitive or creative in mo- 
tive and purpose, most are primarily one or the 
other. The volume of aid determines whether aid 
is merely token aid, designed to manifest the 
donor's interest in the country as an encourage- 
ment to friendly groups, or whether more important 
aid constitutes a down-payment, committing the 
donor to long-term contributions as long as the 
recipient goes on displaying willingness and capac- 
ity to administer a politically and economically 
promising development process. 

The two main types of aid, acquisitive and cre- 
ative, have a different bearing upon two classes of 
problems that are raised by foreign aid: the dis- 
tribution of aid and the control over aid and related 
policies. The problem of distribution has so far 
concerned chiefly the choice between military and 
economic aid and choice among alternative re- 
cipients. 


Distribution of aid 


Military versus economic. In the United States, 
allocation of resources between economic and mili- 


tary forms of assistance has been a major, if some- 
times distracting, issue. The faith in economic 
means, in the guise of liberalized trade and private 
investments, was marked in American foreign 
policy before, during, and after the period of plan- 
ning for peace during World War 1. It received a 
confirmation from the successful economic recon- 
struction of western Europe under the European 
Recovery Program, which became the doubtfully 
valid model for subsequent programs of economic 
assistance elsewhere, following the disappointment 
of exaggerated expectations from technical-assist- 
ance programs short of large-scale aid for develop- 
ment. The military—political crises of the cold war 
produced countervailing stresses in favor of mili- 
tary assistance to countries and regimes threatened 
from within or without. The resulting debate over 
the proper ratio between economic and military aid 
was confused by ambiguous official categories of 
aid, such as “defense support.” The outcome of this 
debate was marked by the contrast between nomi- 
nal victories for advocates of economic aid and the 
less spectacular shifts in the actual ratio, due in 
large part to the continuing demand by many recip- 
ients for substantial military aid. 

The issue between economic and military aid has 
been largely a false one. As long as economic aid 
promotes development, it also fosters social mobili- 
zation and, thereby, some political instability. Not 
all governmental elites that administer outside as- 
sistance are anxious to be self-liquidating. To neu- 
tralize their increased vulnerability, they found 
that military aid for internal security was both 
necessary and legitimate. Moreover, military assist- 
ance can be a means to the goal of economic 
assistance by providing a developing country with 
forces not only for local defense but also for in- 
ternal development that a nationwide, properly 
controlled and used military establishment can gen- 
erate. The United States has, therefore, been justi- 
fied in granting military assistance not only for the 
immediate purposes of its containment and alliance 
policy. On its part, the Soviet Union has been led 
to disseminate military assistance to willing takers 
by the desire to break open the containment ring 
and secure immediate political gains in crucially 
important or vulnerable noncommitted countries. 

Alternative recipients—allies or neutrals? With 
regard to the distribution of aid among alternative 
recipients, both major donors have faced a related 
problem: whether to give preferential or exclusive 
assistance to their allies or to distribute aid widely 
among noncommitted countries as well. Only a 
minority power willing to remain so or prepared to 
rely on military conquest could probably afford the 
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purism of aid only to allies. There is no other ready 
criterion. Different flaws vitiate such criteria as 
that of the need of potential recipients, their capac- 
ity to effect and absorb changes with outside assist- 
ance, or their ability and willingness to make their 
internal conduct or external policies conform to 
the interests and policies of the donor. 

A distribution following any one criterion or co- 
herent set of criteria becomes well-nigh impossible 
once there is an aid race between major donors. 
Such competition occurred between the United 
States and the Soviet Union and reached its peak 
in the late 1950s. The competition was aggravated 
by failures of perception. For one thing, the com- 
peting powers long ignored the similarity or iden- 
tity of many of their problems and frustrations 
in the domain of foreign aid. For another, both in- 
siders and outsiders tended to judge and compare 
the performance of the superpowers in not really 
comparable situations. The temporary beneficiary 
of the political and intellectual misapprehension 
was the Soviet Union, which gained easy, and ap- 
parently lasting, initial successes in the conflict- 
ridden Western orbit, displaying largesse toward 
noncommunist countries and reserving strict aid 
policies for communist countries. 

Multilateral versus bilateral aid. Multilateral 
administration and distribution of foreign aid 
through international agencies has been widely ad- 
vocated as a way out of the frustrations of bilateral 
methods, A change-over on a major scale has been 
inhibited, however, by the growing structural dis- 
equilibrium in the United Nations. The disequi- 
librium has favored numbers over the intrinsic 
weight of states and their contributions, and has 
been inimical to progressive expansion of the or- 
ganization’s functional scope in regard to the dis- 
pensing of aid and similar long-range activities. 
Moreover, most recipient countries have actually 
been reluctant to rely primarily on multilateral 
modes of allocation and control of economic 
assistance. 

A more practical way of dispersing donor— 
recipient relations, and distributing aid in a quasi- 
multilateral manner, may occur by means of the 
multiplication of both Western and communist 
donors. A deliberately planned division of labor 
among ex-colonial powers has been preached in 
the West and practiced by the Soviet-bloc countries 
(excepting Communist China) in the late 1950s 
and early 1960s. The alternative way of dividing 
labor is a less concerted and even superficially com- 
petitive one, whenever one power acts in the place 
of and over the protests of another in an alienated 
country or part of the world. In the postwar period, 
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the United States acted in countries bent upon re- 
pudiating both the presence and the presents of 
former colonial masters. As psychological decoloni- 
zation in the former European empires advanced, 
as experience with powers other than the former 
metropoles grew, and as the colonial revolt spread 
to Latin America, the process has tended to be 
reversed. West Germany has been developing her 
economic activities abroad; France and Britain have 
been recovering their positions of the preferred 
donors in parts of Asia and Africa and have even 
moved to act by means of trade or aid on behalf or 
in lieu of the United States in disaffected countries 
of Latin America. It is unclear at the time of writ- 
ing whether the competitive trade-and-aid activities 
of Communist China and the Soviet Union in Africa 
and Asia will ultimately prove to have been instru- 
ments of a break or of a more or less reluctant 
division of labor. In any event, the over-all result 
has been a larger total bulk of aid and its wider 
diffusion, if not a more rational distribution. 


Control of aid 


The other major problem is that of control. It 
concerns such issues as those of “strings” and 
“intervention,” and it has remained unresolved in 
either theory or practice, although regularization 
of foreign aid depends on a general acceptance of 
a few principles or propositions. Some form of con- 
trol over each other's behavior is the precondition 
of any cooperation between self-willed sovereign 
actors, not least in the sphere of foreign aid. Con- 
trol becomes reciprocal whenever lesser states ex- 
tract a greater leeway from stalemate among greater 
powers. Some strings are inevitably attached to 
grants by one state to another but are likely to be 
effective as well as legitimate only if they are 
adapted to suit particular types of aid and the cor- 
responding kinds of anticipated performance. And 
no effort can do away with at least indirect or un- 
intended intervention by the donor once a govern- 
ment accepts aid and thus admits its inability to 
cope with its tasks in a strictly internal manner. 
The donor's right to intervention generally stops 
short of forcible or dictatorial interference, but 
there are two exceptions. First, the assistance may 
be so massive that it creates responsibility for the 
recipient's existence, and circumstances could be 
So critical that they endanger the vital interests of 
both recipient and donor. Alternately, an invitation 
to intervene may be extended by the recipient, in- 
duced by past or anticipated assistance from the 
interventionist power and directed against a force 


other than clear majority will of the people con- 
cerned, 


The occasions for intervention would be reduced 
by a policy that may be described as one of remote 
control. Under such a policy, the donor seeks to act 
on policies of sensitive governing elites by way of 
precedent-creating action in comparable situations 
accessible to a more direct exercise of influence. 
Alternatives to remote control are direct control, 
over the internal or external activities of a pliant 
or trustful government, and indirect control, ob- 
tained by promoting congenial political groups likely 
to sway politics in the direction favored by the 
donor without his constant interference. The pre- 
viously noted one-country strategy overtaxes the 
capacity for relatively direct forms of controls, no- 
tably if the donor is unable to act upon reliable 
theoretical knowledge of relationships between eco- 
nomic aid and economic development and between 
economic development and political stability. The 
one-system strategy by contrast heightens the bear- 
ing of remote control by consistent policies in com- 
parable situations, creating a body of rules by way 
of discernible authoritative precedents; flexibility 
is subordinated to the rule-making function of pol- 
icy and reserved for diplomatic instruments with 
an immediate, predictable, and easily manipulable 
impact on men and events. 


Critique and prospects 


The problems besetting foreign aid indicate its 
shortcomings. The political future of foreign aid 
depends on its qualities as an instrument of current 
policies and as a foundation of a new system of 
international politics and of a new world order. 
There is a possible conflict between the two tests 
insofar as the traits that constitute the standing of 
foreign aid as an instrument of national policy 
tend to endear assistance to proponents of new 
forms of relations among nations. Foreign aid as 
an economic weapon too frequently tends to be 
separated from a complementary exercise of politi- 
cal influence and military power. Furthermore, aid 
is often rendered in the anticipation of counter- 
performance, the maturity of which is commonly 
all the more delayed and uncertain the more volu- 
minous is the assistance. Consequently, results are 
not forthcoming, cannot be convincingly related to 
the grant, or are reversible by contrary develop- 
ments before they are consolidated, The paucity of 
demonstrable positive returns encourages negative 
appraisals of foreign aid: as a waste or an expedi- 
ent without which things would be still worse. The 
difficulty in consolidating provisional results is frus- 
trating to the donor, even though it may be stabi- 
lizing internationally if it besets all the rival donors. 

The shortcomings of foreign aid as an instru- 


ment of foreign policy become critical when changes 
in the international system reduce the donors’ needs 
that commonly motivate aid, Among such changes 
are technological developments that lessen the de- 
pendence of the major powers on alliances and 
military facilities on foreign soil, and political 
events that dampen the intensity of competition 
over intermediate unaligned states. The task of 
controlling aid relationships from a position of ex- 
ploitable political weakness becomes less impossible 
for major donors, but also less pressing. Military— 
political competition that has polarized the state 
system is apt to give way either to an outright mili- 
tary clash or to international politics of increasing 
diplomatic flexibility, marked by a shift of the 
focus of sought gains and feared losses from small, 
uncommitted states to relations and combinations 
among middle and great powers. Insofar as trends 
of this kind prevail, international agencies might 
get a chance to supersede bilateral foreign aid 
activities and routinize the allocation and admin- 
istration of aid for economic development through 
application of objective criteria of need and capac- 
ity; the agencies themselves might, however, first 
have to undergo a change in their structure or spirit, 
and conform more realistically to the balance of 
material needs and contributions of the aid-giving 
and the aid-receiving countries. Conversely, if for- 
eign aid were made multilateral, it might be ham- 
pered in a multipolar world by the diffusion of the 
tendency for each major economic power center to 
claim and obtain a preferential right to foreign aid 
activities in its regional orbit. 

As long as the rendering of foreign aid and its 
results remain in large part the function of more 
pressing and compelling political, military, and 
diplomatic strategies, foreign aid activities in them- 
selves are unlikely to constitute a basis of a new 
system of international politics. Such a system 
would implement a scientific theory of economic 
and political development, so far largely lacking, 
and would rest on the willingness of both donors 
and recipients to implement common interests in 
the expansion of individual and group welfare, of 
political choices, and of exchanges of all kinds. A 
sustained program of foreign aid free from sub- 
servience to political contests would provisionally 
take the place of free trade as the chief integrating 
element in a system of international relations 
governed by the distribution of developmental func- 
tions among unevenly endowed states. So con- 
ceived, the new system of international politics 
would supplement the older ideal of dividing the 
production and consumption of security among 
more or less favored states in a system of collective 
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security; and it would supplant the diametrically 
opposed view of foreign aid as a perpetuation of 
subsidies by partly altered means in a still essen- 
tially mercantilist statecraft marked by contest over 
insufficiently expanding and expandable assets. 

Barring unpredictable upheavals, the ultimate 
test of foreign aid as a positive and enduring factor 
in but slowly evolving international politics is a 
long-range one. It concerns the fitness of foreign 
aid to contribute to a stable world order. To a lim- 
ited but real extent, international order and stabil- 
ity rest on domestic orders. Foreign aid makes a 
contribution in this area when it promotes coali- 
tions of political and functional elites, meaning 
especially the modernizers and the guardians of 
the modernization process in developing countries. 
To be both stable and stabilizing, the coalitions 
must be able to agree on the scope and rate of 
manageable change, to be implemented by the 
elites and absorbed by the masses, in a process of 
growth that is largely independent of intercessions, 
restraints on modernizers, and compensations to 
dispossessed groups from the outside. Only very 
few underdeveloped countries, not necessarily rep- 
resentative of the fundamental problem, have man- 
aged to initiate and perpetuate such a process that 
would be largely self-sustaining politically as well 
as economically. The liberal explanation that fail- 
ure to overcome underdevelopment in backward 
countries is a source of instability and war has 
been gaining ground over its Leninist counterpart, 
that the source is found in maladjustments due to 
economic overdevelopment in industrialized, cap- 
italist-imperialist countries. To the extent that 
these hypotheses are valid and foreign aid helps 
relieve critical economic maladjustments, not least 
by reducing the gap between unequally developed 
countries, foreign aid can promote international 
stability. 

In the international arena, foreign aid activities 
tend to promote stability when they provide rival 
powers with outlets for moderately competitive be- 
havior, reducing the incentive to violent forms of 
rivalry. Foreign aid activities can relieve interna- 
tional tensions when they are informed by a real- 
istic appraisal of possible gains for either side, and 
they may stabilize the international system both 
in the short and the long run insofar as contempo- 
rary major powers accept the fact that they are 
building up new economic and military powers that 
may eventually recast alignments to their disad- 
vantage. The generation of reserve power may 
reduce the pressures on governments to avert by 
force unfavorable developments that may be fore- 
shadowed by trends in the arms race or by rates 
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of competitive economic growth. Such an anticipa- 
tion may have been partly responsible for some 
major wars in the past, including both world wars 
of the twentieth century. 

Two decades after the end of World War 1, there 
has been a widespread desire to evaluate the inter- 
vening experience with different forms of large-scale 
governmental aid. A continuing retreat of foreign aid 
from the forefront of direct great-power competition 
would favor a new balance between acquisitive and 
creative aid. This might entail depoliticization of 
foreign aid in one respect, that of allowing consid- 
erable autonomy to objective needs of development, 
and its politicization in another respect, that of a 
progressive definition and acceptance of reciprocal 
rights and obligations on the part of recipients as 
well as donors. The new balance would reflect the 
growing need for the so-called revolution or rising 
expectations to come to terms with the less spec- 
tacular but no less significant decline of expecta- 
tions attached to foreign aid. Any new compromise 
between the demands of the revolution in the less 
developed countries and the consequences of the 
evolution in the industrial donor countries could 
not but alter the manifestation of a technique that 
can be shown to be a recurrent, but not continuous, 
and a useful, but not indispensable, feature of in- 
ternational statecraft. 

GEORGE LISKA 


[See also ALLIANCES; COLONIALISM; IMPERIALISM; 
MODERNIZATION; TECHNICAL ASSISTANCE. A guide 
to other relevant material will be found under In- 
TERNATIONAL RELATIONS. ] 
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Il 
ECONOMIC ASPECTS 


The international transfer of economic resources 
is the essence of foreign economic aid. The flow 
of public assistance consists of grants and loans 
in cash and kind, including technical assistance, 
from a donor government or international organ- 
ization to other governments or enterprises in 
recipient countries. The sources, forms, and pur- 
poses of financial and technical assistance are ex- 
tremely varied [see TECHNICAL ASSISTANCE]. Of 
primary importance, however, is the flow of re- 
sources to poor countries in order to accelerate 
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their development. The purely economic function 
of this aid should be distinguished from that of 
direct military assistance, which may also be re- 
ceived by a developing country; moreover, aid 
should properly be considered as only that part 
of the foreign capital inflow that normal market 
incentives do not provide. [See INTERNATIONAL 
MONETARY ECONOMICS, article on PRIVATE INTER- 
NATIONAL CAPITAL MOVEMENTS. | 

Several aspects of foreign economic aid require 
particular emphasis: the historical evolution of aid 
programs; the determination of the objectives and 
magnitude of aid; the appraisal of the relative 
merits and demerits of different types of aid; the 
criteria of effective use of aid; and the burden of 
aid. 

Evolution of aid programs. Foreign assistance 
programs have evolved through a series of legis- 
lative measures and administrative procedures 
required at different times for various purposes. 
Before World War m economic aid came mainly 
from metropolitan countries in support of over- 
seas colonies and dependencies, was on a purely 
bilateral basis, and involved relatively small 
amounts of financial grants. [See COLONIALISM, 
article on ECONOMIC ASPECTS. | 

The United States, however, early supported a 
large-scale assistance program through loans from 
the Export-Import Bank of Washington; since 
1954 this bank has concentrated increasingly on 
capital assistance to less-developed countries. Aside 
from the lend-lease arrangements, which provided 
economic support to wartime allies, the first gen- 
eral foreign aid program supported by the United 
States came with the Economic Cooperation Act 
of 1948. This initiated the European Recovery 
Program (Marshall Aid Program), which during 
1948-1952 furnished loans and grants to 17 
western European countries that confronted prob- 
lems of reconstruction and balance of payments 
pressures. Although aid under this program was 
not directly related to the problems of newly de- 
veloping countries, it demonstrated the potential 
of international aid and was a prototype for subse- 
quent aid programs. 

In 1950 the U.S. Congress passed the Act for 
International Development, which incorporated a 
technical assistance program (“Point Four”). This 
was the first of a series of Foreign Assistance Acts. 
A variety of agencies have administered U.S. aid 
in the course of these successive programs. The 
United States has also provided, under Public Law 
480 and “Food for Peace” programs, for the sale 
of surplus agricultural commodities for local cur- 
rencies. In 1957 the Development Loan Fund was 
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Table 1 — United States government foreign aid by program: fiscal years 1941—1960° 
(converted to millions of U.S. dollars) 


1941-1948 1949-1952 1953-1956 1957 1958 1959 1960 Total 
Gross total—all programs” 70,318 21,529 23,893 5,472 5,429 6,690 5,077 138,408 
Investment in international financial institutions 3,385 - = 35 — 1,375 80 4,875 
Gross grants 57,167 18,959 20,924 4,142 3,963 3,860 3,654 112,669 
lend-lease 48,672 - - = - - - 48,672 
UNRRA and related aid* 3,473 53 - - = = 3,526 
Aid to occupied areas 3,591 2,791 270 1 1 2 2 6,658 
ERP, MSA, and other economic aid* 514 11,778 6,622 1,502 1,244 1,369 1,314 24,343 
Farm surplus disposal’ - 49 653 281 365 312 314 1,974 
MDAP and MSA military cid' - 3,073 13,377 2,358 2,353 2,177 2,023 25,361 
Other grants? 914 1,214 2 - - - - 2,130 
Gross credits® 9,766 2,570 2,969 1,295 1,466 1,455 1,343 20,864 
Lend-lease settlement 2,957 56 - 6 = - - 3,019 
Export-Import Bank 2,570 880 1,547 237 760 708 399 7,101 
British loan 3,750 - - - - - - 3,750 
ERP and MSA economic aid - 1,532 512 106 232 240 315 2,937 
Farm surplus" - — 860 937 474 506 629 3,406 
Other credits 487 102 49 9 i i i 647 


. Fiscal years ending June 30; details may not add to totals because of rounding. 
. Includes net accumulation of foreign currency claims deriving from farm surplus disposal. 
Includes post-UNRRA and Interim Aid. 


anya 


|. ERP—European Recovery Program; MSA—Mutual Security Act; other aid includes contributions to UN agencies, technical assistance to Latin America, Inter-American 
and related highways, Trust Territory development, Libyan Special Purpose Fund, etc. 


» Donations of agricultural products plus dollar equivalent of foreign currency grants derived from farm surplus disposal. 
. MDAP—Mutual Defense Assistance Program. 


l- Greek-Turkish, Chinese stabilization, and military aid; Philippine, Korean, and other Far East aid grants. 


+ Dollar equivalent of loans made in foreign currencies derived from the farm surplus disposal programs, plus the net accumulation of such currencies. 
- Less than $500,000. 


rrea xe 


Sources: Foreign Grants and Credits by the United States Govern- 
ment, 1952, p. 81, appendix B; U.S. National Advisory 
Council on International Monetary and Financial Problems; 
Higgins 1962, table 3-1. 


created to administer long-term, low-interest loans Table 2 — Commitments by the centrally planned econ- 


from the United States. The magnitude of Ameri- omies of bilateral economic aid to under- 

can foreign aid from 1941 to 1960 is summarized developed countries, 1954-1960 (millions of 

by program in Table 1. U.S. dollars) 

Although the United States has been the largest Belen 

donor, many other countries have also participated 1958 1958 1959 1960 Total 

in international economic assistance. Since 1954 Donor: 

the Soviet Union has entered into assistance agree- China (mainland) B20 26s ieia 4129 

ments with developing countries and has also Czechoslovakia T OBIE 76 ESOS 215 
East Germany 7 - 22 - 29 


extended technical assistance on an increasing 


h A Poland - - 6 30 36 
scale, While a considerable part of aid from non- Rumania - fhe wees - 11 
communist Western countries has consisted of U.S.S.R. 712 345 782 804 2,643 
grants, the Soviet system has concentrated on long- Yugoslavia i LO eG pease 135 
term “bulk” credits, under which agreements for Total, donor countries 822 460 953 963 3,198 
deliveries of goods on credit are negotiated. Al- sfe: 
5 c 
though there has not been much difference in the ine an iy ae rk oe 


length of loans and terms of repayment between Recipient: 


Soviet and Western agreements, a larger propor- Africa cae: WRIST) 7260) 1241 

tion of Soviet aid has allowed for repayment in ene j 4a pe mie 120 232 
e is! 

local currency or local products, and Soviet loans $ e A 


Southeast Asia 615 76 494 513 1,698 


have had lower interest rates. Total, recipient countries 822 460 953 963 3,198 


Soviet technical assistance has usually been 


P Sı UN, 
supplied as part of a larger, integrated program pa DAAI Os, Wa ero 


consisting of loans, development goods, and bi- 
lateral trade; in contrast, technical aid from 
Western countries has usually been connected to 
individual projects or to specific training objectives. 
Communist-bloc technical experts have also con- 
centrated on relatively few conspicuous projects 
that entail complex technology, while American 
technical assistance has emphasized the broader 
and more basic needs of agriculture, health, and 
education. 

As may be noted in Table 2, commitments 
(which were not in all instances implemented) 
made by the centrally planned economies to the 
underdeveloped countries amounted, by the end of 
1960, to approximately $3,200 million. Table 3 
compares the magnitude of aid from the Sino- 
Soviet bloc with that of the United States. 

Other countries have also sponsored bilateral aid 
programs—notably the United Kingdom, France, 
West Germany, Canada, and Japan. Table 4 sum- 
marizes the magnitude of bilateral aid during 
1958—1960. 

Regional organizations have also been significant 
contributors to aid: the Colombo Plan for Coopera- 
tive Economic Development in South and South- 
east Asia, inaugurated in 1950; the Organization 
for American States, formed in 1951 and strength- 
ened by the Inter-American Development Bank in 
1959, the Act of Bogotá of 1960, and the Charter 
of Punta del Este, which launched the “Alliance 
for Progress”; and the Organization for Economic 
Cooperation and Development (OECD), which in 
1961 succeeded the Organization for European 
Economic Cooperation (OEEC). 

Finally, the United Nations and a number of its 
specialized agencies play a prominent role in 
foreign aid: the International Bank for Recon- 
struction and Development (IBRD), International 
Finance Corporation (IFC), United Nations Special 
Fund (UNSF), International Development Asso- 
ciation (IDA), Technical Assistance Board (TAB), 
World Health Organization (WHO), Food and 
Agriculture Organization (FAO), and regional eco- 
nomic commissions. 

Utilizing the capital subscribed by member na- 
tions, the loans raised in the markets of member 
countries, and the return on its own operations, 
the IBRD has been the principal development- 
financing institution of the United Nations. The 
IDA has provided loans at very long term and with 
no or low interest, and it has had great latitude in 
its selection of projects for financing. This has 
enabled the IDA to meet the special needs of de- 
veloping countries that, for balance of payments 
reasons, were nearing the limit of their capacity 
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Table 3 — Comparison of economic credits and grants 
extended to less-developed countries by 
the Sino-Soviet bloc and the United States 
(millions of U.S. dollars) 


SINO-SOVIET BLOC UNITED STATES* 


Mid-1955— Mid-1955- Mid-1948- 
December December December 
1961 1961 1961 
Total 4,371 9,449 12,727 
Latin America 465 1,341 _2,080 
Argentina 104 360 524 
Brazil 4 961 1,515 
Cubo 357 20 4) 
Middle East 1,077 1,896 2,723 
Cyprus 1 10 10 
Egypt 615 394 479 
Iran 6 411 672 
Iraq 216 13 21 
Syria 178 69 70 
Turkey 17 980 1,452 
Yemen 44 19 19 
Africa 601 502 512 
Ethiopia 114 115 125 
Ghana 182 35 35 
Guinea 110 4 4 
Mali 65 2 2 
Somalia 62 15 15 
Sudan 22 64 64 
Tunisia 46 267 267 
Asia 2,112 4,881 5,973 
Afghanistan 204 138 184 
Burma 93 68 90 
Cambodia 65 195 232 
Ceylon 58 77 res 
India 963 2,726 3,221 
Indonesia 641 308 541 
Nepal 55 40 44 
Pakistan 33 1,329 1,584 
Europe 116 829 1,439 
Iceland 5 35 70 
Yugoslavia 111 794 1,369 


* U.S, figures include credits and grants from ICA obligations; DLF loans ap- 
proved; Export-Import Bank long-term loans authorized; and Public Law 
480 funds earmarked for country use under Title |, authorizations under 
Title II, and shipments under Title Ill. 


Source: U.S. Congress, Committee on 
Appropriations, p. 207. 


to borrow abroad for development projects on con- 
ventional terms. The IFC has been instrumental in 
assisting private industrial development by invest- 
ing without government guarantee in productive 
private enterprises. During the year 1961/1962, 
the IBRD and its two affiliates, the IFC and IDA, 
made new commitments of more than $1,000 
million. Although not itself a lending agency, the 
UNSF has made greater investment feasible and 
more effective by granting assistance for preinvest- 
ment work in the form of surveys, training, and 
applied research. Contributions to the United Na- 
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Table 4 — Bilateral aid to underdeveloped countries, 1958-1960 
(millions of U.S. dollars)* 


CALENDAR YEAR 1958 


CALENDAR YEAR 1959 


CALENDAR YEAR 1960 


Contributing country Gronts loans Repayments Grants Loans Repayments Grants Loans Repayments 
Australia® 37.0 1.2 - 34.4 0.6 - 39.2 TZ - 
Belgium* 4.0 - - 30.0 - - 86.0 - - 
Canada 47.3 9.5 - 60.3 1.3 =- 48.1 - - 
Denmark - - - - - - 0.7 - - 
France 524.5 227.6 41.2 665.2 176.2 42.4 687.0 87.0 19.0 
Germany (FRG) 5.2 74.4 16.0 6.6 46.1 17.5 6.2 77.5 16.9 
Italy 10.2 - 17 9.0 30.5 8.9 8.9 16.8 10.0 
Japan 177.4 - - 4.0 - - 2.5 78.6 - 
Netherlands 20.8 1.7 - 21.7 2.3 - 28.2 3.6 - 
New Zealand’ 4.2 0.2 - 5.0 0.3 - 6.3 - - 
Norway 1.2 - - 1.0 = = 0.9 = = 
Sweden 0.8 - = 0.9 - = 1.0 — - 
Switzerland 0.1 - = 0.1 - _ - - — 
United Kingdom 143.0 67.0 20.0 151.0 137.0 25.0 165.0 176.0 22.0 
United States 1,132.0 796.0 167.0 1,157.0 733.0 205.0 1,320.0 723.0 220.0 
Yugoslavia - - - - — - 0.3 26.0 - 


a. Omits aid from Sino-Soviet bloc. 
b. Fiscal years ending 30 June of years stated. 
e 

d. Fiscal years ending 31 March of years stated. 


tions Expanded Program of Technical Assistance 
amounted to over $41.5 million in 1961, compared 
with less than $20 million a decade earlier. 

The amount of capital assistance from the 
United Nations has been small in comparison with 
bilateral programs, but it has particular importance 
because of its multilateral character. The same is 
true for the technical assistance programs provided 
by the United Nations itself and by several of its 
specialized agencies. Table 5 indicates the magni- 
tude of multilateral aid, 1958—1960, while Table 6 
shows the magnitude of public aid by source and 


Advances to Ruanda-Urundi with no specific schedule of repayment are treated as grants. 


Source: UN, Statistical Office 1961, table 157, p. 467. 


type, 1953/1954-1958/1959. In recent years the 
relative share of the flow of capital to underde- 
veloped countries has been approximately 90 per 
cent on a bilateral basis and only 10 per cent on 
a multilateral basis, with the IBRD, IFC, and IDA 
accounting for slightly over òne-half of the multi- 
lateral aid. 

Objectives and magnitude of aid. Foreign aid 
is usually a component of a nation’s foreign policy; 
for this reason its objectives are diverse. The un- 
derlying objectives may be mixed even for a par- 
ticular act of assistance by a single government 


Table 5 — Multilateral aid to underdeveloped countries, 1958—1960 
(millions of U.S. dollars) 


CALENDAR YEAR 1958 


Source* 


CALENDAR YEAR 1959 CALENDAR YEAR 1960 


Grants Loans Repayments Grants loans Repayments Grants Loans Repayments 
EDF 2 = 2 0.1 oe bs 3.4 — - 
IBRD - 345.7 47.0 = 310.3 68.5 - 340.1 91.5 
IFC - 49 - - 7.8 - - 13.6 = 
UNICEF 18.3 - = 20.4 - - 18.3 =- = 
UNRWA 31.8 - - 34.1 — - 34.7 - = 
UNSF - - - 0.3 = 3 2.0 = = 
UNEPTA 25.9 - - 24.7 - = 24.1 - = 
Other UNTA 10.8 - - 11.3 - ~ 13.5 - = 


* The following abbreviations have been introduced in the table: EDF, Develo; 


pment Fund of the European Economic Community; IBRD, International Bank for Recon- 


struction and Development; IFC, International Finance Corporation; UNICEF, United Nations Children’s Fund; UNRWA, United Nations Relief and Works Agency 


for Palestine Refugees in the Near East; 


other UNTA, all programs of technical assistance administered by 
other than EPTA. 


UNSF, United Nations Special Fund; UNEPTA, United Nations Expanded Programme of Technical Assistance; and 
the United Nations, the specialized agencies, and the International Atomic Energy Agency, 


Source: UN, Statistical Office 1961, table 157, p. 467. 
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Table 6 — Total governmental aid, multilateral and bilateral, by type, 1953/1954-1958/1959 


1953/1954-1955/1956 
ANNUAL AVERAGE 


Grants 
Total aid in billions of U.S. dollars 1.5 
Bilateral aid 1.4 
Multilateral aid 0.1 
Total aid as percentage distribution by type 74.4 
Bilateral aid 77.0 
Multilateral aid 48.3 


to another—since humanitarian motives, military 
interests, and political purposes tend to fuse with 
the purely economic objective. 

Solely in terms of the direct economic goal, 
foreign aid is intended to augment the resources 
of the recipient country so that its rate of develop- 
ment may be raised. The largest proportion of the 
flow of public funds has been devoted to invest- 
ment projects in industry, agriculture, mining, and 
basic “overhead” facilities, such as power and trans- 
portation [see CAPITAL, SOCIAL OVERHEAD]. To 
achieve a higher rate of economic development, 
however, a country must also experience social 
development through investment in human re- 
sources, introduction of new techniques of produc- 
tion, and institution building. Technical assistance 
programs fulfill some of these requirements by 
improving health and education and supporting 
rural community development. Even though social 
projects are difficult to evaluate in terms of eco- 
nomic criteria alone and hence are a questionable 
objective of economic aid, financial and technical 
assistance may nonetheless be necessary to initiate 
social programs that will contribute to higher pro- 
ductivity. An even more controversial objective of 
foreign aid is stabilization assistance in the form 
of balance of payments, or general purpose, loans 
to a country that has current external liabilities 
in excess of its international reserves or to a coun- 
try that cannot meet its current liabilities out of 
current foreign exchange earnings without restrict- 
ing essential imports. 

Clarity on the objectives of foreign aid is the 
necessary first step in determining how much aid 
is needed by a recipient country. But even when 
the objectives are clearly defined, it is still ex- 
tremely difficult to calculate the magnitude and 
duration of external assistance required to attain 
these objectives. Nonetheless, such calculations are 
commonly attempted, especially in relation to the 
recipient country’s development plan. Although 
the statistical exactitude of this type of calculation 


1957/1958-1958/1959 
ANNUAL AVERAGE 


Loans (net) Total Grants Loans (net) Total 

0.5 2.0 2.2 1.1 3.3 

04 1.8 2.1 0.8 2.9 

0.1 0.2 0.1 0.3 0.4 
25.6 100 66.0 34.0 100 
23.0 100 70.9 29.1 100 
51.7 100 27.2 72.8 100 


Source: UN, Secretary-General 1961, p. 48. 


is illusory, its logic is of interest, and improvement 
of the concepts and methodology underlying these 
quantitative estimates may result from more in- 
tensive consideration of the problems involved. 

The usual type of calculation proceeds as fol- 
lows: A target is set for the desired increase in 
per capita real income over a given time period. 
On the basis of estimated annual population 
growth and the economy’s marginal capital—output 
ratio (the ratio between the value of additional 
capital and the value of additional annual output 
produced), it is concluded that a certain percent- 
age of national income must be saved and invested 
to attain the per capita income target. If domestic 
saving is less than this amount and cannot be 
immediately raised by domestic policy measures, 
the balance must be sought from external sources. 
To the extent that private foreign investment does 
not fill the gap, public funds from abroad are 
required. 

This exercise is, however, a highly oversimplified 
way to calculate aid requirements. Capital—output 
ratios have not yet been measured accurately 
enough to justify their use in deriving quantitative 
conclusions. Moreover, exclusive concentration 
merely on the relationships between a capital—out- 
put ratio, a savings ratio, and population growth is 
a much too mechanical and narrow interpretation, 
for it misses the inner complexity and subtleties of 
the development process. Other factors besides in- 
vestment are strategic in accelerating development, 
and the full implications of external aid cannot be 
appreciated without a more comprehensive analysis 
of the role of foreign capital. In the broadest con- 
text the analysis should consider how the foreign 
aid relates to a greater national effort to increase 
the rate of development, and it should examine the 
differential impact of various forms of capital as- 
sistance in terms of their costs and benefits. 

Although the calculation of foreign aid require- 
ments is difficult, the underlying rationale for such 
aid is clear. It is not possible to relate any one 
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source of funds to a specific type of use, but the 
net capital imports can offset, or “finance,” the dif- 
ference between the value of products used and the 
value of products produced domestically; or the 
difference between net investment and net domes- 
tic savings; or the value of imports and factor pay- 
ments abroad minus the value of exports and factor 
receipts. All these differences constitute a shortfall 
in real resources that results when the claims of the 
development plan exceed the available domestic 
resources. This gap in real resources is in turn 
reflected in a foreign exchange gap. To the extent 
that the use of foreign exchange reserves or an in- 
flow of private capital does not fill the gap, public 
capital assistance from abroad will be necessary to 
give command over the additional resources. The 
size of a development plan is therefore constrained 
by the amount of foreign economic aid that is forth- 
coming. 

While the dependence on external aid may have 
to be large at the outset of the development pro- 
gram, most plans aim for a progressive reduction 
in aid and the eventual realization of a self-financ- 
ing plan. To achieve this the plan relies on a high 
marginal rate of domestic saving. It is expected 
that the proportion that can be saved out of an 
increase in output will be much higher than the 
average savings ratio at the plan’s beginning. The 
proportion of national income invested and fi- 
nanced by domestic savings would then increase, 
and the ratio of net foreign capital imports to addi- 
tional investment would decline. The reliance on 
foreign aid will also diminish if, as development 
proceeds, the composition of investment alters 
toward projects with a lower import content, the 
production of import-substitutes increases, export 
revenue expands, and total domestic output grows 
without inflationary financing. 

Forms of aid. The significance of foreign aid 
depends not only on its magnitude but also on the 
terms on which it is made available. The various 
forms of financial assistance can be differentiated 
by their terms—whether “hard” loans, “soft” loans, 
or grants; by the parties they involve—whether 
government-to-government or government-to-enter- 
prise and whether furnished by national or multi- 
lateral agencies; and by their conditions—whether 
for specific projects or for a general development 
program. 

Hard loans carry a commercial rate of interest, 
have a relatively short amortization period, and re- 
quire repayment in foreign currency. Various types 
of soft loans are possible: very long-term loans re- 
payable in foreign currency at a low rate of interest 
or without interest; loans repayable in local cur- 


rency; long-term loans at low interest with a long 
grace period for payment of principal and interest. 

Since they require neither servicing nor repay- 
ment, grants rather than loans are sometimes pro- 
posed for projects involving “social” development. 
It is thought that loans are appropriate for self- 
liquidating “economic” projects, where the criterion 
can be ability to service the loan, but that grants 
should be made for such nonself-liquidating proj- 
ects as education, health, and community develop- 
ment, where the criterion is simply “need.” Inter- 
national grants do have the merit of adding no 
burden of interest and amortization charges upon 
the recipient country, and they are a clear manifes- 
tation of the donor's willingness to offer a unilateral 
income transfer. 

There is, however, little justification in differen- 
tiating between “social” and “economic” capital, in- 
sofar as both can contribute to an expansion of 
total output, which is the basic consideration in 
determining the recipient country’s capacity to serv- 
ice foreign obligations. If in the interest of assuring 
effective use of grants, the granters attempt to ex- 
ercise more control over the recipient country than 
would be the case for loans, then grants may, of 
course, be resented. And while it may be politically 
inexpedient for a donor nation to offer grants, there 
may be less political opposition to loans—even with 
generous low-interest and long-term provisions. 

Although the greater part of all economic aid has 
been on a bilateral basis, several arguments are 
advanced in favor of providing more aid through 
multilateral channels: better coordination of assist- 
ance activities can be achieved; the aid is more ac- 
ceptable to recipient nations; the practice of dis- 
crimination among recipients is more difficult, and 
the aid program is less susceptible to political in- 
fluences; tied loans (requiring expenditure in the 
lending country ) are avoided; and more appropriate 
technical assistance may be obtained by drawing 
upon the experience of all nations. 

In some areas and for some activities interna- 
tional auspices may not be feasible. And few would 
insist that all types of aid be administered by in- 
ternational agencies. Short of complete multilat- 
eralism, however, there is still considerable scope 
and need for the multilateral coordination of bilat- 
eral programs—multilateral supervision of the 
technical, financial, and commodity aid supplied 
by individual countries, 

Another major issue is whether aid should be 
given only for a specific project or extended for 
general purposes too. Advocates of the project ap- 
proach claim that only by financing soundly con- 
ceived projects will foreign aid contribute to an 


increase in productive capital. The IBRD, for in- 
stance, has tended to give prime consideration to 
the foreign exchange requirements of specific proj- 
ects, emphasizing the economic soundness of the 
projects and their direct contribution to the pro- 
ductivity of the borrowing country. Moreover, it 
is claimed that the review of specific projects has 
some educational value and allows the donor to 
act more readily as a critic of the details of the 
recipient's development plan. 

It has become increasingly clear that the effec- 
tiveness of aid depends not merely on the produc- 
tivity of specific projects but more generally on the 
complementarity of different projects and the ap- 
propriateness of the developing country’s allocation 
of total investment expenditures from both domes- 
tic and foreign sources. Similarly, the narrow view 
that aid should be confined to the foreign exchange 
component of specific projects misses the funda- 
mental principle that the receipt of aid allows the 
country to acquire command over resources in gen- 
eral, not simply to increase command of foreign 
resources in particular. Insofar as changes occur 
in the general program of development, it is im- 
possible to identify exactly the inflow of capital 
with a specific project; the inflow contributes to the 
whole program. The effectiveness of aid depends 
upon the country’s allocation of its general re- 
sources. The same consideration is relevant for 
determining the country’s capacity to service for- 
eign debt, as noted below. 

Use of aid. Even if aid is provided in substan- 
tial amounts and on favorable terms, there remains 
the ultimate problem of achieving its most effective 
use. External assistance should add to—not sub- 
stitute for—the country’s own developmental ef- 
forts. If capital assistance is to result in a higher 
rate of investment, it must be prevented from sim- 
ply substituting for domestic sources of financing 
investment, and it should not be dissipated in sup- 
porting higher personal consumption or an increase 
in nondevelopment current expenditure by the 
government. 

If external assistance were to be used for bolster- 
ing current consumption, the scope for the use of 
foreign capital would obviously be practically 
boundless. When this use is excluded, the econom- 
ically warranted demand for capital is limited by 
the country’s ability to use capital productively— 
that is, by the country’s “capital absorptive capac- 
ity.” In the short run, even if not in the long run, 
there may be a limit to how much capital assistance 
can be effectively used when the investment must 
not only cover its costs but also yield a reasonable 
increase in income. It should also be recognized 
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that the effective use of capital assistance may de- 
pend in large part upon the effectiveness of tech- 
nical assistance. For the purpose of contributing 
technical aid is essentially to raise absorptive ca- 
pacity. Once the pace of development gains momen- 
tum, the absorptive capacity will be higher, and 
foreign capital can then be utilized more effectively. 

When capital assistance is not confined to spe- 
cific investment projects but is available on a gen- 
eral-purpose basis, the allocation of the foreign 
capital is decisive in determining whether it con- 
tributes as much as possible to raising the growth 
potential of the recipient economy. An efficient allo- 
cation of investment resources then depends upon 
the application of investment criteria based on the 
country’s entire development program and the 
adoption of appropriate domestic policy measures 
to supplement the use of foreign aid. Since the 
formation of capital depends, in the last resort, on 
domestic action, there can be no simple equivalence 
between the amount of aid received and the sub- 
sequent rate of development. The effectiveness of 
the use made of foreign aid in the country’s entire 
development program will be more decisive in de- 
termining whether development is to be sustained 
than is the initial amount of aid received. 

It is therefore proper to emphasize the necessity 
of self-help measures, as American aid programs 
have increasingly done. Unless recipient govern- 
ments adopt measures to mobilize fully their own 
resources and to implement their plans, the max- 
imum potential from aid will not be realized. It has 
become increasingly evident that to realize this ob- 
jective in many poor countries it is necessary to 
undertake basic reforms in land tenure systems, 
the tax structure, education, and governmental ad- 
ministration. Additional attention must therefore 
be given to the sociocultural and political aspects 
of aid. 

Burden of aid. Significant as the benefits from 
aid may be, they are not without their costs. For 
the donor there is a burden in mobilizing aid; for 
the recipient, a burden of debt servicing and re- 
payment. 

Although the mobilization of aid is not limited 
by a physical incapacity on the part of donor coun- 
tries to provide the needed resources, there is a 
problem of paying for the transference of the re- 
sources. Unless inflation is to be endured, it is nec- 
essary to withdraw—by taxation or government 
borrowing—the money equivalent of the resources 
to be transferred. 

Moreover, the donor nation may be intensifying 
the difficulty of maintaining equilibrium in its bal- 
ance of payments, unless it requires that its loans 
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and grants be expended in the country of origin. 
If such “tied” aid is necessary out of balance of 
payments considerations, and the alternative to 
tied funds would be no aid at all, then the larger 
amount of tied aid is to be preferred, even though 
the condition that the aid be expended in the donor 
country may diminish the value of the aid to the 
recipient country as compared with freedom to 
import from a third country. 

The willingness of an individual country to bear 
the burden of providing resources out of public 
funds may depend on the willingness of other coun- 
tries to share in the burden. An increase in the total 
flow of aid becomes politically more feasible if 
there is a belief that the burden is being shared 
equitably. But there is no unambiguous formula 
for determining what constitutes an equitable dis- 
tribution among the contributors of aid. A refined 
proposal would have the developed countries con- 
tribute according to a progressive tax schedule 
applied to the country’s per capita real gross na- 
tional product. Perhaps the most practicable solu- 
tion is to accept the commonly advocated proposal 
of a flat-rate levy on national income—say, one 
per cent of the national income of the donor 
nations, 

For the recipient country the burden of foreign 
aid depends upon the country’s capacity to service 
its external obligations. When the return flow of 
interest and of amortization payments exceeds the 
inflow of new capital assistance, the country con- 
fronts a transfer problem in servicing the debt. It 
will have to generate an export surplus to cover 
the net outward transfer of amortization on capital 
account and of interest payments on current ac- 
count. This entails a transformation of the econ- 
omy such that resources can be reallocated to the 
sector producing exports or import-substitutes, To 
accomplish this the country may have to endure 
internal and external controls or experience ex- 
change depreciation. The adverse effects of these 
measures of balance of payments adjustment might 
be interpreted as the indirect costs of foreign aid, 
to be added to the direct costs. 

The direct cost of servicing capital assistance, 
however, is not a burden in itself. Even though part 
of the increased production from the use of foreign 
capital has to be paid abroad in interest—and this 
is a deduction that would not be necessary if the 
savings were provided at home—nonetheless the 
country has benefited from additional investment. 
What does matter in mitigating the burden of the 
debt is the country’s ability to avoid the necessity 
of adopting restrictive measures in order to find 
sufficient foreign exchange for the remittance of 


the external service payments. To minimize these 
indirect costs the country’s development program 
must give due consideration to the debt-servicing 
capacity of the country. 

This becomes part of the problem of selecting 
appropriate investment criteria. To provide for ade- 
quate servicing of the foreign debt the capital 
should increase productivity sufficiently to yield an 
increase in real income greater than the interest 
and amortization charges. If this is done, the econ- 
omy will have the capacity to raise the necessary 
funds—either through a direct commercial return 
or greater taxable capacity. Moreover, to provide a 
sufficient surplus of foreign exchange to avoid a 
transfer problem, the capital should be utilized in 
such a way as to generate a surplus in the other 
items of the balance of payments equal to the trans- 
fer payments abroad. If it is realized that the ability 
to create a sufficiently large export surplus depends 
on the operation of all industries together, not 
simply on the use made of foreign investment, it 
is then apparent that a project financed by foreign 
aid need not itself make a direct contribution to the 
balance of payments. Instead of such a narrow bal- 
ance of payments criterion, the basic test for the 
allocation of capital aid is simply that it should be 
invested in the form that yields the highest social 
marginal product. As long as capital is distributed 
according to its most productive use and the ex- 
cess spending associated with inflation is avoided, 
the necessary export surplus can be created in- 
directly. The essential point is that the allocation 
of foreign aid according to the criterion of produc- 
tivity will also be the most favorable for debt serv- 
icing, since it maximizes the increase in income 
from a given amount of capital and thereby con- 
tributes to the growth of foreign exchange avail- 
abilities, 

The problem of debt servicing will also be eased 
for the aid-receiving country when capital assist- 
ance is offered at lower interest, for longer terms, 
and with more stability and when the creditor coun- 
try follows a more expansionary domestic policy 
and a more liberal trade policy, 

Need for further research. Fi oreign aid has been 
extended mainly in an ad hoc fashion in response 
to immediate policy situations and without the op- 
portunity for a substantial amount of prior research. 
Its practice, however, has suggested several areas 
of basic research that merit considerably more in- 
vestigation. Of fundamental importance is the need 
for clear criteria in determining aid requirements, 
its optimal allocation among recipients, and its 
most effective use. 

Since foreign economic aid is only a single com- 


ponent of the total flow of resources from rich to 
poor countries, it is important to investigate more 
thoroughly the relationships between capital assist- 
ance, private foreign investment, and international 
trade. Under what conditions can loans and grants 
stimulate private foreign investment? And under 
what conditions are they competitive? To what ex- 
tent are the benefits of foreign aid being offset by 
adverse foreign trade policies? Might not more be 
gained from policies that stabilize the poor coun- 
tries’ foreign exchange earnings than from addi- 
tional capital assistance? 

Insofar as the effective use of foreign aid may 
depend on social and political reforms, it would 
be instructive to examine more systematically the 
sociocultural and political aspects of foreign aid. 
This should serve to correct the tendency to over- 
emphasize capital productivity and technical pro- 
ductivity at the expense of noneconomic variables 
that are strategic in the development process. [See 
ECONOMIC GROWTH, article on NONECONOMIC AS- 
PECTS.] 

The impact of technical assistance might also 
be greater if additional research were undertaken 
on the special problems of adapting techniques of 
production to the special circumstances of the poor 
countries, Not only should research be directed 
toward new scientific advances of particular utility 
to underdeveloped economies, but the problems of 
diffusing the new technology and gaining its ac- 
ceptance in the recipient society must also be 
solved. 

Further, attention should be given to the possible 
means of coordinating the many different actual 
and potential sources of foreign assistance and to 
feasible arrangements for giving greater continuity 
to the flow of aid. 

Finally, on the basis of accumulated experience 
in a variety of countries it would be useful to have 
a number of case studies of foreign aid as a basis 
for a comparative study of the conditions under 
which different types of aid have proved effective. 


GERALD M. MEIER 


[See also CommuNITY, article on COMMUNITY DEVEL- 
OPMENT; ECONOMIC GROWTH; PLANNING, ECONOMIC, 
article on DEVELOPMENT PLANNING; TECHNICAL AS- 
SISTANCE. ] 
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FOREIGN POLICY 


For half a century the making of foreign policy 
has been studied in Western democratic countries 
as a field of specialization separate from the mak- 
ing of public policy in general. This intellectual 
differentiation rests upon implicit and explicit 
assumptions about the way the foreign policy field 
differs from other areas of public policy. The lead- 
ing assumption is that foreign policy is “more im- 
portant” than other policy areas because it concerns 
national interests, rather than special interests, 
and more fundamental values. A second assump- 
tion builds upon the first: since foreign policy 
questions evoke a different political response, it is 
assumed that political institutions function differ- 
ently when they confront foreign policy issues. In 
addition, of course, different institutions are also 
involved, in that some governmental agencies are 
concerned exclusively or substantially with foreign 
policy. There is growing uncertainty among politi- 
cal scientists, in the United States at least, as to 
the validity of these assumptions. There is also 
considerable skepticism concerning the theoretical 
value of treating foreign policy processes as analyt- 
ically distinctive—a skepticism that will undoubt- 
edly draw all the different public policy research 
fields closer together in the future. Nevertheless, 
the present state of our knowledge reflects, for 
better or worse, a set of beliefs about the unique- 
ness of foreign policy processes within the political 
order, 

This orientation toward foreign policy making 
can be traced to World War 1 and the events lead- 
ing up to it—although it might have developed in 
the United States in any case, since these were 
America’s first years of great-power status and a 
period in which increasing attention was given to 
foreign relations. In the United States the war ex- 
perience convinced some people that isolationism 
was no longer an appropriate response to interna- 
tional problems and that the United States must 
remain actively involved in the affairs of the world. 
Foreign policy became, for these people, the most 
important area of public policy, and responsible 
international participation the only viable political 
stance. And in both England and the United States 
the war gave rise to movements to end the tradi- 
tional secretive practices of the world’s statesmen 
and diplomats, movements that found concrete 
expression in Wilson's plea for “open covenants of 
peace, openly arrived at” and in the conference 
diplomacy of the League of Nations. Points One 
and Fourteen, and Wilsonian philosophy generally, 
were major building blocks in the progressive 
“democratization” of foreign policy. 


Logically, this struggle for greater public parti- 
cipation in, and control over, foreign relations 
could have resulted in the assimilation of foreign 
policy to the working principles and practices of 
everyday politics in democratic countries. That it 
has not may be attributed to the prior claim for the 
“primacy of foreign policy,” to the ineluctable re- 
quirements of secrecy imposed by the demands of 
national political and military strategies, and to the 
small amount of interest in foreign policy on the 
part of the general public. 

Nevertheless, the study of foreign policy making 
was permanently affected by the liberal, rationalist, 
democratic ideology, with its twin convictions that 
the people, once they were educated, would make 
liberal and wise decisions in foreign policy and that 
man could improve his political institutions and 
their policy outputs if he would only think about 
them intelligently. These convictions explain the 
focus of attention, down to the present day, on the 
search for “better” ways to handle foreign policy, 
both organizationally and politically, so that public 
values and common sense might be more freely 
admitted into policy-making circles, and “better” 
policies (that is, responsible international partici- 
pation) might eventuate. 

The bulk of scholarly enterprise in the field of 
foreign policy making lies along one or the other 
of these two lines of concern, which I shall sum- 
marize here as “rationality” and “ideology.” By 
“rationality” I mean a concern for the organization 
and structure of policy making which generally 
(but by no means exclusively) reflects a belief 
that certain policy-making relationships or arrange- 
ments can be found that are for one reason or 
another more “rational” than any others—that is 
to say, better designed to implement the values and 
decisions of intelligent and responsible foreign pol- 
icy makers. By “ideology” I mean a concern for 
democratic control of foreign policy making which 
generally (although, again, not exclusively) reflects 
a belief that foreign policy decisions should be 
made by politically responsive or responsible indi- 
viduals or groups and that the foreign policy of a 
democratic society can be wise and strong only to 
the extent that it commands the understanding and 
Support of the public. Cutting across these two 
major approaches to the subject of foreign policy 
making are three different levels of analysis: the 
individual, the institutional, and the systemic. 
There are thus six major classifications, or clusters, 
into which the political scientist’s study of foreign 
policy making may be conveniently divided for 
explanatory purposes. We shall look at each of 
these in turn, largely in the American context, 
since by its intellectual antecedents and history 


this subject has been of interest predominantly to 
American scholars. Having done this, we will be 
in a better position to identify and discuss research 
on foreign policy making in other countries and 
research through cross-national comparisons. 


The rational approaches 


Rational-institutional approach. The central 
focus of the study of foreign policy making has 
long been on the institutions of government that 
are chiefly responsible for the formulation and 
administration of foreign policy. And in the present 
period, as in the past, the motivating force seems 
to be the uncovering of more efficient, effective 
ways of organizing the policy-making machinery 
so that it rationally serves the interests or purposes 
of the analyst, whether these purposes are econ- 
omy, speed of decision, singularity of authority, or 
coherence in execution. The executive departments 
and agencies—the Executive Office of the Presi- 
dent, the Department of State, the Foreign Service, 
the Department of Defense, the Central Intelligence 
Agency, the U.S. Information Agency, the Peace 
Corps—have been studied and, in many cases, re- 
studied from different perspectives in the various 
historical periods that mark America’s development 
as a major power in world affairs. And certain of 
the functions of these institutions have come in 
for some scrutiny—diplomacy, for one, because the 
work of the diplomat is involved in so many of the 
processes of foreign policy making and execution, 


and intelligence, for another, because of the steady . 


enlargement in the scope of covert political opera- 
tions. Some examples of studies in this area are the 
works by Ransom (1958), Ilchman (1961), and 
McCamy (1964) on the executive side and Carroll 
(1958) on the congressional side. 

In recent years, also, growing attention has been 
paid to the planning function in foreign affairs; 
this has been in response to the fear that without 
the assignment of adequate institutional responsi- 
bilities for thinking about the future, no one will 
have the time to give advance thought to emerging 
problems and to their avoidance or solution. (Cur- 
rent concern with foreign policy planning repre- 
sents, perhaps better than anything else, the faith 
that political problems, like scientific or engineering 
problems, can be solved by the timely application 
of rational thought and organization. ) 

Because of the complexity of the institutional 
framework for making foreign policy and the fre- 
quency with which the institutions are altered 
either formally or through usage, the rational- 
institutional outlook has also focused on interagency 
coordination as a technique for rationalizing the 
continuously unwieldy processes of foreign policy 
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formulation and agreement within the government. 
Actually, the phrase “administration of foreign af- 
fairs” refers as much to the coordination of power- 
ful and nearly autonomous foreign policy agencies 
both at home and “in the field” as it does to the 
processes of policy making and execution within 
these agencies (Brookings Institution 1960). The 
development of high-level coordinating mechanisms 
like the National Security Council has drawn the 
interest of scholars, government officials, and mem- 
bers of Congress (see Hammond 1961) who have 
feared that the coordinating institutions may take 
over important policy-making functions. 

Rational-individual approach. The development 
of the social sciences, especially of psychology, 
under the impetus of the political and social crises 
of World War 11 resulted in a new orientation to- 
ward the individual as the organizing point for 
political research. In opposition to what was felt 
to be the empty formalism of traditional institu- 
tional and administrative analyses, some students 
of foreign policy making shifted their focus to in- 
dividuals and the ways in which they perceived the 
internal and external factors that came to bear on 
their institutional settings. The major products of 
this new line of inquiry have been decision-making 
studies (e.g., Snyder et al. 1962) that attempt to 
understand both the factors in human choice and 
the logic of administrative structure in the foreign 
policy field by examining the processes of individ- 
ual and group decision making in institutional 
settings. 

Recent developments in organization theory, al- 
though based on research in other types of institu- 
tions (e.g., Cyert & March 1963), have considerable 
relevance to the study of behavior in foreign policy 
agencies and should be an important influence on 
future research. The behavior of individuals when 
they occupy policy-making roles is central to the 
analysis of decision making, in game models as 
in the real world, and it explains the continuing 
interest in biographies and memoirs of policy of- 
ficials. Students of foreign policy look to political 
biography, both conventional and psychoanalytic 
(see George & George 1956), for life experiences 
that help to illuminate the relations between indi- 
viduals possessing particular personality types, 
backgrounds, and the like and the operating norms 
of particular policy-making offices. 

Rational-systemic approach. At still another 
level, the search for more rational arrangements of 
organization and structure in the foreign policy 
field has led scholars to study the patterned inter- 
actions of individuals and institutions that consti- 
tute the universe of foreign policy making. These 
studies of interactions or systems cover dyadic 
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relationships between political actors as well as 
more inclusive studies of how men and institutions 
handle problems or produce policies. And if the 
focus, on the individual level, is on reality as 
perceived by persons in policy-making roles, here 
the focus may be said to be on the objective reality 
created by the interactions of two or more political 
institutions. 

The interactions of individuals and institutions 
involve accommodations of power and interest that 
we customarily associate with politics rather than 
administration. This is the case whether the in- 
dividuals or institutions stand in some hierarchical 
relationship to one another or are relatively auton- 
omous, deriving their power independently from 
other sources. Yet much of the scholarly work at 
the systemic level overlooks these sources of con- 
flicts of power and interest, approaching the sub- 
ject instead as if there were some arrangements 
of the foreign-policy-making institutions of govern- 
ment that were rationally or optimally designed to 
achieve a superior kind of foreign policy. Some 
commonly studied partial systems of foreign policy 
making, described in the phrases “executive-legis- 
lative relations” and “bipartisanship,” are often 
treated in this way. The three-cornered relationship 
of the president, his party in Congress, and the 
Opposition party in Congress has often been ex- 
amined (e.g., Woodrow Wilson Foundation 1952) 
to see if a way could be discovered to remove the 
friction in the system that is presumed to interfere 
with the smooth functioning of a unified, national- 
interest-oriented policy machinery. [See BIPARTI- 
SANSHIP.] 

Recent scholarship, however, has begun to 
change this emphasis on the structural aspects of 
foreign policy interactions. Instead of attempting 
to eliminate or bypass the political features of 
foreign-policy-making systems by emphasizing ad- 
ministrative arrangements, contemporary scholars 
have been concentrating precisely on these features 
(Huntington 1961; Robinson 1962; Schilling et al. 
1962). They have begun to study the interactions 
of individuals within and between foreign policy 
agencies as sets of bargaining relationships, which 
are similar to the bargaining processes that take 
place in other areas of public and political life 
where competitions of power and of interest are 
more overt and accepted. This bargaining orienta- 
tion is thus less committed to the “rational” aspect 
of foreign policy making; its normative implica- 
tions tend rather to be those of science in general 
—that is, one needs to understand how a system 
operates before one can calculate the consequences 
of attempting to make any given change in it. 


Another body of literature that might appropri- 
ately be described in this category consists of “case 
studies” that reconstruct the way specific issues, 
policies, or events were handled by the governmen- 
tal participants in the foreign-policy-making proc- 
ess. These are more likely to be historical—descrip- 
tive (e.g., Jones 1955) than analytical in nature, 
and they find their rationale in the assumption that 
the mysteries of the governmental process will 
yield to the accumulated weight of information on 
how the system has actually worked in specific 
instances in the past. 


The ideological approaches 


The second major approach to the study of for- 
eign policy making I have called “ideological,” be- 
cause in one form or another it is concerned with 
the democratization of foreign policy—with the 
possibilities and procedures for introducing a sub- 
stantial measure of public values into every stage 
in the formulation of foreign policy. This approach 
starts from the general premise that the foreign 
policy area has different roots and different re- 
quirements of interest and support in the political 
system than do other policy areas. It then branches 
out in two directions. One line of inquiry, based 
explicitly on the primacy of the foreign policy area 
over other policy areas, seeks to alter the bases of 
interest and support in order to optimize public 
knowledge of, participation in, and control over, 
foreign policy decision making. A second line of 
inquiry seeks to discover what the different roots 
and requirements of interest and support are, in 
order to understand the operative political proc- 
esses, sometimes for the purpose of facilitating 
democratic control and sometimes for more general 
social science purposes. In both of these lines of 
inquiry, the central objects of study are most often 
either nongovernmental groups or the interactions 
between the nongovernmental and the official el- 
ements. 

Ideological-institutional approach. Just as the 
foreign policy institutions of government have been 
the central focus of study for those seeking better 
modes of organizing the foreign policy machinery, 
so the institutions of nongovernmental foreign pol- 
icy activity have come in for a large share of the 
attention of those interested in a greater democratic 
perspective for foreign policy. In this context, for- 
eign policy has benefited from an inclination in 
political science generally to explore the group basis 
of politics; thus, the role of organized interest 
groups in foreign policy making has become an 
important area of study (Cohen 1959), Much of 
this research has been limited to the positions, in- 


terests, and demands of particular organizations, 
and thus it has shared in the empty formalism of 
some of the rational-institutional studies. But in- 
creasingly, interest-group activities are being ex- 
amined at the points where they intersect the 
domains of other participants in the process of 
foreign policy making: that is, they are more and 
more being treated as parts of a political system. 
And attention is also being turned to skill groups 
of a comparatively unorganized nature, such as 
scientists and “academic strategists,” which are 
playing a larger role in the technical complexities 
of foreign policy making in a nuclear environment 
(Gilpin & Wright 1964). 

The widespread conviction that a sound and 
stable foreign policy requires an interested, in- 
formed, intelligent public opinion has led to ex- 
tensive, although far from satisfactory, investiga- 
tions of the major channels through which foreign 
policy attitudes and information are transmitted 
to public audiences. The role of the universities as 
factors in the development of interest in, and 
knowledge about, foreign affairs has been a subject 
of study for some years, although precise measures 
of the impact of educational institutions at all 
levels on the foreign policy thinking of students are 
still lacking. The media of mass communication 
have been intensively scrutinized for many years 
(see Hero 1959), although their impact on foreign 
policy attitudes and information is still largely a 
matter of intelligent speculation (guided by deep 
concern) rather than of demonstrable fact. 

It was noted above that most of the ideologically 
oriented studies of foreign policy making were non- 
governmental in nature. An important exception 
concerns the treatment of the U.S. Congress. Part 
of the study of Congress as a foreign-policy-making 
institution is approached from the point of view 
of clarifying its internal organization and proce- 
dures and of understanding its interactions as part 
of the governmental process; hence, it was dis- 
cussed under the rational-institutional heading. 
But important parts of this study of Congress are 
approached from the point of view of understand- 
ing and improving its representative role in the 
foreign policy field, and so they merit inclusion 
here also. This ideological perspective on the Con- 
gress is concerned with that body as a major object 
of public representation on foreign policy matters 
and as a channel through which public preferences 
and aspirations are made a constituent ingredient 
of foreign policy (Dahl 1950). 

Ideological-individual approach. One of the 
most important and durable approaches to the sub- 
ject of popular participation in foreign policy mak- 
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ing has been through public opinion studies— 
through sampling and aggregating the attitudes of 
ordinary citizens. These studies find their rationale 
in the belief that if foreign policy were to be formu- 
lated both democratically and wisely, the average 
man would have to be interested, well-informed, 
and enlightened. And in order to discover the condi- 
tions of public interest, information, and attitudes 
at any point in time, as well as possible changes in 
these conditions over time, it has been necessary 
to explore the many dimensions of public opinion 
on foreign policy. In fact, the phrase “public opin- 
ion and foreign policy” generally refers to the 
foreign policy outlook of ordinary citizens rather 
than of those groups or publics that might be polit- 
ically more effective in presenting their views. 

After thirty years of public opinion research, the 
data on public attitudes toward foreign policy ques- 
tions have reached mountainous proportions and 
have formed the bases for many studies of the 
public opinion process, both in its political and its 
psychological dimensions (see especially Almond 
1950; Scott & Withey 1958; Deutsch & Edinger 
1959). Normatively, these data, which continually 
show the large majority of the population to be 
uninterested and uninformed on foreign policy, 
have been used to justify and guide governmental 
and private efforts to capture public attention and 
to enlighten the public on foreign policy matters. 

A growing feeling that data drawn from samples 
of the general population have already yielded 
whatever significance they contain is leading to new 
directions in research on public opinion, Scholars 
are beginning to use opinion data more imagina- 
tively, seeking to relate the foreign policy opinions 
of attentive and interested individuals to more 
formal steps in governmental foreign policy mak- 
ing. Similarly, students of public opinion are focus- 
ing their attention more narrowly on the leader- 
ship elements among nongovernmental groups—on 
those persons who, by virtue of their special posi- 
tions or qualifications, have an especially powerful 
role in shaping the views and the policies of public 
groups. [See PUBLIC OPINION; see also Rosenau 
1963.] 

Ideological-systemic approach. Careful study 
of the processes of public participation and control 
in foreign policy making has led rather directly to 
the systemic level, since it has been readily ob- 
served that any effort to explore the influence of 
public factors on governmental decision makers 
almost automatically compels the investigator to 
study the patterns of interaction between public 
and governmental actors. The broadest studies of a 
systemic variety are, once again, case studies of 
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the interactions of the governmental and the non- 
governmental spheres as they have actually taken 
shape in a particular policy-making circumstance 
(e.g., Cohen 1957; Bauer et al. 1963) or as they 
may be traced out through a detailed analysis of 
how a single nongovernmental group or institution 
confronts the governmental process (Cohen 1963). 
Still lacking in this effort to understand how the 
public influences foreign policy making, however, 
are studies at the decision-making points to estab- 
lish with some accuracy the constraints and op- 
portunities that foreign policy officials read into the 
public mood: too often the subject is approached 
as if “the public” were always a source—if not the 
source—of constraints on wise governmental ini- 
tiative and decision making in foreign policy. 

A more circumscribed but still striking pattern 
of interactions that has been studied from the per- 
spective of “who is to control” is the civil-military 
relationship. For many years the main concerns of 
these studies were the devices and techniques for 
maintaining civilian—hence, ultimately, public— 
control over military personnel and institutions, 
which were generally presumed to pose a threat to 
democratic government by their very existence, if 
not by their hidden aspirations to power. This sim- 
plistic viewpoint has given way in recent years to 
more complex approaches to the question of civil 
versus military contributions to public policy, since 
the military have perforce been accorded a larger 
role even while civilian authority has been substan- 
tially strengthened (Stein 1963). 


Research needs 


The persistent and increasingly imaginative ex- 
ploitation of the approaches to the study of foreign 
policy making discussed above has taught us a great 
deal about the subject, but in a rather segmented 
way. This review points up how little there is in 
the way of explicit general models of the foreign- 
policy-making process—or, more precisely, proces- 
ses, since there is no single combination of forces 
that is always operative and no single path that is 
followed in every instance. While much necessary 
work remains to be done in each of the above clas- 
sifications before we can lay claim to confident 
understanding of how the parts of the foreign- 
policy-making system operate, such work would be 
helped immeasurably by more elaborate theories 
governing the whole system of foreign policy 
making, 

In addition to a dearth of general theories, and 
perhaps as a contributing factor to that dearth, 
there has been very little comparative analysis in 


the study of foreign policy making. Rather, our 
analyses of institutions, individuals, and even sys- 
tems have been singular and for that reason not 
always cumulative. At the present time we cannot 
have very great confidence in the generality of our 
hypotheses and insights. Comparisons are possible 
between similar institutions or similar policy-mak- 
ing functions in different countries or different 
political settings, or with respect to the same in- 
stitutions or functions operating in different his- 
torical periods. It is also possible to compare the 
behavior and interactions of participants across a 
representative range of policy issues arising in any 
one political system. Such comparative analyses 
seem to be a very promising means of formulating 
a generally applicable model of foreign policy 
making. 

We noted at the beginning of this article that the 
subject of foreign policy making has been most 
extensively pursued in the United States. Much less 
work—and much less sophisticated work—has 
been done on foreign policy making elsewhere in 
the world. How does one account for this situation? 

In the first place, the historical circumstances 
and the political setting that have inspired this 
research in the United States have not been present 
to any important extent in other countries. Even in 
Great Britain, where the movement for open diplo- 
macy had some force during World War 1, the 
stronger political traditions of executive discretion 
in foreign affairs have discouraged an open ex- 
ploration of foreign policy making, especially where 
one could detect in that exploration some aspira- 
tions for greater public participation. In countries 
that did not undergo political experiences compa- 
rable to those of the United States, the political cul- 
ture has directed the attention of scholars to other 
kinds of problems. 

Second, interest in modern methods of social 
science research has developed more slowly else- 
where in the world than in the United States. Apart 
from public opinion research, which has had an 
extensive development, research on foreign policy 
making that has been done in other countries has 
usually been historical—descriptive in nature and 
can be characterized as rational—institutional. With 
few exceptions (e.g., Frankel 1963), the major 
works on foreign policy making in countries other 
than the United States have dealt with the formal 
administrative apparatus for the conduct of foreign 
relations. 

Third, the lack of a tradition of comparative 
analysis in this field has meant that even American 
scholars have been slow to turn their attention 


to aspects of foreign policy making in other coun- 
tries. Quality studies dealing with individual coun- 
tries have only recently started to appear (e.g., 
Speier & Davison 1957; Bishop 1961; Mendel 1961); 
genuinely comparative studies have still to be 
made. 

There is no reason to believe that our growing 
knowledge of foreign-policy-making processes in 
the United States will help us to understand what 
these processes are like in other countries: we have 
long since learned that propositions drawn from the 
American political system are of limited relevance 
to other political contexts. The comparative study 
of political systems is now expanding rapidly; the 
comparative study of foreign-policy-making proc- 
esses could be made an important part of that en- 
deavor. 

BERNARD C. COHEN 


[See also CIVIL—MILITARY RELATIONS; INTERNATIONAL 
RELATIONS; MILITARY POLICY; NATIONAL INTEREST; 
NATIONAL SECURITY; PUBLIC POLICY; SCIENCE, ar- 
ticle on SCIENCE—GOVERNMENT RELATIONS. ] 
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FORGETTING 


It seems quite unnecessary to be concerned with 
a definition of “forgetting.” Each of us has had 
innumerable experiences, often painful, with this 
phenomenon, and we know that, in general, the 
longer the period between the point at which we 
learn something and the point at which we try to 
remember it, the less likely we are to remember 
correctly. The growth of living organisms is always 
accompanied by deterioration processes. Forgetting 
is the behavioral deterioration process that may be 
likened to organic deterioration and is in opposi- 
tion to the growth process called “learning.” We do 
not, in fact, know much about the organic or 
physiological correlates of forgetting, and it is not 
easy to understand why, in the evolutionary proc- 
ess, forgetting should remain so powerful a process 
in man when, from almost any point of view, it is 
unadaptive. It is almost as if we have “room” for 
only so much learning, and when more learning 
is to be added than the room will hold, some of 
the old learning must go. Indeed, in a manner of 
speaking, the theories of forgetting are based on 
this notion. 

Learning must occur before forgetting can oc- 
cur. This fact, seemingly trivial, is the key to an 
understanding of the processes causing forgetting 
and must also be the foundation of any applied 
program aimed at retarding forgetting. 


Sources of evidence 


A review of the experimental studies of forget- 
ting can but lead to the impression that the 
evidence on which we base our knowledge of 
forgetting comes from quite a restricted set of 
materials and procedures, and thus our generali- 
zations about forgetting may be severely restricted. 
It is a fact that most of our data come from what 
is classically known as rote-learning tasks, and 
the subjects for experimentation have been pre- 
ponderantly college students. The study of the 
forgetting of motor skills has lagged; the memory 
for ideas, concepts, and principles has never been 
systematically investigated. Yet, when probes into 
these areas have been made, the results have given 
us no reason to believe that the basic laws are dif- 
ferent from those derived from rote-learning tasks. 
The extensive use of college students—whose 
use has not been dictated by theoretical concern 
but has been largely a matter of convenience— 
may make us wary of generalizing these conclu- 
sions to all ages and ability levels. But again, the 
evidence available does not indicate that any seri- 
ous distortions in the laws of forgetting are attend- 


ant upon the particular subjects that have been 
used in experimental work. 

Nonsense syllables and nonwords. We must 
look to the tasks and materials from which most 
of our contemporary knowledge of forgetting is 
derived. The materials are simple verbal units. 
These may be words, or they may be a sequence 
of letters not found in English dictionaries. Calling 
these latter units “nonsense syllables,” or “nonsense 
material,” has produced considerable misunder- 
standing. It has often been assumed that a non- 
sense syllable (for example, RUH) is an artificial 
unit which removes the study employing such units 
from contact with real-life situations. It might be 
argued that experimentation is always, in fact, arti- 
ficial (a vacuum does not exist in nature) and that 
it is only by such procedures that fundamental 
laws are obtained, because these procedures allow 
for the control of variables. In fact, the original 
use of nonsense syllables by Ebbinghaus in his 
pioneering studies on memory was, to a large 
extent, based on this premise. But there are three 
other facts that must be set forth and that may be 
more critical than an argument based on a philos- 
ophy of science. 

To say that a nonsense syllable is not related 
to real life is simply erroneous. A word common 
to an adult must be, in essence, a nonsense sylla- 
ble to a child before its meaning is acquired. The 
use of verbal units not in dictionaries may be 
tapping the same or similar processes that are 
involved when the child changes from a nonverbal 
organism to an organism with a verbal repertoire 
that can be correctly applied. Or a nonsense sylla- 
ble may be likened to a word in a foreign lan- 
guage, and thus to the college student it is any- 
thing but removed from real life, 

The second point is that a derogation of the 
use of nonsense syllables assumes a dichotomy 
between what is meaningful and what is nonsense. 
But measurements of meaning, by any of many 
techniques, do not Support such a schism. There 
is a continuum between words and nonwords. 
Many so-called nonsense syllables have greater 
meaning than some words. Nonsense words, like 
“real” words, have varying affective connotations. 
Indeed, it could probably be demonstrated that 
affective reactions to such nonsense syllables as 
WOR are much more intense than those to such 
common words as THE. A nonsense unit, there- 
fore, should mean only a word that does not appear 
as an entry in a dictionary, but this distinction 
per se between words and nonwords appears to 
have little psychological relevance. 

The third reason for not viewing nonwords as 


something significantly apart from words is prob- 
ably the most critical when reference is made to 
the use of these materials only in the study of 
forgetting. This reason is simply that the evidence 
indicates that words and nonwords “behave” the 
same as far as forgetting is concerned. This point 
will be given elaboration at a later point in the 
discussion. 

Manner of presenting verbal units. In the bulk 
of the studies on forgetting, verbal units are 
formed into lists, so that in a single task the 
subject learns several items. The items might be 
formed into a serial list in which the subject must 
learn to reproduce not only the items but also a 
particular order of the items. A second widely 
used type of list is the paired-associate list. The 
subject is presented a series of pairs and is re- 
quired to learn to reproduce the second member 
of each pair when the first member is shown alone. 
Acquiring the order of the pairs is not a part of 
the task; indeed, the investigator usually prevents 
his subject from learning the order by presenting 
the pairs in a different order from time to time. 
The paired-associate task is a counterpart of the 
task of learning a vocabulary of a foreign language, 
in which the first member of the pair is a word in 
the learner’s native language and the second mem- 
ber is its foreign equivalent. 

Other tasks that have also been used include 
prose passages, sentences, and lists of words to be 
recalled in any order. But for reasons that need 
not be detailed here, at the present stage in the 
development of psychological research, the paired- 
associate task possesses properties that make it a 
superior vehicle for studying forgetting, and much 
of the discussion will be cast around this task. 
This does not imply that there is evidence that 
forgetting varies as a function of the task. Such 
comparisons are difficult to make, although insofar 
as they have been made there is no convincing 
evidence that the type of task is a critical variable 
in the study of forgetting. 

Rapid single-unit technique. Most of the re- 
search on forgetting has been with multiple-unit 
tasks (for example, several pairs of items in a 
paired-associate task). What must be considered 
a most amazing technical breakthrough was made 
in 1959 by Lloyd R. Peterson and Margaret J. 
Peterson. These investigators were able to devise 
a situation in which a single verbal unit was pre- 
sented for learning and the subsequent course of 
forgetting could be measured over a period of time. 
A syllable, such as RZL, was presented for study 
for two seconds. Then, in order to prevent the 
subject from rehearsing the syllable, he was given 
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a neutral task (for example, counting) for a few 
seconds. Finally, he was asked to recall the syllable 
originally shown to him. The startling fact is that 
lawful and reliable forgetting curves were obtained 
in whicn forgetting was essentially complete (the 
subject could ot reproduce the unit) after 20 
seconds. The exploitation of this technique -has 
been, and is being, pursued by many investigators. 
The classical studies have used forgetting intervals 
of hours or days, and thus the sheer labor and 
mechanical problems in the performance of these 
studies were great. Although it is perhaps too early 
to say with assurance that the laws of forgetting 
observed under the Peterson—Peterson technique 
are the same as those for classical studies, the 
evidence thus far points strongly to this conclu- 
sion. If this is true, the study of forgetting may 
proceed at a pace hitherto impossible with classical 
procedures. In the Peterson—Peterson technique, 
seconds are comparable to hours or even days with 
the classical list method. 


Single-task studies of forgetting 


We may turn next to an examination of the 
reliable experimental facts in single-task studies 
of forgetting. A distinction must be drawn between 
single-task and multiple-task studies if we are to 
focus on the processes believed critical for an 
understanding of forgetting. 

The laboratory situation can be described as 
follows. A subject who has not previously served 
in a laboratory experiment involving verbal learn- 
ing is given a single list of verbal units to learn. 
These are normally presented over and over, each 
unit being exposed for a short interval of time 
(for example, two seconds), until the subject is 
just able to reproduce the list perfectly. Assume, 
further, that the subject returns 24 hours later and 
is asked to recall all of the units he can. We ask 
now about the general quantitative aspects of for- 
getting in such a situation and, in particular, what 
variables cause the amount of forgetting to vary. 

Amount forgotten. Over the 24-hour period, 
the subject will forget from 15 to 20 per cent of 
the units. Thus, if the list contained ten paired 
associates, we will expect about eight of these to 
be recalled correctly. Under the conditions specified 
above, this amount of forgetting remains fairly 
constant across a wide range of materials and 
tasks. Some gross measures may indicate a degree 
of forgetting as great as 30 per cent, but when 
refined techniques of measurement are used, the 
15 to 20 per cent loss will usually be found. With 
intervals longer than 24 hours, more forgetting 
will, of course, occur, but we do not know with 
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any precision the rate of fall for many longer 
intervals with various materials. Because of the 
relatively large amount of data available on forget- 
ting after 24 hours, we will use this interval as a 
basic reference interval. 

Significance of the degree of learning. We may 
now ask about variables that influence the amount 
of forgetting during the 24-hour period. The facts, 
as viewed at the present time, provide a conclusion 
that seems not only to be opposite to that expected 
as research in an area proceeds but also opposite 
to that often expected according to “common 
sense.” For we can name only one variable that 
substantially influences the rate of forgetting in 
the single-task situation. Obviously, considerable 
clarification and elaboration are needed. 

Characteristics of the verbal units will produce 
enormous differences in the rate at which the list 
is learned. As might be expected, a list of common 
words is learned much more rapidly than a list of 
difficult nonwords. The differences are so large that 
even the casual reader will be convinced that this 
is true. Thus, it is clear that a list of five 3-letter 
words, such as CAT, PEN, BUS, FAR, and ELK, 
would be learned more rapidly than a list of five 
nonwords, such as RZL, DBQ, HFG, BJX, and 
PCR. Technically, we say the words have a higher 
level of meaningfulness than do the nonwords. But 
if both sets are learned to an equal degree, they 
will be equally remembered 24 hours later. The 
critical point is the “equal degree of learning,” 
because degree of learning is the one variable that 
does influence the amount of forgetting. This can 
be demonstrated in two ways. First, when the 
number of practice trials is varied within a given 
task, it will be found that the greater the number of 
practice trials, the less the forgetting. Second, when 
units within the task are examined, one may note 
that those units which have been given correctly 
many times during learning will prove to be re- 
membered better after 24 hours than those given 
correctly only a few times. So we say that if the 
degree of learning is equal for materials of widely 
different meaningfulness, they will be forgotten at 
the same rate. 

Meaning related to degree of learning. The 
foregoing conclusion might seem contradictory to 
the experience of many, because most of us have 
long believed that retention of more meaningful 
material is better than that of less meaningful 
material. But this belief may have arisen as fol- 
lows. If we spend an equal amount of time in 
learning the two kinds of material, retention of 
the list of words would assuredly be better after 
24 hours than the retention of the list of nonwords. 
Even if a subject spent twice as much time study- 


ing the nonword list, retention of the words might 
still be better. This is because differences in rate 
of learning such lists vary so markedly that even 
after a longer study period on the nonwords the 
degree of learning will still be less than the degree 
of learning for words. But under such circum- 
stances, the differences in amount remembered 
after 24 hours must be ascribed to the degree of 
learning achieved and not to the differences in the 
material. Differences in the material produce dif- 
ferences in learning, but if the level of learning 
achieved for materials of different difficulty prior 
to the introduction of the retention interval is 
equivalent, we may expect essentially equal 
amounts of forgetting, which, as noted earlier, 
will be about 15 to 20 per cent over 24 hours. 

Similarity related to degree of learning. Let us 
consider another variable that produces wide dif- 
ferences in learning. If the number of duplicated 
letters among nonwords is varied within a list of 
paired associates, the greater the number of dupli- 
cated letters, the more difficult it is to learn the 
task. Technically, this is called “variation in intra- 
list formal similarity.” But again, if the level of 
learning achieved for a list with high similarity is 
equivalent to that for a list with low similarity, 
the amount of forgetting will not differ measurably 
over a 24-hour period. The same situation will 
prevail if we use a list of synonyms and a list 
of words with very little similarity of meaning. 
The former will be more difficult to learn, but dif- 
ferences in forgetting cannot be demonstrated if 
the levels of learning attained are equivalent. 

Individual differences in rate of learning. The 
general principle that degree of learning is the only 
critical variable in the single-task situation can be 
pursued one step further. There is evidence leading 
to the conclusion that there are no individual dif- 
ferences in forgetting. One proposition that has 
been ubiquitous in psychological research is that 
any set of measurements will produce a distribu- 
tion of reliable individual differences. Some indi- 
viduals are slow learners: others are fast learners. 
Some have excellent depth perception; others have 
poor depth perception. Measurements on living or- 
ganisms of all levels have supported the proposition 
that there is a consistent range of individual dif- 
ferences. To suggest that this Proposition does not 
hold true for forgetting requires a careful exposi- 
tion of the extent of the data available for its 
support. 

For one characteristic of individuals, the data 
are quite clear. The rate at which a person learns 
a given task is not related to rate of forgetting. 
A subject who learns a list very slowly will show 
no more forgetting than a subject learning the 


same task very rapidly, if the level of learning 
attained is equivalent. Such a fact is not without 
some support in incidental observations. It is not 
unusual for students in the author's classes to 
observe (when discussing their performance in the 
course) that they do not learn rapidly, but once 
they have learned something well, they remember 
it well. A slow-learning student, therefore, may take 
much longer to achieve a level of learning attained 
by a rapid-learning student, but given the equiva- 
lent degree of learning, forgetting does not differ. 

Rate of learning is, of course, only one of 
hundreds of characteristics on which people differ 
reliably. Some people are more emotional than 
others; some are more conservative than others. 
Do none of these many possible characteristics 
produce differences in rate of forgetting? Here the 
evidence is less convincing, but it is pointing 
toward the conclusion that if these characteristics 
are related to rate of forgetting, the magnitude of 
their effect is small. 

Consider, for example, the following situation. 
Subjects of widely different learning abilities are 
given the same period of time to study a given 
task, and immediately following this study period 
they are tested. The test scores will show a wide 
range, indicating that rate of learning differed 
markedly among the subjects. After 24 hours, a 
recall test is given. The recall scores are then 
correlated with the scores taken immediately after 
learning. Such a procedure will produce very high 
correlations across a wide variety of materials. 
The high correlations can be taken to indicate 
again the strong relationship between degree of 
learning and retention. If there were wide indi- 
vidual differences in rate of forgetting that vitiate 
the relationship between learning and retention, 
such high correlations would not be found. This is 
to say that a given person's retention score can be 
predicted with a high degree of accuracy if only 
his learning score is known. Under such circum- 
stances, it does not seem possible to attribute to 
individual differences a role of consequence in 
forgetting. That such a conclusion runs contrary 
to the conclusions concerning the role of individual 
differences in many other situations is indeed a 
puzzle, and perhaps future research will change 
the conclusion. But the available data allow at 
least a tentative conclusion that individual differ- 
ences in forgetting are, at best, of very small 
magnitude. 


Multiple tasks and forgetting theory 

The reason for making the distinction between 
single-task forgetting and multiple-task situations 
may now be examined. Very early in the period in 
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which the study of learning and memory was 
brought into the laboratory, it was discovered that 
if two tasks were learned in immediate succession 
and the subject's retention of the first task was 
then tested, severe forgetting was observed. That 
the second task was in some way responsible for 
this forgetting was shown by control conditions 
in which only the single task was learned and 
tested for retention. 

Retroactive and proactive inhibition. The 
amount of forgetting produced by the learning of 
a second task, interpolated between the learning 
and recall of a first task, is called “retroactive 
inhibition.” The amount of retroactive inhibition 
that can be produced in the laboratory is very 
great; under appropriate conditions recall of the 
first task can be reduced to zero. 

The discovery of retroactive inhibition had two 
important consequences. First, it became an object 
of study per se, and hundreds of experiments have 
been performed in which such variables as the 
similarity between tasks, the degree of learning of 
the interpolated task, and various time relation- 
ships have been studied. 

The second consequence of the work on retro- 
active inhibition is that it has largely set the theo- 
retical thinking of investigators. This thinking has 
preponderantly tended toward the use of inter- 
ference as a fundamental cause not only of for- 
getting observed in the retroactive situation but 
of all forgetting. Certainly the role of interference 
in retroactive inhibition cannot be disputed: the 
higher the similarity between the two tasks, the 
greater the amount of retroactive inhibition. Fur- 
thermore, subjects are regularly observed to give 
items from the interpolated task when they are 
asked to recall the first task. A theory built around 
an interference notion seemed quite appropriate. 

In more recent years, a second source of inter- 
ference was discovered that has only served to 
refine and reinforce the notion that interference 
must be a major cause of forgetting. This second 
source of interference is identified as “proactive 
inhibition.” Again, a subject learns two tasks, but 
this time he is asked to recall the task subse- 
quently learned. Of course, some time must elapse 
between the learning of the second task and the 
request for its recall to expect forgetting to occur, 
but given this situation, there will be greater for- 
getting than would occur for a single list. It is as 
if the originally learned task interferes with the 
recall or retention of the subsequently learned 
task. The magnitude of proactive inhibition in- 
creases with time between the learning of the 
second task and its recall. After 24 hours, we may 
expect the amount of forgetting produced by pro- 
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action to be as great as that produced by retro- 
action. The combined effects of both retroaction 
and proaction may be studied by having the sub- 
ject learn three lists and then subsequently re- 
questing the recall of the second. 

Interference theory. The refinements in the 
interference theory have consisted of increased 
specification of the mechanisms involved. In order 
to study these mechanisms, certain fairly standard 
interference paradigms have been devised. The 
most widely used is known as the “A-B, A-C 
paradigm.” This may be visualized as representing 
a paired-associate arrangement in two successive 
lists in which the left-hand unit (stimulus term) 
is identical in both lists and the right-hand unit 
(response term) differs. With the first list the 
subject learns to give a response to a specific stim- 
ulus, but with the second list he must learn a 
new response to the same stimulus. When the 
several items in the two lists have this relationship, 
interference or negative effects are observed in 
learning the second list. 

Extinction and spontaneous recovery. It is an 
easy step to assume that the negative effect in 
learning the second list must somehow be related 
to the forgetting observed in the retention tests 
for either list at some later point in time. The 
evidence points to the fact that something akin 
to an extinction process occurs in learning the 
second list. More particularly, the associations 
developed in learning the first list are extinguished 
or unlearned in the process of learning the second 
list. Therefore, if immediately after learning the 
second list, recall of the first list is requested, 
serious retroactive inhibition should occur. It does; 
it is as if the response terms of the first list are 
not available to the subject. But now, assume that 
24 hours elapse before recall of the first list is 
requested. Retroactive inhibition will occur, but it 
will be less in magnitude than the amount meas- 
ured immediately after second-list learning. Inde- 
pendent evidence suggests that following extinction 
a form of spontaneous recovery occurs, just as in 
the case of an extinguished conditioned response. 
Such spontaneous recovery would account for the 
decreased retroactive inhibition observed after the 
24-hour retention interval. 

Next, let us examine the role of extinction and 
Spontaneous recovery in proactive inhibition. If 
following learning of the second list there is a 
recovery over time of the associations in the first 
list, these recovered associations should interfere 
with the recall of the second list. The greater the 
recovery, the greater the interference; hence, such 
a mechanism would account for the fact that pro- 
active inhibition increases with time. 


The use of extinction and recovery mechanisms 
in an interference theory of forgetting has further 
implications. Suppose the second list is learned 
by distributed practice; that is, we carry out 
a trial or two, let an interval elapse, give another 
trial or two, another interval, and so on. These 
intervals, inserted in learning, should allow some 
spontaneous recovery of the first-list associations. 
But the next learning trials should produce some 
extinction of the first-list associations. Thus, over 
a series of such learning and rest cycles, we would 
have successive recovery and extinction cycles. 
Again, generalizing from work in other areas of 
learning, we find that such extinction—-recovery 
cycles may lead to more permanent extinction of 
the first-list associations. Distributed practice on 
the second list, therefore, should decrease the 
amount of proactive inhibition. Tests of this propo- 
sition are positive. Indeed, when the distribution 
intervals are long (such as 24 hours), enormous 
reduction in proactive inhibition will occur. In 
general, it is fair to say that the use of extinction— 
recovery notions, as subprocesses under a general 
interference theory of forgetting, have proved very 
useful. 

Rapid single-unit technique. Earlier we dis- 
cussed the technique of studying retention of single 
items over very short intervals and, in addition, 
noted that near-complete forgetting had been ob- 
served in as short a period as 20 seconds. Experi- 
mental evaluations of this phenomenon have now 
made it fairly reasonable to assume that this rapid 
forgetting is produced by proactive interference. 
It is common to use a single subject many times 
in the single-item studies. The subject is given 
different units at different times, and retention is 
tested at various intervals following presentation. 
Thus, in effect, the subject builds up a repertoire 
of units as the experiment proceeds. If it is as- 
sumed that the previously presented units inter- 
fered proactively, the rapid forgetting of the item 
of the moment may be accounted for. The inter- 
pretation seems to hold. Given the appropriate 
arrangement of conditions, it can be shown that 
the greater the number of previous items pre- 
sented, the greater the rate of forgetting the item 
at hand. The first item presented to the subject 
shows little, if any, forgetting over time. It seems 
reasonable to conclude, therefore, that we are not 
confronted with new principles of forgetting when 
studying retention of singly presented items over 
short intervals of time. 

Pre-experimental associations. We now come 
to the final step in the analysis of the interference 
theory of forgetting. As previously discussed, when 
the subject learns a single list, forgetting will be 


approximately 15 to 20 per cent over 24 hours. 
The facts of retroactive and proactive inhibition 
produced in the laboratory made it reasonable to 
suspect that interference also accounted for this 
15 per cent to 20 per cent loss. More particularly, 
it would appear that the interference must arise 
proactively from sources outside the laboratory. 
The reasoning is as follows. Assume that the sub- 
ject is 20 years old at the time he learns this single 
list. If forgetting the list is to be attributed to 
interference, it is much more probable that he 
would have learned “something” that would inter- 
fere during these previous 20 years than during 
the 24-hour retention interval. The latter, of course, 
would be a source of retroactive interference. The 
longer the retention interval, the more important 
would interference from retroactive sources be- 
come. 

What could be the source of interference from 
outside the laboratory? The subject has a large 
verbal repertoire that he has built up during the 
years. This includes not only a great many verbal 
units but also associations among those units. It 
is known, furthermore, that there are varying 
strengths of associations among letters, and a 
major reason for the difficulty of learning non- 
words is that the required associations are con- 
trary to the existing stronger habits or associations. 
In effect, then, when a subject is required to learn 
a nonword, he must extinguish, or at least sup- 
press, the associations he brings to the laboratory 
session, If one were to apply the extinction- 
recovery theory, it would be expected that during 
the retention interval the older associations would 
recover and interfere with those built up in the 
laboratory, thus producing the forgetting observed 
at recall. Or, if the materials involved words, pre- 
experimental associations among those words and 
with other words would have to be extinguished 
if the task of the laboratory required new associa- 
tions. But again, the old habits should recover with 
time and interfere with recall. Thus the interfer- 
ence theory, as refined and shaped by the facts 
derived from the studies of retroactive and pro- 
active inhibition in the formal laboratory setting, 
was given a direct counterpart in the single-list 
situation, with the interference stemming from 
habits the subject had learned outside the labora- 
tory’ This translation seemed quite appropriate, 
and the magnitude of the interference to be ex- 
pected from outside sources could surely account 
for the 15 per cent to 20 per cent forgetting ob- 
served. 

To test the application of the theory to the 
single-list situation requires the construction of 
learning tasks that will be differentially interfered 
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with by associative habits that the subject brings 
to the laboratory. This, in turn, requires a knowl- 
edge of the associative habits that the subject has, 
in fact, developed over the years. From various 
word-association tables and letter-association ta- 
bles, something is known of the associative net- 
work of the average college student. For example, 
it is known that words occurring with great fre- 
quency in our language have more and stronger 
associations than do words occurring with low 
frequency. It might be expected, therefore, that if 
a verbal task were constructed of frequently used 
words in such a way that the already established 
habits were inappropriate or contrary to the habits 
that must be acquired to master the task, an effec- 
tive proactive situation would be devised. Further- 
more, proactive inhibition should be greater than 
for a task constructed of infrequently used words, 
since the pre-experimental associations among the 
infrequently used words are minimal. Clearly, it 
would be predicted that more of the task con- 
structed of frequently used words would be forgot- 
ten; forgetting should be greater because proactive 
interference after a day, a week, or a month should 
be greater. 

Tests of such expectations have been disappoint- 
ing. Little, if any, difference in the forgetting of 
the various tasks has been observed. The same has 
been true when associations among letters have 
been used to construct nonwords that should be 
differentially susceptible to previous habits. 

We can see that although the interference theory 
has been quite successful in accounting for ob- 
served facts in the formal retroactive-inhibition 
and proactive-inhibition paradigms, the translation 
of this theory to account for the forgetting ob- 
served for a single list has not. In study after study 
we find that 15 to 20 per cent of the task is for- 
gotten over a 24-hour period; we cannot get 
differential forgetting of any appreciable amount 
for different materials. The success of the inter- 
ference theory in other contexts will not allow it 
to be given up easily. Rather, it would appear that 
the more fruitful approach would be to continue 
to devise tasks in which the amount of interference 
from already established habits would be different. 
This, in turn, may require further extensive assess- 
ment of the habit repertoires of the subjects. 

Implications of interference theory. There are 
further untested implications of the interference 
theory as outlined above. If proactive inhibition 
from outside sources is responsible for the forget- 
ting of a single list, we would expect more forget- 
ting for a given task for older people than for 
younger people. For example, a college student 
should show more forgetting of a verbal task than 
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should a first-grader. This should follow because 
the number of potential interfering associations 
should be greater for the college student. We might 
also predict that a genius would show more forget- 
ting than an imbecile. The genius has, in his life- 
time, learned infinitely more than the imbecile; 
hence, the sources of potential interference should 
be greater. Other similar propositions can be de- 
duced from the interference theory. It will prob- 
ably not be abandoned despite its inadequacy in 
the single-list situation until a variety of tests of 
the theory have been negative. 


BENTON J. UNDERWOOD 


[See also LEARNING. Other relevant material may be 
found in the biography of EssNcHaus.] 
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FOUNDATIONS 


The foundation may be defined as an instrument 
for the contribution of private wealth to public 
purpose. In this broad sense, foundations existed 
in antiquity and include perpetuities set up by the 
pharaohs for religious purposes, Greek and Roman 
endowments, and the ecclesiastical and charitable 
trusts made in Tudor England and later. 

Early charitable trusts. For example, Plato es- 
tablished his famous Academy on land that Cimon 
the Athenian had given for public use. Before his 
death in 347 B.c., Plato directed that the income 
of his own adjacent fields should be used for the 
perpetual support of the school. This foundation 
survived nearly nine hundred years, until it was 
finally suppressed by the Christian Emperor Jus- 
tinian in A.D. 529 for teaching pagan doctrines. 

The corporate personality, able to receive and 
hold property in perpetuity, was primarily a Roman 
legal concept. In the Roman Empire during the 


first two centuries of the Christian era, foundations 
to aid the needy—usually associated with munici- 
palities but sometimes private associations—be- 
came common. Under Constantine special laws 
were passed extending this principle to the 
churches. 

The foundation in perpetuity (vaqf, vakif) has 
been common in Islamic countries. Examples of 
deeds of trust appeared in the early Hittite civiliza- 
tion (c. 1200 B.c.), but after the foundation prin- 
ciple was given specific approval by Muhammad 
it spread widely. A vaqf has been defined as the 
appropriation of a particular article in such man- 
ner that it becomes subject to the rules of divine 
property: the appropriator’s right in such objects 
is extinguished and they become the property of 
God, who returns the derivative benefits to his 
people. Three broad categories of obligation are 
recognized: first and foremost, a man’s duty to his 
own family; second, the maintenance of God’s wor- 
ship according to the tenets of Islam; and, finally, 
charity in the everyday sense, including works of 
public utility. No world estimates of either the 
number or accumulated wealth of vakiflar are 
available, but they are reported to exist in consider- 
able numbers in Turkey, Saudi Arabia, India, Iran, 
and other countries with substantial Muslim popu- 
lations. In 1964 the Turkish administrator of 
vakiflar reported more than seventeen thousand 
properties under his supervision. 

In England, after Henry vm dissolved the mon- 
asteries, the secular charities grew to such num- 
bers that in 1601, under Elizabeth I, a special act 
was passed. Commonly called the Statute of Char- 
itable Uses, it became the legislative cornerstone 
for the creation, control, and protection of such 
funds. The registry of charitable trusts compiled 
in the 1960s in England may record as many as 
200,000, many of them surviving from Tudor and 
Stuart times. 

As early examples of foundations in the United 
States, consider the two funds established in Bos- 
ton and Philadelphia in 1791, under the will of 
Benjamin Franklin, to assist “young married artif- 
icers of good character.” Portions of these funds 
were to accumulate for two hundred years and are 
still accumulating. But the first substantial fund 
that corresponds to the more modern concept of a 
foundation was the Peabody Education Fund, set 
up in 1867 by George Peabody, with a principal 
sum of over $2 million to “aid the stricken South.” 

Development of the foundation concept. About 
the beginning of the twentieth century the founda- 
tion idea began to take deeper root in American 
soil, and with a significant difference from the old 


concept of the fixed charitable trust. Large endow- 
ments were set up, often in perpetuity, but with 
wide latitude in their use. The new doctrine as- 
serted that the funds of foundations are largely 
the venture capital of philanthropy, best invested 
in activities requiring risk and foresight that are 
not likely to be supported either by government or 
private individuals. The usual purpose is not relief 
or even cure; it is prevention, research, and dis- 
covery “to promote the well-being of mankind.” 
Characteristically, the larger foundations have 
great freedom of action, with their trustees de- 
voting less time to conserving money than to ex- 
ploring new and enterprising ways of spending it. 

Andrew Carnegie was a chief architect of these 
ideas, first through his essay The Gospel of Wealth 
(first published as Wealth in 1889; see Carnegie 
1886-1899) and then through the foundations he 
himself established. In 1902 he set up his first im- 
portant foundation, the Carnegie Institution of 
Washington, “. . . to encourage, in the broadest and 
most liberal manner, investigation, research, and 
discovery, and the application of knowledge to the 
improvement of mankind” (“Articles of Incorpora- 
tion” [1902] 1965, p. 577). In the same year 
John D. Rockefeller’s General Education Board was 
established with Mr. Carnegie as an active trustee. 

Others that are now well known—like the Mil- 
bank Memorial Fund and the Russell Sage Foun- 
dation—followed rapidly. In 1911 various Carnegie 
benefactions culminated in the Carnegie Corpora- 
tion of New York, the largest and most general of 
the Carnegie group. The Rockefeller Foundation, 
giant of the early foundations and still the world’s 
second largest, was initiated in 1913; in 1914 ap- 
peared the Cleveland Foundation, the first of the 
community foundations inviting multiple trust 
funds. 

The first three decades of the twentieth century 
constituted a period of increasing but gradual 
growth. Of the 5,050 foundations with known 
dates of origin included in the latest Foundation 
Directory, some 285, or less than 6 per cent of the 
group, had been established by the end of the lush 
1920s. The fourth decade, the depression 1930s, 
seemed scarcely conducive to accumulations of 
surplus wealth; and some foundations did disap- 
pear or find their assets severely impaired. How- 
ever, the Foundation Directory tabulates 288 new 
foundations originating in that decade, including 
the incorporation in 1936—with an initial endow- 
ment of $25,000—of the now gigantic Ford 
Foundation. 

Postwar growth. By the middle 1940s a new 
group of foundations had sprung up in various 
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sections of the United States, induced in part by 
high postwar levels of taxation resulting from 
World War 11. Many of these were family founda- 
tions, set up by a living individual, with contribu- 
tions and direction closely held within the family 
group. Another large segment was composed of 
company-sponsored foundations set up by business 
corporations to receive substantial contributions in 
years of good profits—or especially high taxes— 
and to disburse these funds, through good years 
and bad, in customary patterns of corporation giv- 
ing. Both of these types of foundations differ in 
one significant respect from the older, traditional 
type: they usually have no large initial corpus but 
carry on their often substantial programs with 
moneys received currently. They may eventually 
accumulate substantial assets, but presently they 
are conduit foundations rather than the fixed- 
endowment type. 

By 1964, date of publication of the second edi- 
tion of the Foundation Directory, it was estimated 
that there were somewhat more than fifteen thou- 
sand foundations in the United States; but nine 
thousand were too small to meet the Directory’s 
size qualifications. Assets totaled $14,500 mil- 
lion, and grants were in the neighborhood of $800 
million annually. These, however, are not large 
sums in the total American economy or even 
in its philanthropic segment. All foundations to- 
gether are able to spend only about 8 cents of the 
annual dollar of private philanthropy. The sig- 
nificance of the foundation contribution lies in its 
special qualities and directions rather than in its 
gross total. 

Moreover, although foundations are numerous 
and have been increasing at a rate of about 1,200 
a year, most assets are in the hands of a relatively 
small number of large foundations. The 176 large 
foundations, each possessing $10 million or more, 
accounted for $11,000 million in assets, or 76 per 
cent of the total for the whole group. Among this 
group were 10 foundations with assets exceeding 
$150 million, headed by the gigantic Ford Founda- 
tion which, at market values of its stock holdings 
in late 1964, had assets of over $4,000 million. 

The foundation as it exists in America has not 
spread extensively in other countries. England has 
less than a dozen foundations of substantial size 
which have freedom in programming. Canada has 
a growing number of foundations. A few examples 
exist in continental Europe, South America, Aus- 
tralia, India, and the Far East. In 1964 New 
Zealand issued its first directory (A Directory of 
Philanthropic Trusts 1964), including 63 trusts. 
Evidence increases of lively interest in other coun- 
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tries, and as surplus wealth becomes available and 
changes are made in tax structures, this typically 
American development may be extensively copied. 

Types of foundations. Foundations differ so 
widely that description is difficult without some 
effort at classification. One such division is into 
grant-making and operating foundations. The 
former carry out their objectives by making gifts 
to independent agencies or persons and usually 
have small staffs merely to evaluate (or, in some 
cases, stimulate) requests for support. The latter 
themselves conduct operations in the fields of their 
choice and may have professional staffs of con- 
siderable size. 

General foundations. Nearly all the larger, 
well-known foundations such as Ford, Rockefeller, 
Carnegie, Kellogg, and Sloan are general founda- 
tions. Operating under broad charters, they sup- 
port the research projects in education, health, and 
welfare that characterize foundation work in the 
public mind. Usually they have large endowments, 
but some smaller foundations also support pro- 
grams of national significance and properly fall 
within this category. Foundations of this type 
have a board of trustees (directors, managers) 
with broad interests, and a trained professional 
staff to serve as the “eyes” of the foundation, seek- 
ing out promising new ventures, evaluating pro- 
jects offered, handling details of grants—or actual 
operations, if the foundation is of the operating 
type—and following through on results, so that 
future programs may be guided by past experience. 

Special purpose foundations. Special purpose 
foundations are created—many of them by will or 
trust instrument rather than by incorporation—to 
serve a charitable purpose closely detailed, usually 
in the charter or perhaps in a letter of gift. The 
purpose may be as narrow as that of the Prairie 
Chicken Foundation of Illinois, devoted to presery- 
ing remnants of prairie chicken flocks, or as broad 
as advancement of education. Criticism of such 
foundations is not directed against program con- 
centration on particular needs or projects, as some 
of these foundations efficiently serve useful ends, 
but against rigid restrictions freezing concentration 
for a long period or perpetually. 

Community foundations. Community founda- 
tions are composite foundations, usually set up as 
trusts rather than as corporations, and functioning 
under community control in a way seldom found 
in other philanthropic endowments. Capital gifts 
are received, and the principal is administered by 
the trust departments of local banks. Income is 
distributed, together with such portions of the 


principal as may be authorized in any trust, under 
the supervision and control of a distribution com- 
mittee. The foundation reserves the power to trans- 
fer to other similar purposes any funds which can 
no longer be effectively used for the ends originally 
designated. The donors may be numerous; each 
donor may specify how his gift is to be used or 
leave it to the discretion of the distribution com- 
mittee, but usually the use of the donation is lim- 
ited to the city or county within which the com- 
munity foundation is situated. 

Company-sponsored foundations. Typically, the 
company-sponsored foundation is a tax-exempt, 
nonprofit legal entity separate from the parent 
company but with a trustee board consisting 
wholly or principally of corporation officers and 
directors. Except in the case of foundations associ- 
ated with large national companies, their programs 
are likely to be confined to the communities with- 
in which the parent company has offices and to 
center on philanthropies that benefit the corpora- 
tion, its employees, its stockholders, or its business 
relationships. They are conduit foundations in that 
they not only disburse their investment income, as 
do nearly all foundations, but typically receive 
substantial new funds every year, or at least every 
year of good corporate profits. 

Family foundations. Family (or personal) foun- 
dations are usually established by a living person 
or persons rather than by bequest. Generally they 
are initially small and may have no administrative 
organization or headquarters other than the office 
of the donor or a law firm. The trustees are apt to 
be the donor or donors, his immediate family, and 
perhaps his lawyer, banker, or business associate. 
At the outset, programs may differ little from the 
personal giving of wealthy men and women who 
have not incorporated their charity and probably 
include gifts to the local community fund, hospital, 
the donor's college or church, etc. Assets are no 
adequate measure of potential; such foundations 
may make substantial grants out of new annual 
gifts. Many of these initially small foundations 
may become recipients of large bequests upon the 
deaths of donors; and as experience accumulates, 
programs may change. Nearly all the large gen- 
eral foundations of today began as family foun- 
dations with limited funds oriented toward per- 
sonal charities. 

The relative numbers, assets, and grant activi- 
ties of these five types of foundations are indicated 
by Table 1, taken from the Foundation Directory 
and including all American foundations of sub- 
stantial size at the time of its publication. 


Table 1 — Assets and grants of 6,007 foundations by 
type of foundation, 1964 


CLASS NUMBER ASSETS GRANTS 

Amount Per Amount Per 

(in millions) cent (in millions) cent 

General 190 $ 9,289 64 $384 49 

Special purpose 479 1,277 9 62 8 

Community 102 425 3 18 2 
Company- 

sponsored 1,716 1,177 8 143 19 
Family and 

miscellaneous 3,520 2,343 16 173 22 

Total 6,007 $14,511 100 $779* 100 


* Entries do not add to this totol becouse of rounding. 


Source: Foundation Directory 1964, p. 22. 


Fields of foundation activity. Table 2 shows 
education to be the most favored field of foundation 
activity, although before large governmental ex- 
penditures were channeled into medical research, 
grants made in the field of health were equally im- 
portant. Similarly, expansion of social security, 
private insurance and retirement plans, and in- 
creased governmental involvement in relief and 
similar fields formerly supported largely by volun- 
tary contributions have resulted in declining foun- 
dation support in social welfare areas. The great 
recent expansion of research within educational 
institutions makes them logical channels for an 
ever-broadening stream of foundation activities in 
the sciences and international affairs. 

The rise of international activities into second 
place is the most notable recent change, and later 
data indicate continuing growth in this area. Not 
all of these sums go abroad; many of these grants 
are made to American universities for area studies 
and like projects. It needs to be added that the 
Ford Foundation is the major factor in this increase 
in internationally oriented dollars, although other 
foundations have been entering the field or expand- 
ing their programs within it. 

Welfare and health receive only modest founda- 


Table 2 — Grants of 6,007 foundations, by major fields, 


1964 
Amount 
Field (in millions) Per cent 

Education $315 40 
International activities 106 14 
Welfare 96 12 
Health 90 12 
Sciences 86 1 
Religion 46 6 
Humanities 40 Ra 

Total $779 100 


Source: Foundation Directory 1964, p. 44. 
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tion support. Closer analysis of the data reveals 
that this pattern is characteristic of the larger 
rather than the small foundations. Some five thou- 
sand small foundations, for exar ‘ple, gave 34 per 
cent of their grants in the field of welfare, as com- 
pared with a mere 6 per cent for foundations with 
assets above $10 million. 

The sciences have been an expanding research 
interest. In the Directory tabulation, support for 
the physical sciences mounted to 40 per cent of the 
total science expenditure, with the life sciences— 
formerly the unchallenged leader—at 32 per cent. 
The social sciences received 28 per cent of the total. 

Religion receives only 6 per cent of foundation 
grants, although as a category it receives an es- 
timated half of all private philanthropic gifts in 
the United States. But most of this giving is done 
personally, rather than through a foundation, even 
where an individual could choose either channel. 

The humanities come last in the Directory tabu- 
lation, at 5 per cent of the total. However, still 
more recent figures indicate a substantial rise in 
this area, stimulated through such special projects 
as Lincoln Center in New York. 

Foundations in American society. Because foun- 
dations are numerous and some of them bear the 
names of wealthy families, a popular misconcep- 
tion exists that they have tremendous assets and 
are able to make almost unlimited expenditures. 
Relatively, the resources of foundations are not 
large in the American economy and may be shrink- 
ing. In 1913 the federal government's total expendi- 
tures for education came to $5 million. The Car- 
negie Corporation of New York in that year spent 
a total of $5.6 million, so that if it had devoted all 
its funds to education, it could have more than 
equaled the federal government total. The Rocke- 
feller Foundation is known around the world for 
its extensive fellowship program, on which it has 
spent $61 million in the past fifty years; but this 
total is less than the amount the American govern- 
ment spends in a single year today for its fellow- 
ship program alone. 

Although in relative terms foundations consti- 
tute a minor segment of the American economy, 
their still considerable growth in numbers and in 
wealth has at times occasioned concern. As early 
as 1915 the Walsh Committee of the U.S. Senate 
charged that foundations were dominated by big 
business and were exerting strong conservative and 
reactionary influences. In the late 1940s it became 
evident that business enterprises in some numbers 
were seeking tax shelter under the foundation um- 
brella. To correct this and other abuses, the Rev- 
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enue Act of 1950 set up “prohibited transactions” 
involving dealings between foundations and their 
donors or related persons. It taxes at ordinary cor- 
poration rates foundation profits from business 
enterprises not substantially related to the founda- 
tion’s tax-exempt purposes, and it makes unreason- 
able accumulations of income a basis for loss of 
exempt status, 

In the House of Representatives in 1952 the 
Cox Committee, and again in 1954 the Reece Com- 
mittee, held special hearings on foundations, con- 
centrating on program; but the charge then was 
the opposite of the Walsh charges. The Reece Com- 
mittee attempted to demonstrate that foundations 
and certain educational institutions were engaged 
in a “diabolical conspiracy” to weaken and discredit 
the capitalistic system in the United States and to 
promote Marxist socialism. Neither the Walsh nor 
the Reece thesis was supported by substantial evi- 
dence, and no legislation followed. 

In the 1960s the spotlight turned back to finan- 
cial aspects, with allegations that certain founda- 
tions were serving primarily as tax shelters and 
business conveniences for their donors. 

Some present tendencies. A few foundations 
now have more than a half century behind them, 
and a substantial record of experience is available 
for all foundations. The administration of founda- 
tions is becoming professionalized. While there are 
no institutions of special training for this field, na- 
tional and regional conferences, informal luncheon 
groups, and other similar devices increasingly offer 
opportunities for exchange of experience. Founda- 
tions have resisted the formation of any closely 
knit national association, perhaps because of the 
high degree of individualism that characterizes the 
field and perhaps partially to avoid charges of col- 
lusion and wealth concentration. 

Literature in the field, once sparse, is rapidly in- 
creasing. About two hundred foundations issue 
printed reports, usually annually; and histories of 
a number of the older foundations have recently 
been published. A few critical studies have ap- 
peared, and more are in progress. 

Foundations are in some respects unique among 
American social institutions. They are the only im- 
portant agencies in America free from the political 
controls of legislative appropriations and pressure 
groups, and free from the lay controls of having 
to temper programs with the judgments and preju- 
dices of current contributors. Because of this posi- 
tion of unusual freedom, the foundations have an 
opportunity—and perhaps a special responsibility 
—to attack the longer range, more difficult, and 

often controversial questions which face the nation 


and the world; and many of them do spend a sub- 
stantial part of their funds in pioneering ventures 
that would have difficulty in obtaining support 
from other private sources or from government. 


F. EMERSON ANDREWS 


[See also PHILANTHROPY.]} 
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FOURIER, CHARLES 


Charles Fourier, French socialist thinker, was 
born at Besançon in 1772 and died in Paris in 
1837. The sources of his thought remain obscure. 
To a great extent he was an autodidact, who owed 
much to his reading and still more to his reflec- 
tions on what he read. Fourier’s inclination was 
to become a military engineer, but his family— 
wealthy merchants—made him a tradesman in 
Marseilles. A setback in business in 1799, com- 
bined with his lack of business ability, reduced 
Fourier to the role of civil servant at Lyons. After 
1816, when he inherited an income from his 
mother, he was finally able to devote himself en- 
tirely to his writing. In 1823 he settled perma- 
nently in Paris. Although he left a large body of 
published writings as well as many unpublished 
manuscripts, he never did write the “Grand Traité” 
that was to lay out his whole system in detail. 

To use the language of the time, Fourier was a 
capitalist whose personal experience made him 
sensitive to the social plagues and the vices of a 
civilization based on commercial lies. The sole 
thought of his life was to find a “way out” that he 
might propose to his contemporaries. His first ma- 
jor book (1808) painted a cosmic fresco, showing 
blind humanity the path toward happiness and 
abundance. All his life he waited for the bene- 
factor (king, minister, or financier) who would 
enable him to make even a partial test of his social 
“invention,” or scheme. 

This scheme assumes a correspondence between 
the cosmic order and the social order. It applies to 
the latter order the law of gravity established for 
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the former by Newton and then applies to both 
orders a law of evolution, with eight ascending 
periods followed by the same number of descending 
periods. The peak is the reign of “harmony,” in 
which “association” will be fully triumphant; how- 
ever, humanity is still immersed in the fifth period, 
called “civilization” to distinguish it from the bar- 
barism from which humanity has only recently 
emerged. The goal is the free governance of things 
by individuals who are at once completely free, fully 
mature, and highly organized. This goal explains 
the importance that Fourier attached to education, 
his criticism of the family and of the relations 
between the sexes, and finally his theory that the 
“passions” should be put to use instead of being 
repressed. The means of achieving this goal is the 
“phalanstery,” a basic social unit whose organiza- 
tion rests on the mathematical rationality of social 
phenomena (the dimensions of groups, the balance 
of age groups, the alternation of activities) and 
which encompasses all aspects of the life of its 
members. 

Fourier, an antirevolutionary and a strong op- 
ponent of Saint-Simon and Owen, believed his 
social system to be consistent with any form of 
government, including the monarchical one. Had 
not the French monarchy already shown great 
adaptability, having been feudal in the Middle 
Ages, absolute under Louis xiv, and bourgeois un- 
der Louis Philippe? There seemed to Fourier no 
reason why it could not also adapt to the new 
industrialism. This strange man thus combined 
traits that in others are generally mutually ex- 
clusive—political conservatism, social reformism, 
moral anarchism (at least in his conception of free 
love), and anti-Christian religiousness. Fourier be- 
lieved in the goodness of human nature and re- 
jected the dogma of original sin. He saw harmony 
as the law of the cosmos and held that what is 
true for nature must be true for society. This is 
why he gave to Newton’s work the same impor- 
tance that Marx was to see in that of Darwin; 
however, Marx abstracted from Darwinism a proc- 
ess of struggle, while Fourier found in Newton’s 
theory a mechanism of attraction. 

Fourier had only a limited influence on the 
development of society: his realm was that of 
thought, not of action. Yet the only thinkers who 
acknowledged a direct debt to Fourier were the 
Russian revolutionaries Herzen, Petrachevsky, and 
Tchernichevsky. While he was in some ways far 
ahead of his time, in many other respects he was 
a man of the past: his views apply much more to 
an agricultural and trading society than to one 
that is industrial and technological. Marx, of 
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course, scorned him as a “utopian socialist.” Never- 
theless, Fourierism did not remain an abstract 
utopia. For thirty to forty years Fourierist ideas 
were realized in a series of community experi- 
ments, both in France and in a number of other 
countries, especially in the United States. For the 
most part, these experiments were attempts at the 
collective operation of a large rural estate, and 
they foreshadowed more modern forms of social 
organization. One of these enterprises took on an 
urban and industrial aspect—the familistére at 
Guise (in northern France). Today there remains 
not only a factory (like any other factory) but also 
a set of dwellings that is a unique example of 
Fourierist architecture. Some of Fourier’s disciples 
put little trust in the success of these partial reali- 
zations of his scheme and held that a total trans- 
formation of society was called for. Rallying around 
Victor Considérant, 1808-1893, they entered into 
political life and became a real force for some 
years, but with the advent of Napoleon m they 
could do no better than merge into the bourgeois 
opposition to the Second Empire. As Charles Gide 
pointed out some time ago (1924), there is a direct 
relationship between Fourierism and the coopera- 
tive movement which played an important role be- 
fore 1914 and of which there are now new develop- 
ments, for example, in Yugoslavia, Israel, and the 
countries of the Third World. 


ÉMILE Poutat 


[For the historical context of Fourier’s work, see So- 
CIALISM; UTOPIANISM; and the biographies of Marx; 
Owen; Sarnt-Simon.] 
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FRANK, JEROME 


Jerome N. Frank (1889-1957) was an Ameri- 
can lawyer, public official, judge, law teacher, 
and writer on law and social philosophy. As an 
active participant in public affairs, first as a middle- 
ranking figure in the New Deal and later as a 
United States circuit court judge, Frank was re- 
spected as an effective, if sometimes turbulent, 
influence for change. He had opponents but few 
enemies. He was a generous person, kind, unpre- 
tentious, enthusiastic in behalf of his friends, and 
a witty companion as well. His passion was writing, 
and even in the most demanding periods of his 
service in Washington, he produced books and ar- 
ticles on a wide variety of subjects. They remain 
the chief source of his influence. 


Frank was born in New York but grew up in 
Chicago; he attended the University of Chicago, 
taking his law degree in 1912. The university left 
in Frank’s mind strong traces of attitudes that were 
typical of the institution’s early outlook: a sense 
of identification with the liberal and progressive 
theme in the spirit of the nation; a wide-ranging 
enthusiasm for ideas, particularly new ideas; and a 
preference for innovations, especially if they could 
be called “reforms.” After leaving the university, 
Frank practiced law for 17 years in Chicago, where 
he was a conspicuous and successful counselor and 
attorney, mainly in the field of commercial and 
financial transactions. He and his wife, a well- 
known poet, also belonged to the literary world of 
Chicago. 

Intellectual interest and a sense of immediate 
need led Frank to undergo psychoanalysis, an event 
in his personal history that had a far-reaching im- 
pact on his thinking. In 1930 Frank published his 
most famous book, Law and the Modern Mind, 
which views the law in a Freudian perspective and 
interprets the methods and processes of law as so- 
cial magic. Thus, judges are shown as father figures 
of mystical authority, dressed in robes and required 
to sit on high benches so that they evoke irrational 
deference. This kind of magic, according to Frank, 
helps maintain social order. The book came at a 
moment when Freudian insights were just begin- 
ning to touch the literature of political science, 
anthropology, literary criticism, sociology, and law; 
and it created a storm. Its sparkle and force made 
it a conspicuous factor in the acceptance of psy- 
choanalysis as an important element in American 
views of man as a social being. 

Frank had moved to New York in 1929; there he 
continued to practice law and joined a number of 
intellectual circles. He lectured at the New School 
for Social Research and in 1930 began an inter- 
mittent association with the Yale Law School, 
which lasted until his death. 

The Yale Law School of the early 1930s was a 
lively place, well suited to Frank’s personality and 
interests. The staff included William O. Douglas, 
Walton H. Hamilton, Thurman Arnold, Abe Fortas, 
Underhill Moore, and a number of psychologists, 
economists, and statisticians, as well as more 
orthodox law professors. The school imbued Frank 
with the sense that the world was about to be re- 
made, if not reborn. It was one of the centers of 
American “legal realism,” a controversial philo- 
sophical movement, which was concerned with 
directing attention from the “law in the books,” 
the doctrinal law of the appellate courts, to “the 
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law in action,” the patterns of usage and practice 
that actually prevail in daily life, in the lower 
courts, and in business. Frank fought zestfully for 
the latter view of the law, a view which was 
Strongly resisted. 

After this stimulating experience he joined 
several members of the Yale faculty in the ad- 
venturous first wave of President Roosevelt's New 
Deal and was appointed general counsel of the 
Agricultural Adjustment Administration, This radi- 
cal experiment in direct controls was headed by 
George Peek, a rural conservative and, like most 
of his agrarian staff, suspicious of urban, volatile, 
intellectual types like Frank himself and the 
extraordinary group he assembled, a group which 
included Arnold and Fortas from Yale, Adlai 
Stevenson, and Alger Hiss, among many others. 
The conflict between this group, led by Frank, and 
the older breed of agricultural experts lasted nearly 
two years; it ended in one of the spectacular ex- 
plosions of the New Deal, with Frank and a con- 
siderable number of his allies in the Department 
of Agriculture being dismissed. Frank was soon 
reappointed, first to the staff of the Reconstruction 
Finance Commission and then as a member—from 
1937 to 1939—and as the chairman—from 1939 
to 1941—of the Securities and Exchange Commis- 
sion, where he played an active part in the develop- 
ment of policy, particularly under the Public Utility 
Holding Company Act of 1935. 

During this period he published Save America 
First (1938), a far-ranging comment on the eco- 
nomic element in social experience, which included 
an argument against American participation in the 
European war. It was written in the spirit of sena- 
tors Borah and Norris and the older tradition of 
agrarian Populism; before June 1940 Frank took 
part as an isolationist in the national debate over 
America’s interest in the war. 

In 1941 he was appointed judge of the United 
States Court of Appeals for the Second Circuit, 
sitting in New York; he later moved to New Haven 
when he began to teach regularly at the Yale Law 
School. Frank found stimulus in the company of 
his judicial colleagues, especially judges Learned 
Hand, Augustus Hand, and Charles E. Clark, as 
well as at the law school, where he maintained an 
active office and saw students and faculty mem- 
bers regularly. His opinions were colorful and often 
unorthodox in form, reflecting the range of his 
reading and the mannerisms of a style that had be- 
come unusually free and personal in tone. He did 
important judicial work in several fields of law, 
including procedure, finance, criminal law, and 
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civil liberties. He was widely quoted on obscenity, 
immigration, the rights of the accused in criminal 
trials, and other subjects; and his views were much 
discussed in the law reviews. 

The books and articles of the final period in 
Frank's life embody the unresolved contradictions 
of his view of law and of the social process. Quick, 
agile, and known for his brilliant apergus, Frank did 
not have the discipline of a systematic scholar and 
permitted himself the luxury of inconsistent posi- 
tions. In his teaching at Yale and in his writing, 
especially in Courts on Trial (1949), he stressed 
the importance, as well as the uncontrollable irra- 
tionality, of fact-finding in the processes of law, 
arguing that fact-finding constitutes a step in the 
trial of cases that dominates the apparently su- 
perior legal rules laid down by appellate judges. He 
deplored and ridiculed man’s yearning for strict 
and predictable legal rules, but in the end his pro- 
posals for law reform aimed to bring trials under 
the more effective control of those rules and to 
make the rules clearer, more uniform, and more 
predictable. He argued that legal education should 
be brought closer to trial practice and also that 
legal practice should correspond more closely to 
the law taught in law schools. For many years 
he saw “natural law” as a dangerously vague and 
misleading slogan, but he never ceased to contend 
that the administration of law should give a larger 
place to ideas of ethics and individual justice. 

In his work Frank revealed the strengths and 
shortcomings of the legal and social philosophy 
of which he was a militant partisan. He spoke 
persuasively for sensible reform, for simplicity, and 
for humane values in the law and in the social 
order generally. By preaching the need to relate 
theories to facts he helped men to build the law 
anew on the foundation of social need. If he never 
realized the equal need of relating facts to theories, 
he shared his faith in the autonomy of fact with 
many pragmatists. 


EUGENE V. Rostow 


[See also JUDICIARY; JURISPRUDENCE; the detailed 
guide to related articles under Law; and the biog- 
raphies of HAMILTON, WALTON H.; LLEWELLYN; 
Pounp.] 
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Sir James George Frazer (1854-1941), British 
social anthropologist, folklorist, and classical 
scholar, was born in Glasgow. He attended the 
University of Glasgow from 1869 to 1874 and then 
went to Trinity College, Cambridge. He became a 
fellow of the college in 1879 and remained at 
Cambridge for the rest of his life. For one year, 
1907/1908, he visited Liverpool University as pro- 
fessor of social anthropology, being the first to 
hold a chair with that name. Throughout his career 
he made little or no direct contact with the remote 
peoples who figure so extensively in his writings, 
and much of his simplification of their ideas 
(which explains some of his popular success) 
may be attributed to this absence of personal 
experience. 

It is difficult now to appreciate Frazer's theories 
of magic and religion, since they have either been 
thoroughly assimilated or else outdated by ex- 
tensive field research, yet in the intellectual life 
of his time, and particularly in the field of social 
anthropology, his work was of great importance. 
He himself appears to have anticipated the obso- 
lescence of his work when he stressed that first- 
hand observation of foreign societies would furnish 
the science of man with a solid foundation which 
could never be shaken, and which would endure 
when many of the theories of his time, his own 
included, were forgotten or remembered only as 
curiosities. And with a characteristic skepticism 
that may now appear sententious he wrote that 
magic, science, and religion are nothing but theories 
of thought. . . . And as science has supplanted its pred- 
ecessors, so it may hereafter be itself superseded by 
some more perfect hypothesis, perhaps by some totally 


different way of looking at the phenomena—of regis- 
tering the shadows on the screen—of which we in this 
generation can form no idea. The advance of knowl- 
edge is an infinite progression towards a goal that for 
ever recedes. (1890, pp. 712-713 in the abridged 
edition ) 


Although Frazer was directly in that current of 
opposition to established clerical orthodoxy that 
swept so many thinkers of his time (notoriously 
his friend William Robertson Smith) into socially 
painful controversy, he himself was consistently 
held in respect amounting to veneration. He treated 
Christianity as comparable with pagan religions 
and thus, at least by implication, deprived Chris- 
tianity of its uniqueness, but he did so too tact- 
fully to give real offense. He undermined rather 
than attacked the doctrinal convictions of his con- 
temporaries. He was rewarded in his lifetime with 
public and academic honors, among them a civil 
list pension granted in 1905 for services to litera- 
ture and anthropology. (It is interesting to note 
that he received his honorary D.C.L. at Oxford in 
the company of Lord Kitchener and Cecil Rhodes. ) 
His wife was single-mindedly devoted to his repu- 
tation; indeed, R. R. Marett likened her to the 
guardian—wife of a priest of ancient Rome. 

To his admirers Frazer appeared as a modern 
seer, a role that he accepted. He was revered by 
colonial administrators and missionaries as few 
anthropologists have been, and his extensive cor- 
respondence with them, together with his pub- 
lished questionnaire (1907), produced firsthand 
information about many peoples of the world. Men 
of letters (Kipling, Tennyson, T. S. Eliot, Ezra 
Pound, and D. H. Lawrence, to mention only a 
few) also took account of Frazer’s writings, which 
have probably influenced the poetry of this century 
as much as they have the anthropology. A. E. 
Housman identified some of the characteristics 
that account for the popular appeal of Frazer’s 
work when he described it as “. . . a museum of 
dark and uncouth superstitions invested with the 
charm of a truly sympathetic magic” and said it 
illuminated “the forgotten milestones of the road 
which man has travelled” (Dawson 1932, p. xi). 
Frazer’s success in popularizing his subject and 
consolidating its practical relevance must be seen 
as one of his great claims to fame in the social 
sciences, 

The development of the psyche. The Golden 
Bough, a reconstruction of the whole history of 
modes of human thought, orders a vast range of 
exotic beliefs and customs in terms of man’s search 
for true knowledge. and effective control of his 
environment and his condition. Frazer posited 
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three elements in the development of the human 
psyche, and in the spirit of the evolutionary think- 
ing of his time, he saw these as characterizing 
three stages in human mental advance: magical, 
religious, and scientific thought. 

Magical thought assumed that the universe is 
regulated by impersonal and unchanging laws (in 
this respect, Frazer believed, magic is like science). 
These laws were known to the magician through 
his art and applied by him in a quasi-technical 
way to control events. But magical beliefs and 
procedures (unlike scientific ones) were derived 
from faulty reasoning by analogy and from super- 
ficial associations: the qualities of one object were 
supposed to induce similar qualities in another. 
(One of the many examples given by Frazer is the 
ancient Greek custom of eating ravens’ eggs to 
produce black hair.) This homeopathic magic 
assumed a “law of sympathy,” which operates in 
such a way that like produces like. In addition, 
there was “contagious magic,” which assumed that 
things once in intimate contact would subse- 
quently act upon one another: a man’s hair or 
nail clippings, for example, were used to work evil 
upon him. Although Frazer oversimplified the 
problems of symbolic thought that such beliefs 
now present to anthropologists, his distinctions con- 
tinue to have some elementary taxonomic value. 

Magical thought, which was gradually discred- 
ited, in Frazer’s opinion, because its failures be- 
came apparent, gave way to religious thought. 
In this phase superhuman beings were thought to 
control the world. The uniformity of nature ceased 
to be taken for granted, since the occurrence of 
natural events was assumed to depend upon the 
will of conscious personal agents. Man sought to 
gain the help of these agents by acts of supplica- 
tion and propitiation. 

Finally, recognizing more clearly the limits of 
his own powers of control and applying logico- 
experimental methods, man achieved the scientific 
stage. 

Assessment of “The Golden Bough.” The main 
value of The Golden Bough for many of Frazer's 
contemporaries lay in its wide range of reference, 
its bold ordering of complex and varied informa- 
tion, and its perception of similarities in beliefs 
and customs that to them might have appeared 
quite distinct from one another. Frazer greatly 
overstressed the part played by deliberate reason- 
ing in religious and magic belief and created a 
primitive man whose intellectual ambitions and 
processes of thought were those of a scholar like 
himself. (Even his evolutionary sequence reflects, 
psychologically, the process of growing up as many 


552 FRAZER, JAMES GEORGE 


people with a pious upbringing like Frazer’s ex- 
perienced it: magical fantasies of power in early 
childhood, followed by adolescent religious belief, 
yielding in maturity to “scientific” agnosticism.) 

Often Frazer preached the dangers of misin- 
terpretation that he himself had failed to avoid. 
What he said of Robertson Smith’s Religion of the 
Semites may be said of his own explanations of 
primitive thought: namely, that what now seems 
to be their inherent plausibility is in fact a presump- 
tion against them. Similarly Rousseau’s views on 
the origin of society commended themselves to the 
most reasonable people of the previous century just 
because, if they had to reconstruct society from 
its foundations, they would have proceeded much 
as Rousseau supposed primitive men did. Thus 
Frazer and many of his contemporaries explained 
magical and religious thought by imputing a 
“natural” form of their own reasoning to very differ- 
ent peoples. Moreover, lacking knowledge of local 
languages and cultures, they were satisfied to take 
reports of isolated customs and beliefs quite out 
of their social context, and in this process they 
sometimes distorted even such evidence as they 
had. 

An early critique of Frazer’s work is that of 
W. Ridgeway (1924), who discussed divine king- 
ship, one of Frazer’s central themes in The Golden 
Bough. The divine king, according to Frazer, has 
a vitality that is believed to be the source of vi- 
tality in society and nature, and therefore, when 
his powers fail, he must be put to death so that 
a vigorous successor may continue to ensure pros- 
perity. Ridgeway admitted that Frazer's exposition 
of the sacredness of kingship and the symbolism 
surrounding it did draw attention to these subjects 
and stimulate further investigation, but he charged 
that for many years Frazer’s mode of analysis also 
distracted attention from the practical politics of 
ancient tribal monarchies. A more recent examina- 
tion of Frazer’s interpretation is Evans-Pritchard’s 
Divine Kingship of the Shilluk of the Nilotic Sudan 
(1948). This shows in a particularly interesting 
way the hold Frazer’s imagination had even on pro- 
fessional field observation: thus C. G. Seligman’s 
investigations among the Shilluk, which used 
Frazer's concept of divine kingship, revealed just 
the set of beliefs and practices that fitted the Frazer 
argument. As Evans-Pritchard points out, there is 
very slight evidence for the actual ritual killing 
of the kings of Shilluk, while there is considerable 
evidence for the important role played by political 
conflicts within the ruling house. Many anthropolo- 
gists thus became “enslaved,” as Seligman was and 
as Malinowski once claimed to have been, by 
Frazerian anthropology (see also Leach 1965). 


Influence of Frazer’s ideas. Frazer's detailed con- 
tributions to anthropological thinking—his studies 
of the soul, of death, and of totemism and taboo, 
for example—belong to a framework of discussion 
that has now largely been abandoned, although it 
should be granted to-him that in his time his was 
a more modern view than many others. He rec- 
ognized, for example, that the notion of taboo 
has something to do with the Roman concept of 
sacer, with its dual connotation of “sacred” and 
“accursed,” and though his general theory of totem- 
ism was crudely mechanistic, he did go some way 
toward identifying the bond of a common life that 
frequently links men and their totemic species. 

But the significance of Frazer does not depend 
finally upon his anthropological theories. What he 
did was to introduce a comparative approach to the 
study of human social institutions. Here the very 
ambivalences and contradictions in his attitudes 
toward “the savage” may have served a purpose. 
While he had a certain ethnocentric contempt for 
the “dark and uncouth superstitions” he recorded, 
he also constantly implied that primitive thought 
is part of a single body of human tradition, link- 
ing his contemporaries (and himself) with much 
they despised. His self-deceived magicians and 
priests were also agents of human progress, the 
ablest and most intelligent of their time. In keep- 
ing with his own political philosophy, he also 
commended them for establishing intellectual 
despotism—and any statesman of his time who 
agreed with Frazer in principle was thereby ac- 
cepting comparison with savages. 

Frazer denied that all savages have an equal 
propensity to accept mystical explanations, assert- 
ing that in “savage as in civilized society there are 
sceptics as well as mystics” ([1923] 1931, p. 416); 
indeed, he believed that the “primitive mentality” 
that Lévy-Bruhl attributed exclusively to savages 
also characterizes such modern thinkers with 
mystical proclivities as Pascal, Newton, and Hegel. 
And on various occasions he stressed not only the 
common qualities of the thought of savage and 
civilized men but the positive contributions the 
former have made to the latter, Thus, in Psyche’s 
Task (1909) he made a plea for the value of 
“superstitions”: he saw their function in the main- 
taining of such institutions as civil order, private 
property, marriage, and the sanctity of human life. 
In a sense he anticipated the interest in “function” 
of Malinowski and Radcliffe-Brown. And in The 
Golden Bough he wrote that 


of [all] the benefactors whom we are bound thankfully 
to commemorate, many, perhaps most, were savages. 
For when all is said and done, our resemblances to the 
Savage are still far more numerous than our differences 


from him; and what we have in common with him, 
and deliberately retain as true and useful, we owe to 
our savage forefathers who slowly acquired by experi- 
ence and transmitted to us by inheritance those seem- 
ingly fundamental ideas which we are apt to regard as 
original and intuitive. (1890, p. 264 in the abridged 
edition ) 


It is, then, as a powerful solvent of contemporary 
prejudices that Frazer’s work stands out. Without 
him, the struggle to introduce anthropological 
knowledge and anthropological viewpoints to au- 
thorities in academic and public life would have 
been longer and harder. In an essential way, he 
made modern anthropology possible. 


R. G. LIENHARDT 


[For discussion of the subsequent development of 
Frazer's ideas, see ANTHROPOLOGY, article on THE 
COMPARATIVE METHOD IN ANTHROPOLOGY; CULTURE; 
MYTH AND SYMBOL; RELIGION; RITUAL; and the 
biographies of L&vy-BRUHL; MALINOWSKI; SELIG- 
MAN, C. G.; SMITH, WILLIAM ROBERTSON. ] 
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FRAZIER, E. FRANKLIN 


The most significant contributions of E. Frank- 
lin Frazier (1894-1962) to the literature of soci- 
ology are embodied in his writings in the fields of 
family behavior and race and culture contacts. 
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Although these fields are commonly demarcated as 
separate areas of study, they were not always so 
conceived by Frazier. 

Frazier’s major contribution to the literature of 
the family is The Negro Family in the United 
States (1939). Building on earlier research (1932a; 
1932b), the book analyzes the impact first of 
slavery and then of emancipation and urbanization 
upon the Negro family. These experiences pro- 
duced in the Negro family variations from the 
dominant American family pattern—to wit, a more 
important role for the female; attachment of great 
significance to variations in skin color; and a higher 
incidence of illegitimacy, of common law relation- 
ships, and of other forms of family disorganization. 
Frazier’s viewpoint that the structure and values 
of the Negro family in the United States are to be 
understood, except in the most isolated instances, 
as products of the Negro’s American experiences 
involved him in a lively controversy with the an- 
thropological scholar and Africanist, Melville Hers- 
kovits, whose studies led him to the conclusion 
that the major institutions of Negro life, including 
the family, incorporate African survivals to a 
significant extent. 

Frazier’s sociological conceptions were shaped 
mainly by his graduate training at the University 
of Chicago, from which he received the doctoral 
degree in 1931. There he studied with Ellsworth 
Faris, Robert E. Park, William F. Ogburn, and 
Ernest Burgess. He became associated with the pro- 
gram of research on the urban community and on 
race relations, directed by Park and conducted by 
a group of brilliant graduate students and young 
instructors that included Louis Wirth, Everett C. 
Hughes, and Herbert Blumer. Although critics 
labeled this group the “Chicago ecological school,” 
its basic conception of sociology was in fact much 
broader than the study of ecological phenomena. 
The group believed that any social phenomenon 
may be understood within the context of the larger 
social system and that the larger social system may 
be coterminous with society itself. 

The influence of this approach is reflected in 
Frazier’s work on culture contacts (1949a; 1957). 
In his Race and Culture Contacts in the Modern 
World (1957), he analyzed the ecological and 
demographic relationships that result from con- 
tacts between people of diverse racial and cultural 
backgrounds and the effects of these relationships 
on economic, political, and social organization. 

Perhaps the most interesting result of Frazier’s 
work on culture contacts is his Black Bourgeoisie 
(1955), an analysis of the evolution, composition, 
and style of life of the Negro middle class in the 
United States. The Negro middle class, according 
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to Frazier, differs from middle classes in general 
not only in composition but also in values. Frazier 
pointed out that the Negro middle class lacks the 
strong entrepreneurial tradition that has been the 
backbone of the middle classes in general. Negro 
business is small business, mainly of the service 
variety—restaurants, beauty parlors, food stores, 
undertaking establishments. Negro insurance com- 
panies and the Negro press may be exceptions, but 
even these are small compared to white organiza- 
tions of the same type. Similarly, Negro banks are 
few and possess limited capital. 

The black bourgeoisie, therefore, “is constituted 
of those Negroes who derive their principal income 
from services they render as white-collar workers.” 
This class of white-collar workers, mainly profes- 
sionals and clerical and sales personnel, has ac- 
quired a dominant position among Negroes. Al- 
though their incomes are limited, they have lost 
much of the old virtues identified with the middle 
class—industry, thrift, belief in the substantive 
values of education—and have, instead, empha- 
sized conspicuous consumption and attractive so- 
cial life, values more commonly associated with a 
leisure class. This emphasis upon society and 
social life represents “status without substance.” 
This theme, that Negro values are distorted, recurs 
in many of Frazier’s articles on Negro life in the 
United States: he saw the racial system as forcing 
the Negro to live in isolation and as endowing him 
with a sense of dependency and inferiority. 

Frazier was born in Baltimore, Maryland. An 
apt and intellectually curious student, he received 
a B.A. degree with honors from Howard University 
in 1916. His first serious, formal encounter with 
sociology was as a graduate student at Clark Uni- 
versity, from which he received an M.A. degree in 
1920. There he studied with Frank Hankins, whom 
he credited with opening up to him the possibilities 
of sociology as a systematic study. In addition to 
graduate study at Chicago, his formal education 
included a year of study at the New York School 
of Social Work, 1920-1921, and a year in Den- 
mark as a fellow of the Scandinavian—American 
Foundation, 1921-1922. His major academic affili- 
ation was with Howard University, where he was 
professor of sociology from 1934 until he died, 

Frazier was president of the American Socio- 
logical Society in 1948 and was awarded honorary 
degrees by Morgan College, Baltimore, in 1955 and 
by the University of Edinburgh in 1960. 


G. FRANKLIN EDWARDS 


[For the historical context of Frazier’s work, see the 
biographies of Burcess; HANKINS; HERSKOVITS; 


OGBURN; PARK; for discussion of the subsequent 
development of his ideas, see RACE RELATIONS.] 
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See INTERNATIONAL TRADE and INTERNATIONAL 
TRADE CONTROLS. 


FREEDOM 


The word “freedom,” with its synonym “liberty,” 
has a strong laudatory connotation. It has therefore 
been applied to whatever actions, policies, or insti- 
tutions may be deemed valuable, from obeying the 
law to attaining economic affluence. Political writ- 
ings seldom provide explicit definitions of “freedom” 
in descriptive terms, but it is often possible to infer 
descriptive definitions from the context. If this is 
done, it will be seen that the concept of freedom 
refers most frequently to social freedom, which 
must be distinguished from other descriptive and 
valuational usages. Descriptive definitions of “free- 
dom” designate empirically specifiable states of 
affairs and can therefore be accepted by anyone, 
regardless of his normative views on liberty. “Free- 
dom” in a valuational sense is used to commend 
rather than to describe; it therefore means different 


things to writers committed to different ethical 
standards. 


Social freedom 


The concept of interpersonal or social freedom 
refers to relationships of interaction between per- 
sons or groups, namely, that one actor leaves an- 
other actor free to act in certain ways. This concept 
is best defined by reference to another interaction 
relation, that of interpersonal or social unfreedom. 

Social unfreedom defined. With respect to actor 
B, actor A is unfree to perform action x if and only 
if B makes it either impossible or punishable for 
A to do x. “B makes it impossible for A to do x” 
means that B performs some action y such that 
were A to attempt x, his attempt would fail. By 
denying a citizen a passport, the government makes 
him practically unable to travel abroad and, hence, 
unfree to do so. With respect to the United States, 
Communist China is unfree to conquer Formosa, 
and vice versa, since U.S. forces would presumably 
prevent either power from invading the other. If 
the Ku Klux Klan forcibly prevents Negroes from 
entering a public school, the Negroes are unfree to 
do so with respect to the Klan, but not with respect 
to the government. “B makes it punishable for A 
to do x” means that were A to carry out x, B would 
perform some action y that would deprive A. Gov- 
ernmental sanctions against illegal acts are only 
one example of punishability as an instance of social 
unfreedom. With respect to a union, a company 
would be unfree to withhold certain benefits if the 
union were to picket the company. Residents in a 
typical block of modern suburbia are unfree to 
deviate from certain tacit norms with respect to 
the “neighborhood,” which tends to penalize non- 
conformists. 

Social freedom defined. Social freedom is not 
the contradictory of social unfreedom. I am not offi- 
cially unfree to pay incomes taxes, yet, I am not 
free to pay them either; rather, I am unfree to 
withhold payment. A relationship of freedom refers 
to a set of at least two alternative actions or types 
of actions. I am unfree to do this; I am free to do 
this or that. An actor is free to act in any one of 
several ways, provided there is no other actor who 
makes him unfree to perform any of these actions. 
Thus, with respect to B, A is free to do either x or z 
if and only if B makes it neither impossible nor 
punishable for A to do either x or z. “Freedom to 
vote” means freedom either to vote or to abstain; 
but “freedom to propagate the truth” really means 
unfreedom to spread “erroneous” views. Further- 
more, I may be free to act in one way or another 
with respect to one person or group, whereas an- 


FREEDOM 555 


other actor makes me unfree to engage in one of 
these activities. Officially, Americans are free to 
adopt any religion or to adhere to none, but many 
Americans are unfree to be agnostics with respect 
to certain nonofficial groups who subject “atheists” 
to all kinds of informal sanctions. 

Testing statements about social freedom. 
Whether an actor was unfree to do what he actu- 
ally did can be determined with certainty, but only 
ex post facto. If A’s attempt to do x was frustrated 
by B, or if A succeeded in doing x but was penal- 
ized by B for having done so, it follows by definition 
that A was, with respect to B, unfree to do x. That 
A is unfree to do x, or that A was or is or will be 
free to do x or z, are empirical hypotheses that can 
be asserted only with a certain degree of prob- 
ability, depending on the answers to such questions 
as: were A to carry out x, would B penalize him? 
If 60 per cent of all speeders in France are con- 
victed, every French driver is to that extent unfree 
to speed, regardless of how many comply. A per- 
son’s social freedom does not depend on his actual 
behavior. We often perform actions that we are 
unfree to do (for example, speeding) and refrain 
from actions that we are free to perform (for ex- 
ample, driving at any speed lower than the speed 
limit). 

Social freedom and political freedom. Relation- 
ships of interpersonal or social freedom and un- 
freedom may hold between any two persons or 
groups, for example, members of a family, buyers 
and sellers, legislature and executive, pope and 
emperor, members of the Common Market. A gov- 
ernment’s freedom may or may not be limited by 
an international organization, another government, 
a church, its own citizens, some interest group 
within or outside of its jurisdiction, etc. Political 
freedoms are a subclass of social freedoms and usu- 
ally refer to the freedom of citizens or associations 
with respect to the government. Interest in political 
liberty has in various periods of history centered 
on freedom of religion, of speech and writing, of 
association (religious, political, economic), and of 
participation in the political process (suffrage). 
The idea of political freedom has been extended to 
cover demands for economic liberty, “freedom from 
want,” national self-determination, etc. 

Social freedom and control. Social unfreedom 
and control are overlapping categories. By prevent- 
ing A from doing x, B makes A unfree to do x and 
controls his behavior. If B punishes A for having 
done x, one may infer that A was, with respect to 
B, unfree to do x; but B did not control A’s be- 
havior, since his threat of punishment failed to 
deter A from doing x. Influence is another form of 
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control; however, if B succeeds, for example, in 
persuading A to vote Democratic, B does not there- 
by restrict A’s freedom to vote Republican (or 
Democratic). Here, both control and freedom rela- 
tionships hold between B and A. 

Social freedom and power. Although there may 
be unfreedom without control and control without 
unfreedom, the concept of power had best be taken 
as comprising both control and unfreedom relation- 
ships. If B either has influence over A’s not doing 
x, or prevents A from doing x, or makes it punish- 
able for him to do so, B may be said to have power 
over A in this respect. The example of B persuading 
A to vote Democratic illustrates that power and 
freedom relationships may hold between the same 
pair of actors. The same is true in the following 
situations: B has power over A with respect to a 
limited range of alternatives; A is free within that 
range. Government, for example, has the power to 
compel citizens to serve in the armed forces but 
may leave them free either to submit to the draft 
or to volunteer. A may be, with respect to B, free 
to do x, either because B has no power to limit A’s 
freedom or because he permits A to do x. The United 
States Congress is free to legislate as it pleases as 
far as the president is concerned, to the extent that 
he chooses not to exercise his veto power. To affirm 
that freedom of speech prevails in a given society 
is to refer to the following relationships of both 
freedom and unfreedom (and power) between any 
two of its members, A and B: A and B each leave 
the other free to say what he wants; with respect 
to B, A is unfree to prevent him from expressing 
his views, and vice versa; A and B are unfree to do 
so, not only with respect to each other but also with 
respect to the government, which protects every- 
body's right to free speech. 

Social freedom and legal rights. It is true both 
that liberty depends “on the silence of the law” 
(Hobbes) and that “where there is no law there is 
no freedom” (Locke). I am socially free to act in a 
certain way only if (1) there is no effectively en- 
forced law prohibiting or ordering me to do so and 
if (2) I have an effectively protected legal right to 
that effect, that is, if all others are unfree to hinder 
me from doing so. We must distinguish between 
freedom in the behavioral sense and in the legal 
sense. All drivers have the legal duty not to speed, 
but they are socially unfree to speed only to the 
extent that speeders are actually fined. Thus, driver 
A, who sped on a particular occasion without being 
detected, was socially free to do so on that occasion, 
even though he had no legal right to that effect. 
If 40 per cent of all speeders in France escape con- 


viction, French drivers are to that extent socially 


free to speed. 


Other descriptive meanings 


Freedom of choice. Whereas social freedom re- 
fers to two actors and their respective actions, free- 
dom of choice signifies a relationship between one 
actor and a series of alternative potential actions. 
“A has freedom of choice as to x or z” means that 
it is possible for A to do either x or z, or that both 
x and z are open as well as avoidable to A, or that 
A will bring about x provided he chooses to do x. 
Conversely, if it is either impossible or necessary 
for A to do x, A has no freedom of choice as to x. 
It is in this sense that Hume defines liberty as “the 
power of acting or not acting, according to the 
determinations of the will.” Freedom of choice is 
neither a necessary nor a sufficient condition for 
social freedom. If A cannot do x, he is unfree to 
do so only if his inability has been caused by some 
other actor B. Otherwise, A remains free to do x, 
even though he has no freedom of choice as to 
x. Most men are incapable, yet at liberty, to become 
millionaires or to run for high political office. Un- 
employment during a recession diminishes freedom 
of choice, not social freedom, unless the recession 
itself can be causally linked, for example, to spe- 
cific governmental policies. The high cost of tele- 
vision time renders this medium inaccessible to 
most; this restricts freedom of choice for prospec- 
tive broadcasters, not their freedom of speech. 
Everybody is socially “free to sleep under bridges” 
or at home, including the homeless, who have no 
choice in the matter. (In all such cases, the illusion 
of paradox arises because the actor is likely to value 
the opportunity he lacks, not the freedom he has. ) 
Conversely, we do have freedom of choice with 
respect to most punishable actions; we can be made 
unfree to do them precisely because they are open 
to us. 

Free will. Indeterminists often hold that human 
beings have “free will,” that is, that their actual 
choices and resulting behavior are not causally de- 
termined but constitute chance events. Determin- 
ists can with perfect consistency deny this doctrine 
and yet affirm that men often have freedom of 
choice. They argue that the fact that A has the 
choice of doing either x or z does not preclude the 
possibility of explaining and predicting A’s actual 
choice by virtue of causal (for example, psycho- 
logical or sociological) laws. 

Free actions. Of an action itself, it can be said 
that it was either a free or an unfree one, as when 
we say: “this murder was a free action,” or “he paid 


his taxes, but not freely.” Involuntary behavior is 
unfree, and so are nondeliberate actions, for exam- 
ple, those that the actor has been conditioned to 
perform. Voluntary actions are free, unless they 
are motivated by fear of punishment. A’s handing 
over his money to B, who points a gun at him, is 
an unfree action (yet it is a voluntary action, de- 
termined partly by B’s threat and partly by A’s 
desire to save his life). But if A refuses to comply 
with B, then A acts freely. One may do freely what 
one is unfree to do. Again, if B persuades A to do 
x without threats of punishment, A’s action x is a 
free one. Sometimes, however, “free” is used more 
broadly to refer to actions that are autonomous, 
that is, determined exclusively by the actor's own 
decisions and not by the influence of others, as 
when John Stuart Mill said: “The only freedom 
which deserves the name is that of pursuing our 
own good in our own way.” 

Free persons. “Free” often refers to a character- 
istic, not of actions but of persons. A person may 
be said to be free to the extent that he has the 
disposition to act freely, or to act autonomously, or 
to develop his capacities to the fullest. Marx, for 
example, prophesied a society “in which the free 
development of each is the condition for the free 
development of all.” “Freedom” becomes a synonym 
for “self-realization.” Laski uses the term almost 
exclusively in this sense in his article on liberty 
(1933). 

Feeling free. Liberty is often said to consist in 
doing what one desires. It would be more accurate 
to say that an actor feels free to the extent that he 
does what he wants. Freedom as a state of mind 
must be distinguished from freedom as a state of 
affairs. Among the things I want to avoid doing, 
there may be some I am free to do and others I am 
unfree to do. Some persons derive a feeling of 
freedom from the fact that they are left free to act 
out any one of several alternatives. Others feel free 
when they “escape from freedom” into submission 
to some authority that conditions them to want to 
do its will. Dostoevski’s grand inquisitor plays on 
these two meanings of the word: “Today, people 
are more persuaded than ever that they have per- 
fect freedom; yet they have brought their freedom 
to us and laid it humbly at our feet.” 

A free society. Is it legitimate to use “free” as 
a characteristic of a group, such as when democ- 
racy is held to be a free society? There is no such 
thing as freedom in general; every organized soci- 
ety consists of an intricate network of specific rela- 
tions of both freedom and unfreedom. Citizens in 
a democracy have the political freedom to partici- 


FREEDOM 557 


pate in the governmental process through “free” 
elections. Voters, parties, and pressure groups are 
thereby empowered to limit the freedom of their 
elected officials. Democracy also requires that “civil 
liberties” be protected by legal rights and duties, 
and these duties again imply limitations of free- 
dom. In a perfect dictatorship, the ruler has unlim- 
ited freedom with respect to his subjects, whereas 
they are totally unfree with respect to him. In a 
democracy, both liberties and restrictions of free- 
dom are distributed more evenly, for example, 
among the various branches of government, be- 
tween government and governed, majority and 
minority. Equal freedom, not more freedom, is the 
essence of democracy. (Strictly speaking, it is not 
meaningful to say that there is “more” freedom in 
one society than in another; but it is possible to 
define degrees of social freedom in the sense that 
one actor has greater freedom in a certain respect 
than another.) A society in which liberties are 
evenly distributed may be called a free society. 
However, here we come close to using “freedom” 
in a valuational sense: a society is free in which 
those and only those freedom relations hold that 
are desirable. 


Valuational meanings 


Because of the laudatory connotation of the word 
“freedom,” writers have been inclined to define it to 
cover those and only those relationships of both 
social freedom and unfreedom that they happen to 
value and wish to commend to others. Such per- 
suasive definitions of freedom are useful not as 
tools of the empirical social sciences but as rhe- 
torical devices; they enable the writer to express 
his normative views in assertive form. For example, 
by stating that “to obey the laws laid down by so- 
ciety is to be free,” Rousseau in effect exhorts citi- 
zens to obey such laws; he is not trying to expli- 
cate the meaning of freedom. Persuasive definitions 
of freedom have been used to propound almost 
every political ideology, as is illustrated by the 
following examples. 

Freedom as protection of basic rights. Classical 
liberalism from Locke to Spencer and his followers 
advocated that government ought to restrict a per- 
son’s freedom when and only when necessary to 
protect another person’s basic rights (often held to 
correspond to natural rights). Accordingly, “no 
society in which these liberties are not, on the 
whole, respected, is free” (Mill). Conversely, a so- 
ciety is free, provided it is based on these laissez- 
faire principles. And a person who enjoys these 
legal rights and is subject to the corresponding 
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legal duties is free, however unfree he may be in 
other regards and with respect to actors other than 
the government, for example, because of economic 
exploitation or social pressure. Thus, the United 
States Supreme Court once held that minimum- 
wage and maximum-hour laws violate the constitu- 
tional principle of liberty, because such regulations 
are not necessary to the protection of basic rights 
but constitute “arbitrary” limitations of “freedom 
of contract” of both employer and employee. 

Freedom as satisfaction of basic needs. Neo- 
liberals point out that the right to acquire the 
necessities of life is of little value to those who lack 
the opportunity to acquire them, that government 
ought to make them available to all, and that this 
may require governmental restriction of individual 
freedom through regulations concerning public 
health, education, and welfare. Social welfare, not 
social freedom, is their ultimate goal; but they still 
use the word “freedom” to designate this end. “Per- 
sonal freedom means, in fact, the power of the 
individual to buy sufficient food, shelter, and 
clothing” (Sidney and Beatrice Webb). And so, 
“the distinction between welfare and liberty breaks 
down altogether” (Ralph Barton Perry). Converse- 
ly, those who are unable to bring about what so- 
ciety ought to enable them to achieve, but who are 
free with respect to the government to make the 
attempt, are said to lack “true freedom.” “Freedom 
from want,” unlike freedom of speech, does not 
refer directly to social freedom, but to absence of 
want and presence of a satisfactory living standard 
for all. It is only in an indirect sense that “neces- 
sitous men are not free men” (Franklin D. Roose- 
velt). They have little freedom of choice and are 
socially unfree with respect to the economically 
powerful. “Freedom” is applied not only to the wel- 
fare goal itself but also to whatever restrictions of 
social freedom are deemed necessary to achieve it. 
The Supreme Court now interprets liberty to be 
compatible with minimum-wage laws and other 
“reasonable regulations and prohibitions imposed 
in the interest of the community.” “Freedom” in- 
cludes desirable social unfreedom and excludes 
undesirable social freedom. 

Freedom as government by consent. This per- 
suasive definition of freedom is used to express the 
norm that government ought to be based on con- 
sent of the governed, and this usually means rep- 
resentative government and majority rule. For 
example, “the liberty of man in society is to be 
under no other legislative power but that estab- 
lished by consent in the commonwealth” (Locke). 
Under such a system, men are free because their 
freedom is limited only by measures in the enact- 


ment of which they were free to participate. With a 
slight shift in emphasis, “freedom” stands no long- 
er for the government's duty to be responsive to the 
will of the citizens but for the citizen’s duty to obey 
governmental enactments reflecting the will of the 
majority or the “general will.” According to Rous- 
seau, the citizen is free whether he fulfills this 
obligation freely or whether he has been “compelled 
to be free.” And so “freedom” comes to refer no 
longer to having the choice of acting in one way or 
another, but to acting in no other way than that 
prescribed by authority. 

Freedom as moral constraint. The definitions 
of freedom taken up so far, including even the 
persuasive ones, are made up entirely of descrip- 
tive terms. However, definitions of freedom often 
include ethical words, such as “right,” “ought,” or 
“virtue.” In such cases, not only the term to be 
defined (freedom) but also the defining expression 
has valuational meaning. For example, “Liberty 
can consist only in the power of doing what we 
ought to will” (Montesquieu). Similarly, a person 
is often said to be free, not if he acts freely or de- 
velops his capacities, but if he realizes his “best” or 
“essential” self. For example, “Liberty may be de- 
fined as the affirmation by an individual or group 
of his or its own essence” (Laski 1933, p. 444). 
Some have held that a person is most likely to 
realize his essence if he is left free to choose for 
himself. 

According to another tradition, which extends 
from Plato via the Stoics and Christian thought to 
Neo-Hegelianism, man reaches the highest form of 
self-realization by submitting to some moral norm 
imposed by his own “higher self,” which is usually 
identified with faith, reason, or moral conscience. 
“I call him free who is led solely by reason” 
(Spinoza). “Obedience to a law which we prescribe 
to ourselves is liberty’ (Rousseau). Freedom no 
longer signifies the absence of unwelcome, but the 
presence of welcome restraints. “For freedom is not 
acquired by satisfying yourself with what you de- 
sire, but by destroying your desire” (Epictetus). In 
short, freedom is unfreedom to do wrong, whereas 
freedom to deviate from the prescribed path is 
license. “If unbridled license of speech and of writ- 
ing be granted to all, nothing will remain sacred 
and inviolate. . . . Thus, license will gain what 
liberty loses” (Encyclical libertas 1888). 


If “freedom” becomes a label for anybody’s moral 
or political ends, then everybody’s value commit- 
ment to freedom will be vacuous. All will agree that 
liberty is the supreme good, but they will agree on 
nothing else. Meaningful disagreement about the 


value of freedom presupposes agreement about 
the meaning of freedom in nonvaluational terms. 
The concept of social freedom provides an adequate 
basis for a fruitful discussion of the normative, as 
well as the empirical, aspects of liberty. In this dis- 
course, the divergent views about which social free- 
doms ought to be extended or limited will depend 
on the value one assigns to such other social goals 
as equality, justice, or welfare, which may com- 
pete with the goal of freedom. 

FELIX E. OPPENHEIM 


[See also DEMOCRACY; EQUALITY; Justice; NORMS; 
POLITICAL THEORY; VALUES.] 
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FRENKEL-BRUNSWIK, ELSE 


Else Frenkel-Brunswik (1908-1958 ) was born of 
Polish-Jewish parents in the town of Lemberg, then 
part of the Austro-Hungarian Empire, and received 
her doctorate in psychology in 1930 at the Uni- 
versity of Vienna. She continued at the university, 
as lecturer and research associate, until shortly 
after the Nazi Anschluss in 1938. With her hus- 
band, Egon Brunswik, she then moved to the Uni- 
versity of California at Berkeley, where she re- 
mained until her death. 

The professional life of Else Frenkel-Brunswik 
covers a period that was not one of great new the- 
ories or startling discoveries in psychology. It was, 
rather, a time of expansion and ferment. The 
boundaries of academic psychology were being ex- 
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tended from the study of segmental psychological 
processes in the experimental laboratory to a more 
profound conception of human personality in its 
development over time and its engagement with a 
complex sociocultural environment. Psychologists 
sought to grasp the implications of the revolution- 
ary changes stemming from psychoanalysis, from 
the physical sciences, and from cataclysmic soci- 
etal developments. To an unusual degree, Frenkel- 
Brunswik’s career reflected these major intellectual 
and social developments and at the same time 
influenced the course of psychology. 

Her doctoral thesis (1931) under Karl Biihler 
attempted a rapprochement between traditional as- 
sociationism and the newer concepts of gestalt psy- 
chology. Subsequently, on the staff of the Psycho- 
logical Institute, headed by Charlotte Bühler, she 
collaborated in a series of developmental studies 
(see, for example, Frenkel & Weisskopf 1937) de- 
rived from the biographies of 400 persons selected 
from various historical periods and walks of life. 
This research, although theoretically limited, exem- 
plifies enduring features of her approach: the de- 
velopmental view of personality; the combined use 
of phenomenological and behavioral data and of 
multiple assessment methods; and the functional 
view of behavioral striving in terms of adaptation 
and goal achievement. 

The situation in Vienna provided three additional 
formative influences. First, there was the philo- 
sophical influence of the logical positivism of the 
“Vienna circle,” which included such men as Neu- 
rath, Schlick, and Carnap. This produced an endur- 
ing concern with problems of theory development, 
conceptual and operational definition of terms, and 
the role of inference in personality measurement. 

Second, there was the influence of psychoanal- 
ysis, which was of fundamental importance in 
shaping her future work. She had a personal anal- 
ysis in Vienna. Although she did not become a prac- 
ticing psychoanalyst, a clinical mode of thought 
continually informed her research, her teaching, 
and her experience of life. One of her initial studies 
drawing upon psychoanalytic theory was “Mecha- 
nisms of Self-deception” (1939). 

A third influence derived from the sociopolitical 
situation during the 1930s. Frenkel-Brunswik was 
deeply affected by the tragic events of that period. 
However, it was only after she came to the United 
States that these concerns were explicitly reflected 
in her intellectual work. 

Although the emigration from Austria in 1938 
was in many respects painful, her situation in the 
United States offered tremendous opportunity and 
stimulation. She flourished there. Her first position 
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in California was at the Institute of Child Welfare, 
and she maintained this association for the rest of 
her career. 


Personality theory 


One of her first publications in the United States 
was the paper “Psychoanalysis and Personality Re- 
search” (1940), which was read at the now-his- 
toric Symposium on Psychoanalysis by Analyzed 
Experimental Psychologists. In this paper, she 
wrote, not as an “analyzed experimental psycholo- 
gist,” but as a personality theorist in the broadest 
sense. She recognized that there was complemen- 
tarity as well as conflict between the psychoana- 
lytic, phenomenological, and behaviorist schools. 
Rather than starting with one of these theoretical 
positions, she sought to identify, in terms meaning- 
ful to all of them, the basic theoretical domains in 
the study of personality. She posited four such do- 
mains: the central and the peripheral levels of per- 
sonality (following and enlarging upon Henry A. 
Murray’s formulation); and the proximal and distal 
effects of behavior upon the environment (a 
perspective on behavior utilizing the distinction 
made by gestalt theorists and by Egon Brunswik). 
A truly comprehensive personality theory, she main- 
tained, must encompass relationships between all 
four domains rather than being encapsulated within 
one or two. She argued also for the combined use 
of clinical and experimental modes of investigation. 
Although many of her then-radical proposals are 
now widely accepted, this paper stands as a fresh 
Statement of the theoretical and methodological 
tasks of dynamic personality research. 

In her monograph “Motivation and Behavior” 
(1942) Frenkel-Brunswik applied and developed 
further the theoretical position outlined in her 1940 
article. This empirical study of adolescents made 
use of concepts and measures in several theoretical 
domains. There were self-reports obtained through 
questionnaires, observers’ ratings of behavior in 
specific social situations, projective test material, 
and clinical ratings of underlying (central) motives 
based upon inferences from a wide variety of ob- 
servational and test data. The concept of alternative 
manifestation was developed and operationalized, 
to help account for the relationships of underlying 
motives to overt behavior and conscious self-report, 
and theoretically sophisticated use was made of 
partial and multiple correlation techniques. In this 
study Frenkel-Brunswik provided one of the earliest 
models of systematic personality research. She dealt 
with various levels of personality, seen as dynam- 
ically interdependent, and she contributed to the 
methodology of inferential clinical ratings, relating 


them to more-directly behavioral measures and giv- 
ing them a legitimate place within the academic 
psychological investigation of personality. Her work 
on the clinical ratings was innovative in at least two 
respects. On the one hand, she developed techniques 
to increase the explicitness and reliability of clin- 
ical inferences. On the other, she anticipated the 
later research on social perception in her study of 
the influence of the rater’s personality on his judg- 
ments of subjects’ motivation and behavior. 

Her sociopsychological interests took form in the 
early 1940s. In 1942 she had a Social Science Re- 
search Council fellowship for work in sociology and 
anthropology at the University of Chicago, Harvard, 
the Langley Porter Institute, and with Kroeber and 
others at the University of California. In 1943 she 
joined Sanford and Levinson in a study of per- 
sonality and prejudice, and Adorno became their 
collaborator soon after. The study expanded over 
the next few years and was published under the 
title The Authoritarian Personality (1950). During 
this period she was also engaged in a related study 
of prejudice in children and families (see Frenkel- 
Brunswik & Havel 1953). 


“The Authoritarian Personality” 


Frenkel-Brunswik made certain distinctive and 
specifiable contributions to The Authoritarian Per- 
sonality; these will be noted below. She also con- 
tributed in significant but less identifiable ways to 
the character of that work as a whole. The basic 
theoretical conception and methodology of the re- 
search were an emergent product of intimately col- 
laborative effort by the four coauthors and reflected 
their common intellectual concerns. Without at- 
tempting to allocate specific credit to individual 
authors, I shall simply indicate the major features 
of this comprehensive project. It will be evident 
that they express Frenkel-Brunswik’s earlier inter- 
ests and are of importance in her subsequent work. 

Theoretically, The Authoritarian Personality rep- 
resented a convergence of previously disparate 
viewpoints: attitude theory in academic psychol- 
ogy; the sociological analysis of ideology; and psy- 
choanalysis and related personality theories, nota- 
bly that of Murray. Consciously held attitudes, 
values, and ideologies were regarded as aspects of 
personality. They were related to contemporaneous 
character traits, cognitive-emotional styles, ego de- 
fenses, and less conscious wishes and fantasies. 
The formation of ideology was seen in developmen- 
tal perspective and was related to more general 
processes of ego and superego development. In ret- 
rospect, the approach may be seen as a form of 
psychoanalytic ego theory; that is to say, it drew 


upon traditional psychoanalytic concepts and at 
the same time emphasized and conceptualized ego 
processes that link the person more directly with 
his social environment. This approach, now widely 
held in various forms, was at the time a major in- 
novation in social psychology. 

Methodologically, the research on authoritarian- 
ism involved the conjoint use of previously dis- 
parate techniques: scales and other questionnaire 
devices, projective techniques, and semistructured 
clinical interviews. Intensive case studies were used 
to develop hypotheses and to guide the explicit for- 
mulation of variables that were then measured by 
means of questionnaires administered to groups. 
The continuing interplay of clinical and statistical 
analysis was an essential feature of the methodol- 
ogy. The Authoritarianism (F) Scale, perhaps the 
most widely used and most poorly understood prod- 
uct of the study, was one result of this interplay. 

The most clearly identifiable contributions by 
Frenkel-Brunswik are in the chapters bearing her 
name. These deal with the construction and anal- 
ysis of the intensive, semistructured interviews. 
The interviews were standardized in the sense that 
they covered a predefined set of topical areas and 
theoretical issues, and they required considerable 
advance training of the interviewers. At the same 
time they had a clinically “open” quality in that 
the interviewer tried to follow the threads of the 
subject’s thought and feeling; within the loose struc- 
ture of the topical areas and issues, the content and 
sequence of the interview were largely determined 
by the interviewee. This was, in short, a quali- 
tatively distinct hybrid—a cross between the sur- 
vey and the therapy interview—and an important 
contribution to personality-social research. 

In the analysis of the interviews Frenkel-Bruns- 
wik developed an extensive series of categories that 
distinguish “high-authoritarian” from “Jow-authori- 
tarian” subjects. The rating of each category re- 
quired clinical inference on the part of the rater. 
To help control for the subjectivity in this process, 
the categories were defined in some detail and were 
scored by two independent raters, so that reliability 
could be determined. Categories were developed in 
three broad domains: parents and childhood; sex, 
people, and self; and dynamic and cognitive per- 
sonality organization. The interpretive integration 
of findings in these domains led to the formulation 
of a smaller number of major themes and patterns. 

One of the concepts developed here (and given 
greater prominence in her subsequent research ) is 
intolerance of ambiguity (1949). This concept has 
a complex, many-faceted character. Its referents 
include a tendency toward oversimplification in per- 
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ceptions and conceptions of the external world; a 
deep uneasiness about ambiguity and lack of order 
in personal relationships and social structures; a 
proclivity to engage in moralistic, all-good or all-bad 
value judgments, without recognition of moral am- 
biguities; an inability to acknowledge ambivalence 
(emotional ambiguity) in one’s experience of par- 
ents and significant others. This concept thus has 
motivational, emotional, cognitive, and moral as- 
pects, and it refers to the person’s experience of 
both self and external world. Although subsequent 
evidence has not fully supported Frenkel-Bruns- 
wik’s initial formulation of intolerance of ambiguity 
as a relatively unified, pervasive trait, it does indi- 
cate the usefulness of this kind of multiple-compo- 
nent concept as a genotype underlying diverse overt 
behaviors. 


Later career 


The period from 1950 to 1955 was a lively and 
productive one for Frenkel-Brunswik. She visited 
Europe for the Rockefeller Foundation and reported 
on the condition of European psychology. She wrote 
a number of papers that reflected the growing 
breadth of her theoretical interests: on the inter- 
action of sociological and psychological variables 
(1952; 1954b); on perceptual—cognitive function- 
ing as an aspect of personality (1951); on psycho- 
analysis and the unity of science (1954a; 1957). 
In 1953 she began a study of aging; this work was 
completed and published by colleagues (Reichard 
et al. 1962) after her death. In 1954/1955, she was 
a fellow at the Center for Advanced Study in the 
Behavioral Sciences. 

This period of great vitality was cut short by the 
tragic death of her husband, Egon, in 1955—a blow 
from which she never fully recovered. Her final 
dream had been a major treatise on values, a book 
that would bring together her lifelong interests in 
psychoanalysis, sociology, and philosophy. This 
hope was ended by her own premature death in 
1958. 

Frenkel-Brunswik’s intellectual commitment, 
warmth, and intensity were reflected in her rela- 
tionships with students. Although she did not do 
much formal teaching, she developed unusually 
close relationships through seminars, research col- 
laboration, and informal contacts. She had a charis- 
matic femininity. The number of present-day psy- 
chologists who feel a special gratitude to her and 
who regard her as a major formative influence in 
their personal and professional development is large 
indeed. 

The contributions of Frenkel-Brunswik can per- 
haps best be grasped within the context of the rad- 
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ical new developments in American psychology dur- 
ing the period of roughly 1935 to 1950. She was a 
founding figure in the establishment of the new 
fields of personality psychology and social and clin- 
ical psychology. She forged important links between 
psychoanalysis and academic personality research. 
She contributed to the development of a conception 
of man that takes account of both reason and pas- 
sion, of the most primitive and the most mature, 
of the autistic and the socially embedded aspects 
of his nature. Finally, she was multidisciplinary in 
the best sense of the word. She was ready to engage 
in disciplined search for the relevant, no matter 
how far it led from her disciplinary origins. 


DANIEL J. LEVINSON 


[For the historical context of Frenkel-Brunswik’s work, 
see GESTALT THEORY; PERSONALITY: CONTEMPORARY 
VIEWPOINTS, article on COMPONENTS OF AN EVOLY- 
ING PERSONOLOGICAL SYSTEM; and the biographies 
of BRUNswIK; BUHLER; ScHLick. For discussion of 
the subsequent development of Frenkel-Brunswik’s 
ideas, see ATTITUDES; PERSONALITY, POLITICAL, arti- 
cle on CONSERVATISM AND RADICALISM.] 
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FREUD, SIGMUND 


Sigmund Freud was born May 6, 1856, in Frei- 
berg, Moravia (now Czechoslovakia), and died 
September 23, 1939, in exile in London. When 
the boy was three, his father, a small wool mer- 
chant, was forced by economic reverses to move 
for a year to Leipzig and thence to Vienna, where 
Freud spent the rest of his life—1860 to 1938— 
except for his last year. His biographers agree that 
the unusual structure of the family into which he 
was born was partly responsible for his interest in 
intimate human relationships: Freud’s father had 
two sons by his first wife; when he remarried after 
her death, it was to a woman of their age. Sig- 
mund, her first child, often played with his year- 
older nephew. A brother who was born when 
Sigmund was not yet a year old died after eight 
months; then came four sisters and another 
brother. 

A dedicated student, Freud graduated swmma 
cum laude from the Gymnasium at age 17 and 
entered the University of Vienna medical school, 
After three years Freud became deeply involved in 
research, which delayed his M.D. until 1881; re- 
search was to remain his main interest. In 1882 he 
met and became engaged to Martha Bernays, and 
he began clinical training in order to be able to 
earn a living from the practice of medicine. He 
continued research and publishing, was made Do- 
zent, and received a grant in 1885 to study for sev- 
eral months with Charcot in Paris. The next year 
he married and began practicing neurology; three 
sons and three daughters were born between 1887 
and 1895. 

Since existing therapies were not effective for 


his patients, most of whom were neurotic, he turned 
to hypnotic suggestion and in 1889 briefly visited 
Bernheim and Liébeault to perfect his technique. 
He learned a more helpful method, however, from 
a close friend, Josef Breuer, whose patient “Anna 
O.” (Bertha Pappenheim) had managed to over- 
come some hysterical symptoms by talking freely 
about the circumstances of their first occurrence. 
Freud’s successful experiences with and modifica- 
tions of this “cathartic” treatment were reported in 
the book he wrote jointly with Breuer, Studies on 
Hysteria (1893-1895). For the next five years, he 
continued to develop this psychotherapeutic meth- 
od into psychoanalysis, gradually withdrawing 
from neurology, although by then he had an inter- 
national reputation in that field. [See HYSTERIA; 
PsYCHOSOMATIC ILLNESS.] 

The way his own self-analysis contributed to the 
growth of his ideas during this period may be seen 
in the letters and drafts of papers sent to a Berlin 
colleague, Wilhelm Fliess, who became a close 
friend and confidant (see 1887—1902; 1895). The 
first major statement of his theories was The Inter- 
pretation of Dreams (1900). In 1902 he was made 
professor extraordinarius at the University of 
Vienna, and about that time his publications and 
lectures began to attract a group of followers, 
which became in 1908 the Vienna Psychoanalytical 
Society. 

The principal focus of Freud's life thereafter was 
the growth of psychoanalysis—as a theory, a form 
of treatment, and a movement. The movement did 
not remain monolithic: dissident followers who 
withdrew and formed their own schools included 
Adler (in 1911), Stekel (in 1912), Jung (in 1913), 
and Rank (in 1926). As Freud’s ideas began to 
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become more widely known, they attracted respect 
and scientific interest, but also met with a great 
deal of hostility as well as extreme rejection. A 
truly objective weighing of these two kinds of reac- 
tions at various periods of Freud’s career has yet 
to be done, but in any event Freud seems to have 
been more keenly aware of the negative than of 
the positive reception. An especially welcome early 
sign of recognition was the award of an honorary 
degree by Clark University in 1909, on which occa- 
sion he visited America with Jung, Ferenczi, and 
Jones and delivered a series of lectures. 

In 1923 he had the first of many operations for 
cancer of the upper jaw, which finally proved fatal. 
During his last 16 years, Freud suffered almost 
constant pain and difficulty in speaking because 
of an awkward prosthesis, but he continued psycho- 
analyzing and writing into his final year. Only after 
the Nazi Anschluss could he be persuaded to leave 
Vienna, though he often had declared his detesta- 
tion for the city. Long before the end, he had 
achieved world-wide acclaim and recognition as 
one of the decisive shapers of the twentieth century. 


The development of Freud’s ideas 


Freud’s first scientific contribution was pub- 
lished in 1877; his last was written only a few 
months before his death. Only the most superficial 
sketch of the development of his thought in the six 
hundred-odd papers and books he produced over 
these 63 years can be given here. There were four 
major and overlapping phases of that development. 

(1) His prepsychoanalytic work, which lasted 
about twenty years, may be subdivided into an 
initial ten years of primarily histological—anatomi- 
cal research and a partly overlapping 14 years of 
clinical neurology, with increasing attention to 
psychopathology, beginning in 1886 when he re- 
turned from Paris. 

(2) The first theory of neurosis dates from the 
decade of the 1890s, when Freud used hypnosis 
and Breuer’s cathartic method of psychotherapy, 
gradually developing the psychoanalytic methods 
of free association, dream interpretation, and the 
analysis of transference, The first dozen truly psy- 
choanalytic papers appeared during this time, ex- 
pounding the view that neurosis is a defense against 
intolerable memories of a traumatic experience— 
infantile seduction at the hands of a close relative. 
With the discovery of his own Oedipus complex, 
however, Freud came to see that such reports by 
his patients were fantasies, which led him to turn 
his interest away from traumatic events in external 
reality and toward subjective psychic reality, A 
notable but only recently discovered event in the 


development of Freud’s thought occurred in 1895 
after the publication of the book he wrote with 
Breuer: he wrote but did not publish a “Psychology 
for Neurologists” (or “Project for a Scientific Psy- 
chology”; see Freud 1895), presenting a com- 
prehensive anatomical—physiological model of the 
nervous system and its functioning in normal be- 
havior, thought, and dreams, as well as in hysteria. 
He sent it to Fliess in high excitement, then quickly 
became discouraged by the difficulties of creating 
a thoroughgoing mechanistic and reductionistic 
psychology, tinkered with the model for a couple 
of years in letters to Fliess, and finally gave it up. 

The turn of the century marked many basic 
changes in Freud’s life and work: he severed his 
close and dependent friendships with colleagues 
(first Breuer, then Fliess) and his contacts with 
the Viennese medical society; his father died; his 
last child was born; he psychoanalyzed himself; 
he gave up neurological practice, research, and con- 
ceptual models; and he created his own new pro- 
fession, research method, and theory, in terms of 
which he worked thereafter, 

(3) Freud’s topographic model was the founda- 
tion of two decades of work, during which he pub- 
lished his major clinical discoveries, notably, The 
Interpretation of Dreams ( 1900) and Three Essays 
on the Theory of Sexuality (1905); his papers on 
the technique used in psychoanalytic treatment; 
his five major case histories, the central works of 
metapsychology; and a series of important surveys 
and popularizations of his ideas, in addition to his 
principal applications of his theories to jokes, liter- 
ature and art, biography, and anthropology. A com- 
plete or metapsychological explanation, Freud 
wrote in 1915, requires “describing a psychical 
process in its dynamic, topographical and economic 
aspects”—that is, in terms of a theoretical model 
in which the central concepts are psychological 
forces, structures, and quantities of energy (Rapa- 
port & Gill 1959). Hence, we speak of three meta- 
psychological points of view. The topographic 
model, which was first set forth in Chapter 7 of 
The Interpretation of Dreams and was further elab- 
orated in the metapsychological papers (1915), 
conceptualizes thought and behavior in terms of 
Processes in three psychological systems: the con- 
scious, preconscious, and unconscious (none of 
which has an explicit locus in the brain). 

(4) In the final period, extending between the 
two world wars, Freud made four main types of 
contributions: the final form of his theory of in- 
stinctual drives (Beyond the Pleasure Principle, 
1920); a group of major modifications of both gen- 
eral and clinical theory—most notably, the struc- 


tural model of the psychic apparatus (The Ego and 
the Id, 1923) and the theory of anxiety and defense 
(Inhibitions, Symptoms and Anxiety, 1926); appli- 
cations of psychoanalysis to larger social problems; 
and a group of books reviewing and reformulating 
his theories. 

To grasp the structure of Freud’s work, it is 
useful not only to adopt such a developmental 
approach but also to view his theories from the 
perspective of the following threefold classification: 

First and best known is the clinical theory of 
psychoanalysis, with its psychopathology, its ac- 
counts of psychosexual development and character 
formation, and the like. The subject matter of this 
type of theorizing consists of major events (both 
real and fantasied) in the life histories of persons, 
events occurring over spans of time ranging from 
days to decades. This theory is the stock in trade 
of the clinician—not just the psychoanalyst, but 
the vast majority of psychiatrists, clinical psychol- 
ogists, and psychiatric social workers. Loosely re- 
ferred to as “psychodynamics,” it has even pen- 
etrated into general academic psychology via 
textbooks on personality. 

Second, there is what Rapaport (1959) has 
called the general theory of psychoanalysis, also 
called metapsychology. Its subject matter—proc- 
esses in a hypothetical psychic apparatus or, at 
times, in the brain—is more abstract and imper- 
sonal; and the periods of time involved are much 
shorter—from fractions of a second up to a few 
hours. The processes dealt with are mostly those 
occurring in dreams, thinking, affect, and defense; 
Freud’s reasoning in working out this theory is 
much closer, and he made more use of theoretical 
models of the psychic apparatus. The main works 
are the “Project for a Scientific Psychology,” Chap- 
ter 7 of The Interpretation of Dreams, and the meta- 
psychological papers. 

Third is what might be called Freud’s phylo- 
genetic theory, The subject matter is man as a 
species or in groups, and the periods of time in- 
volved range from generations to eons, Here are 
Freud’s grand speculations, largely evolutionary 
and teleological in character; they contain no ex- 
plicit models of a psychic apparatus, employing 
instead many literary, metaphorical concepts. The 
principal works of this type are Totem and Taboo 
(1913), Beyond the Pleasure Principle ( 1920), 
Group Psychology and the Analysis of the Ego 
(1921), The Future of an Illusion (1927), Civiliza- 
tion and Its Discontents (1930), and Moses and 
Monotheism (1934-1938). 

His clinical contributions are among the earliest 
of Freud’s papers that are still being read, and he 
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continued to write in this vein all of his life. As 
far as the other two types of theory are concerned, 
however, they overlap fewer developmental periods: 
the major metapsychological works came early, the 
main phylogenetic ones late. As Freud’s concepts 
became more metaphorical and dealt with such 
remote issues as man’s ultimate origins and the 
meaning of life and death, he became less con- 
cerned with describing or systematically accounting 
for the course and fate of an impulse or thought. 

The rest of this article will concentrate on hy- 
potheses and observations about two groups of 
Freud’s ideas: what now appear to have been his 
major, lasting contributions, those that have been 
most influential on and most assimilated into the 
behavioral sciences (not to mention literature, art, 
and other aspects of contemporary Western cul- 
ture); and his major errors, those concepts that 
have been most cogently criticized by psychoan- 
alysts and other scientists. Finally, it will discuss 
historical antecedents of his ideas and influences 
upon them. The literature on these topics is al- 
ready large and growing rapidly, so this survey 
must be highly selective. 


Major contributions and weaknesses 


Contributions. Freud may be said to have made 
five major contributions. 

(1) He based his work on the assumption of 
psychic determinism: the lawfulness of all psycho- 
logical phenomena, even the most trivial, including 
dreams, fantasies, and slips of the tongue. 

(2) The lastingly valued aspects of Freud’s com- 
plex doctrine of the unconscious include the gen- 
eral proposition that cognitive and other psycho- 
logical events can go on outside of awareness; the 
influence of unconscious motivation on behavior; 
and the special qualitative characteristics of un- 
conscious processes—the primary process and 
symbolism. The primary process is the kind of 
primitive functioning of the “psychic apparatus” 
that characterizes the unconscious id; indeed, it is 
the principal property by means of which the latter 
is defined. Processes characterized by magical 
rather than rational logic and by wishfulness—a 
seeking for immediate gratification of crude sexual 
or aggressive impulses—are called primary. Freud 
emphasized the concepts of displacement and con- 
densation of psychic energy in his conceptualiza- 
tion of the primary process and noted that it often 
makes use of symbols, which differ from other 
types of displacement substitutes in having been 
shared by many persons for generations. These 
were the main theoretical resources Freud called 
upon to explain dreams, neurotic symptoms, psy- 
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chotic thought and language, normal character 
traits, myths, creative thought, art, and humor. 

(3) Of the many contributions Freud made to 
our understanding of sexuality, the following seem 
to enjoy the most acceptance: his stress on its great 
importance in human life generally; his broad defi- 
nition, which includes oral, anal, and other bodily 
pleasures and links them to the phallic-genital; 
his conception of its plasticity—it can be delayed, 
transformed, or fixated, and interest can be shifted 
from one “component drive” or “partial instinct” 
to another; his discovery that it appears early in 
human life (infants and young children mastur- 
bate, have sexual curiosity, ete.) and follows a 
typical developmental sequence; his insistence that 
bisexuality and “polymorphous perversity” are uni- 
versal endowments or potentialities; his explana- 
tion of sexual perversions as pathological develop- 
ments, not (or not wholly) as constitutional givens 
and not as sins; and his elaborations of many 
aspects of the Oedipus complex—the fact of inevi- 
table but tabooed incestuous attraction in families, 
the associated phenomena of anxiety about castra- 
tion (or, more generally, mutilation ), and of intra- 
familial jealousy, hatred, and envy, much of it 
unconscious. 

(4) Three of Freud’s concepts—conflict, anxi- 
ety, and defense—are so interrelated that we may 
look on them as constituting one major contribu- 
tion. He saw the pervasive importance of conflict 
(not merely the traditional opposition of reason 
and passion, or ego versus id, but also ego versus 
superego and superego versus id) in both normal 
and abnormal behavior. One of his earliest insights 
was that defenses—structuralized means of con- 
trolling impulse and preventing the outbreak of 
anxiety, thus being in effect resolutions of conflict 
—are major factors in the formation of symptoms 
and character traits and are shaping influences on 
the organization of thought. He also described the 
specific mechanisms of defense, such as repression, 
projection, reaction formation, isolation, and mas- 
tery via the turning of passivity into activity. [See 
ANXIETY; CONFLICT, article on PSYCHOLOGICAL 
ASPECTS; DEFENSE MECHANISMS.] 

(5) A number of Freud’s lasting discoveries and 
insights make up the genetic point of view. He 
showed the necessity of knowing facts of develop- 
ment in order to understand personality; the im- 
portance of the events of early life for the main 
features of character, including the specific syn- 
dromes of the oral and anal character types as 
outgrowths of events at the corresponding psycho- 
sexual stages; the role of identification as a prin- 
ciple of learning and development; the importance 


of drive delay and control in development; and the 
nature of psychopathology as regression along a 
developmental path. 

As Shakow and Rapaport (1964) have pointed 
out, in each instance it is the general conception 
and the observations that have been accepted, not 
the specific concepts and the explanatory theory in 
which they are embedded. But this is to be ex- 
pected: theories necessarily age, and any theory 
in the behavioral sciences formed as long ago as 
Freud's is bound to contain many anachronisms, 
obsolete assumptions, and unfortunate turns of 
thought. 

Flaws. Three weaknesses in Freud’s work seem 
to have had the most extensive negative effects 
upon theory, research, and practice (see Holt 
1965a; 1966). However necessary they may have 
been to his positive contributions, they are logically 
separable from them. 

(1) His basic operating model (at first, of the 
central nervous system; later, of the psychic ap- 
paratus) was a passive reflex apparatus with no 
energies of its own, operating only to rid itself of 
inputs from the body (instinctual drive) and from 
the environment (reality), these inputs being con- 
ceptualized as quantities of energy (subjectively 
experienced as tension), and the regulative rule 
formulated as the principle of constancy. Conse- 
quently, the fundamental principle of motivation 
and affect is tension reduction. 

Even in Freud's fourth period, when he said little 
directly about it, the passive reflex model seems to 
have operated as a silent pressure in the following 
directions: 

ta) Freud tended to overemphasize quantitative 
as against qualitative aspects of behavior and 
thought, though only the latter were observable. 
The quantitative emphasis would have been more 
defensible if it had actually led to measurement, 
which is needed, but it did not. The result was a 
relative neglect of the phenomenology of affects in 
favor of a primary emphasis on pleasure and un- 
pleasure, including anxiety (Kardiner et al. 1959), 
and a relative neglect of the phenomenology of the 
primary process, such as the various specific forms 
taken by condensation and displacement, in favor 
of an elaborate theory about unmeasurable energies 
of various qualitatively and directionally specific 
types. 

(b) Similarly, Freud tended to reduce motiva- 
tion to the somatically based; for example, he 
assumed that love and affection are derivative 
forms of a seeking after sensuous pleasure. 

(c) There was a relative neglect of motives that 
do not easily fit the tension-reduction conception, 


such as curiosity and positive interest in stimuli 
and the seeking of challenges to master (White 
1963). 

(d) The passive reflex model suggests a sim- 
ple theory of pleasure and unpleasure as perceived 
concomitants of rises and falls in energic tension. 
Yet from the beginning, Freud was aware of con- 
flicting data, and he was never able either to 
abandon the original theory entirely or to account 
for the anomalous observations in a way that was 
consonant with the model. 

(e) There were other shortcomings in the meta- 
psychological economic point of view—the notion 
of a fixed and limited amount of energy which has 
to be withdrawn from one locus if used at another: 
for example, a scarcity economics of love, accord- 
ing to which the more one loves others the less 
self-esteem is possible—and in the quasi-vitalistic, 
unmeasurable, and overelaborated concept of psy- 
chic energy (Holt 1966). 

(f) The model was hospitable to the death in- 
stinct and nirvana principle as ultimate extensions 
of tension reduction. But outwardly directed ag- 
gression was difficult to fit into the model and was 
relatively neglected for years. 

(2) The second fundamental flaw was Freud's 
originally physicalistic conception of reality: basi- 
cally, as “masses in motion and nothing else” 
([1895] 1954, p. 369); more generally, as a welter 
of dangerous energies in which may be found some 
tension-reducing objects. In this conception, reality 
lacks significant organization on a large scale, as 
in the social structures or value systems (especially 
the latent ones) postulated by modern sociology 
and anthropology. To be sure, Freud did not con- 
sistently hold to such reductionism; he always dealt 
with meanings as such. Nevertheless, a physicalistic 
notion of reality lingered in his mind as an implicit 
conceptual ideal, with several consequences: 

(a) Without a way to conceptualize an endur- 
ing structure of society and culture, Freud needed 
to assume individual genetic transmission of stable 
but latent cultural themes by way of a Lamarckian 
inheritance of acquired characteristics and to as- 
sume also that much of what he observed was 
inherited and universal, not culturally specific (for 
example, the Oedipus complex, or the inferior 
status of women as determined by anatomical 
differences). 

(b) Freud tended to neglect the problems of 
adaptation and the relations to reality studied more 
recently in ego psychology. 

(c) There were problems in the theory of object 
relations: for example, the “primary hate of ob- 
jects,” which the model requires, conflicts with the 
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facts of infant observation; the theory also had 
difficulty in accounting for sustained interest and 
affection in a sexually consummated relationship, 
and psychoanalysts were led to a relative neglect of 
the natural history of subtleties in human relations. 

(3) Freud's third basic error was his unclear 
and ambivalent handling of the mind-body prob- 
lem, alternating between psychophysical parallel- 
ism and interactionism. The consequences began 
to show up as soon as he gave up the neurological 
model of 1895: he carried over its basic assump- 
tions (the passive reflex model and the implicitly 
physicalistic concept of reality) in his later, osten- 
sibly psychological, theories. There were several 
consequences of Freud's failure to take a consistent 
position on the mind-body problem. 

(a) The status of the basic model remained un- 
developed, unclear with regard to the existential 
status both of psychic energies and forces and of 
psychic structure. 

(b) Since the basic model was not made fully 
explicit, there followed a relative neglect of struc- 
tural considerations in favor of a “motivational 
reductionism” (Gill 1959), the explanation of the 
control and restraint of impulses (especially ag- 
gressive ones) in terms of instinctual fusion and 
defusion instead of in structural terms, and the 
neglect of adaptive and health-maintaining capac- 
ities in favor of an emphasis on pathology. 

(c) A further consequence was Freud’s tend- 
ency to reify functions as structures and to per- 
sonify theoretical entities of uncertain existential 
status. 

(d) Psychoanalysis became isolated from prog- 
ress in medical and physiological sciences because 
of the difficulty in assimilating their findings. 

It should be emphasized that none of these short- 
comings was as crippling as it might have been if 
the underlying assumptions had been applied rig- 
idly and consistently. As a group, they did steer 
Freud’s thinking to a demonstrable extent, but 
thanks to the creative looseness of his cognitive 
style (Holt 1965b) he was able to observe much 
and to develop many specific theories that were 
logically incompatible with them. 


Historical background 

Before examining the antecedents of those ideas 
of Freud’s that have been outlined above, it will 
be helpful to sketch in some of the grand trends of 
intellectual history in the nineteenth century. 

“Naturphilosophie” and its rejection. The way 
for the romantic revolt that broadly characterized 
all aspects of intellectual life in the early 1800s 
had been prepared by Naturphilosophie, a mystical 
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and often rhapsodic view of Nature as perfused 
with spirit and with conflicting unconscious forces 
and as evolving according to an inner, purposive 
design. Not a tightly knit school, its constituent 
thinkers included (in chronological order) Kant, 
Lamarck, Goethe, Hegel, Schelling (perhaps the 
central figure), Oken, and Fechner. With the ex- 
ception of Fechner, who lived from 1801 to 1887, 
they all lived athwart the eighteenth and nine- 
teenth centuries. Naturphilosophie encouraged the 
recrudescence of vitalism in biology, championed 
by the great physiologist Johannes Miiller, and 
stimulated a humanistic school of romantic medi- 
cine (Galdston 1956). In psychiatry, the early part 
of the century was dominated by the reforms of 
Pinel, Esquirol, and their followers, who introduced 
an era of “moral treatment”: firm kindness in place 
of restraints, therapeutic optimism based on etio- 
logical theories of a more psychological than or- 
ganic cast, and an attempt to involve inmates of 
asylums in constructive activities. 

The tough-minded reaction to this tender-minded 
era was greatly aided by the strides being made in 
physics and chemistry. Three of Miiller’s students, 
Briicke, du Bois-Reymond, and Helmholtz, met 
Carl Ludwig in 1847 and formed a club (which 
became the Berlin Physical Society) to “constitute 
physiology on a chemico—physical foundation, and 
give it equal scientific rank with Physics” ( Ludwig, 
quoted by Cranefield 1957, p. 407), They did not 
succeed in their frankly reductionist aim but did 
attain their other objectives: to promote the use of 
scientific observation and experiment in physiology 
and to combat vitalism. Among themselves, they 
held to the following program: 


No other forces than the common physical-chemical 
ones are active within the organism. In those cases 
which cannot at the time be explained by these forces 
one has either to find the specific way or form of their 
action by means of the physical-mathematical method, 
or to assume new forces equal in dignity to the chemi- 
cal—physical forces inherent in matter, reducible to the 
force of attraction and repulsion. (du Bois-Reymond, 
quoted by Bernfeld 1944, p. 348) 


In Germany especially, this materialistic ferment 
of physicalistic physiology, mechanism, and reduc- 
tionism became the mode, gradually putting roman- 
tic medicine and other aspects of Naturphilosophie 
to rout. Where earlier there had been Psychic, 
Psycho-somatic, and Somatic schools in German 
psychiatry (Earle 1854, see in Hunter & Macalpine 
1963, pp. 1015-1018), the Somatic gradually won 
out; Meynert, for example, conceived mental dis- 
orders to be diseases of the forebrain. Despite its 
therapeutic successes, moral treatment was ban- 


ished along with its psychogenic (often sexual) 
theories as “old wives’ psychiatry,” in favor of 
strictly organic—hereditarian views and very little 
by way of therapy (Bry & Rifkin 1962). 

The University of Vienna medical school was 
an outpost of the new hyperscientific biology, with 
one of its promulgators, Briicke, holding a major 
chair and directing the physiological institute 
(Bernfeld 1944). Ironically, Freud tells us that his 
decision to enter medical school was determined 
by hearing the “Fragment on Nature” attributed to 
Goethe read aloud at a public lecture. This short 
prose poem is an epitome of Naturphilosophie, and 
it must have swayed Freud because of his long- 
standing admiration for Goethe and perhaps be- 
cause of a “longing for philosophical knowledge,” 
which had dominated his early years, as he said 
later in a letter to Fliess. Evolution had been a 
major tenet of Naturphilosophie; so it is not sur- 
prising that this 1780 dithyramb could be part of 
a lecture on comparative anatomy, the discipline 
that furnished much of the crucial evidence for 
Darwin's Origin of Species (1859). 

Energy and evolution. Perhaps the two most 
exciting concepts of the nineteenth century were 
energy and evolution; both of these strongly influ- 
enced Freud’s teachers at the medical school. Helm- 
holtz had read to the 1847 group his fundamental 
paper on the conservation of energy—presented as 
a contribution to physiology. Thirty years later, 
Briicke’s lectures were full of the closely related 
(and still poorly differentiated) concepts of energy 
and force. To use these dynamic concepts was the 
very hallmark of the scientific approach; Briicke 
taught that the “real causes are symbolized in sci- 
ence by the word ‘force ” (Bernfeld 1944, p. 349). 
It seems obvious that the first of Freud’s three 
metapsychological points of view, the dynamic (ex- 
planation in terms of psychological forces), had its 
origins in this exciting attempt to raise the scientific 
level of physiology by the diligent application of 
mechanics and especially of dynamics, that branch 
of mechanics dealing with forces and the laws of 
motion. The heavily quantitative emphasis of the 
school of Helmholtz and its stress on energy are 
clearly the main determinants of metapsychology 
seen from the economic point of view (explana- 
tions in terms of quantities of energy). The fact 
that, among authors Freud respected most, such 
disparate figures as Fechner and Hughlings Jack- 
son held to dynamic and economic viewpoints no 
doubt strengthened Freud’s unquestioning convic- 
tion that these viewpoints are absolutely neces- 
sary elements of an explanatory theory. 

Despite its physicalistic program, the actual 


work of Briicke’s institute was largely classical 
physiology and histology. Freud had had his Dar- 
winian scientific baptism under Claus in a micro- 
scopic search for the missing testes of the eel, and 
his several attempts at physiological and chemical 
experiments under other auspices were fruitless. 
He was happy, therefore, to stay at the microscope 
where Briicke assigned him neurohistological stud- 
ies, inspired by and contributing to evolutionary 
theory. When he worked with Meynert, it was again 
in a structural discipline with a genetic method— 
the study of brain anatomy using a series of fetal 
brains to trace the medullar pathways by following 
their development. His subsequent clinical practice 
was in neurology, a discipline which, as Bernfeld 
(1951) has noted, was “merely a diagnostic appli- 
cation of anatomy.” Moreover, Freud's first full- 
scale theoretical model, the “Project” of 1895, is 
foremost a theory about the structural organization 
of the brain, both gross and fine. His early training 
thus demonstrably convinced him that a scientific 
theory has to have a structural (or topographic) 
base. 

Of Freud’s enduring contributions listed above, 
the two that are most plausibly traced to the intel- 
lectual climate of physicalistic physiology and to 
specific teachings of Briicke and Meynert are psy- 
chic determinism and the genetic point of view. 
The evolutionary surge of Naturphilosophie, given 
a modern, scientific, and nonteleological form in 
Darwinism, inspired all the biological sciences of 
the late nineteenth century with a conviction that 
phenomena of life cannot be understood without 
the elucidation of how the organism develops—out 
of its own parental germ plasm and out of a phylo- 
genetic series. This point of view pervaded all of 
Freud’s work in this first period; it would have been 
surprising if he had not carried it over and ex- 
tended it when he turned to psychopathology, as 
Spencer was doing in academic sociology and 
psychology. 

The assumption of exceptionless determinism 
was so fundamental a principle of mechanistic sci- 
ence as hardly to need discussion. Freud was ex- 
posed to it on all sides in the university and in 
much of his reading, then and later. Doubtless, this 
conviction that all phenomena are lawful and are 
thus legitimate subjects of scientific interest helped 
Freud to pay attention to the trivia of mental life 
and underlay his conviction that even if a patient 
relaxed the controls of conscious purpose in favor 
of free association, the material he produced would 
not be random but would betray an inner organi- 
zation, a deeper and more meaningful set of 
psychological laws. 
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The unconscious. But the existence of psycho- 
logical forces determining a meaningful inner 
organization required the assumption of a dynamic 
unconscious realm of the mind, besides conscious- 
ness. Many writers have shown how much a part of 
the thought of the time one part of this assumption 
was: “the general conception of unconscious men- 
tal processes was conceivable . . . around 1700, 
topical around 1800, and fashionable around 1870- 
1880” (Whyte 1960, pp. 168-169). Among the 
scientists known to have been familiar to Freud 
during his formative years (before 1900 )—Charcot, 
Bernheim, Breuer, Lipps, H. Jackson, Galton, Fech- 
ner, and Helmholtz—all had one or another con- 
cept of the unconscious; other such concepts are 
to be found in writings known to have influenced 
Freud: those of the philosophers Plato, Kant, and 
Spinoza (Aron 1963-1964), the Bible, and the 
works of his favorite writers of fiction, Goethe, 
Shakespeare, Cervantes, and Dostoevski. Moreover, 
the Herbartian psychology taught in all Austrian 
secondary schools when Freud was in the Gym- 
nasium was presented “not as one of several schools 
of psychology but as a well-established semi-official 
psychology” (Andersson 1962); together with asso- 
ciationism, it was adopted by Meynert and trans- 
lated into neurological terms. It was a theory of 
the dynamic interaction of ideas, some of which 
may repress others below the threshold of con- 
sciousness, whence they may be hindered from 
rising by the resistance of the more or less inte- 
grated masses of ideas. This is, not surprisingly, 
precisely the terminology used by Breuer and Freud 
in 1893 ({1893-1895] 1955, chapter 1) in their 
first attempt at a psychological explanation of 
hysteria. 

It is possible to find some predecessor who ex- 
pressed one or another of most components of 
Freud’s conception of the unconscious (though it is 
not established that any of them were known to 
Freud before he formed his own hypotheses )—for 
example, symbolism in dreams (von Schubert, 
Scherner), the role of dream symbols in myths 
(Carus, von Schubert), dreams as wish fulfillments 
(Plato, Maass), and the unconscious as the source 
of powerful motives (Herder, Richter) and of artis- 
tic creativity (Goethe, Schiller). Nevertheless, these 
were mostly isolated aperçus; even von Hartmann, 
whose Philosophy of the Unconscious touches on all 
of them, was not able to integrate these fragmen- 
tary insights into a coherent theory, like the psycho- 
analytic theory of the primary process. Further- 
more, Freud differed from all his “anticipators” in 
that they remained outside the cave and made 
remarks—often profound ones—on what they 
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glimpsed inside, but Freud went boldly in and de- 
voted himself for decades to its painstaking (and 
painful) empirical exploration. Although the idea 
of unconscious processes had been around for cen- 
turies, Freud forced us to see the true power and 
pervasiveness of these processes in man’s thought, 
feeling, and behavior. 

Sexuality. The enormous importance of sex- 
uality as a basic human motive had long been ex- 
plored by poets and dramatists, and there had been 
many times when it was freely discussed in science 
and medicine as well as in everyday conversation. 
The prevailing belief today is that the second half 
of the nineteenth century, when Freud grew to ma- 
turity, was an era of unusually strong shame about 
sexuality and of moralistic attempts to suppress 
even its scientific study in the name of Christian 
morality; this climate of opinion influenced Freud’s 
own personality and behavior, so characterized by 
propriety, self-control, and conventional monog- 
amy. Yet our stereotype of Victorian prudery is 
probably oversimple; in any case, by the end of 
the nineteenth century the tide had begun to turn; 
Bry and Rifkin (1962) and Rieff (1959, chapter 
10) have documented many social, literary, artistic, 
and scientific countermovements. In addition, it 
might be mentioned that sexology was already an 
established (if minor) science when Freud’s con- 
tributions first appeared. Though Freud and Fliess 
felt like lonely pioneers, Magnus Hirschfeld and 
Krafft-Ebing had preceded both of them, and 
Havelock Ellis and G, Stanley Hall were their con- 
temporaries. We know little about how a Zeitgeist 
of slowly liberalizing attitudes about sex may have 
been transmitted to Freud. We do know that there 
were contemporary moves to broaden the concept 
of sexuality and extend it backward in the life 
span, such as the observations of diffuse sexuality 
in children by Bell (1902), the relation of sucking 
and masturbation in the first year by Roehmer 
(1891), and the plasticity of the sexual drive in 
children (Barnes 1892), 

Again, diligent historical scholarship can un- 
cover predecessors for many, perhaps most, of 
Freud’s lastingly valid formulations about sex. For 
example, Shakow and Rapaport (1964) have 
pointed to James’s (1890) recognition of the poten- 
tiality for perversion in all of us; and many authors 
have discussed the similarities between Freud’s and 
Plato's broad concept of eros. Freud himself ac- 
knowledged that the idea of universal bisexuality 
had first been suggested to him by Fliess, In the era 
of moral treatment, various sexual etiologies had 
been suggested for a number of neurotic conditions, 
which made Freud’s first psychoanalytic papers 


seem more reactionary than radical to many of his 
organically minded psychiatric colleagues. 

As with the difference between Freud and earlier 
writers on the unconscious, so here again it was he 
alone who really devoted himself to prolonged, 
focused empirical as well as theoretical work on 
sexuality. In addition, however, Freud differs from 
most of the others who have helped to liberalize 
sexual mores in his mode of presenting his con- 
clusions: he always refused to pull his punches, to 
make any concessions to prudishness in hopes of 
gaining a more sympathetic hearing; but at the 
same time, he had not the slightest tendency to 
glorify or romanticize sexuality. He presented his 
facts and theories dryly and as directly as his own 
distaste allowed him. In the short run, he antag- 
onized the bluestockings and undermined the fem- 
inists, while disappointing the libertines and dis- 
illusioning the sexual utopians. By overstating the 
centrality of sexuality and stretching the concept 
of libido, he laid himself open to misunderstanding 
as a pansexualist and attained the unwelcome no- 
toriety of a succés de scandale. In the long run, 
however, the net effect was to force the world’s 
attention to the problems of sex, greatly advancing 
the contemporaneous anti-Victorian movements 
and, despite vicious opposition and vilification, 
probably getting his ideas a hearing and ultimate 
acceptance faster than he could have done in any 
other way. 

Role of conflict. Views of man’s behavior as 
the outcome of interior conflicts have been pro- 
pounded since ancient times. The pervasive human 
tendency to think in terms of dichotomies is specu- 
latively traceable to anatomical bilateral symmetry 
and to the diurnal cycle of light and dark. In any 
case, a probable shaping influence on Freud's con- 
ceptual orientations is the centrality of conflict 
between unconscious purposive forces in the world 
view of Naturphilosophie. More immediately, the 
Herbartian psychology of Lindner that Freud 
studied in the Gymnasium treats the life of the 
mind in terms of conflicting ideas, which could be 
smoothly absorbed into the paradigm of parallelo- 
grams of forces in physicalistic physiology. Hugh- 
lings Jackson, whom Freud greatly admired, held 
to even more proto-Freudian views on the role of 
conflicting forces (Stengel 1954a; 1954b), 

The substance of the conflicting forces in Freud’s 
final model is clearly related to traditions in West- 
ern thought going at least as far back as Em- 
pedocles, for whom love alternated with its anti- 
thesis, strife, and to religious sources: God versus 
Satan and the doctrine of conscience as the op- 
ponent of the base passions. In the era of moral 


treatment, there had been no dearth of psychiatrists 
who saw these inner battles as a cause of neurosis. 
Carter, writing in 1853, observed the conflict be- 
tween sexual desire and moral scruple in chaste 
hysterics of both sexes and even blamed “the mod- 
ern necessity of [a single woman's] entire conceal- 
ment” of her sexuality as the social cause of most 
hysteria (see Hunter & Macalpine 1963, pp. 1002- 
1003). 

Defense and repression. Freud's conception of 
defense, more particularly his ideas about defense 
against anxiety, seems to be a more original clinical 
innovation than most other parts of psychoanalytic 
theory. Perhaps its model was the medical concep- 
tion of the body’s defenses against pathogenic in- 
vasion; and the proposition that ideas can be 
repressed was familiar in Herbartian psychology. 
Nietzsche described the general outlines of several 
specific defenses—according to Brandt (1955), re- 
pression, isolation, reaction formation, sublimation, 
and projection—but there is no evidence that Freud 
knew about it. These conceptions and the closely 
related signal theory of anxiety are as directly 
derived from clinical observation as any in psycho- 
analysis and are among Freud’s most original 
contributions. 

Regression. The Darwinian origins of the ge- 
netic viewpoint in psychoanalysis have been briefly 
mentioned. An important specific mediator of Dar- 
winism was Hughlings Jackson, from whose view 
of the evolutionary organization of the brain Freud 
took the conception of a hierarchic organization of 
psychic structures (Rapaport 1959), and from 
whose correlated theory of neurological diseases as 
“reversals of evolution,” Freud derived the concept 
of regression as an explanation of psychopatho- 
genesis (Stengel 1954a; 1954b). The genetic dis- 
cipline of embryology had been given an enormous 
impetus by Darwin; Freud’s use of fetal materials 
in his medullar researches must have caused him 
to study this discipline with particular care and to 
become familiar with the epigenetic principle that 
is implicit in his doctrine of psychosexual develop- 
ment. The phenomenon of identification and its 
role in interpersonal and social relations of all 
kinds, as well as its outstanding importance in 
development, appears to be largely the outgrowth 
of clinical observation; Rieff’s (1959) hint that it 
may owe something to the “sympathy” of earlier 
social theorists (for example, Adam Smith) re- 
mains unexplored. By contrast, it is easy to find 
plenty of antecedent recognition of the importance 
of impulse control in the growth of character. 

Passive reflex model. Freud put his personal 
stamp upon all ihe conceptions he drew—directly 
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or indirectly—from his broad intellectual herit- 
age. His reworking of the ideas of others often 
brought to the forefront their latent potential- 
ities for advancing the understanding of the central 
problems—the innermost longings and agonies— 
of real persons, At times, however, the usefulness 
of Freud's ideas was marred by his need to fit them 
into his set of basic working assumptions, the 
passive reflex model of the nervous system. 

This model was synthesized by Freud from the 
physicalistic physiology of his teachers and was 
more clearly enunciated in the “Project” than in 
any of the works of Briicke, Meynert, Breuer, or 
Exner, in which its elements may be clearly dis- 
cerned (Amacher 1965). Anticipations of the dy- 
namic, economic, and topographic points of view 
are plainly to be seen in books that Freud cited, 
for example, those by Fechner (Ellenberger 1956) 
and Jackson (Stengel 1954a; Spehlmann 1953); 
the economic point of view is also traceable to 
Darwin (Andersson 1962). It seems safe to con- 
clude, therefore, that this set of principles was 
concurrently developed by many leading scientists 
in Germany and Austria in their attempts to apply 
the physics of Helmholtz and his school to biology 
and to generalize the reflex arc as a model of all 
mental processes in order to close the doors to such 
romantic notions as spontaneity, free will, and 
vitalism. Although virtually all of the assumptions 
underlying the passive reflex model are demon- 
strably untrue, much of the same doctrine underlies 
the behavioristic schools of psychology and is only 
now being painfully modified. 

When Freud put aside explicit neurological the- 
orizing, he was unable to give up the passive reflex 
model and the closely related physicalistic concept 
of reality (Spehlmann 1953; Holt 1965a). Although 
he explicitly postponed any attempt to relate the 
terms of metapsychology to processes and loci in 
the body, he substituted psychological theories 
that carry the same burden of anachronistic 
assumptions. 

Nevertheless, Freud did not make a clean break 
from neurology, which fact contributed to his un- 
clear and inconsistent stand on the mind-body 
problem (Rubinstein 1965; Holt 1965). Jones 
(1953-1957, vol. 1) and a number of other authors 
portray him as a consistent follower of Jackson in 
a psychophysical parallelism that makes mind an 
“independent concomitant” of brain. But neither 
Jackson himself, who at times postulated that 
physical energy is directly transformed into psychic 
energy (Spehlmann 1953), nor Freud’s teachers 
held to a consistent position (Amacher 1965); it 
was common to slip into interactionism, and Freud 
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followed suit (Andersson 1962, p. 107). The con- 
cept of hysterical conversion is the most obvious 
instance of interactionism. 

Speculative elements. The concept of Freud's 
most frequently rejected, at least among psycho- 
analysts, is the death instinct; not surprisingly, 
many of them have attributed this concept to 
Freud’s own needs and conflicts. While the evi- 
dence is impressive that for many years Freud had 
an unusual and probably pathological preoccupa- 
tion with death, there are plenty of less tendentious 
hypotheses about sources for this notion. Freud 
himself traced it to the ancient Greeks; it has been 
likened to the Christian concept of original sin; pes- 
simistic, late romantic philosophers like Nietzsche 
contributed a strain of thought of which Freud 
could not have been unaware, however little he 
read them directly. More generally, the works in 
which he developed the theory of the repetition 
compulsion (clearly related as it is to the ancient 
conception of “the eternal return”) and the virtu- 
ally mythic concepts of life- and death-instincts 
are part of his phylogenetic theory. It is tempting 
to interpret this last development in Freud’s thought 
as a kind of return of the repressed: that youthful 
speculative bent and yearning for broad philosophi- 
cal knowledge which he repeatedly admitted fear- 
ing in himself and suppressed for many years (Holt 
1963). His exposure to and enthusiasm for Natur- 
philosophie antedated his conversion to physical- 
istic physiology, which helped him put such unsci- 
entific ways of thought out of mind. Even the 
philosopher with whom he studied, Brentano, 
strongly opposed Naturphilosophie and advocated 
the scientific method as the only valid source of 
knowledge. 

Yet the old mode of thought remained an under- 
current in his thinking, as it did in that of his 
friend Fliess (Galdston 1956). Finally, after Freud’s 
change of identity at the turn of the century, after 
his great discoveries, the establishment of the psy- 
choanalytic movement, and his first international 
recognition, the old mode of thought may have 
seemed less threatening. In Totem and Taboo 
(1913), one of his favorites among his own books, 
he created what he himself called a “scientific 
myth” (the slaying of the Darwinian primal horde’s 
father). Shortly thereafter, he first read Lamarck, 
who was strongly identified with Naturphilosophie. 
It is Fechner the Naturphilosoph, not the psycho- 
physicist, whom he cited in Beyond the Pleasure 
Principle (1920), and the loose, analogical, teleo- 
logical, and personifying mode of thought he em- 
ployed there is in marked contrast to the cognitive 
style of his metapsychological works (Holt 1965b), 


Many properties of his concept of psychic energy 
can be traced to the vitalism that was a prominent 
feature of Naturphilosophie (Holt 1966). Some of 
the specific fallacies in his phylogenetic works can 
be traced to sources on the more mechanistic— 
materialistic side: Haeckel’s “biogenetic law” that 
ontogeny recapitulates phylogeny seemed to pro- 
vide respectable scientific auspices for the exten- 
sion of genetic speculations based on individuals to 
the development of all mankind; Spencer and vari- 
ous post-Darwinians were the source of his linear— 
evolutionary conception of anthropology; and the 
inheritance of acquired characters had been a gen- 
eral assumption of the organic—hereditarian psychi- 
atrists of Freud’s formative period (Andersson 
1962), as well as having been given some currency 
by Darwin himself (Ritvo 1965). Nevertheless, the 
major flaws of both manner and matter in Freud’s 
speculative works show the unmistakable earmarks 
of Naturphilosophie. 


Freud’s stature is too great for his errors to re- 
quire any whitewashing; it is not an apology for 
him but a fact that the least tenable aspects of his 
theories are traceable to influences that shaped his 
basic outlook when he was still a student, while 
his enduring achievements seem to owe less to 
antecedent influences. It appears to be one of the 
marks of a genius that he finds more sources of 
ideas in his reading and observation and subjects 
them to a truer integration and transmutation than 
does the less gifted man. The great contributor to 
knowledge (who may or may not be a genius) does 
not only toss off brilliant sparks of ideas; he lights 
and tends an enduring fire, Freud had both of these 
innovative capacities and more: his ideas have the 
remarkable property of being self-transcending. He 
taught us how to know more than he could, how to 
find and use the best in what he left by testing it 
against reality. 

ROBERT R. HOLT 


[For a listing of articles describing Freud's influence 
upon psychology, psychiatry, and the other social 
sciences, see the detailed guide under the entry Psx- 
CHOANALYSIS. Directly related to Freud's work and 
influence are the biographies of ABRAHAM; ADLER; 
ALEXANDER; CHARCOT; ELLIS; FECHNER; FERENCZI; 
HELMHOLTZ; HORNEY; JONES; JUNG; KLEIN; MÜL- 
LER, JOHANNES; RANK; RAPAPORT; REICH; RÓHEIM; 
SULLIVAN.] 


WORKS BY FREUD 
(1887-1902) 1954 The Origins of Psychoanalysis: Letters 
to Wilhelm Fliess, Drafts and Notes: 1887-1902. New 
York: Basic Books. 


(1893-1895) 1955 BREUER, Joser; and FREUD, SIGMUND 
Studies on Hysteria. Volume 2 of The Standard Edi- 
tion of the Complete Psychological Works of Sigmund 
Freud. London: Hogarth; New York: Macmillan. 

(1895) 1954 Project for a Scientific Psychology. Pages 
347-445 in Sigmund Freud, The Origins of Psycho- 
analysis. New York: Basic Books. 

(1900) 1953 The Interpretation of Dreams. Volumes 4-5 
of The Standard Edition of the Complete Psychologi- 
cal Works of Sigmund Freud. London: Hogarth; New 
York: Macmillan. 

(1905) 1953 Three Essays on the Theory of Sexuality. 
Volume 7, pages 123-245 in The Standard Edition of 
the Complete Psychological Works of Sigmund Freud. 
London: Hogarth; New York: Macmillan. 

(1913) 1959 Totem and Taboo. Volume 13, pages ix—162 
in The Standard Edition of the Complete Psychologi- 
cal Works of Sigmund Freud. London: Hogarth; New 
York: Macmillan. 

(1915) 1957 Papers on Metapsychology. Volume 14, 
pages 109-243 in The Standard Edition of the Com- 
plete Psychological Works of Sigmund Freud. Lon- 
don: Hogarth; New York: Macmillan. 

(1920) 1955 Beyond the Pleasure Principle. Volume 18, 
pages 7-66 in The Standard Edition of the Complete 
Psychological Works of Sigmund Freud. London: Ho- 
garth; New York: Macmillan. 

(1921) 1955 Group Psychology and the Analysis of the 
Ego. Volume 18, pages 69-134 in The Standard Edi- 
tion of the Complete Psychological Works of Sigmund 
Freud. London: Hogarth; New York: Macmillan. 

(1923) 1961 The Ego and the Id. Volume 19, pages 12- 
63 in The Standard Edition of the Complete Psycho- 
logical Works of Sigmund Freud. London: Hogarth; 
New York: Macmillan. 

(1926) 1959 Inhibitions, Symptoms and Anxiety. Vol- 
ume 20, pages 77-178 in The Standard Edition of the 
Complete Psychological Works of Sigmund Freud. 
London: Hogarth; New York: Macmillan. 

(1927) 1961 The Future of an Illusion. Volume 21, 
pages 5-58 in The Standard Edition of the Complete 
Psychological Works of Sigmund Freud. London: Ho- 
garth; New York: Macmillan. 

(1930) 1961 Civilization and Its Discontents. Volume 22, 
pages 64-148 in The Standard Edition of the Complete 
Psychological Works of Sigmund Freud. London: Ho- 
garth; New York: Macmillan. 

(1934-1938) 1964 Moses and Monotheism: Three Es- 
says. Volume 23 in The Standard Edition of the Com- 
plete Psychological Works of Sigmund Freud. London: 
Hogarth; New York: Macmillan. 

Gesammelte Schriften. 12 vols. Leipzig, Vienna and Zurich: 
Internationaler Psychoanalytischer Verlag, 1924-1934. 

Gesammelte Werke. 18 vols. London: Imago, 1940-1952. 

The Standard Edition of the Complete Psychological Works 
of Sigmund Freud. 24 vols. London: Hogarth; New 
York: Macmillan, 1953-1964. 


SUPPLEMENTARY BIBLIOGRAPHY 

Amacuer, PETER 1965 Freud’s Neurological Education 
and Its Influence on Psychoanalytic Theory. Psycho- 
logical Issues, Vol. 4, No. 4; Monograph No. 16. New 
York; International Universities Press. 

ANDERSSON, OLA 1962 Studies in the Prehistory of Psy- 
choanalysis: The Etiology of Psychoneuroses and 
Some Related Themes in Sigmund Freud’s Scientific 
Writings and Letters, 1886-1896. Stockholm: Svenska 
Bokforlaget Norstedts. 


FREUD, SIGMUND 11 


Anon, WiıLLiam 1963-1964 Freud and Spinoza. Harofé 
haivri [1963] no. 2:260-242; [1964] no. 1:284-265; 
[1964] no. 2:260-242 

Barnes, E. 1892 Feelings and Ideas of Sex in Children. 
Pedagogical Seminary 2:199-203. + Now called the 
Journal of Genetic Psychology. 

Beit, Sanrorp 1902 A Preliminary Study of the Emo- 
tion of Love Between the Sexes. American Journal of 
Psychology 13: 325-354. 

BERNFELD, Siecraiep 1944 Freud's Earliest Theories 
and the School of Helmholtz. Psychoanalytic Quar- 
terly 13:341-362. 

BERNFELD, Siecrrixp 1951 Sigmund Freud, M.D.: 
1882-1885. International Journal of Psycho-analysis 
32:204-217. 

Branpr, Rupotr 1955 Freud and Nietzsche: A Com- 
parison, Ottawa, University of, Revue 25:225-234. 

Bry, ILsE; and RIFKIN, ALFRED H. 1962 Freud and the 
History of Ideas: Primary Sources, 1886-1910. Pages 
6-36 in Academy of Psychoanalysis, Science and 
Psychoanalysis. Volume 5: Psychoanalytic Education. 
New York: Grune. 

CRANEFIELD, PauL F. 1957 The Organic Physics of 
1847 and the Biophysics of Today. Journal of the His- 
tory of Medicine 12:407-—423. 

Darwin, CHarLes (1859) 1964 On the Origin of Spe- 
cies. Cambridge, Mass.: Harvard Univ. Press. 

ELLENBERGER, HENRI F. 1956 Fechner and Freud. Men- 
ninger Clinic, Bulletin 20:201-214. 

Gatpston, laco 1956 Freud and Romantic Medicine. 
Bulletin of the History of Medicine 30:489-507. 
Git, Merton M. 1959 The Present State of Psycho- 
analytic Theory. Journal of Abnormal and Social Psy- 

chology 58:1-8. 

Hott, Rosert R. 1963 Two Influences on Freud's Scien- 
tific Thought: A Fragment of Intellectual Biography. 
Pages 364-387 in Robert N. White (editor), The 
Study of Lives. New York: Atherton. 

Hott, Rosert R. 1965a A Review of Some of Freud's 
Biological Assumptions and Their Influence on His 
Theories. Pages 93-124 in Norman S. Greenfield and 
William C. Lewis (editors), Psychoanalysis and Cur- 
rent Biological Thought. Madison: Univ. of Wisconsin 
Press. 

Hott, Ropert R. 1965b Freud's Cognitive Style. Amer- 
ican Imago 22:163-179. 

Hott, Rosert R. 1966 Beyond Vitalism and Mecha- 
nism: Freud’s Concept of Psychic Energy. Unpublished 
manuscript. 

Hunter, RICHARD A.; and Macatpine, Ipa (editors) 
1963 Three Hundred Years of Psychiatry, 1535-1860: 
A History Presented in Selected English Texts. Oxford 
Univ. Press. > See especially pages 1002-1003 and 
1015-1018. 

James, Witt1aM (1890) 1962 The Principles of Psy- 
chology. 2 vols. New York: Smith. 

Jones, ERNEsT 1953-1957 The Life and Work of Sig- 
mund Freud. 3 vols. New York: Basic Books. > Vol- 
ume 1: Formative Years and the Great Discoveries, 
1953. Volume 2: Years of Maturity, 1955. Volume 3: 
Last Phase, 1957. 

KARDINER, ABRAM; KarusH, AARON; and OVESEY, LIONEL 
1959 A Methodological Study of Freudian Theory. 
Journal of Nervous and Mental Disease 129:11-19, 
133-143, 207-221, 341-356. 

Parsons, TatcotT (1952) 1953 The Superego and the 
Theory of Social Systems. Pages 13-29 in Talcott 
Parsons, Robert F. Bales, and Edward A. Shils. Work- 


12 FRIENDSHIP 


ing Papers in the Theory of Action. Glencoe, Ill.: 
Free Press. 

Parsons, Tatcotr (1958) 1964 Social Structure and 
the Development of Personality: Freud’s Contribution 
to the Integration of Psychology and Sociology. Pages 
78-111 in Talcott Parsons, Social Structure and Per- 
sonality. New York: Free Press. 

Rapaport, Davip 1959 The Structure of Psychoanalytic 
Theory: A Systematizing Attempt. Pages 55-183 in 
S. Koch (editor), Psychology: A Study of a Science. 
Volume 3: Formulations of the Person and the Social 
Context. New York: McGraw-Hill. 

Rapaport, Davip; and GILL, MerToN 1959 The Points 
of View and Assumptions of Metapsychology. Inter- 
national Journal of Psycho-analysis 40:153-162. 

REFF, Pulp 1959 Freud: The Mind of the Moralist. 
New York; Viking. 

Ritvo, Lucitte B. 1965 Darwin as the Source of Freud's 
Neo-Lamarckianism. Journal of the American Psycho- 
analytic Association 13:499-517. 

ROEHMER, A. 1891 Ueber psychopatische Minderwertig- 
keiten des Siuglingsalter. Medizinisches Korrespond- 
enzblatt fiir Wiirtemberg 61:265-269, 273-279, 281- 
285, 289-292. 

RusinstEn, B. B. 1965 Psychoanalytic Theory and the 
Mind-Body Problem. Pages 35-56 in Norman S. 
Greenfield and William C. Lewis (editors), Psycho- 
analysis and Current Biological Thought. Madison: 
Univ. of Wisconsin Press. 

SHakow, Davin; and Rapaport, Davin 1964 The Influ- 
ence of Freud on American Psychology. Psychological 
Issues, Vol. 4, No. 1. New York: International Uni- 
versities Press. 

SPEHLMANN, RAINER 1953 Sigmund Freuds neurolo- 
gische Schriften: Eine Untersuchung zur Vorgeschichte 
der Psychoanalyse. Berlin: Springer. 

STENGEL, E. 1954a A Re-evaluation of Freud’s Book On 
Aphasia: Its Significance for Psychoanalysis. Interna- 
tional Journal of Psycho-analysis 35:85-89. 

STENGEL, E. 1954b The Origins and Status of Dynamic 
Psychiatry. British Journal of Medical Psychology 27: 
193-200. 

Waite, Ropert W. 1963 Ego and Reality in Psycho- 
analytic Theory: A Proposal Regarding Independent 
Ego Energies. Psychological Issues, Vol. 3, No. 3. New 
York: International Universities Press. 

WHYTE, Lancetor L. 1960 The Unconscious Before 
Freud. New York: Basic Books. 


FRIENDSHIP 


Friendship is a voluntary, close, and enduring 
social relationship. The behavior of friends varies 
greatly among societies and situations and accord- 
ing to personality variables. Values about friend- 
ship vary less and can be summarized as involving 
closeness, solidarity, absence of ulterior ends, reci- 
procity, impulsiveness in mutual choice, and, per- 
haps, independence of social distinctions such as 
age, sex, and class. Friendship is intimate but less 
so than love and some family ties. Supplementing 
sexual and familial ties, friendship is a residual 
cultural category subsuming close and expectedly 
enduring ties. Since friendship involves voluntary 


commitment, intimacy, and spontaneity, its conse- 
quences for the individual and for society, through 
individual growth and security, are presumably 
crucial. Possibly for this reason, to be without 
friends often involves shame. 

Most other important social relationships exclude 
friendship. Even highly compatible and close broth- 
ers are brothers rather than friends, and friendship 
tends to be incompatible with such relationships 
as those of mother and child, lovers, and employer 
and employee. This incompatibility is probably due 
to the fact that the obligations and rights of friends 
typically are subject to overruling by other ties. 
The impulsiveness of choice and the reciprocity or 
symmetry of the friendship relation also rule out 
various choices. If friends are impulsively chosen, 
few brothers will be friends. Reciprocity and 
symmetry imply rough equality in mutual rights 
and obligations and in qualities and performances, 
requiring fairly equal status in significant respects 
between friends. In general, friendship can be 
logically and culturally expected to occur only 
when there is a low probability of higher or 
strongly sanctioned obligations intervening directly 
between friends. 

Like distant kinship terms, “friend” is a rela- 
tional designation: “friend” and “friendship” refer 
to a relationship between two or more persons 
rather than to the characteristics of one or more 
persons. This is in contrast to close kinship terms, 
which are simultaneously relational and categori- 
cal, and to occupational designations, which are 
on the whole categorical. Accordingly, while friend- 
ship is significant in personal terms, it is no less an 
interpersonal structure. It follows that variants of 
friendship structure may well be a characteristic 
of collective units. 

In theoretical terms, friendship is definitely a 
relational phenomenon: it is impossible to assign 
meaning to statements such as “He is a friend” 
without implying to whom “he” is related in this 
way. However, empirically the matter is less clear, 
partly because the friendship role is vague. Also, 
the friendship relationship as a social type is made 
ambiguous by personal descriptions such as “He 
is friendly,” “He is a great friend,” or “He is every- 
body’s friend.” The suggestion has been made that 
a highly differentiated society with a high degree 
of mobility and an emphasis on specific perform- 
ance cannot also support enduring and important 
intimate relationships beyond those of the nuclear 
family. This may be the structural source of 
“pseudo Gemeinschaft’—that is, appeals, usually 
commercial, that use a presumption of closeness 
to transfer modes of behavior from a friendship 
setting to strangers, with the consequent growth 


of values of superficial friendship, friendliness, and 
popularity. 

Friendship is a distinct institution. In Western 
societies (loosely the basis for the above considera- 
tions) it is a vague institution, whereas in other 
societies friendship is often more salient and dif- 
ferentiated. In either case, friendship is a low- 
order (as well as a crosscutting) institution: it is 
found everywhere but is not a distinct, compre- 
hensive segment of society. It is comparable to 
money, language, and love rather than to religion, 
the family, or the economy. Partly for this reason, 
friendship is not at present a specialized field of 
inquiry in sociology. While few studies focus on 
friendship, many find it, since closeness to others 
is a pervasive potentiality in man. 


Friendship as an institution 


To say that friendship is an institution is to ask 
the cross-cultural question: What links are there 
between variations in the structure of friendship 
and in the structure of society? Two studies have 
initiated the general cross-cultural study of friend- 
ship (Eisenstadt 1956; Cohen 1961). At the least, 
these studies provide a spectrum of variation in 
friendship institutions; at their best, they provide 
hypotheses or findings about the place of friendship 
in encompassing social structures. 

Variations between societies. S. N. Eisenstadt’s 
subject is “ritualized personal relations,” including 
blood brotherhood, blood friendship, “best” friends, 
compadre and godparent relations, and cases of 
contractual servitude. All these relationships are 
particularistic, personal, voluntary, and fully in- 
stitutionalized (usually in ritual terms). They are 
both diffusely affective and instrumental, always 
in the economic sphere, often more broadly (poli- 
tics, etc.). Eisenstadt hypothesizes that these kin- 
shiplike but voluntary relations are to be found in 
predominantly particularistic (that is, kinship- 
dominated or caste-dominated) societies because 
they alleviate strains in and between the groups 
that constitute such societies. In effect, Eisenstadt 
proposes that ritualized personal relations, similar 
to friendship in Western terms in that they are 
voluntary and personal or intimate, are mecha- 
nisms of social integration. They are parallel to 
such institutions as kinship extension, extralineage 
kinship obligations, various types of associations, 
hospitality toward strangers, and joking relation- 
ships in providing ties cutting across groups and 
categories. Ritualized personal relations are also 
mechanisms of social control. [See KINsHIP, article 
On PSEUDO-KINSHIP. | 

Yehudi A. Cohen presents a typology of friend- 
ship institutions that loosely expresses a dimension 
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of degree of commitment between friends. Inalien- 
able friendship is a variant characterized by ritual 
or ceremonial entry and, ideally, permanence—in 
the main corresponding to Eisenstadt’s ritualized 
personal relations. Three other somewhat separate 
categories are close, casual, and expedient friend- 
ships. In a sample of about sixty societies, twenty 
have the inalienable type of friendship as the domi- 
nant form, thirty have close friendship, three have 
casual friendship, and four have expedient friend- 
ship (there are no data for ten societies, and some 
are counted twice because of structural changes). 
Activities in the friendship relation commonly in- 
clude material exchange (and/or economic as- 
sistance) and sociopolitical and emotional support; 
in some cases they include the more specialized 
activities of go-between in love affairs and mar- 
riage arrangements, homosexuality, sponsorship 
in rites of passage, mourning obligations, and ex- 
change of children. Formalized friendship is far 
more frequent among men than women and is 
rare across sex lines. Inalienable friendship carries 
with it incest taboos in half the cases. It is almost 
invariably joined with just one partner, who in 
some cases has to be chosen inside the local soli- 
dary group and in other cases outside of it; in a 
few societies either choice is open. 

Cohen predicts covariation between type of 
friendship and the nature of the significant soli- 
dary grouping to which the individual is attached. 
He expects inalienable friendship to coincide with 
the maximally solidary community (generally, lo- 
calized descent groups where nuclear families and 
households are socially, physically, and emotion- 
ally close as a societal nucleus sharply distin- 
guished from other groupings). Close friendship 
is expected to be associated with the solidary- 
fissile community (where solidarity is split between 
kinship group and community). The nonnucleated 
society (where isolated, solidary nuclear families 
are tied loosely together) is expected to be asso- 
ciated with casual friendship. Finally, an expedient 
type of friendship institution is expected to occur 
in individuated social structures (where there is 
emphasis on individual amassing of wealth and rel- 
atively little solidarity even in the nuclear family). 

Roughly speaking, Cohen’s hypotheses are sup- 
ported by his data. There is an association be- 
tween the degree of solidarity in the local com- 
munity and the degree of commitment in the 
friendship institution (although the number of 
cases in extreme types of friendship and solidarity 
is small). Of 13 societies with maximally solidary 
communities, 11 have predominantly the inalien- 
able type of friendship; of 35 societies with solidary- 
fissile communities, 27 have close friendship. 
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Cohen’s interpretation of this finding is in terms 
of holistic compatibility in culture and personality. 
A dimension of personal generosity versus with- 
holding seems to be the basic variable that unites 
community structure with friendship structure. 
There is a correlation between childhood gratifica- 
tion versus deprivation, on the one hand, and adult 
food sharing versus individual amassment of food 
or money on the other. For the present, it remains 
an open question as to how this holistic pressure 
can be translated into detailed questions about the 
operation of social arrangements. 

Cohen's study raises formal questions about his 
variables: To what extent are his four-point vari- 
ables adequate and reliable in use? Is inalienable 
friendship a more significant category than formal- 
ized friendship? Further, under what conditions, 
and with what effects, are intragroup and inter- 
group friendships mandatory or preferred? The 
most crucial and stimulating question would seem 
to be this: Is institutionalized friendship (i.e., 
ideal and/or dominant forms) a more relevant 
variable than distribution of actual forms of friend- 
ship? This question leads to interesting problems 
in Western societies. What, in various settings, is 
the distribution of close, casual, and expedient 
friendships? Is degree of differentiation of societies 
correlated with dominance of expedient friendships? 
Is a possible dominance of expedient friendships 
to be contrasted with some of our values concern- 
ing friendship rather than with our past—or has 
there been a real change in the friendship institu- 
tion over, say, the last fifty years, concomitant with 
industrialization and increasing differentiation? 

Variation within one society. Additional dem- 
onstrations of the dependence of norms regulating 
friendship on an encompassing social structure 
emerge if we compare separate settings within one 
society rather than comparing different societies. 
In a study of the merchant marine in a society 
where it is a significant element economically and 
culturally, Aubert and Arner (1959) found that 
the culture and the social structure of the Norwe- 
gian merchant marine include a near taboo on 
personal friendships. This trait is probably a conse- 
quence of (as well as a contribution to) other 
structural elements: the ship is a “total institution” 
(Goffman 1958 ); top positions can only be reached 
from the bottom; there is an extraordinarily high 
rate of turnover; work roles dominate (to the ex- 
tent that terms of address and reference are largely 
job titles, the alternative being home region); 
there is a cultural and realistic emphasis on crises 
requiring discipline; and, finally, there is a peculiar 
combination of equality and inequality (in a num- 


ber of respects the crew are sailors “in the same 
boat,” yet each man and his position is unique 
through pay and shift arrangements). In outline, 
the occurrence and forms of friendship among the 
crew are reinforcing consequences of the nature 
of the ship as a place of work, in particular 
through its arrangements for interaction, physical 
closeness, and recruitment. In a significant con- 
trast, friendship is a standard occurrence among 
the crew in the Hull distant-water fishing fleet 
(Tunstall 1962); the distant-water trawler lacks 
the character of the total institution that is inher- 
ent in the Norwegian merchant vessel. 


Friendship as interaction 


Perhaps the most penetrating study of friendship 
yet to appear is W. F. Whyte’s Street Corner So- 
ciety (1943), a description and analysis of life in 
an immigrant slum in the late 1930s. Whyte’s 
topic is the interaction between young men, the 
significance of this interaction for individuals, and 
its relationship to career, welfare work, and poli- 
tics. In the first place, Whyte gives a vivid picture 
of voluntary association among “Cornerville” young 
adults. This association is marked by a strong in- 
formal structure. Loosely integrated gangs, con- 
sisting of small cliques, have a clearly hierarchic 
structure in terms of influence and prestige. Par- 
ticipation and acceptance in these groups are cru- 
cial for the balance of individual personalities. 
Changing and stable group structure is symboli- 
cally expressed in interaction. The typical group 
ties the individual to his community in many ways; 
it takes membership in atypical groups to foster 
career ambitions beyond Cornerville. In the first 
place, then, Whyte describes the social realities, 
of which friendship is the predominant one, for 
young men in Cornerville. Second, he analyzes the 
interrelations of friendship and the Cornerville 
culture with racketeering and politics, which are 
intimate, as well as with traditional welfare work, 
which cuts itself off from the mainstream of Cor- 
nerville life. 

The kind of friendship structure that Whyte 
describes is, in all likelihood, unusual. Friendship 
groups are rarely as highly structured in a hier- 
archy as in Whyte’s community; friends are rarely 
as dependent on leadership; they are rarely as 
significant for each other and so often and so 
regularly in each others presence; friends are 
rarely so sharply segregated from nonfriends; and 
friends of one person are rarely to such an extent 
also friends of each other. Whyte described a situ- 
ation that was unusual, arising as it did from the 
historical accident of a major depression in a 


lower-class environment of second-generation im- 
migrants. 

Several themes above have been investigated in 
later, more specific studies of friendship. Elizabeth 
Bott (1957) has investigated friendship networks, 
stressing the difference between connected net- 
works, where one’s friends are also friends of each 
other, and open ones, where friends do not make 
up an interconnected group. Her exploratory study 
of families and their friends indicates that if a 
married couple is involved in a close-knit set of 
friends, the couple tends to have a rigid separation 
of roles in the household. On the other hand, if the 
network of friends is loosely connected, separation 
of roles among husband and wife is at a minimum. 

Adolescent friendship. In his study of friend- 
ship among young men, Whyte related variants of 
friendship to career ambitions in distinguishing 
between “corner boys” and “college boys.” Friend- 
ship among adolescents may be more significant 
than at earlier and later ages, both in general and 
specifically for career choice. What is the nature 
and significance of friendship among adolescents? 
In a discussion of David Riesman’s hypothesized 
“other-directed” personality type, with its assumed 
peer dependence among adolescents, Parsons and 
White (1961) see adolescent friendship as a mech- 
anism for loosening children’s dependence on par- 
ents and as a channel for the testing of career 
choice. Relying on findings from studies of friend- 
ship choices in high schools, they conclude that the 
adolescent’s dependence on peers is far from a 
seeking of free-floating approval; relations among 
peers partially consist of commitments to normative 
standards. Parsons and White emphasize that there 
are two kinds of normative culture among high 
school students: one is rather hedonistic, charac- 
terized by much value on popularity and reluctance 
to accept the achievement orientation in the adult 
world, and the other, which is somewhat less fre- 
quent, includes a strong commitment to mastery 
of this achievement orientation. The availability of 
friendship cliques (of either type of culture) is both 
a sorting mechanism and a testing ground for long- 
term educational and career commitments. 

The finding of two distinct normative cultures 
among adolescents, one centering on hedonism 
and the other, nearly as important, on scholastic 
achievement, is unusual. Coleman (1961) found 
widely varying social climates in the ten high 
schools that he studied. In all of them, athletic 
achievement was the major basis for recruitment 
to the informal elite of the school; scholastic 
achievement was always a decidedly minor basis; 
the strongest position of all was held by the all- 
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rounder. Gordon (1957), in a case study of a Mid- 
western high school, found a salient structure of 
dominant cliques, within which close friendship 
might occur. Cliques centering on scholastic 
achievement were quite unimportant, although 
students showed increasing fulfillment of school 
expectations for scholastic achievement with each 
additional year in school. The students’ school and 
social life seemed to be dominated by a fierce 
competition for prestige that was achieved on the 
basis of conformity to highly developed normative 
patterns. Adolescent life in school was dominated 
by efforts to achieve a differentiated social status 
—that is, essentially a search for identity. Mem- 
bership in a group gives the protection of being 
visibly and actively “somebody,” but friendship 
appears to be largely an instrumental and regula- 
tive structure rather than a supportive and permis- 
sive one, However, the detailed regulation of be- 
havior implied in the adolescent culture is partly 
(as in dress) a symbol of autonomy relative to 
adult society and partly (as in puritan morals) an 
acceptance of explicit adult values. 

In spite of variation in content of adolescent 
culture and friendship groupings, Parsons and 
White's main points are valid: adolescent culture, 
with cliques forming around variant values, affords 
standards of right and wrong, affects career choice, 
and enforces a degree of independence from par- 
ents. Nevertheless, the central place of popularity 
in this adolescent culture leaves Riesman’s claim 
of a growing “other-directedness” an open issue. 

Friendship in work groups. Adolescent friend- 
ship approaches being a way of life, although a 
transitional one, related to love and a future family 
as well as to work and a future career. Friendship 
is also important in careers, The influence of 
friendship on recruitment to and work in a career 
has been shown in community studies (Warner 
& Lunt 1941, pp. 188-199; Seeley et al. 1956, p. 
135). The importance of friendship in one line of 
work was shown dramatically by Coleman, Katz, 
and Menzel (1957): doctors’ adoption of new 
therapeutic drugs depends to a considerable extent 
on membership in informal friendship cliques. 

The theme of friendship in work has been a 
significant one in social science at least since the 
Hawthorne studies in the 1930s. In one setting it 
was found that a differentiated set of friendship 
relations among male workers supported a broad 
normative orientation, including a norm limiting 
productivity (Roethlisberger & Dickson 1939, part 
4). In another setting, friendly relations among 
female workers were suggested as an explanatory 
variable behind a continuously climbing curve of 
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productivity under varying and controlled external 
work conditions (ibid., part 1). The concept of an 
informal social structure in organizations has been 
with us since these studies. With respect to friend- 
ship or friendly human relations, these two find- 
ings have been duplicated over and over again, in 
industrial and bureaucratic settings as well as in 
research on problem solving in laboratory groups: 
friendly relations may lead to either increased or 
decreased productivity. The normative basis of 
friendship, particularly the definitions of relations 
to higher authorities, is presumably one decisive 
differentiating variable. 


Selection of friends 


The initial view of friendship given in this ar- 
ticle is a romantic one; it is empirical in that it 
formulates surface values concerning friendship in 
our culture. However, while it may be true that 
friends are culturally expected to be chosen im- 
pulsively, it is certain that choice follows socially 
structured paths. Friends tend to share social posi- 
tion. The tendency of friends to be alike is well 
illustrated by a finding in a study of a political 
election that compared voters’ choices with those 
of their best friends. A sample of voters was di- 
vided into four subgroups according to whether 
three, two, one, or none of their best friends in- 
tended to vote for the Republican party; among 
those whose friends were all intending to vote 
Republican, 61 per cent expressed strong intentions 
of voting Republican, and the percentage dropped 
to 37, to 23, and to 2 for the other three subgroups, 
respectively (Berelson et al. 1954, p. 99). Two 
explanatory principles are needed: friends select 
each other on the basis of similarity, and they in- 
fluence each other to become similar. The system- 
atic study of similarity and dissimilarity between 
friends was opened up by Lazarsfeld and Merton 
(1954) when they asked how selectivity comes 
about and how it varies for different kinds of at- 
tributes and within different kinds of social struc- 
tures. While they clarify many issues in their 
article (for example, by distinguishing between 
status-homophily and value-homophily), their con- 
tribution has remained programmatic. 

Similarity in attitude is one basis on which 
friendships are formed. Longitudinal studies (e.g., 
Newcomb 1961) indicate that the explanatory 
principle of selectivity in terms of similarity out- 
weighs the principle of similarity resulting from 
friendship. The balance of the evidence for the 
view that friends select each other on the basis 
of complementarity of needs rather than on the 


basis of similarities is negative (Secord & Backman 
1964). Findings based on sociometric studies of 
friendship concerning other bases of choice have 
been summed up as follows: 


. .. a person is likely to choose the following individ- 
uals: (1) those with whom he has a greater oppor- 
tunity to interact, (2) those who have characteristics 
most describable in terms of the norms and values of 
the group, (3) those who are most similar to him in 
attitudes, values, and social-background characteristics, 
and (4) those whom he perceives as choosing him or 
assigning favorable characteristics to him, (5) those 
who see him as he sees himself, and (6) those whose 
company leads to gratification of his needs. (Secord 
& Backman 1964, p. 247) 


Two theoretical formulations with implications 
for the study of friendship have appeared in recent 
years (Thibaut & Kelley 1959; Homans 1961). 
Both are exchange theories of interaction, where 
basic terms are costs, rewards, outcomes, and com- 
parison levels. This framework allows the analysis 
of steps in attraction processes (Secord & Backman 
1964, chapter 7): friendship is the outcome of 
sampling and estimation, bargaining, commitment, 
and, finally, institutionalization. 


Problems for research 


Both sociometric findings on friendship and 
Secord and Backman’s exchange analysis of friend- 
ship focus on the initiation of friendship struc- 
tures; that is, their central theoretical topic is 
attraction rather than established friendship rela- 
tions. Research on friendship needs more concen- 
tration on the substantive contents of friendship 
itself, How, in fact, are rights and obligations in 
friendship experienced in various social environ- 
ments? What are the institutional and actual en- 
couragements and limits to friendship in various 
contexts—politics, business, everyday life? What 
are the major rewards and strains in friendships? 
Is the ambiguity of friendship a circumstance that 
serves to initiate as well as to terminate other 
kinds of relationships? Under what conditions will 
friendships end? Is friendship, more often than 
other types of relationships, a subjectively sus- 
tained reality in the face of decreased overt inter- 
action? For partial answers to such questions, and 
for the generation of other specific and significant 
questions about friendship, analysis of detailed re- 
ports on the behavior and orientations of friends, 
acquaintances, strangers, and enemies in everyday 
life is required. It would seem that such studies, 
which demand more descriptive patience than we 
now see in social science, can give a rich yield, 


since friendship, as a kind of cement in personality 
and social fabrics, is probably more strategically 
related to other social relationships than research 
has indicated so far. 


Opp Ramsøy 


{Other relevant material may be found in Leapersuir 
and SOCIOMETRY.] 
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FROBENIUS, LEO 


Leo Frobenius (1873-1938) was one of the last 
of the great explorers that the nineteenth century 
turned out in such profusion; he was also one of 
the first ethnologists who did not confine himself 
to an ethnographic description of the facts but 
elaborated a method for organizing in space and 
time the confused welter of discrete observations 
about the world’s nonliterate peoples. 

Frobenius was born in Berlin. In his early youth, 
he devoted his time to the ethnology of Africa, at 
the age of 21 publishing a bulky volume on African 
secret societies (1894). As a scientist, he was self- 
taught. And although he combined an intuitive 
mind with tremendous industry, emotional involve- 
ment with his material often led him to lapse into 
pathos. 

His works on cultural morphology grew out of 
a profound knowledge of primitive cultures. In 
1893 he began a collection of photographs and 
other ethnographic materials that later formed the 
basis of the Africa Archives. Frobenius made 12 
expeditions to Africa from 1904 to 1935, devoting 
himself to ethnological field work and the photo- 
graphing of rock pictures. He also made a trip to 
India. The holdings of the Africa Archives were 
greatly expanded by his expeditions, and in 1922 
the archives became the Forschungsinstitut für 
Kulturmorphologie. In 1932 he was invited to teach 
cultural anthropology at the University of Frank- 
furt am Main as an Honorarprofessor. In 1934 he 
was appointed director of the Municipal Museum 
of Ethnology in Frankfurt am Main. He died in 
Biganzolo on Lago Maggiore. 

As a historian, Frobenius endeavored through his 
research to provide historical background for the 
civilizations that had formerly been regarded as 
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having no history because their past was not illumi- 
nated by any written records, and thereby to in- 
corporate them into world history. This work stimu- 
lated an expansion of the historical perspective, 
which at the turn of the twentieth century had been 
almost exclusively confined to the advanced and 
literate civilizations of Europe and the Near East. 
The historical field of vision was expanded not only 
in spatial—geographical terms, by the inclusion of 
previously neglected parts of the world, but in eth- 
nological terms as well; many primitive cultures 
have preserved forms of thought, of behavior, and 
in general, of cultural patterns that are obviously 
much older than those of the earliest advanced and 
literate civilizations of the Near East. In his works 
on the philosophy of history and civilization, Fro- 
benius sought to comprehend not only a particular 
culture, but culture as such, and therewith the en- 
tire history of the world. He tried to grasp and to 
explain the forces and motives that lead to the 
origin of a culture, the laws governing its course, 
the relationship between man and culture, and the 
meaning and goal of historical development. The 
broad scope of his research contributed insights 
and findings of fundamental importance to histor- 
ically oriented ethnology, 

“Kulturkreise.” At the close of the nineteenth 
century, there were two conflicting doctrines in 
ethnology, represented in the German-speaking 
world by Adolf Bastian on the one hand, and by 
Richard Andree and Friedrich Ratzel on the other. 
Both doctrines proceeded from the same set of 
facts, i.e., the observation that elements of mental 
and material culture identical in form and function 
may be found in regions of the world that are far 
apart, frequently separated by oceans, and among 
peoples speaking different languages and often be- 
longing to different races. The doctrine advocated 
by Bastian in Der Mensch in der Geschichte: Zur 
Begriindung einer psychologischen Weltanschauung 
(1860) explained such occurrences by assuming 
that the evolution of mankind has always, and in 
all parts of the world, been subject to the same 
laws; that this evolution has been a linear one; and 
that, consequently, upon reaching a certain stage 
of development, all societies are independently 
compelled by the rigid law of evolution to create 
and shape the cultural objects that correspond to 
the given stage of development. 

This evolutionary view was opposed by the doc- 
trine advocated by Andree in Ethnographische 
Parallelen und Vergleiche (1878) and by Ratzel in 
“Geschichte, Vélkerkunde und historische Perspec- 
tive” (1904), who attributed the occurrence of iden- 
tical elements of culture in different regions of the 


world to the migrations either of entire cultures or 
of individual cultural elements and to direct or 
indirect cultural contact. Frobenius was always a 
convinced champion of this latter doctrine, which 
is known as the diffusion or migration theory. 

Building on the ideas of Andree, Ratzel, and his 
own teacher, H. Schurtz, Frobenius made a giant 
step through his two pioneering works “Der west- 
afrikanische Kulturkreis” (1897) and “The Origin 
of African Civilizations” (1898a), which cleared the 
way for a new scientific approach in ethnology. He 
demonstrated that many elements of material and 
mental culture are by no means scattered at ran- 
dom over the world but are always densely con- 
centrated in certain areas and always occur in a 
characteristic combination with other cultural mani- 
festations. Frobenius inferred from the identical 
geographical distribution of certain elements of cul- 
ture that these could not be fortuitous combinations, 
but that there had to be a close relationship among 
several of the elements. He therefore grouped areas 
of identical distribution into what he called Kultur- 
kreise; these, in turn, he arranged in relative chron- 
ological order so as to provide a historical back- 
ground for nonliterate cultures. The core of this 
method was cultural comparison, and its technical 
tool was cartography. As the creator of the theory, 
Frobenius became the trail-blazer of the historical 
approach in ethnology, for even today, cultural 
comparison constitutes an important means to the 
construction of historical accounts of cultures with- 
out written traditions. Later on, the theory of 
Kulturkreise was extended by Fritz Graebner and 
Bernhard Ankermann, and it finally became part 
of the theory of the “Vienna school” of Wilhelm 
Schmidt. However, this school did not develop the 
concept of Kulturkreise as Frobenius understood it 
but instead allowed it to become Static, thus im- 
pairing its effectiveness. 

Although the concept of Kulturkreise continued 
to play a major role in Frobenius’ work, he soon 
came to realize clearly the methodological short- 
comings of his own early works that dealt with it. 
He sharply attacked the use of statistical methods 
to establish Kulturkreise—methods which amount- 
ed only to a summation of isolated and often quite 
heterogenous elements, often totally unrelated and 
of different historical significance. What he had 
described as Kulturkreise in his initial works were 
in fact bloodless, empty constructions, mere accu- 
mulations of data. Frobenius demanded a morpho- 
logical mode of considering cultures—a meaning- 
ful combination of individual elements into an 
organic whole and the comprehension of a culture 
in its complexity and its historical context. As Fro- 


benius saw it, Kulturkreise obtained from a carto- 
graphic picture of distribution were not cultures 
in the real sense of the term, but merely starting 
points for further research, skeletons that must be 
fleshed out, or auxiliary constructions that might 
enable him to penetrate to the core of the problem: 
the question of the nature of culture. The provi- 
sional nature of his Kulturkreise becomes especially 
manifest when they are compared to the Kultur- 
kreise of Graebner and the Vienna school, which 
were by no means intended to have a provisional, 
skeletal character. The latter were of a definitive 
nature and of fixed, rigid magnitudes; arranged 
chronologically and combined, they were even sup- 
posed to furnish an outline of the history of man- 
kind. 

Development of culture. Frobenius’ intellectual 
objective was to understand the essential nature of 
culture. His first book devoted to this problem was 
Die naturwissenschaftliche Kulturlehre (1899, Pro- 
bleme der Kultur, vol. 1). It foreshadowed the con- 
cepts of his subsequent theory of paideuma (1921), 
in which he endeavored to provide an answer to 
the question of the nature, the morphology, and the 
development of a culture. Frobenius regarded the 
several cultures as living organisms in the biolog- 
ical sense that every culture is subject to the laws 
of the organic world, springing up like seed, grow- 
ing, and attaining its apogee at maturity, after 
which it begins to age and finally dies. He used the 
terms Ergriffenheit (emotional involvement), Aus- 
druck (expression), and Anwendung (application ) 
to characterize the stages of youth, maturity, and 
age traversed by a culture, comparing them to a 
life curve. 

Frobenius regarded the Ergriffenheit of man as 
the crucial event in the emergence of a culture. 
Once man is gripped by the world about him, the 
particular nature of the things in his world and the 
existential order within which he lives are revealed 
to him. In this process, man plays a rather passive 
role, being object rather than subject, affected by 
the phenomena that overcome him, move him, and 
shake him to his innermost being. It would be 
wrong, however, to exaggerate his passivity: in the 
last analysis, only alert, active, and creative spirits 
are open to the phenomena of the world around 
them; only they are capable of reproducing these 
phenomena and putting together a comprehensive 
and valid picture of the reality that surrounds them. 

Frobenius attributed decisive importance to the 
creative qualities of man. In his eyes, Ergriffenheit 
and creative will are the motive forces that give rise 
to the emergence of a culture. The emergence of 
culture occurs almost exclusively in the realm of 
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ideas, for the process begins not simply with objects 
but with the inner nature of the objects and phe- 
nomena in man’s environment. Therefore the Er- 
griffenheit, the youth stage, of a culture is always 
characterized by fundamental and magnificent 
spiritual—religious creations. More simply stated: 
Religion stands at the beginning of a culture. Be- 
cause the rise and formation of the spiritual—reli- 
gious foundations generally take place quietly and 
without any visible indications, it is very difficult to 
establish precisely the emergent phase of a culture; 
in the case of nonliterate peoples, precision is al- 
most impossible. As a rule, the historian can iden- 
tify and describe only those fully developed cultures 
that already possess a stamp of their own and have 
reached the stage of maturity, while the decisive 
mental processes that form a culture are not ac- 
cessible to observation; at best they can be recon- 
structed from its mature state. 

If the Ergriffenheit of man is the crucial event 
in the origin of a culture, man’s environment must 
be of decisive importance, for it alone is able to 
move him. That is why Frobenius regarded environ- 
ment as a factor of overriding significance and rec- 
ognized the primary bond between every culture 
and its location, while the factor of race seemed to 
him unimportant. He attributed differences be- 
tween various cultures largely to the differences in 
environmental conditions. The cartographic method 
he employed in mapping Kulturkreise and the pat- 
terns of distribution he obtained greatly aided him 
in reaching this conclusion. Thus, he believed that 
the extensive rain forests of the equatorial regions, 
with their uniformity of scenery and climate, give 
rise to a different form of Ergriffenheit than that of 
the savanna and steppes, with their pronounced 
alternations of drought and rainy periods. 

The second stage that a culture must pass 
through is that of Ausdruck, maturity, the apex of 
the life curve. At this point of development, men 
continue to be affected by things, but they have now 
assimilated the experience of their environment 
and mastered it in the philosophical sense. They 
are able to express the essential nature of things 
and of the existential order, as well as of their own 
existence, in religious, artistic, and social con- 
structs. Only when man has risen above the level 
of things can he make them completely his own 
and shape them creatively. In the period of the ma- 
turity of a culture, ideas take on visible shape; they 
are manifested in religious rites, in the forms of 
social order, in artistic creations, as well as in the 
complexity of daily life and the economy. A culture 
in the stage of maturity is in a state of harmony 
and draws on abundant resources. 
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Frobenius’ researches were always aimed at com- 
prehending the spiritual center of a culture, from 
which its impulses proceed and all actions of the 
culture are controlled. He called this spiritual center 
the paideuma, the soul of the culture, which per- 
meates man and gives his action a direction and 
goal. Every movement and every expression of a 
culture, even the simple implements of daily use, 
are related to the spiritual center and bear the 
stamp of the particular paideuma. All elements are 
conceived as intimately interrelated in function, 
which is not true of the elements of the early con- 
ception of Kulturkreise; this is why Frobenius re- 
fused to regard the early Kulturkreise as true 
cultures in the paideuma sense. 

The last stage is called by Frobenius Anwendung 
(application) because in this phase he saw the ra- 
tional aspect of reality and the question of the use- 
fulness and possible applications of a cultural as- 
set coming increasingly to the fore. An implement 
or a social institution is no longer evaluated solely 
as the expression of a central spiritual idea but is 
regarded only from the standpoint of its utility. The 
rational utilitarian aspect of many cultural assets, 
especially of the material objects, is no doubt in- 
herent from the very outset, but in the Anwendung 
phase it predominates over ideal values. The bonds 
linking the individual elements of culture to the 
spiritual center become ever looser, and their pro- 
gressive isolation finally results in the disintegra- 
tion of the entire culture. The Anwendung stage of 
a culture is characterized by overemphasis on tech- 
nical—civilizing forces. The nonpurposeful striving 
for knowledge of the Ergriffenheit phase recedes, 
and creative spiritual ideas are “applied” and 
“turned to account” according to their usefulness. 
An increasing semantic depletion makes itself felt 
in the religious field, and symbols and religious rites 
are dominated by routine. When the creative forces 
are paralyzed, meaningful content dwindles away, 
cultural forms become petrified, and the culture 
gradually dies away. ? 

As Frobenius saw it, the various cultures are 
closed organisms, subject to the same law that gov- 
erns all living things in this world. Every culture 
has a soul of its own, a unique character, and an 
individuality otherwise found only in animate na- 
ture. Frobenius therefore attributed to every culture 
an autonomous set of laws that determines its de- 
velopment, largely independent of its particular 
human members. This is an inevitable although 
often misunderstood conclusion if cultures are re- 
garded as living organisms that are governed by 
the laws of life. Man cannot stem the natural de- 
velopment of a culture. For example, he cannot 


prevent the aging of a culture; all he can do is pro- 
long or shorten the process. When Frobenius ven- 
tured the opinion that culture might exist even 
without human beings, he meant that culture is not 
a fiction but a reality that “takes hold of” men just 
as do the phenomena of their environment. This 
notion is supported by the historical fact that cul- 
tures tend to expand beyond the boundaries of their 
original centers, especially in the stage of maturity, 
and “seize,” as it were, people of alien cultures, put- 
ting them under their sway. 

Although Frobenius was the target of violent at- 
tacks, his thoughts and ideas fell upon fertile soil 
in the German-speaking world. His scientific work 
in the field of culture founded cultural morphology 
in ethnology, which is the special concern of the 
Frobenius Institute. The work of his disciple Adolf 
E. Jensen on comparative religion refined and 
advanced Frobenius’ description of the history of 
culture and increased its depth. All work in the 
cultural-historical aspects of ethnology has been 
profoundly influenced by Frobenius’ doctrines. For 
a time, he was closely associated with Oswald 
Spengler, who advocated similar ideas about the 
essential nature of culture. Outside of Germany, 
and especially in the English-speaking world, Fro- 
benius was accepted only with reservations. 
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[For the historical context of Frobenius’ work, see the 
biographies of BasTIAN and RATZEL. For discussion 
of the subsequent development of Frobenius ideas, 
see CULTURE; CULTURE AREA; History, article on 
CULTURE HISTORY; and the biographies of GRAEB- 
NER; KOPPERS; NORDENSKIÖLD; SCHMIDT.] 
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I 
STRUCTURAL-FUNCTIONAL ANALYSIS 


Few concepts in modern social science history 
have generated as much discussion as those of 
structure and function and the type of analysis 
associated with them. The main difficulty in speak- 
ing of structural-functional analysis in general 
arises from five sources. One, there is the feeling 
in many quarters that there is something new and 
special about structural-functional analysis. Two, 
in general usage elementary procedures in defini- 
tion have not been observed. The same term has 
frequently been used for more than one distinct 
referent. Three, teleology in the sense of scientific 
fallacy—in this case structural teleology, functional 
teleology, or both—has frequently been committed 
in connection with such analysis. Four, the use of 
stability assumptions in models generally has been 
both misunderstood and misconstrued. Five, unin- 
tentionally, evaluations have been written into the 
analysis, thereby raising questions about its ob- 
jectivity. 

Structural-functional analysis is not new in 
either the social or the natural sciences; it has a 
pedigree that stretches indefinitely back in both 
fields. The only new aspect of it is its formidable 
new name, structural-functional analysis. Discus- 
sion of it as something new under the sun is the 
social scientists counterpart of M. Jourdain’s dis- 
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covery that he had been speaking prose. The only 
possible novelty associated with this form of analy- 
sis is the attempt in recent years to be carefully 
explicit in the use of these concepts and to differ- 
entiate special subsidiary forms of the analysis, 
although none of the latter are substantively new 
either. Shorn of careless uses of definitions and of 
teleology, structural—functional analysis is simply 
a synonym for explicit scientific analysis in gen- 
eral. In scientific fields marked by greater theoret- 
ical development and associated applications of 
mathematical forms of expression, the increased 
verbal explicitness that has recommended the 
various concepts of structure and function to many 
in the biological and social sciences is more cum- 
bersome than available alternatives. Correspond- 
ing increases in theoretical development in the 
biological and social sciences will lead to similar 
alternatives there. 

Simply speaking, structural-functional analysis 
consists of nothing more complicated than phras- 
ing empirical questions in one of the following sev- 
eral forms or some combination of them: ( 1) What 
observable uniformities (or patterns) can be dis- 
covered or alleged to exist in the phenomena 
studied? (2) What conditions (empirical states of 
affairs) resultant from previous operations can be 
discovered or alleged to exist in the phenomena 
studied? (3) When process (or action, i.e., changes 
in the patterns, conditions, or both, depending on 
one’s point of view, are discernible between any two 
or more points in time) can be discovered (or 
alleged) to take place in terms of observable uni- 
formities, what resultant conditions can be dis- 
covered? The first question asks, “What structures 
are involved?” The second asks, “What functions 
have resulted (or have been performed )?” And the 
third asks, “What functions take place in terms of 
a given structure(s)?” Many special forms of these 
three questions are useful and necessary for the 
analysis of different types of problems; for exam- 
ple, one might ask about the possibility of adjust- 
ment in terms of a system, the normative content 
of a system, the “necessary” features of a system, 
the degree of planning or consciousness involved, 
etc. All of these are variants of the three basic 
questions or some combination of them. In addition 
to the most general form of the concepts of struc- 
ture and function, six subsidiary sets of concepts 
most generally associated with this form of analy- 
sis, either explicitly or implicitly, are defined and 
discussed below in terms of the special purposes 
generally associated with them. 

Function and structure. These explicit concepts, 
ostensibly in their most general form, are fre- 


quently encountered in the biological and social 
sciences. In both fields somewhat similar difficulties 
have been associated with the use of the concepts. 
Joseph H. Woodger in biology and Robert K. Mer- 
ton in the social sciences (Woodger 1924, pp. 326- 
330; Merton 1949, pp. 21-28) have pointed to the 
profusion of different referents given to the term 
“function” even in the scientific sense. These differ- 
ent referents have for the most part been left im- 
plicit by both authors although the confusion is 
noted. Perhaps the major difficulty associated with 
the general concept of function has been the use 
of a single term to cover several distinctly different 
referents. This difficulty is much greater with the 
concept of function than with the associated con- 
cept of structure. Most of the general discussions 
in the literature have been largely preoccupied with 
the concept of function. The concept of structure 
has, more often than not, been left undiscussed. To 
prevent confusion, the most general form of the 
concepts “structure” and “function” used for pur- 
poses of scientific analysis must be defined in such 
a way that the other uses of the terms represent 
specific subcategories. The term “function” may be 
defined as any condition, any state of affairs, re- 
sultant from the operation (including in the term 
“operation” mere persistence) of a unit of the type 
under consideration in terms of a structure(s). In 
the case of the biological sciences that unit is usu- 
ally an organism or a subsystem of an organism. 
In the case of the social sciences the unit is usually 
a system of action involving a set of one or more 
persons (actors). The term “structure” may be de- 
fined as a pattern, i.e., an observable uniformity, 
in terms of which action (or operation ) takes place. 

In ordinary usage the term “function” is most 
generally identified with the term “eufunction” de- 
fined below. Similar confusion of the term “struc- 
ture” with “eustructure” is not common. The con- 
cept of structure in this general form explicitly 
covers a wide range of possibilities from highly 
stable uniformities to highly fleeting ones. Any 
event may contain an element indicative of a struc- 
ture if it is considered with regard to nonunique 
aspects or parts. 

Much of the interest of scientific social analysis is 
centered on the structure of societies and other social 
systems (or the structures of social action in gen- 
eral), that is, on the interrelationships among dif- 
ferent kinds, aspects, and parts of structures. The 
relationship between the concepts of function and 
Structure is close. Structure refers to an aspect of 
empirical phenomena that can be divorced from 
time. The patterns of action, qua patterns, do not 
exist as concrete objects in the same sense that 
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sticks and stones do. The patterns of action in this 
sense are abstractions from concrete empirical phe- 
nomena, and they “exist” and are empirically veri- 
fiable in the same sense that the squareness of a 
box exists and is empirically verifiable. What has 
been said here of patterns qua patterns does not 
apply to the patterns when they are considered in 
operation. Structures in operation (i.e., the exem- 
plifications of particular patterns) are empirical 
in the same sense as sticks and stones. In this sense 
the term “structure” in social science is no depar- 
ture from the usage of the natural sciences. 

The concepts of structure and function fall into 
a peculiar set of concepts. Classification of a refer- 
ent as a function or a structure depends in part 
on the point of view from which the phenomena 
concerned are discussed, What is a function from 
one point of view is a structure from another. The 
concepts of consumption and production are more 
familiar examples of this peculiar set. The manu- 
facture of automobiles is production from the point 
of view of an automobile user and consumption 
from the point of view of a steelmaker. Thus func- 
tions in this sense are themselves structures (i.e., 
patterns) or have important structured (i.e., pat- 
terned) aspects, and all structures are the results 
of operations in terms of other structures (i.e., they 
are functions). The politeness of small children 
may be considered as a structure of their behavior 
or as a function of operation in terms of the struc- 
tures (i.e., patterns) of discipline indulged in by 
their parents. 

Functional and structural requisites. The con- 
cepts functional and structural requisites are pri- 
marily oriented to the development of systems of 
analysis for any cases of a particular type of unit. 
A functional requisite may be defined as a general- 
ized condition necessary for the maintenance of 
the type of unit under consideration, given the 
level of generalization of the definition and the 
most general setting of such a unit. In seeking to 
discover the functional requisites of a unit one asks 
the question, “What must be done to maintain the 
system concerned in its setting on the level under 
consideration?” A given condition is a functional 
requisite if its removal (or absence) would result 
in (a) the total dissolution of the unit, or (b) the 
change of one of the structural elements of the 
unit on the level under consideration (i.e., one of 
the structural requisites ). 

A structural requisite may be defined as a pattern 
(or observable uniformity ) of action (or operation ) 
necessary for the continued existence of the type 
of unit under consideration, given the level of gen- 
eralization of the definition and the most general 


setting of such a unit. In trying to discover the 
structural requisites of a unit one seeks an answer 
to the question, “For a given unit, what structures 
(ie., patterns) must be present such that opera- 
tions in terms of these structures will result in the 
functional requisites of the unit?” Briefly, func- 
tional requisites are answers to the question, “What 
must be done?”; structural requisites are answers 
to the question “How must what must be done, be 
done?” Both questions involve the qualifying phrase 
“if the unit is to persist in its setting on the level 
of generalization given.” Most of the misunder- 
standing about stability mentioned above has been 
associated with this question of persistence. One 
utilizes an assumption of stability to get at a list 
of requisites, but this does not imply that any unit 
analyzed must in fact be stable. To make such 
units stable by definition is to reduce most discus- 
sions of this sort to trivia. To use a stability as- 
sumption as an element in a model (i.e., paradigm) 
does not. Structural—functional requisite analysis 
includes the following steps in any specific case: 
(1) define the unit of phenomena to be studied; 
(2) discover (or hypothesize about) the setting 
(ie., those factors determining the limits within 
which the ranges of variations of the unit con- 
cerned take place); (3) discover what general con- 
ditions must be met (i.e., functional requisites) if 
the unit is to persist in its setting without change 
(i.e. alteration of structures) on the level under 
consideration; (4) discover what structures must 
be present in the system, as a minimum, if action 
in terms of the system is to result in the persistence 
of the unit in its setting without any change on 
the level under consideration (i.e., the structural 
requisites ). 

Several things should be kept in mind. One, al- 
though the definition of the unit is arbitrary, 
whether empirical referents of such a unit exist 
is not. Two, the setting of such a unit is not a mat- 
ter of definition but of discovery. Three, neither 
structural requisites nor functional requisites can 
be alleged to exist “because they are requisites.” 
Such allegations constitute the commission of struc- 
tural or functional teleology. To assert that requi- 
sites exist because they are requisites is to imply 
that the unit must continue to exist for some pre- 
ordained reason. Scientifically speaking, nothing 
can be alleged to exist “because it is a requisite.” 
The fact that this form of teleology is fallacious is 
in no way contradicted by the fact that it is fre- 
quently a useful element of models of action to 
assume or allege that the action concerned is 
oriented to future states of affairs. Four, the deter- 
mination of the functional requisites of a unit is 
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the determination of the minimum implications of 
interrelationships between the factors setting the 
limits of variation of the unit and the unit 
itself, and in this type of analysis it is never neces- 
sary to deal with more than these minimal impli- 
cations. However, when material is collected in 
these terms, more than the minimum will always 
be collected, since such minimal materials never 
exist neatly separated from all others. Five, there 
is a systematic test for error in requisite analysis. 
If a structure is alleged to be a requisite of the unit 
concerned, and if examination of a particular case 
of such a unit uncovers no material on this score, 
one of three or some combination of three things is 
an explanation of the lack of data. First, the hy- 
pothesis that the structure is a requisite of the unit 
concerned may be an incorrect one. Second, the 
observer may have misobserved; there may be data 
he has overlooked. Third, the unit concerned, al- 
though it may closely resemble the unit as defined, 
may in fact not be a case of such a unit. 

There is nothing new about requisite analysis. 
There has never been a time in which people failed 
completely to ask questions as to whether given 
conditions or patterns were not in some sense nec- 
essary for the continued existence of certain types 
of units. 

Functional and structural prerequisites. The 
requisite concepts are not in and of themselves 
oriented to questions of change; the concepts of 
functional and structural prerequisites are. All 
questions of change implicitly or explicitly involve 
comparisons between at least two of at least three 
possible distinctions with regard to the units under 
consideration. These distinctions are those of an 
initial, a transitional, and a resultant stage. Sys- 
tematic knowledge about any two of these stages 
makes possible systematic derivation (prediction or 
postdiction ) of knowledge about the third. Requisite 
analysis can be used for examining any two of 
these three or more Stages in terms of a constant 
frame of reference. Constants and variables are 
therefore more easily detected. 

A functional prerequisite may be defined as a 
function that must pre-exist if a given unit is to 
come into being in a particular setting. Correspond- 
ingly, a structural prerequisite may be defined as 
a structure that must pre-exist if a given unit is to 
come into being in a particular setting. The closer 
two stages of a given unit under consideration are 
in point of time the greater is the probability that 
the requisites and the prerequisites of a given unit 
will be identical. To illustrate this, one of the 
commonest mistakes in trying to understand the 
problems of development in “underdeveloped areas” 


is the implicit assumption that requisites and pre- 
requisites must or do coincide. It is neither obvious 
nor tenable to take the position that all of the 
structures that must be maintained if the United 
States is to continue as a highly modernized so- 
ciety are identical with the structures that have to 
pre-exist if Nigeria is to become a highly modern- 
ized society, or even that they are identical with all 
the structures that had to pre-exist, say, in the 
beginning of the nineteenth century, if the United 
States was to become a highly modernized society. 

Implicitly or explicitly, some form of requisite 
analysis always underlies any form of prerequisite 
analysis. Teleology must be avoided in uses of the 
concepts of structural and functional prerequisites 
as well as in uses of the concepts of structural and 
functional requisites. To assert that some structure 
must pre-exist because it is a structural prerequi- 
site of a given unit is to fall into teleological dy- 
namics as distinguished from the teleological statics 
described in the case of requisites. 

Eufunction, dysfunction; eustructure, dysstruc- 
ture. These concepts focus attention on questions 
of adjustment and maladjustment of the units un- 
der consideration. Although the term “function” is 
ordinarily used in several different senses in the 
social sciences, most often it refers to the concept 
of eufunction as defined here. With respect to a 
given unit, “eufunction” may be defined as any 
function that increases or maintains adaptation or 
adjustment of the unit to the unit’s setting, thus 
making for the persistence of the unit as de- 
fined in its setting. With respect to a given unit, 
“dysfunction” may be defined as any function that 
lessens the adaptation or adjustment of the unit to 
its setting, thus making for lack of persistence 
(i.e., change or dissolution) of the unit as defined 
in its setting. The terms “eustructure” and “dys- 
structure” are similarly defined, mutatis mutandis. 
Alternatively, eustructures may be defined as struc- 
tures in terms of which operations result in eu- 
functions, and dysstructures may be defined as 
structures in terms of which operations result in 
dysfunctions. Eufunctions or dysfunctions and the 
Corresponding variants of structure may exist, as 
far as a given unit is concerned, as elements of that 
unit (i.e., internal to it) or as elements of the set- 
ting of the unit concerned. Not all eufunctions for 
a unit are eufunctions of the unit, although ordi- 
narily, when one uses the concept of eufunction, 
attention is focused on functions associated with 
the unit itself rather than on functions of operation 
in terms of other units in that setting. 

The concept of “nonfunction,” in the sense of 
being neither a eufunction nor a dysfunction, is 
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inutile. Where questions of adjustment are not in- 
volved, the general concept of function as used here 
or another of its special forms will serve. Where 
questions of adjustment are involved, nothing is 
less probable than a precisely poised function with 
no implications for adjustment or maladjustment. 
Indeed, such considerations define precisely what 
might be called the category of irrelevant functions, 
ie., those lacking any implications for the focus 
of interest at the time. Such functions would also 
have to lack any interdependencies with the eufunc- 
tions or dysfunctions for or of the systems con- 
cerned. 

It is in connection with the terms “eufunction” 
and “dysfunction” (and the corresponding forms of 
the concept of structure) that most allegations of 
building evaluations into structural functional 
analysis are made. In rough terms, a eufunction is 
a function that tends to preserve the unit as de- 
fined and a dysfunction is one that tends to dis- 
solve it. However, loose usage of these concepts 
results in the use of eufunctional to refer to con- 
ditions making for “good adaptations” and dys- 
functional to refer to conditions making for “bad 
adaptations.” Such judgments are a function of 
one’s evaluation of the unit concerned—if one cares 
to make such an evaluation. No condition or as- 
pect of a condition is inherently eufunctional or 
dysfunctional. Without identification of the unit 
concerned and its setting, no judgment of the eu- 
functional or dysfunctional character of the condi- 
tion can be made. The same condition that is 
eufunctional from one point of view may be dys- 
functional from another. For example, conditions 
that were eufunctional for Meiji Japan were dys- 
functional for the continuation of Tokugawa Japan 
and vice versa. 

While these concepts are susceptible to the un- 
critical inclusion of value judgments, they are also 
a useful vehicle for the explicit consideration of 
policy-oriented analyses. Either implicitly or explic- 
itly, in seeking to maximize a given policy goal, one 
asks the questions, “What conditions make for max- 
imal adjustment of the system concerned to that 
state of affairs (i.e., the eufunctions for that sys- 
tem in that setting)?” and “What conditions should 
be avoided as interfering with the maintenance of 
that system in that setting (i.e., are dysfunctional 
for it)?” Two other points should be carefully kept 
in mind. First, teleology is to be avoided in this 
connection too. No function or structure exists 
because it is a eufunction or a eustructure, nor is 
it tenable to hold that any function or structure that 
persists must be eufunctional or eustructural. Sec- 
ond, a given element with important eufunctional 


or eustructural implications for a given unit in its 
setting may also contain aspects with dysfunctional 
or dysstructural implications. A functional requisite 
of a given unit is certainly at least in part eufunc- 
tional for that unit, but a given function, even 
though it is a requisite, may also contain dysfunc- 
tional implications as well, although presumably in 
this case they would not be sufficiently pronounced 
to overcome the requisite nature of the function. 

The concepts of eufunction, dysfunction, eu- 
structure, and dysstructure focus on the question of 
the maintenance or lack of maintenance of a sys- 
tem in its setting. The requisite concepts focus on 
the question of what a system is like if it is main- 
tained. The prerequisite concepts focus attention 
on what conditions must pre-exist before a given 
type of unit can come into being. The requisite con- 
cepts are useful primarily for static theories, al- 
though dynamic analysis is involved in discovery 
of the requisites of a given system in its setting. 
Like the prerequisite concepts, those of eufunction, 
dysfunction, eustructure, and dysstructure focus 
attention on dynamic interrelationships—on the 
implications of the operation of a particular struc- 
ture or the presence of a particular function for the 
state of the system concerned at some future point 
in time. 

Latent, manifest, UIR, and IUR functions and 
structures. This set of terms is adapted from 
Merton's suggestion many years ago about latent 
and manifest functions. Usage here is somewhat 
changed, since Merton did not envisage their use 
apart from the concepts of eufunction and dysfunc- 
tion as defined here, nor did he develop the implica- 
tions of the fact that his defining conditions could 
vary independently. Following Merton's usage an 
element will be termed “manifest” if it is intended 
and recognized by the participants in the system 
of action concerned. It will be termed “latent” if 
it is neither intended nor recognized. It will be 
termed “UIR” if it is unintended but recognized 
and “IUR” if it is intended but unrecognized. These 
concepts focus attention on the level of explicitness 
and sensitivity of the members of a given system 
to the structures in terms of which they operate. 
Such distinctions are vital to any discussions in- 
volving rationality, planning, and so forth. The 
concept of manifest dysfunction is explicitly in- 
cluded as a possibility here. Failure to consider it 
explicitly has built implicit evaluations into a great 
deal of analysis. Social reformers usually concen- 
trate all their efforts on being manifestly dysfunc- 
tional for the system they are trying to reform and 
manifestly eufunctional for what they consider to 
be a better world. 
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The concepts of function and structure in gen- 
eral, the requisites, the prerequisites, and eufunc- 
tion-dysfunction, eustructure—dysstructure, are 
concepts generally applicable to social and non- 
social phenomena and to human as well as to non- 
human phenomena. Concepts of latent, manifest, 
IUR, and UIR specifically focus attention on the 
point of view of the actor and involve the assump- 
tions that such actors can and do orient their be- 
havior to future states of affairs about which they 
are capable of explicit thinking and observation, 
and that the presence or absence of such explicit 
thinking, observation, or both has, in turn, implica- 
tions for their subsequent behavior. Although it is 
conceivable that such concepts be utilized with re- 
gard to nonsocial phenomena, it is not likely that 
they will prove useful in such a field nor have they 
generally been so employed. It is quite conceivable 
that they could be employed with regard to some 
animate but nonhuman phenomena, but by far 
their greatest applications are to human social 
phenomena. 

Concrete and analytic structures. The distinc- 
tion between concrete and analytic structures is 
oriented to the type of abstraction involved in cer- 
tain concepts useful for empirical analysis. Con- 
crete structures may be defined as those structures 
(i.e., patterns) that define the units that are at least 
in theory capable of physical separation (in time, 
space, or both) from other units of the same sort. 
As the term will be applied for social analysis, it 
refers more specifically to the structures that de- 
fine the membership units involved in social action, 
i.e., units in regard to which any given individual 
may be classified as included or excluded, or some 
combination of the two. Society, as that term is 
generally used, is a concrete structure considered 
in operation. So are organizational contexts gener- 
ally, such as families, business firms, and govern- 
ments. Concrete structures other than societies are 
the structures of action that define the membership 
units within a given society or those relating to 
two or more societies, i.e., they are all other social 
systems. All concrete structures are social systems; 
all societies are social systems; and all social sys- 
tems other than societies are either subsystems of 
a given society or the results of interdependencies 
among two or more societies. The family, as that 
term is often used, is a concrete structure; its struc- 
tures define a membership unit. So are business 
firms. A given individual may at one and the same 
time be a member of both a given business firm 
and a given family; nevertheless, the social struc- 
tures characterizing the business firm are not part 
of the social structures characterizing the family 
concern, however complex the interrelationships 


may be, unless the firm is specifically and com- 
pletely a family affair. It is at least conceivable that 
all the members of a given family could be put in 
one room, that the members of a given business 
firm could be put into another, and if necessary, 
those individuals who are members of both could 
be removed from the two rooms and put into a 
third. 

Analytic structures are defined as structures (i.e., 
patterns) that define the aspects of units that are 
not even theoretically capable of concrete separa- 
tion from other structured aspects. If one defines 
the economic aspect of action as having to do with 
the allocation of goods and services and political 
aspects as having to do with the allocation of power 
and responsibility, then economic and_ political 
structures are analytic structures, since there are 
no social systems that are totally devoid of either 
economic or political aspects. 

Failure to keep this sort of distinction straight 
Constitutes the fallacy of reification (or misplaced 
concreteness ). Thus the terms “economy” and “pol- 
ity” as generally used cannot occupy the same posi- 
tion in a system of analysis as the term “family.” As 
these concepts are generally defined, they represent 
analytic structures, and the concept “family” refers 
to a concrete one. This distinction, though difficult 
in the context of social phenomena, is a generally 
familiar one in the natural sciences. Concepts such 
as atoms, molecules, cells, organs, and elementary 
particles are concrete structures, Aspects such as 
mass, shape, color, temperature, and mitosis are 
analytic structures. 

In actual analysis one always uses some combi- 
nation of both sorts of concepts. One cannot iden- 
tify analytic structures without some specification 
sooner or later of the concrete structures of which 
they are aspects, Correspondingly, one cannot dis- 
cuss the nature of concrete structures without 
sooner or later making reference to aspects that cut 
across such units. Careful observance of the dis- 
tinction and the avoidance of the fallacy of reifica- 
tion is the key par excellence to the avoidance of 
much disciplinary parochialism in the social 
sciences. 

Apart from considerations of methodological 
elegance there is another reason for being careful 
about this distinction. One of the most general dis- 
tinctions between concrete structures or organiza- 
tions is that between those which are specialized 
in the predominant orientation of their members 
to one or another of these analytic structures and 
those which are not specialized in this way. (Organ- 
izations may, of course, be specialized in other 
ways, e.g., in terms of the product orientations of 
the members. ) Thus, a business firm may be spoken 
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of as a predominantly economically oriented struc- 
ture (or perhaps more accurately, though more 
cumbersomely, as a specially economic—analytic- 
structure-oriented concrete structure). A family 
unit is not specialized in any one of these aspects 
in any clear-cut predominance over the others. 

Institutions, tradition, and utopian structures. 
The three terms “institution,” “tradition,” and “uto- 
pian structure” refer to different types of structure. 
The concept “institution” may be defined as any 
normative pattern, conformity to which is generally 
expected and failure to conform with which is gen- 
erally met with moral indignation or some other 
form of sanction by the individuals who are in- 
volved in the same general social system and are 
aware of the failure. This is the sense of the term 
employed by Talcott Parsons in his early work. A 
given structure may be more or less well-institu- 
tionalized to the degree to which conformity with 
the structure is generally to be expected and the 
degree to which failure to conform to the structure 
is met by the moral indignation or sanctions of the 
individuals who are involved in the system and 
who are aware of the failure. Thus, the two sources 
of variability with regard to the level of institution- 
alization may be referred to as conformity aspects 
and sanction aspects. An institution may be re- 
garded as crucial or more or less strategic. A given 
institution will be called a crucial institution if it 
is a structural requisite of the system in which it 
appears. It may be regarded as a more or less stra- 
tegic institution to the extent that: (1) itis the insti- 
tutionalized form of all or a portion of a structural 
requisite; and (2) the structure concerned may not 
or may be altered without destroying the structural 
requisite involved. A completely strategic institution 
is a crucial institution. The first of the two aspects 
of the strategic quality of an institution will be 
called its substantive aspect, and the second will be 
called its critical aspect. The critical aspect is a 
special form of consideration of the general prob- 
lem of functional substitutes, equivalents, or alter- 
natives (i.e., the question of the possibility of a 
given condition being the result of behavior in 
terms of one or more alternative structures ). 

A great deal of nonsense is talked and written 
as a result of misunderstanding the problem of 
functional substitutability. Many use this nonsense 
to argue the severe limitation of what can be done 
in these terms. Much of the confusion results from 
‘initially posing a question implicitly on one level 
of generality and in the course of the discussion 
proceeding, still implicitly, on a different and less 
general level. If the argument is correct that role 
differentiation on the basis of absolute age is a 
requisite of all societies, there can be no functional 


substitutability for that. The fact that the members 
of one society handle role differentiation in terms 
of five absolute age distinctions whereas those of 
another handle it in terms of ten simply indicates 
the fact that a particular solution to absolute age 
distinctions is not determinant for all societies. For 
a particular society, however, either five or ten 
distinctions may be just as much a requisite as 
some sort of role differentiation on the basis of 
absolute age is for all societies. Some discussions 
of functional substitutability imply that the struc- 
tures involved can be changed without other 
changes on the level of generality on which the 
treatment has been focused. There are, of course, 
instances in which a given function may result 
from operation in terms of various alternative 
structures on specific lower levels of generality 
without changes on the most general level under 
scrutiny, but that question is by no means settled 
by the analyst's simply being able to conceive of 
another structure from action in terms of which 
the function in question could result—and so as- 
serting his conclusions. 

The term “tradition” may be defined as an insti- 
tution whose perpetuation is institutionalized—that 
is to say, as a special form of institution. An insti- 
tution may be considered more or less traditional, 
or more or less traditionalized, to the extent that its 
perpetuation is institutionalized without regard to 
changes of the functional implications of opera- 
tions in terms of it—whether these be eufunctional 
or dysfunctional implications. In this sense, mo- 
nogamous marriage in the United States would 
seem to be much more highly traditional and tra- 
ditionalized than the structure of driving on the 
right-hand side of the road. A tradition in this sense 
is a double institutionalization: (1) the structure 
concerned is an institution; and (2) the perpetua- 
tion of the structure is also an institution. Tradi- 
tions may vary in at least two radically different 
ways. One, they may, of course, vary with regard 
to the institution that is traditionalized, although 
implicit in the concept of any institution in its 
conformity aspects is some degree of traditionaliza- 
tion, More important, traditions—like other insti- 
tutions—may vary with regard to their combina- 
tions of conformity and sanction aspects. For 
example, the tradition of driving on the right-hand 
side of the road in the United States has very high 
conformity aspects and relatively minor sanction 
aspects, whereas the tradition of the incest taboo 
has very high conformity aspects and very high 
sanction aspects. 

Utopian structures may be defined as those par- 
ticular sets of normative patterns which, though 
not institutionalized, do require allegiance to them 
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as institutionalized ideals. Allegiance to them as 
ideals is highly traditionalized in both conformity 
and sanction aspects. Thus, one does not in fact 
expect conformity to the principle of “Love they 
neighbor as thyself,” but expression of it as an ideal 
is certainly institutionalized in some social con- 
texts, and its perpetuation is institutionalized. In 
the social systems in which they are found, utopian 
Structures often have the function of making it 
easier to teach the actual institutional structures 
of the system and of setting the framework in terms 
of which actual conformity to less extreme expec- 
tations is, in fact, expected. 

Ideal and actual structures. Ideal structures 
may be defined as those structures that the mem- 
bers of any given system (or systems) feel should 
be the structures of their behavior or those of 
others. Actual structures may be defined as the 
Structures (i.e., patterns) in terms of which the 
members of the system in fact behave as they 
would be described by an observer with theoreti- 
cally perfect scientific knowledge. The terms “ideal” 
and “actual” in these respects may be applied in 
other connections, but they are most generally ap- 
plied to the concepts of structure. The following 
generalizations in terms of them are relevant in 
almost any analysis. One, there are no peoples who 
do not make some distinction between ideal and 
actual structures. Two, ideal and actual patterns 
of a given system never coincide perfectly. Three, 
the members of the systems concerned are always 
to some extent aware of the fact that the ideal and 
actual structures do not coincide perfectly. Four, 
Some sources of stress and strain in any social 
system inhere in the fact that the ideal and actual 
structures do not coincide perfectly. Five, not para- 
doxical relative to the fourth generalization, some 
of the possibilities of integration in terms of any 
social system inhere in the fact that the ideal and 
actual structures do not coincide perfectly. Six, 
failure of the ideal and actual structures to coin- 
cide perfectly is never explicable solely in terms 
of hypocrisy. Seven, perfect coincidence of the 
ideal and actual structures would require both 
perfect knowledge and perfect motivation on the 
part of all of the members of the system concerned 
in all possible situations. This would Ça) overload 
any known set of cognitive capacities of individ- 
uals; and (b) were it possible to have perfect co- 
incidence of the ideal and actual structures, the 
resultant system would be totally brittle, since any 
change of setting factors necessitating or causing 
any change internal to the system would require 
complete resetting of all of the cognitive capacities 
of the members of the system. 


These concepts can be further elaborated; e.g., 
there may be ideal ideal structures (i.e., utopian 
structures) as well as many pseudo or pretended 
ideal structures (i.e., hypocrisies which are ideal 
structures which are not actual ideal structures), 


Structural—functional analysis in the most gen- 
eral sense, shorn of confusion of terminology, mis- 
uses of stability assumptions, teleology, and im- 
plicit evaluations, is synonymous with scientific 
analysis in general. The special forms of structural— 
functional analysis are not new, although the at- 
tempt to be explicit about them in general is recent. 
Of the various special forms of structural—func- 
tional analysis, there is one sense in which requisite 
analysis underlies the rest. It is oriented to the deri- 
vation of a system of analysis for any unit of the 
sort under discussion, and all of the other forms 
of structural—functional analysis, whether in terms 
of prerequisites, manifest, latent, UIR, and IUR 
elements, eufunction—dysfunction, eustructure— 
dysstructure, etc., presuppose two or more states 
of some sort of unit(s) which presumably can be 
analyzed systematically. Of the various special 
forms of  structural—functional analysis, those 
having to do with the latent, manifest, UIR, and 
IUR elements, the concepts of institutions, tradi- 
tions, and utopian structures, and the ideal and 
actual structures are the most specifically oriented 
to the analysis of human social phenonema. The 
other special forms are much more generally 
oriented. 

Marion J. Levy, Jr. 
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I 
VARIETIES OF FUNCTIONAL ANALYSIS 


For the past few decades functional analysis has 
been a major approach to understanding the or- 
ganization of society. But at the same time it has 
become the target of very serious criticism and has 
been attacked as illogical, value-laden, and incapa- 
ble of explaining anything. As one would expect, 
there is a serious problem of definition. 

The term “functional analysis” or “structural- 
functionalism” has been applied to a great variety 
of approaches (Merton 1949; Firth 1955) that 
share only one common element: an interest in re- 
lating one part of a society or social system to an- 
other part or to some aspect of the whole. Within 
this variety, three aspects or types of functionalism 
can be distinguished: the first is based on the con- 
cepts and assumptions of sociology; the second, on 
the theoretical orientation that all major social pat- 
terns operate to maintain the integration or adap- 
tation of the larger social system; the third, on a 
model of self-regulating or equilibrating systems. 
Most functional analyses contain all three aspects, 
but a clear description and evaluation of function- 
alism requires their separate consideration. 

A nondistinctive type. The first type of func- 
tionalism consists in stressing sociological analyses 
as opposed to psychological or historical analyses. It 
is based on the assumption that the social traits 
existing in a society at a given time are interrelated 
in a systematic way and that ordered relationships 
can be discovered among “social facts” or social in- 
stitutions without necessarily bringing in psycho- 
logical or historical factors. This guiding assump- 
tion is accompanied by an implicit consensus on 
the important elements of social structure and by 
a developing set of hypotheses on their interrela- 


tions. The crucial elements of social structure in- 
clude: the composition and formation of groups, 
the organization of economics and religion, and the 
distribution of authority and prestige. These are 
the now-traditional concerns of sociology and social 
anthropology, the elements stressed by the major 
theorists from Emile Durkheim and Max Weber to 
A. R. Radcliffe-Brown, Talcott Parsons, and George 
C. Homans. 

This type of functional analysis is most charac- 
teristic of the monographs of British social anthro- 
pologists and their adherents in the United States, 
and of the essays of sociologists, especially Kingsley 
Davis and Talcott Parsons, on some aspect of mod- 
ern societies. The anthropologists typically present 
a careful description of a primitive society. The 
sociologists begin with a more impressionistic ac- 
count of some institution in their own society. Then 
both go on to show how the various parts of the 
society or institution are interrelated, how the 
whole system “hangs together” despite some strains 
and inconsistencies. This type of work has led to 
many valuable hypotheses, such as those relating 
the isolated nuclear family and the modern eco- 
nomic system, or matrilocal residence and divorce 
rates. 

Careful description and a search for general pat- 
terns are obviously necessary to social science, and, 
as Davis (1959) points out, functionalism has been 
an effective banner under which to fight for these 
qualities. In anthropology, functionalism success- 
fully opposed the diffusionists and the empiricists, 
who tended to ignore general patterns, and the evo- 
lutionists and the rigid monocausalists, who tended 
to ignore the necessity of careful description. 

However, as Davis also argues, this type of func- 
tionalism is essentially equivalent to sociological 
analysis, and it is misleading to distinguish it by a 
special name. The functionalist position that socie- 
ties should be viewed as systems having definite 
structure or organization means primarily that the 
parts of a society are interrelated in some nonran- 
dom way; i.e., that there are regular patterns and 
relationships, and, therefore, it is reasonable to 
try to construct a science of society. This position 
underlies all sociological analysis, although there 
is considerable disagreement on whether all parts 
of a society are interrelated or whether there are 
relatively autonomous subsystems. The elements 
of society that these functionalists consider are 
also not distinctive, nor are the kinds of rela- 
tionships among elements. Functionalists do stress 
reciprocal relationships, but the typical monograph 
or essay also includes relationships based on simi- 
larity, one-way causation, fulfillment of a psycho- 
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logical or social need, etc. The problems and ac- 
complishments of this kind of functional analysis 
are the same as the problems and accomplishments 
of sociology and social anthropology in general. 
Therefore, the discussion below will concern only 
the other two types of functionalism, both of which 
are distinct forms of analysis. 

Two distinctive types. The first of the two re- 
maining types of functional analysis will be called 
traditional functional analysis because it is the 
most widely used type. It is based on the theoreti- 
cal orientation that all major social patterns op- 
erate to maintain the integration or adaptation of 
the larger system, and it is further distinguished 
by two crucial attributes, both of which make it 
very difficult to construct adequate explanations. 
First, a social pattern is explained by the effects or 
consequences of the pattern, and, second, these 
consequences must be beneficial and necessary to 
the proper functioning of society. The traditional 
approach includes those functionalists who focus 
on a few aspects of a society at a time and attempt 
to link one social pattern with one need and there- 
by “explain” the pattern. It also includes those who 
deal with a complex system of many elements and 
try to show how these elements are interrelated so 
as to form an adaptive and consistent system. 

Traditional functional analysis can be contrasted 
with the second type: formal functionalism. This 
approach is concerned with homeostatic, or equili- 
brating, systems, and so with feedback and self- 
regulation. It abandons the two distinctive attri- 
butes of the traditional approach. The effects of a 
trait for some part of the system are used to ex- 
plain that part and not to explain the trait; and 
there is no restriction on the kinds of consequences 
to be considered. They may or may not be bene- 
ficial or necessary to society. This approach is 
called “formal” because it does not include a theo- 
retical orientation or a substantive hypothesis about 
empirical events. It is a model of kinds of rela- 
tionships between elements, like a mathematical 
model. 

The distinctiveness of traditional functionalism 
from other forms of analysis can be readily illus- 
trated. For example, a traditional functionalist ex- 
planation of adolescent rebellion in the United 
States might point to the positive effects or func- 
tions of rebellion in securing independence from 
parents, which is crucial in our society. A non- 
functionalist explanation might focus on causes of 
rebellion, such as parental ambivalence about the 
desirability of being stable and mature. Formal 
functionalism would produce an interpretation dif- 
ferent from both of the above, since it is concerned 


with equilibrating or feedback systems and not 
with relationships of one-way effect or cause. It 
should be noted again that these different types of 
analysis are usually combined in practice. Thus 
Parsons’ discussion of adolescence (1942) includes 
both of the arguments in the above example, as 
well as many more. 

Functional analysis clearly is not distinct in 
some ways. It does not have any necessary mo- 
nopoly on the empirical problems of analyzing total 
societies or comparing institutions cross-culturally, 
even though historically these fields have been 
dominated by functionalists. It is also not the case 
that complex, formal models of functional systems 
include some logically special types of proposi- 
tions. Any model that can be communicated to 
social scientists can be reduced to a combination 
of simple statements such as: “if a then b,” and 
“a= (f)b.” Nor does traditional functional analysis 
involve a distinctive form of explanation; rather, 
it must be evaluated according to the general 
standards of adequate scientific explanations. 


Traditional functional analysis 


Scientific explanation. The criteria of an ade- 
quate scientific explanation are extremely difficult 
to state precisely, and the philosophers of science 
continue to struggle with the problem. However, 
Nagel (1961), Hempel (1965), and other phi- 
losophers have stated the criteria in a way that 
seems close to the actual practice of scientists and 
that is based on the idea that explaining a state- 
ment necesarily involves deducing it from some 
more general statement or statements. This is the 
view of explanation that will be used here. 

A scientific explanation consists of deducing 
the proposition to be explained (the explicandum) 
from some true or plausible general assumptions 
in conjunction with a statement of initial or ante- 
cedent conditions. The statement of initial condi- 
tions connects the explicandum to the assumptions. 
For example, if one wanted to explain the existence 
of witchcraft among the Navajo, using the two 
assumptions: (1) “if a society is subjected to ex- 
ternally imposed change then there will be a high 
level of aggression within the society,” and (2) “if 
high aggression then witchcraft”; then the appro- 
priate statement of initial conditions would be 
“imposed change has occurred among the Navajo.” 

The minimal requirements for an adequate ex- 
planation are (1) all the propositions must be 
clearly stated, that is, stated clearly enough to 
enable recognition of a negative case; (2) the as- 
sumptions must not be empirically false; (3) the 
Statement of initial conditions must be true; and 
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(4) the explicandum must be derivable from the 
assumptions and initial conditions according to 
the rules of logic. Formulating an explanation is 
equivalent to constructing a theory, for if the as- 
sumptions are general it will be possible to deduce 
from them many verifiable propositions and not 
only the explicandum. In other words, the logical 
structure of explanation is identical with predic- 
tion if the prediction is based on a theory. In both 
cases one demonstrates that the phenomenon in 
question had to occur, given the assumptions, The 
only difference is that in prediction one starts with 
the assumption and proceeds to deduce testable 
propositions; in explanation one begins with a 
tested proposition and searches for some appro- 
priate assumptions from which it can be deduced. 
Explanation, no less than prediction, requires pre- 
cise propositions with clear empirical implications. 

Traditional functional “explanations” usually 
fail to meet most of the minimal requirements of 
an adequate explanation. It is, of course, difficult 
to construct adequate explanations or theories in 
social science, regardless of the type of analysis 
that is used. However, the concepts and strategy 
of traditional functionalism create formidable prob- 
lems, and the problems are especially tenacious. 
Many contemporary functional analyses contain 
the same errors that have been pointed out for 
decades in numerous papers (for an- excellent 
presentation of the problems of functional analysis, 
see Durkheim 1895, chapter 5). 

Vacuous functional explanations. The problems 
of functional explanation are illustrated in Kluck- 
hohn’s sophisticated and complex interpretation 
of Navajo witchcraft (1944). He uses the three 
major types of functional arguments: (1) a vacu- 
ous explanation in terms of various unrelated, 
adaptive effects of witchcraft; (2) a more adequate 
explanation in terms of a functional prerequisite 
theory; and (3) an interpretation of witchcraft as 
part of an equilibrating system, using the model of 
a functional system. 

Kluckhohn first explains Navajo witchcraft pat- 
terns simply by demonstrating that they have a 
variety of beneficial effects. He states that the 
major question (explicandum) is why witchcraft 
patterns have survived through a given period of 
time and that his basic postulate is “that no cul- 
tural forms survive unless they constitute responses 
which are adjustive or adaptive, in some sense, for 
the members of the society or for the society con- 
sidered as a perduring unit” (1944, p. 46). This 
postulate is very similar to Merton’s statement of 
the three basic postulates of functional analysis: 
“Substantially, these postulates hold first, that 


standardized social activities or cultural items are 
functional for the entire social or cultural system; 
second, that all such social and cultural items ful- 
fill sociological functions; and third, that these 
items are consequently indispensable” ({1949] 
1957, p. 25). These postulates have been criticized 
for leading to a static, conservative orientation 
(Dahrendorf 1958); for if every pattern helps to 
maintain every other pattern, then the system will 
not generate conflict or change; and if every social 
pattern is indispensable, then it is clearly unwar- 
ranted and probably dangerous to change any 
pattern. 

The postulates also lead to vacuous explanations 
like Kluckhohn’s. After stating the major question 
and basic postulate, Kluckhohn proceeds to de- 
scribe many adjustive and adaptive consequences 
of witchcraft. They include (1) expressing aggres- 
sion outside the group, since accused witches are 
generally outsiders; (2) retarding the accumula- 
tion of wealth, since rich people are frequently 
accused; and (3) preventing adultery, since fear 
of witches keeps people from leaving their homes 
at night. 

The form of this type of explanation is: 


(1) Social patterns persist if and only if they 
are adaptive (or functional or fulfill 
needs ); 

(2) Pattern X is adaptive; 


Therefore, pattern X has persisted. 


Such explanations are vacuous because the term 
“adaptive” or “needs” or “functional” is so loosely 
defined that all social patterns can be viewed as 
adaptive (or maladaptive). “Adaptive” is used to 
include anything that is beneficial to some groups 
or system. With a little imagination, one can show 
that any social pattern has adaptive consequences 
for some states of some systems and maladaptive 
consequences for some states of the same or other 
systems. Therefore, the presence or absence of any 
social pattern can be “explained” by this assump- 
tion, and any social pattern can be used to confirm 
or falsify the assumption. For example, one could 
focus on the maladaptive consequences of witch- 
craft and argue that since witchcraft has persisted 
and is maladaptive, the assumption is falsified. 
This problem has been pointed out clearly in 
many critiques of functional analysis (Merton 
1949; Nagel 1957); it is usually stated as the ne- 
cessity for clear system reference or for defining 
“functional” or “adaptive” relative to specific states 
of specific systems. Kluckhohn is aware of the 
problem, and Merton discusses it explicitly and 
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at length in his famous paper “Manifest and La- 
tent Functions” (1949). Nonetheless, Kluckhohn, 
Merton in his explanation of city bosses (1949), 
and many less competent functionalists often point 
to the functions or beneficial effects of the phe- 
nomenon being considered and write as if this 
constituted an explanation of the phenomenon. 
These interpretations often contain valuable hy- 
potheses about how the phenomenon is related to 
other parts of the society, but they typically ignore 
the maladaptive consequences of the phenomenon 
and hinder the development of theory by implying 
that the task of explanation has already been par- 
tially accomplished. 

The form of functional explanations. Kluck- 
hohn’s second line of argument in explaining 
Navajo witchcraft is based on the beginnings of 
a theory of functional prerequisites which limits 
the relevant consequences of witchcraft to the 
effective management of anxiety and hostility. The 
outline of his argument, leaving out many com- 
plications, is (1) if a society is to survive as an 
integrated system, it must provide a relatively non- 
disruptive way of expressing or dealing with hos- 
tility and anxiety; (2) hostility and anxiety can 
be managed through witchcraft and related scape- 
goating behavior or through withdrawal, passivity, 
sublimation, or other functionally equivalent pat- 
terns; (3) because of various conditions of Navajo 
society, the functional equivalents of witchcraft 
cannot be the main ways of dealing with anxiety 
and hostility; and finally (4) Navajo society has 
survived. From these assumptions (1 and 2) and 
initial conditions (3 and 4), Kluckhohn derives 
his explicandum: Navajo witchcraft patterns have 
survived. 

The form of the argument is represented in the 
following scheme: 


(1) If a society survives (A), then hostility is 
managed (B); 

(2) If hostility is managed (B), then witch- 
craft (C) or a functional equivalent 
(D) is present; 

(3) The functional equivalents of witchcraft 
(D) are not present in Navajo society; 

(4) Navajo society has survived (A); 

Therefore, witchcraft is present in Navajo 

society (C). 


Kluckhohn’s argument here is one of the most 
sophisticated and precise attempts to construct a 
general functional explanation. It also illustrates 
one form of an adequate functional explanation, 


which henceforth will be referred to as the para- 
digm of functional explanations: (1) if a system 
survives or is integrated, then a functional pre- 
requisite is fulfilled; (2) if this prerequisite is ful- 
filled, then pattern X or its equivalent is present 
in the system; (3) the functional equivalents are 
not present in system S; (4) S has survived or is 
integrated, and therefore X is present in S. There 
are other forms of explanation that use the basic 
concepts of functionalism, but this form seems to 
be implicit in most traditional functional analyses, 

Let us first consider the simplest question: the 
logic of the explanation. Substituting symbols for 
the phrases in Kluckhohn’s argument, it is appar- 
ent that the logic is sound: (1) if A then B; (2) 
if B then C or D; (3) D is false; (4) A is true, and 
therefore C is true. Hempel (1959) has argued 
that in the typical functional explanation the sec- 
ond assumption is usually stated in reverse. Ignor- 
ing the functional equivalent term D for purposes 
of clarity, this means changing assumption (2) 
to “if C then B.” The argument is then logically 
incorrect and contains the fallacy of affirming the 
consequent: if A then B; if C then B and A is true; 
therefore C is true. However, Hempel has misin- 
terpreted the functionalists, which is easy to do 
given the discursive-essay style of most functional 
explanations. 

In logic, the statement “if A then B” or “A > B” 
is identical with the statement “if not-B then 
not-A” (Kemeny, Snell & Thompson 1956). But 
when a functionalist says, “witchcraft results in the 
effective management of hostility,” he does not 
mean “if witchcraft, then management of hostility.” 
This would be identical to saying, “if hostility is 
not managed, then there will not be witchcraft,” 
and most functionalists would not want to say that, 
since they would hypothesize that managing hos- 
tility depends on several factors in addition to 
witchcraft (and its functional equivalents). What 
they mean to Say is “if witchcraft (or its equiva- 
lent) is not present, then hostility is not managed,” 
which is logically identical with assumption (2) in 
the above paradigm. In other words, a given social 
pattern and its equivalents are typically interpreted 
as necessary but not sufficient conditions for ful- 
filling a functional prerequisite. 

The major problems in Kluckhohn’s explanation 
concern the basic concepts of functional theory: 
adaptation or survival, integration, functional pre- 
requisites, and functional equivalents, An adequate 
theory or explanation requires clear definition of 
these concepts and direct or indirect verification 
of the assumptions in which they are interrelated. 
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However, this requirement is so difficult that al- 
most all functionalists, including Kluckhohn, fail 
to meet it. It is so difficult that at this stage of the 
development of social science it might be advisable 
to abandon the attempt to construct functional 
theories or explanations at the total society level. 

System reference. In Kluckhohn's first assump- 
tion, the crucial terms are survival and manage- 
ment of hostility. Neither term is defined precisely 
enough to allow discrimination between societies 
that have survived or have managed hostility and 
those that have not. Some of the problems in- 
volved in conceptualizing survival, integration, and 
functional prerequisites will be discussed in later 
sections. 

The remaining term in the first assumption is 
“society,” and here again we run into difficulties. 
Any theory must specify its scope—the units to 
which it applies and under what conditions, and 
“society” is a unit that is notoriously difficult to 
define. Kluckhohn himself states that it is ques- 
tionable whether the Navajo as a whole constitute 
an integrated society, and a crucial part of his 
interpretation refers not to Navajo society, but to 
the consumption unit of several families. Witch- 
craft accusations are generally made outside the 
consumption unit but within Navajo society; there- 
fore the statement that witchcraft manages hos- 
tility by directing aggression outside the group is 
true only of the consumption groups. Yet Kluck- 
hohn’s first assumption refers to a different unit— 
society. This shift invalidates his argument. His 
assumptions here are essentially: if a society sur- 
vives (A), then hostility is directed outside of the 
society (B); if hostility is directed outside con- 
sumption groups (C), then witchcraft is present 
(D). It is impossible to deduce anything from these 
assumptions since B and C are not the same. 

Functional equivalents. In Kluckhohn’s second 
assumption we come to the most admirable part 
of his argument, for, unlike most functionalists, 
Kluckhohn explicitly faces the issue of functional 
equivalents. His inability to handle it adequately 
indicates the difficulty of the task. 

The concept of functional equivalents is based 
on the idea that several social patterns can fill 
the same function or have the same consequence; 
to put it colloquially, “there’s more than one way 
to skin a cat.” The concept has been useful in sug- 
gesting the classification of social patterns on the 
basis of common function. Such classification often 
reveals important similarities among apparently 
different social patterns, for example, the simi- 
larities between witchcraft and racial prejudice 


when both are viewed as ways of focusing aggres- 
sion outside the group. However, the concept leads 
to great difficulties in functional explanations of 
a particular social pattern. 

Kluckhohn’s second assumption states that 
witchcraft or its functional equivalent is a neces- 
sary condition of managing hostility. The assump- 
tion is meaningful (directly or indirectly testable) 
only if it is possible to distinguish between patterns 
that are and are not equivalents of witchcraft. This 
requires listing all functional equivalents, or cate- 
gorizing them, or stating their distinctive attri- 
butes. This must be done in such a way that the 
assumption is falsifiable—in this instance, so that 
it is logically possible for a society to manage hos- 
tility and not contain witchcraft or any equivalent 
pattern. 

Kluckhohn’s approach is to list the functional 
equivalents of witchcraft, and he states: “With- 
drawal, passivity, sublimation, conciliation, flight 
and other responses” are alternative adjustive re- 
sponses to hostile feelings (1944, p. 51). One 
problem with this and all lists is that there is 
no reason not to add more items, thereby making 
assumption (2) virtually unfalsifiable. Kluckhohn 
also attempts to identify the distinctive attributes 
of all the patterns on this list: they are all adjust- 
ive responses to hostility. But this only makes 
assumption (2) a tautology: if hostility is man- 
aged, then there will be witchcraft or some other 
way of managing hostility. 

The need for theory. After defining the func- 
tional equivalents of a pattern, another task re- 
mains. In order to explain why a particular social 
pattern is present instead of another, functionally 
equivalent pattern, it is necessary to develop a set 
of assumptions or a theory that specifies which 
pattern will occur under which conditions. Kluck- 
hohn implicitly states such theories as he elimi- 
nates the functional equivalents on his list on the 
basis of specific conditions of Navajo society or 
its environment. For example, he says that with- 
drawal is of limited effectiveness because Navajo 
“types of shelter and the needs for co-operation for 
economic ends sharply limit the privacy of indi- 
viduals” (1944, p. 52). Kluckhohn is moving in 
the right direction here, but he is clearly a long 
way from formulating a clear and explicit theory 
about the conditions determining the occurrence 
of a particular social pattern. 

It should be noted that if such theories are de- 
veloped, functional explanations would be ex- 
panded to include the cause of origin of social 
patterns and the cause of their persistence (Brede- 
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meier 1955). Most traditional functional explana- 
tions of a social pattern X refer to the causes of 
X only if there is a reciprocal relationship in which 
X causes itself, that is, if X has certain effects on 
Y that in turn cause X. In the paradigm of func- 
tional explanations, social pattern X is explained 
by its effects and not by its causes, “cause” here 
meaning the necessary or sufficient antecedent 
conditions for pattern X. But the paradigm can be 
expanded to include assumptions about the con- 
ditions determining the presence of a particular 
pattern, following Kluckhohn’s procedure, and 
these assumptions would probably take the form 
of stating the causes of the origin and persistence 
of pattern X. 

Such an expansion has the additional advantage 
of making it easier to include social change in 
traditional functional analyses. If one excepts the 
concept of functional differentiation (Parsons 
1964), which has limited applicability, it must be 
admitted that traditional functional analysis is not 
very useful in dealing with change. However, with 
the specification of antecedent conditions determin- 
ing particular social patterns, it would be possible 
to predict the change from one social pattern to 
another, functionally equivalent pattern. 

Kluckhohn’s explanation of Navajo witchcraft 
illustrates two major problems in traditional func- 
tional explanations: consistent system reference 
and functional equivalents. Other traditional ex- 
planations will be considered in the following dis- 
cussion of the concepts of functional prerequisites, 
adaptation, and integration. We will return to 
Kluckhohn in the discussion of models of func- 
tional systems. 

Functional prerequisites. The remaining prob- 
lems in traditional functional explanations concern 
the definition of the concepts in the first assump- 
tion of such explanations: if a system survives or 
is integrated, then a certain functional prerequisite 
is fulfilled. ù 

The best-known attempt to formulate a list of 
needs or prerequisites is that of Aberle and his 
co-workers (1950). They start with a definition 
of the survival of a society, which includes bio- 
logical survival of its members and the absence 
of general apathy, and then state nine functional 
prerequisites, such as regulating affect and con- 
trolling disruptive behavior. The list is presented 
as a tentative set of cross-cultural categories, an 
improvement on the standard chapter titles in 
monographs. 

This type of list is not very helpful in construct- 
ing functional explanations, because it does not 
limit the domain of functional prerequisites. The 


needs are so broad that almost any social process 
can be seen as contributing to at least one of them, 
and, being a list, there are few constraints on add- 
ing more needs. But a limited set of prerequisites 
is essential to functional explanation; without it, 
an imaginative investigator can “explain” any 
social pattern merely by describing some conse- 
quence or effect of the pattern and arguing that 
this consequence is a functional prerequisite. 

There are two main approaches to arriving at 
a limited set of prerequisites. One is to construct 
a set of prerequisites and empirically test whether 
they have been fulfilled in surviving societies, while 
other conceivable prerequisites have not. The sec- 
ond approach is to devise a theory that includes 
an assumption specifying social prerequisites. The 
assumption would not be directly testable. It would 
have to be combined with other propositions stating 
the empirical implications of (1) asserting that all 
societies must meet these prerequisites or (2) 
classifying social patterns according to the pre- 
requisites they meet. 

Parsons (1961) is using the second approach 
when he asserts that all systems must meet the 
prerequisites of latent pattern maintenance, inte- 
gration, goal attainment, and adaptation. He is not 
making an empirical statement but is proposing 
that it would be fruitful to categorize and interpret 
social structures in terms of their contributions to 
these functions. His system is not directly useful 
in explaining social patterns because, as yet, he 
has not developed a clear set of propositions that 
specify the empirical relevance of placing a struc- 
ture in one of the four categories as opposed to 
another. This crucial step was made in Bales’s 
general theory of functional prerequisites (1950). 
He specifies two basic functions—adaptation and 
integration—and supplements this nonempirical 
assertion with a complex set of propositions about 
the empirical processes that accompany each func- 
tion. For example, adaptation is associated with 
increasing division of labor, stratification, and 
formality, while the opposite characterizes integra- 
tion. This suggests specific predictions about the 
attributes of organizations oriented to adaptation, 
or of task leaders as opposed to expressive leaders, 
or of small groups in the adaptive phase. Although 
Bales’s theory still awaits precise formulation, it is 
a rare example of a theory that is both general and 
functional and that leads to adequate explanations 
or precise predictions [see INTERACTION, article on 
INTERACTION PROCESS ANALYSIS]. 

Adaptation and integration. The difficulty in 
defining the prerequisites for the adaptation or 
integration of social systems is based in part on 
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the ambiguity of the concepts “adaptation” (or 
“survival”) and “integration.” These concepts are 
crucial to the first assumption in the paradigm of 
functional explanations: if a system survives or 
is integrated, then certain functional prerequisites 
are fulfilled. The assumption requires a definition 
that will discriminate between systems that are 
adapted or integrated and those that are not; and 
this has proved to be difficult. The concept of 
adaptation is further complicated by its association 
with evolutionary theory. 

The difficulty in defining adaptation has long 
been recognized (Radcliffe-Brown 1952). Adapta- 
tion may be equated with physical survival, but 
this runs into the problem that societies, unlike 
organisms, rarely fail to survive in the sense of 
keeping enough of their members alive and to- 
gether. Most recorded cases of the physical death 
of societies are the direct result of hostile action 
by overwhelmingly powerful outsiders who are 
new to the environment, and this is not the kind 
of situation with which functionalists are usually 
concerned. Since almost all societies survive, bar- 
ring sudden catastrophe, it is impossible to test a 
statement of the form “X is necessary for survival” 
if X is universal. It is also impossible to test a state- 
ment of the form “X, is necessary for the survival 
of social system S,” if X, has always been present 
in S;. 

Adaptation or survival may also be defined as 
maintenance of the social organization or struc- 
tural continuity. Survival in this sense is by no 
means universal; in fact it might be difficult to 
find a society that has survived over the last cen- 
tury, i.e., one that has not undergone basic change. 
This leads to new problems, because functional 
explanations of the persistence of social patterns 
require the assumption that the system has sur- 
vived (statement 3 in the paradigm). Precise defi- 
nitions of “stability” or “structural continuity” 
would have to be worked out, and such explana- 
tions would be limited to societies that fit the 
definition of “stable.” Alternatively, adaptation 
can be defined as ensuring high reproduction 
rates following the biologists’ revision of Darwin 
(Simpson 1949). This definition has the advantage 
of discriminating among societies, being easy to 
measure, and making it possible to distinguish 
different degrees of adaptation, as opposed to the 
all-or-nothing criteria of physical survival. These 
two later definitions or others probably could be 
formulated precisely and could be used in speci- 
fying the prerequisites of adaptation. If the 
problem of functional equivalents could be solved, 
it would then be possible to explain social pat- 


tems according to the paradigm of functional 
explanations. 

However, many interpretations in terms of 
adaptation do not follow the paradigm. They often 
use the form of argument discussed in the section 
on vacuous explanations (patterns persist only if 
they are adaptive; pattern X is adaptive; there- 
fore X persists). This type of argument is not 
vacuous if “adaptive” is precisely defined, and 
some investigators seem to be moving toward a 
fairly precise definition which equates “adaptation” 
with “maintaining adequate levels of nutrition and 
health for most members of the society.” Harris’ 
interpretation of the sacred cows of India (1965) 
and other recent papers on economics and ecology 
in primitive societies (Leeds & Vayda 1965; 
Suttles 1960) are examples of this kind of argu- 
ment, in which a social pattern is interpreted by 
showing how it helps to ensure an adequate food 
supply. These interpretations are unobjectionable 
if they are viewed as illustrations of a specific hy- 
pothesis, such as Harris’ hypothesis that here is a 
“cross-cultural tendency to maximize production 
of food” (1959, p. 194). However, the interpreta- 
tions are often presented as if they were explain- 
ing a social pattern in terms of the assumption 
that “social patterns persist only if they are adap- 
tive.” Adequate explanations require plausible as- 
sumptions, and the plausibility of this assumption 
rests on shaky grounds. 

First, the assumption is plausible if one accepts 
the idea that there is a process of selection of the 
fittest social patterns or societies. But I doubt that 
many social scientists imagine, for example, that 
some early societies developed the family and 
therefore survived, while others did not and there- 
fore died out, or that some Northwest Coast In- 
dians developed the potlatch and survived, while 
the others became extinct. One alternative and 
somewhat more plausible notion is that some 
groups developed distinct methods of socialization, 
or the potlatch, and these groups reproduced faster 
than the others and eventually dominated the so- 
ciety. However, I know of no data that clearly 
demonstrate that either of these assumed processes 
ever occurred except in the few situations where 
extinction through disease or military aggression 
was imminent. Another alternative idea is that 
adaptive social traits are more widespread than 
less adaptive traits because the weak tend to imi- 
tate the strong (Dore 1961). Social patterns may 
diffuse to weaker groups from groups with more 
resources and power, that is, with patterns that 
are more adaptive. 

The assumption that only adaptive traits per- 
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sist is also plausible if restricted to the manifest 
functions of social patterns, that is, to the effects 
of the pattern that are recognized and intended by 
members of the society (Merton 1949). If the de- 
cision makers of a society are aware of the adap- 
tive value of a social pattern, this will probably 
influence whether they want to maintain the pat- 
tern. However, the crucial variables here are the 
decision makers’ beliefs about the adaptive or in- 
tegrative consequences of the pattern. These events 
are a subset of the class of goal-oriented behavior; 
people behave so as to maximize certain goals and 
values, and one of these goals is usually successful 
adaptation to the environment. In these cases, it 
would be better to abandon functional terminology 
for a more cognitive, actor-oriented approach. 

The concept of integration is just as problematic 
as that of survival. Integration is defined in a wide 
variety of ways and often masks value judgments 
or unnecessarily complicates a clear nonfunctional 
statement. This section will focus on the difficul- 
ties in defining integration so that it can be used 
in traditional functional explanations. Many anal- 
yses involving integration or adaptation supple- 
ment or replace the traditional approach with the 
use of models of self-regulating systems, and this 
aspect of functionalism will be considered later 
in the discussion of formal functionalism. 

There are two major approaches to defining 
integration. One draws on system theory and de- 
fines integration in terms of interdependence and 
frequent interaction among the members of a 
bounded unit. Although the approach leads to a 
relatively precise and manageable definition, it 
excludes many aspects of the intuitive notion of 
integration. The second approach draws on the 
ideas of congruency and fit or the absence of con- 
flict, strain, or anxiety. The notion of fit is ex- 
tremely vague. The statement that pattern X fits 
with pattern Y means only that there is some rela- 
tionship between X and Y. The type of relationship 
is often left unspecified. 

The typical functionalist approach to the concept 
of integration is to equate fit with lack of conflict. 
Four kinds of definitions in terms of absence of 
conflict can be distinguished. The first defines inte- 
gration as logical—meaningful consistency among 
social patterns (Sorokin [1937-1941] 1962, vol. l 
PP. 15, 23). Integration means that similar themes, 
values, or principles are used in all spheres of 
social life. One ambiguity in this definition is the 
meaning of absence of integration. It could mean 
conflict within the cultural system; however, many 
functionalists seem to mean conflict within indi- 
viduals, and they make the dubious assumption 


that logical inconsistency among the beliefs will 
produce psychological conflict. The second defini- 
tion of integration focuses on the absence of ag- 
gression and exploitation of the powerless among 
members of a social system. An integrated system 
is marked by cooperation, consensus on mutual 
rights and duties, and loyalties that crosscut the 
subgroups within the system. The third definition 
focuses on the absence of conflicting demands on 
individuals as to the appropriate behavior or feel- 
ings in a specific situation; and the fourth defini- 
tion concerns the fit between an individual’s mo- 
tivation and the behavior required by the social 
system. 

The obvious general problem in using any of 
these definitions is that a functional explanation 
must assume that the society being considered is 
integrated. This is proposition (3) in the argu- 
ment: (1) if the system is integrated, then need 
N is filled; (2) if N, then pattern X; and (3) the 
system is integrated; therefore X. This limitation 
is implicitly recognized by functionalists, for they 
tend to shift to nonfunctional explanations when 
interpreting social patterns that result in increased 
conflict. One example of such a shift is Parsons’ 
discussion of the female role in the United States 
(1942). 

It seems very unlikely that any society or large 
social system is perfectly integrated, regardless of 
which definition is used. Therefore, integration 
needs to be defined in a relative way: if a system 
has Y amount of integration, then N will be ful- 
filled. This is not an impossible task but a diffi- 
cult one. It requires extensive theoretical and em- 
pirical work on the distribution and implications 
of different levels of integration, the prerequisites 
for a given level of integration, and the function- 
ally equivalent social patterns that fulfill each 
prerequisite. Until some such specification of inte- 
gration is made, the term cannot be used in ex- 
planations [see INTEGRATION, article on SOCIAL 
INTEGRATION]. 

Alternative approaches. Traditional functional- 
ism has been defined as those analyses that use 
certain concepts (adaptation, integration, func- 
tional prerequisites, and equivalents) and that 
explicitly or implicitly use the paradigm of func- 
tional explanation in which social patterns are 
explained by their effects. Because of the many 
problems discussed above, few functionalists suc- 
ceed in formulating adequate explanations or theo- 
ries. If some of the concepts are abandoned and 
the paradigm is changed, then alternative types of 
analysis are revealed that have a better prospect 
of leading to adequate explanations, Although 
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some of these alternatives are frequently used, 
they are not labeled as “functional analysis” as 
often as the traditional approach. 

One frequently used alternative is to turn func- 
tional explanations around, that is, to reverse the 
explicans and the explicandum in the paradigm 
of explanation so that phenomena are explained 
by their causes, not by their effects or functions. 
Instead of explaining a social pattern by deducing 
it from the assumption that the society has sur- 
vived, one explains the survival of the society by 
the patterns whose effect is to ensure survival. 
For example, the statement that “witchcraft lessens 
in-group aggression” is used to explain rates of 
aggression, not witchcraft. This approach, com- 
bined with a focus on systems other than total 
societies, includes studies of the prerequisites for 
the survival of marriages, utopias, associations, or 
other social systems. It also includes studies of the 
prerequisites for the structural continuity of social 
systems, that is, of the conditions for maintaining 
a certain type of social organization in a specified 
environment. This is the approach described by 
Levy in his outline of “structural—functional requi- 
site analysis” (1952, pp. 34 ff.). 

A second possible change of traditional func- 
tionalism, as mentioned above, is to consider 
smaller systems under specified conditions rather 
than total societies in general. One great advantage 
in this procedure is that systems such as therapy 
groups, unions, or lineages often fail to survive or 
seem to have low levels of integration; therefore, 
statements about the prerequisites for survival or 
integration can be tested. In addition, the possible 
functional prerequisites and equivalents are prob- 
ably more limited, and, finally, there would be less 
difficulty in maintaining consistent system refer- 
ence because there are fewer subsystems than in a 
society. 

A more radical change is to abandon the focus 
on adaptation and integration and to consider the 
prerequisites of any state of a system—such states 
as high suicide rates, for instance, or high or low 
levels of in-group aggression. With this revision, 
we have eliminated the distinguishing attributes 
of functional analysis: explaining patterns by their 
beneficial (adaptive or integrative) effects. Func- 
tionalism so defined includes all analyses of the 
prerequisites, conditions, or determinants of any 
attribute or state of human behavior. Very little 
information is transmitted by labeling all these 
analyses as “functional,” and very few social sci- 
entists use the term in this way. 

However, there is a type of functionalism that 
does not involve explaining patterns by their bene- 


ficial effects and is a distinct form of analysis. 
This is formal functionalism, which is distin- 
guished by the use of models of self-regulating or 
functional systems. Traditional functional analyses 
contain many examples of such systems: of re- 
ciprocal relationships, of one pattern varying so 
as to maintain another within a given range, and 
of complex feedbacks and causal chains that in- 
volve many parts of a system. But some traditional 
functional explanations do not utilize this type of 
model. For example, the explanation of the incest 
taboo in terms of its function of creating alliances 
among families, or the explanation of rites of 
passage by their function of reducing conflict be- 
tween old and new roles, does not involve recipro- 
cal or self-regulating relationships. Therefore, the 
theoretical orientation of traditional functionalism 
and the model of self-regulating or functional sys- 
tems should be kept distinct. 

The assumption that models of functional sys- 
tems fit society is the basis of some of the concepts 
and procedures of traditional functional analysis. 
The stress on latent functions—effects that are 
unintended and unrecognized—is based in part on 
the assumption that models of complex, self-regu- 
lating systems apply to many social systems. There- 
fore, a change in one social pattern may affect 
seemingly unrelated patterns or, because of com- 
pensating changes in other social patterns, may 
have effects that are opposite from those intended 
—or no effects at all. Another instance is the func- 
tionalist postulate that all social patterns “are 
functional for the entire social or cultural system” 
(Merton [1949] 1957, p. 25). This is another way 
of saying that the whole social system forms some 
sort of self-regulating system. These are prema- 
ture judgments about the extent to which models 
of functional or self-regulating systems accurately 
represent the relationships found in social systems. 
The applicability of the models is an empirical 
question for which the relevant data have not been 
collected. 


Formal functionalism 


Formal functionalism consists in the attempt to 
construct models that describe how the units of 
a system are interrelated so as to maintain each 
other or some other state of the system. In con- 
trast to traditional functionalism, it contains no 
theoretical orientation and has no empirical con- 
tent. The models must be supplemented by theory 
and research that specify the empirical units or 
variables that correspond to the abstract elements 
in the models. 

As an illustration of the use of such models, 
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Kluckhohn’s analysis of Navajo witchcraft is use- 
ful again. Kluckhohn supplements his structural 
analysis with an examination of changes over 
time. He describes the deprivation and anxiety 
of the Navajo during their confinement at Fort 
Sumner and several decades later, during a period 
of lesser stress that was caused by the livestock 
reduction program. He notes that warfare was 
impossible for the Navajo after Fort Sumner, that 
witchcraft accusations and executions were most 
frequent shortly after Fort Sumner, and that ac- 
cusations also increased after the inception of the 
livestock program. In his interpretation, Kluckhohn 
assumes, first, that deprivation and frustration 
cause increased hostile feelings, and, second, that 
these impulses will cause in-group aggression un- 
less the hostility is directed outside the group; 
warfare and witchcraft are viewed as social pat- 
terns that direct aggression to outsiders. Given 
these assumptions, the changes in frustration, hos- 
tility, aggression, and witchcraft can be accurately 
represented as a self-regulating system directed to 
maintaining a low level of in-group aggression. 
When the environment (the United States govern- 
ment) imposes increased frustration, hostile im- 
pulses increase, but this is followed by increased 
witchcraft accusations, given the impossibility of 
warfare, and this aggression to outsiders reduces 
hostile impulses back to a level where they no 
longer threaten the maintenance of low in-group 
aggression. 

Nagel’s model. The model used in Kluckhohn’s 
analysis has been precisely described by Nagel 
(1957). Starting with Merton’s article “Manifest 
and Latent Functions” (1949), Nagel rigorously 
describes the model that seems to be implicit in 
functional analysis in the social sciences. 

A functional system, according to Nagel’s defi- 
nition, is made up of two types of variables: G’s 
and state coordinates. G is the property of the 
system that is maintained or is stable. State co- 
ordinates determine the presence or absence of G 
and may include parts of the system’s environ- 
ment. The values of the state coordinates vary to 
such an extent that the maintenance of G is threat- 
ened, but when one exceeds the “safe” limits for 
G, the other(s) compensates and G is maintained, 
Such a system of G and state coordinates may be 
called functional with respect to G, and the state 
coordinates may be described as having the func- 
tion of maintaining G. 

For example, a small task-oriented group could 
be treated as a functional system (Bales 1950). 
Let G be the solution of the group's task or prob- 
lem. Let the state coordinates be task-oriented ac- 


tivity and emotionally supportive activity. If these 
three variables can be usefully treated as a func- 
tional system, then: (1) problem solution is de- 
pendent on task-oriented activity and emotionally 
supportive activity; (2) at certain times, there will 
be such a preponderance of task-oriented activity 
that problem solution will be threatened because 
of decreased motivation or resentment over follow- 
ing others’ suggestions—at these times emotionally 
supportive activity will increase, and problem solu- 
tion will no longer be threatened; (3) at certain 
times, there will be such a preponderance of emo- 
tionally supportive activity that problem solution 
will be threatened. At these times task-oriented ac- 
tivity will increase to maintain problem solution 
or G. 

By definition, more than one state of the system 
leads to maintenance of G. Thus, in the preceding 
example, eventual problem solution might result 
both from initially high task-oriented activity and 
low supportive activity, followed by increased sup- 
portive activity, and from initially low task-oriented 
activity and high supportive activity, followed by 
increased task activity. In a functional system 
there is more than one combination of the values 
of certain parts of the system which will result 
in the same trait or will have the same conse- 
quences (maintenance of G). This is one way of 
stating the notion of functional equivalents. 

The other crucial elements in Nagel’s model 
concern the limitations of the variation of state 
coordinates. Three kinds of limits are relevant. 
First, there are the limits dictated by physical 
reality. For example, the amount of interaction 
in a given period cannot exceed the maximum 
number of acts possible in that time period. Sec- 
ond, within the limits of physical reality there are 
limits determined by the definition of the system 
under consideration. If a property is used to define 
a system, one cannot analyze conditions under 
which this property disappears, The third and most 
important set of limits are those beyond which 
compensation is impossible and G ceases to exist. 
The potential stability of a given G in a specific 
functional system can be expressed in terms of 
the discrepancy between two of these sets of limits: 
the possible variation for each state coordinate and 
the limits of variation within which compensation 
by other state coordinates will occur. G becomes 
less stable as the discrepancy between these two 
ranges increases, 

It should be noted that, contrary to the strategy 
of traditional functionalism, stability of G is not 
assumed. The traditional approach is to assume 
the stability of G (integration or survival) and 
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explain state coordinates in terms of their effi- 
ciency in maintaining G; states of the system are 
explained by their effects. In Nagel’s model, the 
stability or disappearance of G is explained by the 
extent to which state coordinates compensate for 
each other's variation; states of the system are 
explained by their causes. A rough definition of 
“cause” is being used here: A causes B if the oc- 
currence of B depends on the previous occur- 
rence of A, 

Some functionalists have suggested that the 
notion of cause is not applicable to complex social 
systems, but this seems to mean only that social 
patterns are often caused by a complex set of 
interrelated factors and not by one simple factor. 
For instance, in the small group example, task 
solution depends on: (1) a certain amount of 
task-oriented activity; (2) emotionally supportive 
activity; (3) mechanisms which cause a switch 
from one type of activity to the other when one 
type is dangerously low; and perhaps additionally 
on (4) some reciprocal relationship whereby group 
perception of progress in solving the task is neces- 
sary to maintain the two types of activity. This 
is a complex system, but these four factors can 
be isolated as the determinants or causes of task 
solution. The unwillingness to use the term “cause” 
often indicates only that the analysis is incomplete 
and that it is not yet possible to clearly define 
the relevant parts of the system and to specify 
the direction and magnitude of the relationships 
among these parts. - 

To summarize Nagel’s model, a functional sys- 
tem is one that satisfies the following conditions: 
(1) the system can be analyzed into a set of inter- 
dependent variables or parts; (2) the values of 
these variables determine whether or not a certain 
property G will occur in the system; (3) there 
are limits on the variation of state coordinates 
such that variation within the limits is followed 
by compensating variation of other state coordi- 
nates, and G is maintained; (4) variation beyond 
these limits is not compensated for by other state 
coordinates, and G disappears. 

The type of analysis suggested by Nagel’s for- 
malization would probably avoid many problems 
of traditional functional analysis. As in any formal 
model, this one specifies the entities that must be 
clearly and consistently defined: the system, goal 
state, state coordinates, and the limits on their 
variation. Any state of any system might be viewed 
as a G state in a functional system; therefore the 
problematic concepts of survival and integration 
are not necessary, although they could be used as 
the G state (see, for example, the series of analyses 


in Leeds & Vayda 1965, in which the G state is 
the survival of the population in the face of varia- 
tion in the environment). The model also encom- 
passes various kinds of change: (1) goal states 
that are moving equilibria; (2) compensating 
changes in state coordinates; (3) compensating 
changes in subsystems that are treated as state 
coordinates of a higher level system; and (4) dis- 
appearance of G resulting from state coordinates 
exceeding the limits of compensation (Cancian 
1960-1961). 

The contribution of formal functionalism to the 
explanation of social patterns is quite different 
from that of traditional functionalism, because a 
formal model lacks substantive propositions and 
therefore, by itself, cannot explain anything. If the 
correspondence between the model and the parts 
of a specific system are demonstrated, this would 
constitute, at the least, a precise description of 
the system. The description might also be called 
a “little theory” (Cancian 1965, chapter 13)—a 
“theory” because it would explain and predict how 
variance in one part of the system would affect 
other parts of the system, and a “little theory” be- 
cause the propositions would be limited to one 
specific system. In order to construct a general 
theory or explanation, it would be necessary to 
add an assumption stating that the model fit a 
certain class of systems, of which the specific sys- 
tem was a member. 

Specification of functional equivalents. The 
major problem left unresolved in this model is that 
of functional equivalents. An example illustrates 
the difficulty. Assume that one is investigating the 
system of relationships between North American 
college students and their parents and that one 
attempts to explain independence from parents 
(G) by the state coordinates of peer-organized 
rebellion against authority, communication with 
parents, and other variables. How would one deal 
with the different types of organized rebellion, such 
as panty raids, burning draft cards, or picketing 
for civil rights? The approach suggested by the 
model would be to consider only the primary func- 
tion of these activities for the goal state of inde- 
pendence. Panty raids, burning draft cards, etc., 
would be ranked according to the degree of re- 
bellion involved, and all other differences among 
the activities would be ignored. The advantage of 
this method is that it results in a general explana- 
tion that includes college students participating in 
various types of peer-organized rebellion. Note, 
however, that the idea of functional equivalents 
then has no special meaning; it merely indicates 
that a general variable takes various empirical 
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forms. For example, reinforcement can take the 
form of food or water. 

As it is usually used, “functional equivalents” 
does have a special meaning. There is often an 
implicit assumption that if one social pattern dis- 
appeared, a functionally equivalent pattern would 
appear or would be more likely to appear. This 
aspect of functional equivalents can be dealt with 
by treating the equivalent patterns as a subsystem, 
with the G of fulfilling the function of the patterns. 
In this subsystem, as one rebellious activity de- 
creases, some other activity increases so as to 
maintain peer-organized rebellion, if the subsystem 
is a functional system with respect to rebellion. An 
examination of this subsystem would indicate the 
conditions (limits of variance) under which a 
functional equivalent would take the place of a 
disappearing activity. The problem of predicting 
whether a specific rebellious activity would be 
maintained within certain limits could be handled 
by considering the subsystem in which the G state 
is that activity and the state coordinates are the 
variables that maintain the activity. 

Alternative models. Every functional system 
must contain certain minimal attributes: (1) var- 
iable A that is regulated or maintained within a 
range; (2) variable B that does the regulating; (3) 
a mechanism that communicates the variation of 
A to B. These minimal requirements are exempli- 
fied by the famous system: room temperature-fur- 
nace-thermostat, and by various systems in the so- 
cial science area (Lennard & Bernstein 1960). Note 
that the existence of feedback, or reciprocal or mu- 
tual causation between A and B, is not enough. The 
feedback must be of a special kind; it must operate 
so as to maintain at least one of the variables with- 
in a given range. For example, according to many 
observers, the system of “juvenile crime-reform 
school” is not a functional system with respect to 
maintaining low crime rates. There is a reciprocal 
relationship; reform school does feed back to crime 
rates, but its effect appears to increase crime. 

This minimal functional system can be elabo- 
rated in many ways, resulting in alternative func- 
tional models. The primary alternative to Nagel’s 
model is based on changing his notion of a G state, 
Nagel’s system is hierarchical, that is, it contains 
several variables that interact so as to maintain 
some state of the system within a given range (G), 
and the effect of G on these variables is not consid- 
ered, In the alternative, horizontal model, the effects 
of the variation in G on the state coordinates are 
included in the analysis, Both G and the state co- 
ordinates vary so as to maintain each other within 
a certain range; therefore there is no reason to dis- 


criminate between the two types of variables. There 
are many illustrations of such systems in the area 
of complementary roles or interpersonal relations. 
Thus, an authoritarian professor and a dependent 
student or a sadistic-masochistic couple often main- 
tain each other in their respective roles. 

Within the general framework of a hierarchical 
or horizontal model, many specific varieties of func- 
tional systems can be constructed. The system may 
be simple and include only the minimal attributes 
of a functional system, or it may be very complex 
and contain many parts and subsystems, like Mc- 
Cleery’s description of a prison system (Barton & 
Anderson 1961) or Parsons’ model of the social sys- 
tem (1961). 

The system may operate so as to maintain one, 
or several, or all of its components within a given 
range. Systems also differ in the amount of varia- 
tion necessary to set off compensating processes, in 
the limits beyond which compensation is impos- 
sible, and in the time required for compensation. 
When it does occur, the compensation may be too 
much or too little or out of phase (Scientific Amer- 
ican 1955). Compensation for the variance of one 
component may take the form of pushing back that 
component within a range that is less threatening 
to the system, or it may consist of a proportional 
change in other components. There are many pos- 
sible complexities and variations in functional sys- 
tems, some of which are described in the literature, 
especially in the fields of economics, cybernetics, 
and systems engineering. 

Using models. There are two major procedures 
for utilizing functional models in constructing the- 
ories or general explanations. One approach is to 
begin with a careful investigation of a single sys- 
tem. After describing the major components of the 
system and how they are interrelated, the model im- 
plicit in the description can be examined. Some pre- 
viously developed model may fit the description and 
produce new statements about the system, or it may 
be necessary to construct a model. The next step 
is to verify the “fit” by examining the particular sys- 
tem over time, observing variations in the compo- 
nents, and seeing if there is, in fact, compensation 
that maintains some components within a given 
range. The final step is to try to generalize this “little 
theory” to other systems by examining the boundary 
conditions or parameters of the systems and by re- 
stating the components in more general terms. An 
example of this approach is Sweet's analysis of how 
bedouin camping units maintain a certain size 
camel herd by raiding, migration, splitting up the 
social unit, and other means (1965). Her descrip- 
tion could probably be restated as a theory of the 
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bedouin camel complex, that is, as a limited set of 
propositions about herd size, raiding frequency, 
etc., and the relations between them. This theory 
could be tested against the historical record of 
bedouin behavior. If verified, it might be possible 
to restate the “little theory” in more general terms, 
so that, given certain conditions, it applies to camel 
herders in general, or pastoral peoples, or people 
dependent on material goods whose availability fluc- 
tuates widely. 

Another approach is to start with an abstract 
theory about a general class of phenomena that are 
interrelated so as to form a functional system, This 
procedure is exemplified in the research on balance 
theory in social psychology (Heider 1958). The 
basic idea in the theory is that individuals will vary 
their sentiments and evaluations so as to maintain 
a balanced state. Thus, if two individuals like each 
other and one of them likes a third person but the 
other does not, then the individuals will move into 
a balanced state, that is, they will both come to like 
or dislike the third person, or they will change their 
feelings about each other. The relationships among 
components in this theory correspond to the rela- 
tionships in mathematical graph theory, and the 
application of this mathematical model has made 
the theory more precise and has yielded new 
propositions (Berger et al. 1962). 

These two approaches start with a specific or gen- 
eral empirical system and then examine the model 
or set of relationships in the system. An alternative 
would be to start with a model and then search for 
some empirical systems that fit it. This approach 
would be much less likely to succeed than the first 
two because the investigator might not encounter 
any phenomenon of interest that fits the model, and 
then the effort of constructing the model would be 
completely wasted. However, there are some guide- 
lines for locating empirical systems that operate 
like functional systems. 

A primary indicator of the applicability of a func- 
tional model is the persistence of a social pattern 
despite forces that tend to destroy the pattern. For 
example, the apparent persistence of European 
gypsies suggests the presence of compensating proc- 
esses, as does the persistence, despite reform move- 
ments, of deviant patterns such as city bosses or 
prostitution. Of course, it may be that such patterns 
persist because of some constant, powerful factor or 
factors, but it would probably be fruitful to try to 
analyze the persistence in terms of a functional 
model. Another relevant attribute is the presence of 
inherently runaway processes that are kept within 
limits. Some of the social processes that might in- 
crease indefinitely, but often do not, are the exploi- 


tation of followers by powerful leaders or the rejec- 
tion of competing loyalties by a sexually attracted 
couple (Slater 1963). Still another attribute is the 
presence of reciprocal relationships—that is, of reg- 
ular sequences of social patterns that return to the 
point of origin. The most obvious class of phenom- 
ena in this category is self—other patterns, or com- 
plementary roles, but reciprocal relationships also 
occur in large social systems, as shown by Leach’s 
analysis of cyclic change between two political sys- 
tems in Burma (1954). These various attributes 
provide clues as to the applicability of a functional 
model, but in every case the appropriateness of a 
functional model and the fit of different types of 
models are questions that must be empirically in- 
vestigated. 

The usefulness of models in general cannot be 
disputed, and the importance of constructing models 
will surely increase as social scientists attempt to 
deal with complex systems of variables in a precise 
manner and as they shift from monocausal theories 
to the notion of interdependent systems. The use- 
fulness of models of functional systems in particular 
depends on the extent to which the phenomena in- 
vestigated by social scientists exhibit equilibrating 
or self-maintaining processes. Formal functional- 
ism is based on the most valuable part of the re- 
search strategy of functional analysis: an attempt 
to view behavior patterns as mutually reinforcing 
parts of an equilibrating system. Whether or not 
this strategy will succeed in yielding general and 
verified theories is an empirical question that will 
not be settled until this strategy is given a thorough 
trial. 

FRANCESCA M. CANCIAN 


[Directly related are the entries INTEGRATION; INTER- 
ACTION, article on INTERACTION PROCESS ANALYSIS; 
SYSTEMS ANALYSIS. Other relevant material may be 
found in MODELS, MATHEMATICAL; SIMULATION; 
SOCIAL STRUCTURE; SOCIOLOGY, article on THE DE- 
VELOPMENT OF SOCIOLOGICAL THOUGHT; and in the 
biographies of DURKHEIM; HENDERSON; MALINOW- 
SKI; RADCLIFFE-BROWN; WEBER, MAx.] 
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FUSTEL DE COULANGES, NUMA DENIS 


Numa Denis Fustel de Coulanges (1830-1889) 
devoted his life entirely to the study of the origin, 
evolution, and transformation of societies. Fustel 
was born in Paris. He never knew his father, a 
retired naval lieutenant, who died a few months 
after his birth. After secondary studies at the 
Lycée Charlemagne in Paris, he entered the Ecole 
Normale Supérieure. The reactionary atmosphere 
then predominant in France pervaded the school 
and modified Fustel’s previous liberalism. Graduat- 
ing from that school in 1853, he spent two years 
at the Ecole Francaise d’Athénes, where he began 
the study of ancient Greece. Upon his return, he 
was appointed professor at the lycée at Amiens; 
he defended his theses for the doctorat-es-lettres, 
the second of which, in Latin, concerned the cult 
of Vesta and contained in embryo the substance 
of The Ancient City, his first great historical work, 
published in 1864. 

His tastes and his ability destined him for a 
university career, and in 1860 he was appointed to 
the chair of history in the Faculté des Lettres at 
Strasbourg. His brilliant teaching attracted many 
students, but despite this success he felt alone. 


When a vacancy occurred at the Ecole Normale 
Supérieure, he applied for the post and for five 
years taught history there. His reputation as a 
teacher led Victor Duruy, the minister of public 
instruction, to entrust him with teaching history 
to the Empress Eugénie, the wife of Napoléon m1, 
but the lessons were interrupted after only a few 
months by the outbreak of the war with Prussia. 

The war had a profound effect on the patriotism 
of Fustel de Coulanges; it aroused his resentment 
against the German nation and helped to change 
the orientation of his investigations. He now 
plunged into the study of the political institutions 
of early medieval France and devoted the major 
part of his scholarly efforts to them from that time 
on. In December 1875 he was called to the Sor- 
bonne to take the chair of ancient history, and in 
December 1878 he was appointed to the chair in 
medieval history, a position he kept until his death 
in 1889—except for three years, 1880-1883, dur- 
ing which he served, somewhat against his will 
because his health was poor, as director of the 
Ecole Normale. 

The first of the two principal works of Fustel de 
Coulanges was The Ancient City: A Study on the 
Religion, Laws, and Institutions of Greece and 
Rome. In the introduction the author stated that 
he hoped to reveal the principles and rules by 
which the societies of Greece and Rome were 
governed. Fustel’s selection of a subject was not 
determined exclusively by the search for knowl- 
edge: what he wanted as well was to prove that 
the conception of Greece and Rome that people 
had had ever since the beginning of the French 
Revolution was false and that the consequences of 
this erroneous conception had been unfortunate. 
“They have deceived themselves,” he said, “about 
the liberty of the ancients, and on this very account 
liberty among the moderns has been put in peril” 
([{1864] 1956, p. 11). And he added that such 
errors could be corrected only by an objective study 
of the history of the Greco-Roman world that would 
show that modern political conditions were not 
comparable to those of ancient societies. 

According to Fustel, the formation of these 
societies was, in fact, based on a belief common to 
all the Aryan races, namely, that after death the 
soul continues to live, associated with the body, 
in the tomb. The earliest religion was ancestor 
worship, and the family unit that tended the sacred 
fire in the home became the basic unit of the 
ancient societies. This primitive social organization 
expanded by gradual stages: the gens, the Greek 
phratry, and the Roman tribe. The end point of 
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the development was the city, which Fustel defined 
as “a religious association” open only to its citizens, 
that is, only to the members of patrician families. 
Over the centuries these primitive institutions lost 
their simplicity. The priest-kings who had governed 
the cities lost their political authority. The gens 
lost its cohesion; the plebs, which had been outside 
the city, entered into it. Then the Roman conquest 
transformed the character of the old cities bit by 
bit, by destroying their traditional municipal re- 
gimes. The triumph of Christianity was the final 
blow to traditional city government. 

What Fustel de Coulanges argued with great 
logic, but perhaps at the expense of considering 
other relevant factors in social development, was 
that ancient society was founded on a particular 
belief and that it persisted insofar as that belief 
prevailed: it changed gradually as the belief weak- 
ened, and it did not survive its disappearance. 

The Histoire des institutions politiques de lan- 
cienne France (1875-1889) was Fustel’s second 
great historical work. After 1870 he devoted al- 
most all his scholarly activity to it. The many 
articles that he published after 1872 in the Revue 
des deux mondes, the proceedings of the Académie 
des Sciences Morales et Politiques (he was elected 
to membership in 1865), and a number of other 
journals dealt chiefly with specific problems related 
to his major work. 

The work focuses on the impact of the Germanic 
invasions. Fustel demonstrated the way in which 
the political institutions and social organization of 
Roman Gaul changed gradually, becoming first 
those of Frankish Gaul and then those of feudal 
France. According to the prevailing “Germanistic” 
conception of the great invasions, they were an 
avalanche that buried the Roman world. Fustel de 
Coulange’s conception was more sophisticated. He 
saw the invasions as a phenomenon occurring over 
a long period, a slow infiltration of the empire by 
barbarian nations. These peoples, whether Visi- 
goths, Burgundians, Franks, or others, were not 
hostile to Rome; but their invasion, combined with 
the effects of internal causes, produced a gradual, 
imperceptible, and nonrevolutionary transformation 


in the political and social institutions of Romanized 
Gaul. 

This conception of Fustel’s has been accepted 
by most subsequent historians, although they have 
tried to make the theory somewhat more flexible; 
Fustel’s logician’s temperament had left it too rigid. 

Fustel de Coulanges has been reproached for 
having attached exclusive importance to written 
documents, in particular to charters, at the expense 
of other historical sources, such as archeological 
material. Nonetheless his faculty for interpreting 
texts enabled him to extract the maximum of their 
historical significance. As the great medievalist 
Charles Victor Langlois justly observed, Fustel 
needed only to apply his critical approach to a 
hundred words such as villa, marca, allodis, to 
revise radically the interpretation of Merovingian 
times. 

Fustel’s analysis of texts and vocabulary pro- 
duced results of considerable value to sociologists. 
To take one famous example, his method permitted 
him to show that agrarian collectivism had never 
really existed, contrary to the theory of Markgenos- 
senschaft that was developed by German econ- 
omists and legal historians and diffused also in 
Romance countries. Fustel de Coulanges never 
doubted that the institutions of the family and of 
individual property were universal and went back 
to earliest times. 

ROBERT LATOUCHE 


[Other relevant material may be found in FEUDALISM 
and in the biography of GIERKE.] 
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GALIANI, FERDINANDO 


Ferdinando Galiani (1728-1787), Italian econ- 
omist, was born in Chieti. He received a classical 
education and took religious orders. Entering gov- 
ernment service, he became secretary of the Nea- 
politan embassy in Paris in 1759 and served there 
for a decade. During this period he also visited Eng- 
land. The remainder of his life was spent in the 
service of the kingdom of Naples, where he directed 
a variety of government offices, for the most part 
engaging in the formulation and administration 
of national economic policy. 

His published works embrace the humanities as 
well as the social sciences (a bibliography may be 
found in Nicolini 1909). He also left a large num- 
ber of letters that are not only of biographical in- 
terest but are also important for the light they cast 
on the social, economic, and political characteris- 
tics of eighteenth-century Europe. Galiani’s wide 
circle of personal contacts—his intellectual cor- 
respondents and associates included Voltaire, 
Diderot, Genovesi, Turgot, Vico, Friedrich M. von 
Grimm, and others, as well as many highly placed 
diplomatic colleagues—make his letters invaluable 
documents. 

His ideas as an economist had considerable orig- 
inality. His economic thinking was influenced by 
Aristotle as well as by more modern theorists such 
as Bernardo Davanzati (1529-1606), John Locke 
(1632-1704), and Geminiano Montanari (1633- 
1687). He read Vico at an early age, and it was 
because of Vico’s influence that Galiani always 
placed his economic ideas in a social context. De- 
spite his basic rejection of most of physiocracy, his 


ideas were close to those current in France, and for 
a time he had a close intellectual relationship with 
Turgot and André Morellet. Like the works of other 
eighteenth-century economists (for example, Can- 
tillon) Galiani’s reveal a body of thought that is 
thoroughly eclectic. It is systematically related to 
no particular body of ideas, showing traces of scho- 
lasticism, of qualified mercantilism (especially 
where money and wealth are concerned), as well 
as of the natural-law ideas of the time. A free-think- 
ing rationalist, Galiani appears nevertheless to have 
retained from mercantilism a protectionist inter- 
national trade policy, if only with respect to grain. 

The theoretical core of Galiani’s work is found 
in Della moneta (1750); his policy prescriptions 
(clothed, incidentally, in a polished polemical lit- 
erary style) are in Dialogues sur le commerce des 
blés (1770). His originality lies primarily in the 
fields of value theory, interest theory, and economic 
policy. 

Value theory. Galiani’s value theory was based 
on the dichotomy between utility and scarcity that 
had been at the core of value theory from the time 
of Aristotle. This dichotomy was fundamental to 
the theories of his predecessors, Davanzati and 
Montanari (Gordon 1964, pp. 115-116), and of his 
contemporaries, Quesnay and Adam Smith. But 
Galiani’s value theory departed from that of his 
contemporaries: it foreshadowed the subjective de- 
mand theory that was so highly developed in the 
1870s by Menger, Jevons, and Walras (Einaudi 
1953, pp. 272-273; Kauder 1953, p. 638; Schum- 
peter 1954, p. 300 and passim). Moreover, his con- 
cept of relative scarcity foreshadowed the theory of 
diminishing marginal utility, another advance of 
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economics in the last quarter of the nineteenth cen- 
tury. (The theory had, in fact, been clearly stated 
by Herman Gossen in 1854 but did not become 
widely known until its rediscovery by Jevons. ) 

Galiani understood the quantity of a commodity 
demanded as varying inversely with price, prob- 
ably in a schedule sense rather than in the stock 
sense common in his day. The idea of price elas- 
ticity of demand is present in Della moneta (Einaudi 
1953), although not in the mathematical terms de- 
veloped by Alfred Marshall in 1890. 

He did not discuss cost in general, but only the 
factor of labor in cost. For although he believed 
value to be subjectively determined and to vary 
from individual to individual, he did not believe 
that its source or its measurement is subjective. 
Rather, he believed that labor is not only the source 
of value but part of its objective measurement. 

Interest theory. In economic writing before the 
seventeenth century, interest theory is rarely dis- 
cussed, and when it is, the discussion is largely con- 
fined to theories of loan interest. This is also true 
of Galiani’s interest theory. Nevertheless, his is one 
of the first discussions that considers why interest 
is paid instead of whether interest payments are 
legal or moral. Using a probability model, Galiani 
explained that interest is in principle paid to com- 
pensate the lender for the risk entailed in parting 
with his money (here and now )—money that is to 
be recovered from the borrower sometime in the 
future (or from a distant shore ). Like his subjective 
value theory, Galiani’s interest theory is based on a 
psychological assumption: men in general prefer 
present money over money at a future time or at 
some distant place. This theory has come to be 
known as the “time preference theory of interest,” 
and Galiani’s formulation of it anticipates that of 
Bohm-Bawerk by more than a century. 

Economic policy. In matters of policy, Galiani 
adopted a position of historicogeographical rela- 
tivism. Even the best-constructed economic models 
must be adjusted for circumstances of time and 
place before being applied to specific problems. 
This became a basic principle of the German histor- 
ical school of Friedrich List and Wilhelm Roscher, 
but unlike that school, Galiani did not reject ab- 
stract theory. 

It is surely remarkable that Galiani’s early ideas 
in the areas of value and monetary theory antici- 
pated corresponding notions of the neoclassical and 
Austrian schools, respectively. His later writings 
clearly foreshadow the views of the German histor- 
ical school on economic policy. 

PETER R. Toscano 


[For the historical context of Galiani’s work, see Eco- 
NOMIC THOUGHT; and the biographies of LOCKE; 
QueEsnay; SMITH, ADAM. For discussion of the sub- 
sequent development of his ideas, see UTILITY and 
the biographies of BbHM-BAWERK; LIST; RoscHER.] 
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GALL, FRANZ JOSEPH 


Franz Joseph Gall (1758-1828), German anat- 
omist, founder of cranioscopy (later known as 
phrenology ), and physician, was born in the grand 
duchy of Baden and died at Montrouge near Paris. 
He was the sixth child of a merchant. After the 
usual schooling, he studied medicine first at 
Strassburg and later at Vienna, where he was 
graduated in 1785. 

He began his medical practice in Vienna and at 
the same time studied the healing powers of na- 
ture, the subject of his first book, Philosophisch- 
medicinische Untersuchungen über Natur und 
Kunst im kranken und gesunden Zustande des 
Menschen (1791). In it, Gall discussed the rela- 
tionship between body and soul; the latter was 
located in the brain and was the seat of ideas. This 
concept led him to the notion of the cerebral local- 
ization of mental faculties, and he devoted the re- 
mainder of his life to exploring this idea. 

There was at the time great interest in methods 
of determining character and temperament from 
external bodily configurations, especially the facial 
(physiognomy), and Gall gradually evolved a new 
method called “cranioscopy,” combining the con- 
cepts of cerebral localization with the analysis of 
bodily configurations. 

He argued that faculties or talents are inborn 
and dependent upon cerebral structures; therefore 
the brain is made up of as many “organs,” or areas, 
as there are moral and intellectual qualities. These 
“organs” are on the cerebral surface, and as their 
size is reflected accurately by the overlying skull, 
they are palpable. He selected and located the 
“organs” by his empirical observations: having ob- 
served that a good memory is associated with pro- 
truding eyes, he therefore postulated that the 
memory organ is on the cerebral orbital surface 
and its hypertrophy pushes out the eyes. Character 
and intellect are represented by 27 organs, Or 
psychic qualities, sharply delineated on the cranial 
surface. 

About 1796 Gall began lecturing on cranioscopy 
in Vienna. Although the theory won great popu- 
larity, its materialism, manifested by an attempt 
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to relate mind to matter, also incited opposition, so 
that in 1802 Gall’s teaching was banned for re- 
ligious reasons. Gall was joined in 1804 by Johann 
Gaspar Spurzheim, who had recently received his 
medical degree and who helped with the dissec- 
tions of the nervous system that Gall undertook. 
Unlike other physiognomists, Gall attempted to cre- 
ate a scientific basis for his theory. 

Seeking a more congenial atmosphere, Gall and 
Spurzheim left Vienna in 1805, and after an ex- 
tended tour of Germany and neighboring countries, 
where their lectures excited widespread contro- 
versy, they settled in Paris in 1807. As in Vienna, 
Gall established an extensive medical practice with 
many famous patients, and he also resumed his 
lecturing and writing. In 1808 he published his 
Discours d'ouverture au cours de physiologie du 
cerveau (1808) and in the same year prepared 
with the help of Spurzheim a neuroanatomical 
memoir which was assessed by a group of dis- 
tinguished French scientists. On the whole the 
judges were favorably impressed, and in the fol- 
lowing year the report, with replies to the criticisms 
made by the assessors, was published (Gall & 
Spurzheim 1809). In 1810 Gall’s most important 
and comprehensive work on neuroanatomy and 
cranioscopy began to appear, entitled Anatomie et 
physiologie du système nerveux en général . . . 
(1810-1819). Spurzheim helped with the first and 
second volumes only, for in 1813 he left France to 
popularize “phrenology,” as he, but not Gall, came 
to call cranioscopy. Gall’s last publication was a 
revised edition of Anatomie et physiologie. . . . He 
died of a cerebrovascular accident and was buried 
in Pére-la-Chaise cemetery in Paris: his skull is 
preserved in the Musée de Homme. 

Gall had an insatiable desire to learn and claimed 
that he was not guided by a desire for power, hon- 
ors, or riches, although some of each came his way. 
He was perfectly sincere, if somewhat uncritical, 
in his ideas of brain function and was by no means 
a quack as is often thought; the stigma earned by 
phrenology was largely due to its popularizers, who 
claimed that phrenological examination could be 
used in career selection, for all kinds of prophe- 
cies, in choosing members of parliament and other 
professional men, etc. Gall’s own intellectual abili- 
ties, his powers of observation, and his honesty, 
integrity, and independence in matters of principle 
were outstanding. As a product of the eighteenth- 
century Enlightenment, he was a social reformer, 
and he emphasized the need for medical ethics. He 
deduced moral rules from his phrenology and 
called for state educational systems and reform 
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of current methods of handling criminals and 
lunatics. 

Gall’s influence was widespread. His anatomical 
investigations and cranioscopy turned attention to 
the brain, and he had an important influence on 
brain dissection and neuroanatomy in general in 
the nineteenth century. Moreover, phrenology first 
popularized the modern concept of cerebral local- 
ization of function, although it made no specific 
contribution to the concept. Gall’s basic idea was 
right, but for the wrong reasons. Phrenology was 
also important in the development of scientific 
psychology, of anthropological investigations of the 
skull, and of criminology. 

Phrenology retained popularity throughout the 
nineteenth century, and societies furthering its 
practice still exist today. Apart from its general 
effects upon the study of anatomy and physiology 
at the beginning of the nineteenth century, it has 
played no further role in science. However, its in- 
fluence on social reform, political groups, religion, 
philosophy, and literature has been extensive, 
especially in the United States and Britain. 


EDWIN CLARKE 


[See also NERVOUS SYSTEM, article on STRUCTURE AND 
FUNCTION OF THE BRAIN; and the biographies of 
Broca; FLOURENS; LASHLEY.] 
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Francis, Galton was born in 1822 and died in 
1911. He was educated successively at home, at a 
dame school, at Boulogne, and at Kenilworth. In 
1835, at the age of 13, he entered King Edward’s 
School at Birmingham, where he stayed for two 
years. He spent two years as a medical student, 
the first at the General Hospital, Birmingham and 
the second at King’s College, London. In 1840 he 
entered Trinity College, Cambridge, as a mathe- 
matics student, but was content to take a poll 
degree in 1843, when his health broke down. 

Galton’s father, Samuel Tertius Galton, was a 
banker. His mother, Violetta Darwin, was the 
daughter of Erasmus Darwin by his second wife. 
One of Erasmus Darwin’s grandsons through his 
first wife was Charles Robert Darwin; there was 
a certain physical resemblance between the two 
cousins. His mental development is interesting: 
he is credibly reported to have read a simple book 
before he reached the age of three, and his rest- 
less ingenuity with regard to machinery dates from 
his early youth. He did not enjoy school, however, 
nor did he find the profession of medicine, which 
was chosen for him, congenial. In spite of his in- 
terest in mechanics and mathematics, he was not 
successful in his Cambridge studies. 

When his father died in 1844, Galton immedi- 
ately forsook any idea of continuing his medical 
career. He found himself the possessor of a more 
than adequate income and proceeded to spend his 
time and energy “hunting with a set chiefly note- 
worthy for their extravagance and recklessness 

. the strange thing [being] that it [shooting] 
seemed to absorb his whole nature, and to be done 
not for the sake of the experience, but in the pure 
pursuit of occupation” (Pearson 1914—1930, vol. 1, 
pp. 208-209), 

It was after these fallow years, as Pearson called 


them, that Galton carried out the explorations 
for which he was later awarded the gold medal 
of the Royal Geographical Society in 1853. Even 
before going to Cambridge, Galton had taken an 
extended trip down the Danube and on to Smyrna, 
which had perhaps awakened the young man to 
the delights of foreign scenes and strange peoples. 
After his father’s death he set off again for Egypt, 
Khartoum, and Syria, but he “was still touring for 
the boyish fun of movement and of new scenes. He 
had not yet thoughts of the language, habits, or 
archaeology of the people he mingled with” (ibid., 
p. 205). It was not until after four years of idle- 
ness in England that he set out on a trip to tropical 
Africa, the results of which showed that he had 
come to terms with life and with himself. 

In 1850 Galton set off for the Cape and spent 
two years upcountry exploring from Walvis Bay to 
Lake Ngami, territory of which little was known. 
He composed 15 brief laws for the Hottentot chiefs 
who governed the Damaras of the plain and com- 
piled a rudimentary dictionary for the English who 
wished to use the local tongue. He returned to 
England early in 1852 and read a paper to the 
Royal Geographical Society, which awarded him 
its gold medal the following year. This award was 
followed in 1854 by a silver medal from the French 
Geographical Society. Early in 1853 he met Louisa 
Butler and married her in August. After an ex- 
tended honeymoon tour of Europe, punctuated by 
visits to England, the Galtons finally settled in 
London, and in 1855 Galton really began to work. 

Early publications. As might be anticipated, 
Galton’s first publication was on exploration, and in 
1855 The Art of Travel was published. There were 
signs that his scientific curiosity was developing 
in new directions, since in Vacation Tourists and 
Notes of Travel (1861-1864), which he meant 
to be an annual magazine, there is a description of 
the eclipse of the sun in 1860, with a drawing of 
the curved rays of the corona that he had observed. 
Galton’s first piece of fruitful research was on the 
weather. He started to plot wind and pressure maps 
and noted, from very scanty data, that centers of 
high pressure are associated with clockwise direc- 
tions of winds around the calm center. He coined 
the name “anticyclone” for such systems in 1863. 
Several other papers followed, in which he was 
clearly feeling his way toward the concepts of cor- 
relation and regression. 

He tried to determine a linear prediction formula 
for the velocity of the wind, given the pressure, 
temperature, and humidity. He did not succeed, 
possibly because of his failure to realize that the 
prediction formula for pressure from velocity was 
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not the same as the prediction formula for velocity 
from pressure. The realization that there are two 
regression lines was still in the future, as was the 
concept of correlation. In 1870 he read a paper at 
the British Association entitled “Barometric Predic- 
tions of Weather,” in which he was fumbling 
toward a multiple regression, trying to predict the 
wind from pressure, temperature, and humidity. He 
failed in his objective at the time, but he posed the 
problem for others who were to succeed. 

Intellectual influences. In assessing the intel- 
lectual influences on Galton, continuing uncer- 
tainty exists as to the extent of Quetelet’s influ- 
ence. Pearson tended to minimize the significance 
of Quetelet for Galton; he wrote, “I am very doubt- 
ful how far [Galton] owed much to a close reading 
of the great Belgian statistician” (Pearson 1914— 
1930, vol. 2, p. 12), and he placed perhaps undue 
weight on the fact that Galton possessed no copy 
of Quetelet’s Letters . . . on the Theory of Probabil- 
ities (1846). Pearson further remarked that Galton 
“was never a great student of other men’s writings: 
he was never an accumulator like his cousin 
Charles Darwin” (Pearson 1914-1930, vol. 1, 
p. 209). Now Pearson was closer to Galton’s time and 
actually knew him, so that some weight must be 
given to his opinions. Nevertheless, Pearson would 
appear to have underestimated the influence of 
Quetelet; he himself pointed out that Galton’s work 
seemed to flow naturally out of that of Quetelet. 
Further, Galton’s obsession with the normal curve 
of error which, to a certain extent, has unduly influ- 
enced the development of statistical method, can 
only have stemmed from Quetelet. One of Quetelet’s 
great achievements was to consider all human ex- 
perience as ultimately capable of being described 
numerically, which was fundamentally Galton’s 
attitude also. 

The other great influence on Galton during the 
period in which he was establishing himself as a 
research worker affected the whole of the scientific 
world in the second half of the nineteenth century 
—the publication in 1859 of Charles Darwin's The 
Origin of Species. The effect of this work on Galton 
was not immediately apparent in his writings, but 
there can be no doubt that the book was responsible 
for transforming him from a geographer into an 
anthropologist and eugenist. He began with the 
article “Hereditary Talent and Character” in 1865 
and proceeded through Hereditary Genius (1869); 
English Men of Science: Their Nature and Nurture 
(1874); Inquiries Into Human Faculty (1883); 
and Natural Inheritance (1889), by which time he 
was 67 years of age. As Pearson said, “We see that 
his researches in heredity, in anthropometry, in 
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psychometry and statistics, were not independent 
studies; they were all auxiliary to his main object, 
the improvement in the race of man.” 

Application of statistics. In Hereditary Genius, 
Galton claimed that his discussion of heredity was 
the “first to treat the subject in a statistical man- 
ner” ([1869] 1952, p. vi). He clearly owed much 
to Quetelet and paralleled Quetelet’s use of the 
normal curve for anthropometric measurements by 
using it to grade intellectual ability. He was quite 
explicit about this: “The law is an exceedingly gen- 
eral one. M. Quetelet, the Astronomer-Royal of Bel- 
gium, and the greatest authority on vital and social 
Statistics, has largely used it in his inquiries. He 
has also constructed numerical tables, by which the 
necessary calculations can be easily made, when- 
ever it is desired to have recourse to the law” (ibid., 
p. 23). 

Galton supplemented Quetelet’s tables by a short 
table of the abscissas of the unit normal curve cor- 
responding to percentiles of area (1889). He ex- 
amined the abilities of the kin of persons who had 
achieved eminence of some kind—judges, generals, 
scientists, statesmen, painters, poets, and clerics. 
He was concerned with distinguishing between gen- 
eral ability and special ability and regarded each 
individual personality as a combination of natural 
ability and the advantages accruing from early en- 
vironment, i.e., nature and nurture. 

This idea of nature and nurture recurs in his 
writings. Thus we find in English Men of Science 
(1874), “It is, I believe, owing to the favourable 
conditions of their early training that an unusually 
large proportion of the sons of the most gifted men 
of science become distinguished in the same career. 
They have been nurtured in an atmosphere of 
free enquiry. . . .” The thesis is that heredity tends 
to produce eminence in some area and that environ- 
ment tends to be the deciding factor in specifying 
what this area shall be. Galton tried to go beyond 
this in Inquiries Into Human Faculty and Its Devel- 
opment (1883), the book that possibly holds most 
interest for students of the history of psychology, 
in which he discussed preliminary results that he 
had obtained in the psychometric field. 

In 1876, at the exhibition of scientific instru- 
ments at South Kensington, Galton exhibited his 
“Whistles for Determining the Upper Limits of 
Audible Sounds in Different Persons.” Both before 
and after this time he was active in proposing tests 
for the measurement of muscular sensitivity by 
weight discrimination, for the perception of differ- 
ences of tint, for reaction time, for acuteness of 
hearing, for keenness of vision and judgment of 
length by the eye, and for the senses of smell and 


touch. In an attempt to describe the skewed distri- 
butions that often resulted from the application of 
his tests, Galton hypothesized that in some fre- 
quency distributions, such as, for example, judg- 
ment of length, the geometric mean, rather than 
the arithmetic mean, is the best “medium” for the 
distribution, and he wrote a paper on “The Geomet- 
ric Mean in Vital and Social Statistics” (1879). As 
usual the mathematical conceptualization was be- 
yond him, and he took the problem to Sir Donald 
Macalister, who derived what is now known as the 
log-normal distribution. 

At this stage of his work, he was associated with 
the American psychologist James McKeen Cattell, 
who on his return to the University of Pennsyl- 
vania (and later at Columbia University), began 
to teach statistical psychology, giving his first 
course in 1887. Through Cattell, Galton’s ideas and 
experiments exerted possibly the greatest single 
influence upon American psychology during the 
last years of the nineteenth century. 

From the statistical point of view, Natural In- 
heritance is probably the most important of Gal- 
ton’s writings. As can be seen from his earlier 
works, the ideas in it had been fermenting in his 
mind for some time, but it was their expression in 
Natural Inheritance that excited the interest of 
those whom today we might call the practitioners 
of applied mathematics. Again he was influenced 
by the fact that Quetelet was using the normal 
curve to describe anthropometric data and by the 
interest in the problems of inheritance aroused in 
him by The Origin of Species. 

He began the book with a summary of those prop- 
erties of the normal curve that appealed to him. 
He had previously suggested representing a fre- 
quency distribution by using grades or percentiles, 
and he elaborated on this suggestion here, pointing 
out that the normal distribution is completely de- 
termined from a knowledge of the median and one 
other quantile. Galton had observed that many 
measured characteristics can be closely described 
by a normal curve. He used the “quincunx,” first 
shown in print with the publication of his lecture 
“Typical Laws of Heredity,” delivered at the Royal 
Institution (1877), to illustrate the build-up of the 
normal curve: He had noticed that a normal curve 
is reproduced by lead shot falling vertically through 
a harrow of pins and he tried to explain the sta- 
bility of measured characteristics by this mechan- 
ical device. In this paper he had almost reached 
the concepts of both regression and correlation but 
must have felt the need for further thought, since 
it was at this time that he began to collect data 
bearing on inheritance in man. Galton published 


nothing further on heredity for eight years. The 
foundation of his ideas on regression and correla- 
tion did not perhaps become clear to him until a 
short time before the publication of Natural Inher- 
itance. 

The regression line arose naturally out of meas- 
urements of the sizes of the seeds of mother and 
daughter sweet pea plants. The sizes of the seeds 
of daughter plants appeared to “revert” to the 
mean (the word “revert” was soon replaced by 
“regress” ). This inspired him to look at a bivariate 
frequency table of the heights of fathers and sons, 
in which he found a regression to “mediocrity.” 
The arguments he used became familiar ones with 
the analysis of variance put forward by R. A. 
Fisher some forty to fifty years later. Suppose, 
Galton said, that we want to predict the height of 
brother A, given the height of brother B. We take, 
therefore, all the individuals who have heights the 
same as B and form a collection of the heights of 
all of their brothers. These brothers as a group 
Galton called a cofraternity, and he proceeded to 
discuss the variation in height of all individuals 
about the grand mean, the variation of the cofra- 
ternity means about the grand mean, and the varia- 
tion of the individuals of the cofraternities about 
their respective cofraternity means. This splitting 
up of variation had been done previously by Lexis 
in Germany and Dormoy in France, but Galton was 
possibly the first to carry out this type of analysis 
with the idea of assigning the variation. 

While studying the bivariate frequency table of 
heights of fathers and sons, Galton was struck by 
the observation that the contours of equal fre- 
quency in the table were similar and similarly sit- 
uated ellipses. He also found the lines that fitted 
the medians of the arrays (possibly drawing them 
by eye) and the slopes of these lines eventually 
became his regression coefficients. This early work, 
as is inevitable with a pioneering effort, is confused 
and difficult to evaluate, not least because Galton 
himself was not explicit. When, however, he had 
determined that he had what would now be termed 
linearity of regression and homoscedasticity in the 
arrays of the table, his mathematical powers were 
not sufficient to enable him to form a mathematical 
model for his surface, and he took the problem to 
Hamilton Dickson, a Cambridge mathematician. 
Dickson’s mathematical formulation was published 
in an appendix to Galton’s paper “Family Likeness 
in Stature,” presented to the Royal Society in 1886. 
Galton was troubled by the fact that the slope of 
the regression line depends on the variability of 
the margins, and this concern led to his search for 
a unit-free measure of association. 
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Some time earlier, in 1882, Alphonse Bertillon 
had put forward a scheme for classifying criminals 
according to 12 physical measurements that was 
adopted by the prefecture of police in Paris. Gal- 
ton became interested in this scheme and pondered 
for some time over which measurements would be 
the most descriptive—that is, which would discrim- 
inate one man most effectively from his fellows. 
It was from these considerations that he was led 
to the realization that some measurements might 
be so highly correlated with other measurements 
as to be useless for the prescribed purposes and 
finally to the necessity for describing how any two 
measurements are related. The slope of the regres- 
sion line is not adequate for this, since it depends 
on both the scales of measurement and the choice 
of dependent variables. However, the regression line 
fitted between the variables that Galton used 
(1888) after dividing the heights (reduced by their 
median) by a measure of their variability (their 
semi-interquartile range) and similarly dealing 
with forearm length provides a unit-free measure 
of association. Given the problem and 65 years of 
subsequent statistical development, the correlation 
coefficient may now appear to have been inevitable. 
There can be no question, however, that at the 
time at which Galton wrote, 1888-1889, the pro- 
duction of a measure of association that was inde- 
pendent of location and scale was an immense 
contribution to statistical methodology. 

The Bertillon system of measurement also started 
Galton wondering about the whole procedure of 
personal identification. In the paper for the Royal 
Institution in which he discussed bertillonage, he 
also drew incidental attention to fingerprints. In 
his book Finger Prints (1892), he referred to the 
work of Jan Purkinje, Kollman, William Herschel, 
and Henry Faulds, who had preceded him in this 
study, but it is clear that at the time he wrote little 
was known. As he himself said: 


It became gradually clear that three facts had to be 
established before it would be possible to advocate the 
use of finger prints for criminal or other investigations. 
First it must be proved, not assumed, that the pattern 
of a finger print is constant throughout life. Secondly 
that the variety of patterns is really very great. Thirdly, 
that they admit of being so classified, or lexiconised, 
that when a set of them is submitted to an expert, it 
would be possible for him to tell, by reference to a 
suitable dictionary or its equivalent, whether a similar 
set had already been registered. These things I did, but 
they required much labor. 


As a result of Galton’s book and his evidence to 
a committee set up by the Home Office in 1893, a 
fingerprint department was established, the fore- 
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runner of many such throughout the world. Galton 
himself, as might be expected from his previous 
work and interest, turned to studying the inheri- 
tance of fingerprints, a study which was carried on 
for many years in the laboratory that he founded 
and that was named after him. 

Eugenics. The term “eugenics” was introduced by 
Galton in his book Inquiries Into Human Faculty 
(1883) and soon won general acceptance. The study 
of human inheritance and the possibility of improv- 
ing human stock were undoubtedly linked in his 
mind, as his public lectures and papers witness. He 
did more than lecture, however. In 1904 he founded 
a research fellowship in national eugenics at the 
University of London which was to develop in a few 
years into the Galton Laboratory of National Eu- 
genics, with Karl Pearson as its first director. Pear- 
son was succeeded by R. A. Fisher, and the now 
vast complex of statistical theory and method de- 
veloped there thus owes its origin to Galton. 


It was inevitable that Galton’s work should at- 
tract the interest of young men able in the mathe- 
matical and in the biological fields, and the late 
1880s saw Karl Pearson, and W. F. R. Weldon— 
the one a professor of applied mathematics and 
the other a professor of zoology and both at Uni- 
versity College, London—working in the field of 
“biometry,” i.e., the application of mathematics to 
problems of biological inheritance. Galton himself 
said, “The primary object of Biometry is to afford 
material that shall be exact enough for the discov- 
ery of incipient changes in evolution which are too 
small to be otherwise apparent.” Pearson and 
Weldon met difficulties in their attempts to publish 
papers relating to biometry in existing journals and 
determined to start their own. A guarantee was 
required. Galton, on being asked to help, not only 
guaranteed the whole amount but followed it up 
with an additional gift that enabled his admirers to 
go their way in freedom; the journal Biometrika, 
the first to be devoted to both the theory and prac- 
tice of statistics, was established on a firm footing. 
In the last decade of his life, Galton played the part 
of counselor and adviser to the younger men, but 
he still worked away at his own problems, as his 
continued output of letters and papers indicates. 

During his last years many honors came his way. 
He had been elected a fellow of the Royal Society 
in 1856, receiving a gold medal in 1886, the Dar- 
win medal in 1902, and the much-prized Copley 
medal in 1910, the year before his death. He was 
awarded the Huxley medal by the Anthropological 
Institute in 1901 and the Darwin—Wallace medal 
by the Linnean Society in 1908. He received honor- 


ary degrees from both Oxford and Cambridge uni- 
versities and became an honorary fellow of Trinity 
College, Cambridge, his old college, in 1902. The 
citation for the Darwin medal said, in part, “It may 
safely be declared that no one living has contrib- 
uted more definitely to the progress of evolutionary 
study, whether by actual discovery or by the fruit- 
ful direction of thought, than Mr. Galton,” Mr. 
Galton’s private comment was, typically, “Well, I 
am very pleased except that I stand in the way of 
younger men” (quoted in Pearson 1914—1930, vol. 
3A, p. 237). 

F. N. Davip 


_ [For the historical context of Galton’s work, see the 


biographies of DARWIN and QUETELET. For discus- 
sion of the subsequent development of Galton’s 
ideas, see EUGENICS; LINEAR HYPOTHESIS, article on 
REGRESSION; MULTIVARIATE ANALYSIS, articles on 
CORRELATION; and the biographies of CATTELL; 
FISHER, R. A.; PEARSON.] 
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GAMBLING 


Gambling may be defined as a form of activity 
in which the parties involved, who are known as 
bettors or players, voluntarily engage to make the 
transfer of money or something else of value among 
themselves contingent upon the outcome of some 
future and uncertain event. 

While the origins of gambling are lost to re- 
corded history, it appears probable that games of 
chance developed out of various magical and re- 
ligious practices employed by man to cope with 
problems of uncertainty and fate. In contemporary 
societies gambling appears most frequently in rec- 
reational contexts and in association with various 
sports and games. Among the major classes of 
games, gambling is especially common in those in 
which chance plays a prominent role, but it also 
appears in connection with games of skill or strat- 
egy and with games of physical prowess. In non- 
recreational contexts gambling occurs in the form 
of wagers on the outcome of future events about 
which the bettors have strong convictions, as in the 
case of election outcomes. Elements closely parallel 
to gambling appear in conjunction with economic 
activities, especially those in which risk and un- 
certainty are prominent. In effect, speculators on 
the commodity markets bet against each other 
about the rise or fall of commodity prices. 

In its various aspects gambling is at once a major 
recreational institution, a minor vice, a large-scale 
industry, a powerful source of crime and political 
corruption, a perennial social problem, a fascinat- 
ing psychological puzzle, and an intriguing pastime. 
Like prostitution, it is ancient, widespread, and 
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widely disapproved. It flourishes, in spite of ethical 
taboo and legal sanction, as an institutionalized 
deviant pattern and as a form of crime in which 
the victims are willing accomplices. 

Social science perspectives on gambling. Be- 
cause of gambling’s complex and paradoxical 
nature, it is not surprising that the responses to it 
by social scientists and others have been extraor- 
dinarily diverse. Philosophers and theologians have 
struggled with the ethical and teleological impli- 
cations of gambling, viewing it at times as a pro- 
fane and frivolous stepchild of religion, to which it 
bears certain disquieting similarities. Mathemati- 
cians have exploited gambling to the hilt in the 
development of probability theory, some of them 
becoming gamesters in the process. Economists 
have turned to gambling to clarify the distinction 
between the functional and dysfunctional aspects 
of speculation and, more recently, for some sophis- 
ticated reformulations of utility theory. In the 
“theory of games” they have found in gambling a 
model for analyzing strategies in competitive situ- 
ations involving risk and uncertainty. 

Experimental psychologists have employed gam- 
bling situations in studies of probability learning 
and probability preferences, of levels of aspiration, 
and of intermittent reinforcement and conflict— 
drive motivation. Social psychologists have utilized 
gambling games to study social competition, ag- 
gression, and coalition formation. Psychoanalysts 
and clinical psychologists have concerned them- 
selves with the unconscious motivations and person- 
ality structures of gamblers and with the problems 
of addiction. Sociologists have been interested in 
the incidence of gambling, its organization in rela- 
tion to the underworld and the police, its functions 
for individuals and society, and its control. Finally, 
ethnographers and social historians have provided 
some fascinating descriptions of the cultural pat- 
terning of gambling in various times and places. 

In these many different approaches that have 
touched on the subject, the interest in gambling 
has often been peripheral to some other, central 
interest, with the result that the coverage is piece- 
meal and fragmentary. While there are scattered 
bits of useful knowledge and theory, systematic 
treatises on gambling are sadly lacking. Moreover, 
many of the treatments that do exist are specula- 
tive, impressionistic, and moralistic, and many are 
also without adequate data. 


The psychology of gambling 

From a psychological viewpoint, gambling is at 
once an instrumental activity, directed toward a 
consciously recognized economic end, and an ex- 
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pressive activity, enjoyed as an end in itself. Inter- 
pretations of gambling motivations have varied 
widely, depending upon which of these facets has 
commanded the central focus of attention. 

Gambling as instrumental behavior. In making 
their betting decisions, economically oriented gam- 
blers must take two principal factors into account: 
the odds and the probabilities. In gambling termi- 
nology “odds” designates the ratio between the 
amount staked and the amount the player stands 
to win if successful. “Probabilities” refers to expec- 
tations regarding the outcome of the event bet 
upon, expressed as a ratio of favorable to unfavor- 
able outcomes or as the percentage of favorable 
outcomes out of all possible outcomes. The actu- 
arial value of a gambling risk depends upon the 
relationship between the odds and the probabilities. 
In gambling situations generally, there is a tend- 
ency for these two ratios to approach an inverted 
balance: as the probabilities of a favorable outcome 
become smaller, the odds become longer. For ex- 
ample, in horse racing, long shots fetch better prices 
than favorites. That this tendency exists at all may 
be taken as evidence for the operation of an ele- 
ment of economic rationality among gamblers. 

It is clear, however, that no general theory of 
gambling behavior can be constructed from the con- 
ventional notions of economic rationality alone. In 
every gambling situation either the odds and prob- 
abilities are exactly balanced or they are not. As- 
sume, first, that they are balanced, as in the case 
of tossing coins for even money. In the gambling 
world such risks are known as fair risks because 
neither side has any clear advantage. While it 
might appear that the rational gambler would have 
no particular reason to avoid such risks, there is 
also no apparent reason for him to accept them. 
Indeed, according to orthodox economic theory, the 
prudent gambler would presumably assign some 
element of disutility or cost to risk assumption and 
to the activity of gambling itself, regarding it as a 
form of “work.” Moreover, if he assigns some func- 
tion of diminishing utility to successive increments 
of money income, as economic theory assumes, the 
utility of each unit of money he might win in a 
fair-risk situation would be less than that of the 
money he might lose. Hence, the economically 
rational gambler would presumably avoid such 
risks as in this sense “not fair.” 

In all other gambling situations the odds and 
probabilities are not in balance. In situations of this 
Sort, since all gambling transactions are between 
persons, it follows that whereas one player has 
backed a good risk, his opponent has by definition 
accepted a poor one. It is evident, therefore, that at 


least half of all gamblers have lost their rational 
economic bearings. In fact, in the gambling world 
the majority of good risks are monopolized by the 
professional gamblers who operate the various 
games and devices, always with a comfortable mar- 
gin of safety. From the lay player's point of view 
there are no “good risks” at all in any professionally 
operated gambling house. Yet the market behavior 
of such gamblers makes it clear that there is al- 
ways an easy market for poor gambling risks, espe- 
cially those in which the odds are intriguingly long 
but offset by disastrously short probabilities. 

The “utility of money.” Economists and psychol- 
ogists alike have advanced many arguments and 
theories designed to show how such apparently 
nonrational behavior may yet be motivationally in- 
telligible. One line of argument has called for a 
reappraisal of traditional assumptions about the 
diminishing utility of money. Thus, Vickrey (1945) 
reasoned that the behavior of lottery players clearly 
implies that in this situation the utility of money 
is an increasing rather than a decreasing function 
of income. Following this same theme, Friedman 
and Savage (1948) demonstrated that for the gam- 
bler even a small probability of a large reward may 
have more utility than either a much larger prob- 
ability of small loss or the certainty, if the risk be 
rejected, of staying at the same income level. Along 
similar lines, Mosteller and Nogee (1951) recorded 
the actual market behavior of experimental sub- 
jects in a situation that permitted varying amounts 
to be won, with varying probabilities of success or 
failure. Observing that the subjects do not auto- 
matically accept the bets with the highest actuarial 
values, they attempted to construct a series of 
curves showing the actual utility for their subjects 
of varying amounts of money (see also Coombs & 
Komorita 1958). 

The “utility of gambling.” A second line of 
analysis has challenged the assumption that the 
activity of gambling should be reckoned on the cost, 
or disutility, side of the calculus. In effect, the 
gambler may justify his losses as a fair payment 
for the pleasure he has obtained from the activity 
itself. Royden, Suppes, and Walsh (1959) have 
proposed a carefully reasoned model for the ex- 
perimental measurement of the “utility of gambling” 
itself, which they argue must be kept independent 
of notions about the “utility of money.” This line 
of analysis, however, is not much help in explain- 
ing why gamblers find pleasure in this activity, 
while nongamblers presumably do not. 

Subjective probabilities. A third line of anal- 
ysis, focusing on the cognitive aspects of gambling 
orientations, raises the question whether gambling 


behavior is a function of simple ignorance or error 
or whether the “subjective probabilities” upon which 
the gambler premises his behavior are systemati- 
cally distorted or biased by various motivational 
factors. 

Ignorance or error. Studies of probability learn- 
ing provide evidence that there is ample scope for 
error in appraising the probabilities in complex 
gambling situations. Komorita (1959) has shown 
that experimental subjects are least accurate in 
estimating probabilities when the number of events 
or possible outcomes is large and when the prob- 
abilities for unit events depart from .50, Brim and 
Koenig (1959) found, in a sample of 143 college 
students, that none knew the correct way to com- 
bine the probabilities of independent events. And 
Cohen and Hansel (1958) found that even highly 
intelligent subjects tended to interpret multiplica- 
tive probabilities as if they were additive. Profes- 
sional gamblers are less prone to such errors and 
employ their extra knowledge to design gambling 
situations in which the true probabilities are subtly 
concealed. 

Faulty reasoning may also serve to distort sub- 
jective probability estimates, as, for example, in 
the widespread belief among gamblers in the 
“maturity-of-chances” doctrine (Jarvik 1951). Since 
Dame Fortune must keep her books in balance, 
reason such gamblers, at roulette, for example, 
after any unduly long run of black the probabilities 
of red appearing on the next few plays are greatly 
increased. In fact, of course, the probabilities on 
unit events are not affected by preceding sequences. 
Ironically, perhaps, while the ability to estimate 
true probabilities generally increases with maturity 
(Cohen 1957), more sophisticated reasoning er- 
rors, such as the maturity-of-chances theory, do 
not occur among young children and become in- 
creasingly common with advancing age (Ross & 
Levy 1958). 

Distorted estimates of the true probabilities also 
result from erroneous information—a principle 
much utilized by shrewd professionals. Thus, race 
tracks abound with false tips and spurious “inside 
information,” much of which is deliberately circu- 
lated by touts to mislead the fans and thus to skew 
the betting odds in some desired direction. 

Motivated bias. While there is ample scope for 
simple ignorance or error in gambling situations, 
there is also abundant evidence that such errors 
are not random or merely cognitive but reflect con- 
sistent patterns of motivated bias. Very simply, the 
“errors” are almost invariably such as to distort 
the subjective probabilities in the gambler’s favor. 
It has been observed again and again that gamblers 
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consistently overestimate their own skill or luck, 
and it has been demonstrated experimentally that 
subjects consistently overestimate low probabilities 
(cf. Preston & Baratta 1948; Nogee & Lieberman 
1960). 

A variety of possible explanations of this phe- 
nomenon have been advanced. Atkinson (1957) 
demonstrated experimentally that the tendency to 
overestimate chances to win is especially likely to 
be associated with a high need for achievement. 
Another recent study of probability learning has 
shown that positive events are learned more rapidly 
and extinguished more slowly than negative events 
(Crandall et al. 1958). This might help to explain 
the tendency among gamblers to think they are 
ahead—to remember the exciting winning play and 
forget the losses that preceded it. But, of course, 
most gamblers do know in a cognitive sense that 
they might lose and that they have lost in the past. 
Leon Festinger and his associates would regard this 
as an example of dissonant information that must 
somehow be suppressed if the gambler wishes to 
continue, which of course he does; indeed, this 
circumstance may cause the actor to develop some 
extra attraction to the activity, harnessing various 
secondary drives to it to justify his behavior and 
compensate for the dissonant information (Law- 
rence & Festinger 1962). 

In terms of learning theory more generally, gam- 
bling represents an ideal—typical situation of “inter- 
mittent reinforcement,” or partial reward, and the 
evidence is overwhelming that activities reinforced 
in this way are peculiarly resistant to extinction. 
One plausible explanation of this is advanced in 
the conflict—drive theory developed by J. M. Whiting 
and his associates. Basically, this theory argues 
that when the same activity is sometimes rewarded 
and at other times nonrewarded or punished, con- 
flict between the contradictory expectations of re- 
ward and punishment has the effect of adding drive 
strength to the originally reinforced action (Sears 
et al. 1953). Very simply, if the gambler always 
won or always lost, he would presumably lose in- 
terest; however, the conflict between fear and hope 
helps keep him going. 

Gambling as expressive behavior. In consid- 
ering the purely psychological rewards of gambling, 
we are clearly getting away from a concern with 
cognitive orientations and the interpretation of gam- 
bling as an instrumental activity directed toward 
an economic goal and are moving toward interpre- 
tations of gambling as an end in itself. It has been 
shown that even mathematically sophisticated sub- 
jects, possessing full information regarding the 
odds and the probabilities, opt for poor risks most 
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of the time (Scodel et al. 1959). Curiously, it is 
also clear that there are some kinds of people who 
would not even bet on a sure thing. Moreover, 
there are evidently important differences among 
gamblers themselves: between those who prefer 
games of pure chance and those who prefer games 
involving skill or strategy; between system players 
and long-shot players; or between petty gamblers 
and addicts, for example. Such differentials as these 
yield to analysis only in terms of the differential 
noneconomic needs and motives of different classes 
of people. 

Games as expressive models. Games typically 
occur in times, places, and contexts that are re- 
moved from the workaday utilitarian sectors of 
social structure and hence from the constraints and 
disciplines that the reality principle imposes on 
task-oriented activities. Thus, they tend to become 
intimately involved with the expressive and social— 
emotional sectors and to be concerned with prob- 
lems of tension release and integration. Indeed, 
because they operate in what Kurt Lewin called a 
“plane of unreality,” games are well suited to func- 
tion as expressive models, onto which a variety of 
psychological conflicts and problems can be harm- 
lessly projected. As Menninger noted with respect 
to games of strategy, they may enable us “to ex- 
press aggression without reality consequences; one 
can hurt people without really hurting them; we 
can even kill them without really killing them” 
(1942, p. 172). In a similar vein, Phillips (1960) 
has interpreted certain children’s games as exer- 
cises in the mastery of anxiety, in which psycho- 
logical problems can be worked through in a 
miniature and relatively safe context. 

Observing that games are not free and spon- 
taneous expressive activities of individuals but are 
embedded in culture, Roberts and Sutton-Smith 
(1962) hypothesize that games will have special 
relevance for psychological problems that are en- 
demic and widespread in the cultures or subgroups 
in which they are played. While games provide a 
buffered learning experience for children, game 
involvement typically diminishes with maturity as 
individuals become integrated into the mainstream 
of their culture. Games among adults are thus pre- 
sumed to represent unresolved areas of conflict, 
and addicted players are presumed to be persons 
in a high state of unresolved inner conflict. 

We may go on to inquire, for what kinds of needs 
or conflicts, for what kinds of persons, in what 
kinds of groups or societies, are gambling games 
appropriate expressive models? A number of the- 
ories have been proposed, each with at least some 
evidence in its support. 


Teleological motivations in gambling. Since 
games of chance apparently originated out of re- 
ligious and magical practices of divination, it has 
frequently been proposed that gambling may still 
perform an important teleological function in help- 
ing people orient themselves to the problems and 
conflicts invoked by the intrusions of chance, risk, 
and uncertainty in a world presumed to be causally 
and morally ordered. In principle, chance is mean- 
ingless and unintelligible, both causally and eth- 
ically. Yet, because of its capacity to violate 
legitimate expectations in important ways, many 
people feel that it must mean something: why do 
these things happen? why do they happen to me? 
For people who find these problems salient, gam- 
bling may assume a cosmic significance as a device 
for probing after the ground of things and of one’s 
personal relationships to fate (am I lucky today?). 
In one of the earliest psychological studies of gam- 
bling, France (1902) argued that in an environ- 
ment of uncertainty a belief in luck is functional 
in encouraging a necessary element of risk assump- 
tion but that too much reliance on luck would 
obviate action and lead to a lack of effort. Relevant 
empirical evidence is provided in a recent cross- 
cultural study (Roberts et al. 1959) which demon- 
Strates that games of chance are especially likely to 
be found in preliterate societies in which the deities 
are regarded as benevolent and nonaggressive and 
readily subject to compulsion by humans. In con- 
temporary, rationally oriented societies, gambling 
appeals particularly to superstitous persons, and 
it is one of the few areas in which permissive atti- 
tudes toward superstition are tolerated. 

Economic conflicts and gambling. Because of 
gambling’s quasi-instrumental, economic character, 
it is also peculiarly suitable for the working out of 
conflicts engendered by the discipline, frustrations, 
and constraints of the capitalist economic system. 
Since its rewards are distributed on the basis of 
chance, gambling would appear to make a mockery 
of the legitimate economy, with its stress on ration- 
ality, discipline, and hard work and its assumed 
correlations of effort, merit, and reward. Evidence 
that conflicts in this area are relevant to gambling 
motivation is provided in another cross-cultural 
study, which demonstrated that games of chance 
are most frequently found in preliterate societies 
in which child-training practices place special 
stress on responsibility training and arouse high 
anxiety about achievement performance (Roberts 
& Sutton-Smith 1962). The authors of this latter 
study have also demonstrated that in the United 
States games of chance tend to be preferred by 
women and low-status economic groups—cate- 


gories especially involved with positions of frus- 
trating drudgery and with routine responsibilities 
(Sutton-Smith et al. 1963). A similar hypothesis is 
tested by Tec (1964) in her study of gambling in 
Sweden, in which she demonstrated that habitual 
gambling is especially common in groups that 
find conventional channels of social advancement 
blocked (see also Devereux 1949; Caillois 1958). 

Competition and aggression. Where gambling 
occurs in conjunction with games of strategy, as 
in poker (see Riddle 1925), motivations of per- 
sonal competition and aggression may also play a 
prominent role. In a classic analysis of this theme, 
W. I. Thomas (1901) viewed gambling as a form 
of sublimated combat and suggested that such 
games may help to keep this vital “instinct” alive 
in our civilized, bureaucratic world. Among pre- 
literate societies it has been shown that games of 
strategy tend to occur most frequently in the rela- 
tively complex societies with developed systems of 
stratification, in which social competition becomes 
problematic and a source of conflict (Roberts et al. 
1959). In the contemporary United States a prefer- 
ence for games of strategy and also games of phys- 
ical prowess is more common among higher-status 
persons (Sutton-Smith et al. 1963). 

Anxiety and guilt in gambling. Finally, there 
are theories that have focused upon thrill-seeking 
motivations in gambling behavior and their rela- 
tionship to anxiety. There is broad consensus 
among students of habitual gamblers that, for all 
their apparent external calm, gamblers are in fact 
highly anxious persons. The psychological litera- 
ture on levels of aspiration provides abundant evi- 
dence that persons known to be high in anxiety are 
particularly prone to set goals for themselves that 
are grossly unrealistic on the basis of past per- 
formance, being either much too high or much too 
low (cf, Lewin et al. 1944), It has also been dem- 
onstrated that unrealistic aspirations are more 
likely to be set in gaming situations, perhaps be- 
cause of their miniature and fictitious character, 
than in real life situations (Frank 1935). Address- 
ing himself to such findings, Atkinson (1957) 
argues that unrealistic aspirations in fact serve the 
function of minimizing anxiety about failure, for 
if one did not really expect to succeed, failure has 
very little sting. In his studies of probability prefer- 
ences, he was able to demonstrate that persons 
high in “need achievement” and presumably high 
in success drives typically prefer risks at interme- 
diate probability levels, in which success or failure 
are equally probable and, hence, which generate a 
maximum of tension and anxiety. Subjects low in 
“need achievement” and presumably more con- 
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cerned with fear of failure typically preferred risks 
at extreme probability levels (either sure things or 
long shots), in which the success—failure tension 
is greatly reduced. Persons with high success need 
would presumably prefer games of skill or strategy, 
while persons with high fear of failure would prefer 
games of pure chance, in which failure is peculiarly 
noninvidious [see ACHIEVEMENT MOTIVATION]. 

Why do people who are high in anxiety and in 
fear of failure elect to gamble at all? Edmund 
Bergler, the only psychoanalyst who has dealt ex- 
tensively with the problem of addicted gamblers, 
has argued that such gamblers are genuine neurot- 
ics, driven by unconscious aggression and latent 
rebellion “against logic, cleverness, moderation, 
morals and renunciation. That latent rebellion, 
based on the inwardly never-relinquished ‘pleasure 
principle,’ scoffs ironically at all the rules of educa- 
tion. Heavy inner retaliation is the result . . . rebel- 
lion activates a deep unconscious feeling of guilt” 
(1943, pp. 385-386). This guilt, Bergler argues, 
becomes in turn a source of anxiety and creates a 
need for self-punishment. These ‘unconscious feel- 
ings are then neatly displaced into the segregated, 
miniature, and toylike setting of gambling, where 
they may be more or less harmlessly worked out. 
The gambler persuades himself that the real source 
of his guilt and anxiety is the tension of the game 
itself, and he achieves the needed self-punishment 
by keeping going until he loses. Thus, uncon- 
sciously, the neurotic gambler wants to lose, and 
he needs to lose in order to keep his psychological 
books in balance (see also Bergler 1957; Olmsted 
1962). 

Because gambling provides such a neat, ready- 
made, institutionalized, and culturally sanctioned 
mechanism for the handling of neurotic problems 
of this sort, the neurotic gambler does not feel neu- 
rotic and rarely appears voluntarily for treatment. 
Hence, to date there has been very little empirical 
research on gambling addiction, and academic 
psychologists have largely ignored the problem. 
There is, however, at least some empirical evidence 
that gambling may also serve somewhat parallel 
conflict-resolving functions for nonaddicted petty 
gamblers, along the lines indicated earlier in this 
article. 


The sociology of gambling 

From a sociological point of view gambling is of 
interest primarily as an institutionalized deviant 
pattern. Although it is widely disapproved of, gam- 
bling is nevertheless also widely practiced, and it 
has given rise to an extensive sub rosa organization 
that is elaborately articulated with the underworld, 
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on the one hand, and with the legitimate world of 
its clientele, on the other. 

Attitudes toward gambling. The disapproval of 
gambling is ancient and extremely widespread, al- 
though varying greatly in intensity, content, and 
rationale. At different times and places gambling 
has been treated as a capital offense or merely as a 
misdemeanor. Legal restrictions have ranged from 
total prohibition to selective permissiveness, in 
which certain types of gambling activities have 
been permitted or even encouraged while others 
were forbidden or in which the rules against gam- 
ing were lifted during stated holidays or holy days. 
The laws have often made a distinction between 
games of pure chance and games of skill or be- 
tween professional gamblers and their clients. 
Similarly, the grounds for the disapproval of gam- 
bling have varied widely, ranging from views that 
hold that gambling is fundamentally wrong in 
principle to views that hold that gambling is wrong 
only if it produces manifestly evil consequences. 
Essentially the same conclusions, moreover, have 
been derived at times from theological and ethical 
arguments and at other times from rationalistic, 
scientific, and pragmatic grounds. 

In Western society attitudes toward gambling 
have varied significantly among different religious 
groups. While the Bible is silent on the subject of 
gambling, there are numerous references to the use 
of lots for serious purposes, as when Moses was 
instructed to allocate the lands of Canaan among 
the Israelites by lot (Numbers 26.55). Since chance 
events were considered “acts of God,” the use of the 
lot, with appropriate ritual and respect, was re- 
garded as justified for discerning the divine will in 
serious matters. However, the use of the lot for 
frivolous purposes, as in gaming, was regarded as 
a sacrilege and profanation. While formal opposi- 
tion to gambling has persisted amon g Jewish moral- 
ists, prevailing attitudes have softened consider- 
ably, and they now view gaming as essentially a 
useless waste of time. Moreover, since the Middle 
Ages gambling has been widespread within the 
Jewish community. Roman Catholics have also 
come to take a liberal attitude toward gambling, 
holding that there is nothing wrong in principle 
with gambling, providing only that certain condi- 
tions be met: that the game be honest, that the 
stakes be moderate and within the means of the 
players, and that the money staked be one’s own, 
for example, 

Gambling and the Protestant ethic. In Western 
society fundamental opposition to gambling, as a 
matter of basic ethical principle, is centered 
squarely in Protestantism and in the cultures where 
these denominations prevail. These are also, as 


Weber (1904-1905) observed, the same cultures 
in which modern bourgeois capitalism has achieved 
its greatest development. This fact suggests the 
hypothesis that gambling is somehow peculiarly 
antithetical, in principle at least, to the core of 
values embraced in this dominant economic sys- 
tem and in its supporting Protestant ethic. 

On the surface, at any rate, the antithesis would 
seem to be clear enough. Among the core values of 
bourgeois capitalism are its special emphasis on 
rationality, disciplined work habits, prudence, 
thrift, methodical adherence to routine, and the 
assumed correlation of effort, ethical merit, and 
reward. The values symbolically embodied in gam- 
bling are diametrically opposed to these core vir- 
tues. Since its rewards are based on chance, 
gambling is explicitly and spitefully nonethical and 
makes a mockery of the required correlation of 
merit and reward. Thus, it tends to undermine 
disciplined work habits, prudence, and thrift; and 
in place of the needed rational—empirical orienta- 
tions, it tends to foster superstitious beliefs and 
magical practices. If the values fostered by gam- 
bling were to become general in the population, the 
whole system of ethical sentiments that functions 
to sustain this dominant economic system would 
simply wither away. So argue the Protestant moral- 
ists. 

In fact, of course, even in the dominant eco- 
nomic system the alternative values of initiative, 
daring, boldness, shrewdness, aggressive competi- 
tiveness, and willingness to assume risks also play 
a prominent role, and sheer luck is not always 
irrelevant. Because these values do not fit so neatly 
with those of the ethically sanctioned core, they 
have been the focus of considerable cultural am- 
bivalence and guilt. The American public is uncom- 
fortably aware, for example, of certain disquieting 
similarities between gambling and transactions on 
the stock markets; but open discussion of these 
similarities is strongly tabooed, and standard eco- 
nomic texts make a frantic effort to focus on the 
differences. In effect, gambling becomes a whipping 
boy to serve the precarious distinction between 
forms of speculation that are functionally useful 
and hence “legitimate” and those that are function- 
less or dysfunctional, Thus, when speculation gets 
out of hand, the obviously dysfunctional conse- 
quences for the economy are blamed, not on legiti- 
mate businessmen and still less on the stock mar- 
ket system, but on “gamblers,” who have somehow 
invaded the market and should be driven from it. 
More generally, it is quite possible that the disap- 
proval of gambling, in addition to reinforcing cer- 
tain functionally appropriate values and attitudes, 
functions as a mechanism through which society 


seeks to allay its fears and misgivings about the 
ethical integrity of the dominant system—a fact 
that may account for the persistence and intensity 
of this disapproval in Protestant societies (Dever- 
eux 1949, chapter 18). 

Functions of gambling for society. The fact that 
gambling persists in spite of the powerful legal and 
ethical taboos against it may be taken as evidence 
that it serves important psychological functions for 
the gamblers; the preceding discussion of the psy- 
chology of gambling has called attention to at least 
some of these. But what about functions or dys- 
functions of gambling for society? This, of course, 
is an empirical question that cannot be answered 
on the basis of a priori principle. The fact that the 
disapproval of gambling is functional for society 
does not in itself establish that the practice of gam- 
bling is therefore dysfunctional, for the questions 
of scale, contexts, and side effects must also be 
considered. Most observers would probably agree 
that the addicted gambler, like the alcoholic, is a 
waste for society. Moreover, it is probably true, as 
the moralists have argued, that if gambling became 
a major preoccupation for the whole population 
and if the attitudes and practices of gambling were 
to permeate the sphere of the dominant economic 
system, the consequences for society would be 
seriously dysfunctional. 

However, there is no evidence that petty gam- 
bling is in any way damaging to character or that 
petty gamblers differ in significant ways from non- 
gamblers (Tec 1964). On the contrary, petty gam- 
bling may function as a kind of institutionalized 
“solution” for many of the specific psychological 
problems generated by the conflicts, strains, and 
ambivalences embedded in the economic system. It 
may serve to revitalize certain relevant patterns of 
motivation that are given little scope in routine 
economic pursuits, such as motives relating to 
themes of daring, combat, faith, and willingness 
to take chances. It has also been argued that the 
existence of institutionalized petty gambling is 
functional for society in providing a channel into 
which potentially disruptive speculative tendencies 
may be safely deflected from the legitimate market 
place. To these should be added the positive (and 
perhaps somewhat perverse ) value-reinforcing and 
scapegoating societal function of the disapproval of 
gambling, for which institutionalized gambling pro- 
vides a convenient target. 


The legalization issue 

If recreational petty gambling is harmless enough 
and may even perform useful functions for per- 
sonality and society, the question naturally arises, 
should gambling be legalized? In fact, several forms 
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of gambling have already been legalized during the 
present century. Gambling casinos flourish in many 
European countries and in Latin America; many 
nations, including even the Soviet Union and 
China, have adopted state lotteries; and legalized 
football betting pools have captured enormous fol- 
lowings in England and Sweden. 

Gambling and law in the United States. In the 
United States pari-mutuel betting at racetracks has 
been legalized in approximately half the states. 
Several states allow the playing of bingo for char- 
itable purposes; two states (New Hampshire and 
New York) operate a lottery, and only one state 
(Nevada) permits all forms of gambling. Although 
there is unmistakable evidence that moral resist- 
ance to legalized gambling is weakening rapidly in 
the United States, until November 1966, with the 
above exceptions, proposals to legalize any other 
form of gambling in the United States had been 
soundly defeated. 

Although legal sanctions may have some damp- 
ening effect on the amount of gambling that 
occurs, they have never been able to stop gambling 
altogether. Sample polls in the United States have 
indicated over and over again that a majority of 
American adults do in fact gamble at least occa- 
sionally, in spite of moral taboos and legal restric- 
tions. In 1951 the Kefauver committee estimated 
the volume of illegal betting in the United States at 
$20,000 million per year (Kefauver 1951), but that 
estimate is almost certainly too low. In 1963 an 
officially recorded $2,700 million was wagered 
legally on horse races alone, through the pari- 
mutuel machines, and experts are generally agreed 
that at least ten times as much was wagered 
illegally with bookmakers. Scarne (1961, p. 1) 
places the probable total volume of betting, for all 
types of gambling in the United States, at closer 
to $500,000 million a year, While this estimate is 
probably too large, there is ample evidence that the 
volume of illegal betting in the United States is suf- 
ficient to support a major industry. Indeed, since 
the legalization of the liquor industry in 1933, gam- 
bling has become the major source of support of 
the organized underworld in the United States. 

The antigambling laws are unenforced and un- 
enforceable in the United States for two principal 
reasons. First, the enormous financial resources 
controlled by the professional gamblers have en- 
abled them to buy protection from excessive police 
or political harassment. Indeed, gambling has be- 
come one of the principal sources of political cor- 
ruption and graft in America, especially at munici- 
pal levels (Devereux 1949; Kefauver 1951). Second, 
the general public, although sufficiently ambivalent 
to insist that antigambling statutes remain on the 
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books, does not really want the gambling laws en- 
forced and hence provides grossly insufficient sup- 
port to reform-administration and routine enforce- 
ment efforts. As noted above, gambling is a peculiar 
form of crime, which is carried on with the willing 
consent of its victims; even when the victims have 
been clearly duped or cheated, they are usually 
loath to complain to the police, because of feelings 
of embarrassment and shame. 

Arguments for legalization. The proponents of 
legalization argue that gambling—at least regu- 
lated petty gambling—is probably harmless, pos- 
sibly beneficial, and in any case ineradicable. Anti- 
gambling statutes can never be effectively enforced. 
By keeping them on the books, we throw the entire 
operation into the hands of the underworld, create 
thereby an enormous source of revenue and power 
for organized crime, and keep alive a major source 
of political graft and corruption in America. More- 
over, we place an unnecessary burden of guilt and 
hypocrisy upon the lay public, which must patron- 
ize these illegal and frequently dishonest establish- 
ments to indulge their gambling propensities. Par- 
tial legalization, as it currently exists in the United 
States, is doubly unsatisfactory, the argument con- 
tinues, for it is discriminatory, hypocritical, and 
sabotages the moral convictions needed for effec- 
tive law enforcement. Legalization will effectively 
end this sort of hypocrisy, get gambling into the 
open where it can be suitably regulated and con- 
trolled, dry up a major source of underworld in- 
come and power, eliminate the occasion and re- 
source for police graft and political corruption, and 
make available to the state a highly lucrative source 
of additional revenue, achieved through the most 
painless known form of taxation. These arguments 
were ably stated some thirty years ago by a leading 
American sociologist, E. W. Burgess (1935), and 
have been repeated ever since. Until now they have 
not prevailed, even though recent public opinion 
polls show unmistakable trends in this direction. 

Arguments against legalization. The Opposition 
to legalization stems from several sources and 
draws on a variety of arguments. The core of 
resistance in the United States is still firmly rooted 
in the residual Puritan culture, which regards gam- 
bling as inherently sinful and placates its restive 
conscience by keeping the official facade of culture 
officially against it. Whatever the merits of the 
moralists’ theological or ethical grounds, their argu- 
ments tend to be sociologically naive; in the em- 
pirical world one drink does not necessarily make 
an alcoholic or one lottery ticket an addicted 
gambler. While conceding that a generalized gam- 
bling mania might have disastrous consequences 

for society, it is an empirical question whether 


legalization would have this consequence. However, 
the moralists undoubtedly score a strong empirical 
point in their argument that selective legalization 
weakens the public will to enforce the statutes 
which remain. 

Arguments from other sources have attempted 
to grapple more directly with empirical conse- 
quences. Would legalization in fact rid gambling 
of gangster influence? No, answers Virgil Peterson 
(1945), who was for many years chairman of the 
Chicago Crime Commission, citing the continued 
influence of the underworld in the liquor business 
even after the repeal of prohibition laws, and the 
unsavory history of graft and corruption which 
accompanied the later days of legalized lotteries 
in nineteenth-century America. In rebuttal propo- 
nents point out that most European state lotteries 
have operated successfully for years without major 
scandal. But perhaps the United States is different. 

Would legalization set off a wholesale gambling 
mania, create a population of gambling addicts, 
and sabotage work disciplines and the ethical atti- 
tudes which maintain them? Undoubtedly the in- 
cidence of gambling would increase, perhaps quite 
substantially, if the taboos were lifted and the 
facilities made more visible, guilt-free, and accessi- 
ble, but nobody knows for sure how much it would 
increase. So far, the legalization of football pools 
and off-track betting shops in England has not pro- 
duced any runaway mania. But again, perhaps the 
United States is different. There is at least some 
evidence that by the 1820s American lotteries had 
reached such craze proportions that they had seri- 
ously disruptive consequences for some communi- 
ties (Spofford 1892) and some evidence that during 
the racing season at local tracks many local busi- 
nesses suffer, absenteeism increases, installment 
payments fall off, and petty crimes increase (New 
Jersey State Chamber of Commerce 1939). Regard- 
ing the possible long-run effects of wholesale legal- 
ization upon the ethical attitudes and beliefs which 
underpin the system of bourgeois capitalism, there 
is no relevant empirical evidence, Opponents of 
legalization point to Latin America, where permis- 
Sive attitudes to gambling and generally more fatal- 
istic value systems prevail and where bourgeois 
capitalism has been slow to develop. Who can say 
whether gambling is cause or consequence in this 
relationship? Recalling the functions attributed 
above to the disapproval of gambling, could the 
United States afford an attitude of moral indif- 
ference? 


The solution, of course, does not have to be of 
an either—or nature, Although few are willing to 
admit it and still fewer to recommend it, since it 


violates all the principles of logic and common 
sense, the United States has again and again shown 
by its behavior that it still covertly prefers the 
present type of compromise solution, in which a 
formal façade of disapprobation and legal taboo is 
combined with halfhearted enforcement and wide- 
spread practice. Through this arrangement it does 
achieve at least some measure of regulation and 
constraint, keeps the public conscience appeased, 
and yet provides generous opportunities for those 
who would gamble to do so (for a carefully rea- 
soned defense of this arrangement, see Dos Passos 
1904). But these gains, if they be that, are not 
achieved without serious cost; and in any event, 
in this age of increasing secularization it is highly 
probable that the proponents of further legalization 
will have won their battle in the United States 
before another edition of this encyclopedia appears. 


EDWARD C. DEVEREUX, JR. 


[See also GAME THEORY; LEISURE; SOCIAL CONTROL; 
SPECULATION, HEDGING, AND ARBITRAGE; UTILITY.] 
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I 
THEORETICAL ASPECTS 


The theory of games is a mathematical discipline 
designed to treat rigorously the question of optimal 
behavior of participants in games of strategy and 
to determine the resulting equilibria. In such games 
each participant is striving for his greatest advan- 
tage in situations where the outcome depends not 
only on his actions alone, nor solely on those of 
nature, but also on those of other participants 
whose interests are sometimes opposed, sometimes 
parallel, to his own. Thus, in games of strategy 
there is conflict of interest as well as possible co- 
operation among the participants, There may be 
uncertainty for each participant because the ac- 
tions of others may not be known with certainty. 
Such situations, often of extreme complexity, are 
found not only in games but also in business, pol- 
itics, war, and other social activities. Therefore, the 
theory serves to interpret both games themselves 
and social phenomena with which certain games 
are strictly identical. The theory is normative in 


that it aims at giving advice to each player about 
his optimal behavior; it is descriptive when viewed 
as a model for analyzing empirically given occur- 
rences. In analyzing games the theory does not 
assume rational behavior; rather, it attempts to 
determine what “rational” can mean when an indi- 
vidual is confronted with the problem of optimal 
behavior in games and equivalent situations. 

The results of the interlocking individual actions 
are expressed by numbers, such as money or a 
numerically defined utility for each player trans- 
ferable among all. Games of strategy include games 
of chance as a subcase; in games of chance the 
problem for the player is merely to determine and 
evaluate the probability of each possible outcome. 
In games of strategy the outcome for a player can- 
not be determined by mere probability calculations, 
Specifically, no player can make mere statistical 
assumptions about the behavior of the other players 
in order to decide on his own optimal strategy. 

But nature, when interfering in a game through 
chance events, is assumed to be indifferent with 
regard to the player or players affected by chance 
events. Since the study of games of chance has 
given rise to the theory of probability, without which 
modern natural science could not exist, the expec- 
tation is that the understanding of the far more 
complicated games of strategy may gradually pro- 
duce similar consequences for the social sciences. 

History. In 1710 the German mathematician- 
philosopher Leibniz foresaw the need and possi- 
bility of a theory of games of strategy, and the 
notion of a minimax strategy (see section on “Two- 
person, zero-sum games,” below) was first formu- 
lated two years later by James Waldegrave. (See 
the letter from Waldegrave in the 1713 edition of 
Montmort 1708; see also Baumol & Goldfeld 1967.) 
The similarity between games of strategy and eco- 
nomic processes was occasionally mentioned, for ex- 
ample, by Edgeworth in his Mathematical Psychics 
(1881). Specialized theorems, such as Ernst Zer- 
melo’s on chess, were stated for some games; and 
Emile Borel developed a limited minimax strategy, 
but he denied the possibility of a general theorem. 
It was not until John von Neumann (1928) proved 
the fundamental theorem that a true theory of 
games emerged (see section on “Two-person, zero- 
sum games,” below). In their Theory of Games and 
Economic Behavior, von Neumann and Morgen- 
stern (1944) extended the theory, especially to 
games involving more than two players, and gave 
applications of the theory in economics. Since then, 
throughout the world a vast literature has arisen 
in which the main tenets of the theory have been 
widened and deepened and many new concepts 


and ideas introduced. The four-volume Contribu- 
tions to the Theory of Games (Kuhn & Tucker 
1950-1959) and Advances in Game Theory 
(Dresher, Shapley, & Tucker 1964) give evi- 
dence of this continuing movement. These works 
contain extensive bibliographies, but see especially 
Volume 4 of Contributions to the Theory of Games. 


Game theory concepts 


Games are described by specifying possible be- 
havior within the rules of the game. The rules are 
in each case unambiguous; for example, certain 
moves are allowed for specific pieces in chess but 
are forbidden for others. The rules are also in- 
violate. When a social situation is viewed as a 
game, the rules are given by the physical and legal 
environment within which an individual's actions 
may take place. (For example, in a market individ- 
uals are permitted to bargain, to threaten with boy- 
cotts, etc., but they are not permitted to use physi- 
cal force to acquire an article or to attempt to 
change its price.) The concrete occasion of a game 
is called a play, which is described by specifying, 
out of all possible, allowable moves, the sequence 
of choices actually made by the players or partici- 
pants. After the final move, the umpire determines 
the payments to each player. The players may act 
singly, or, if the rules of the game permit it and 
if it is advantageous, they may form coalitions. 
When a coalition forms, the distribution of the 
payments to the coalition among its members has 
to be established. All payments are stated in terms 
of money or a numerically defined utility that is 
transferable from one player to another. The pay- 
ment function is generally assumed to be known 
to the players, although modifications of this as- 
sumption have been introduced, as have other 
modifications—for example, about the character 
of the utilities and even about the transferability 
of payments. 

The “extensive” form of a game, given in terms 
of successive moves and countermoves, can be rep- 
resented mathematically by a game tree, which 
describes the unfolding of the moves, the state of 
information of the players at the moment of each 
choice, and the alternatives for choices available 
to each player at each occasion. This description 
can, in a strict mathematical sense, be given equiv- 
alently in a “normalized” form: each player, unin- 
formed about the choices made by any other player, 
chooses a single number that identifies a “strategy” 
from his given finite or infinite set of strategies. 
When all personal choices and a possible random 
choice are made (simultaneously), the umpire de- 
termines the payments. Each strategy is a complete 
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plan of playing, allowing for all contingencies as 
represented by the choices and moves of all other 
players and of nature. The payoff for each player 
is then represented by his mathematical expecta- 
tion of the outcome for himself. The final descrip- 
tion of the game therefore involves only the 
players’ strategies and no further chance elements. 

The theory explicitly assumes that each player, 
besides being completely informed about the alter- 
native payoffs due to all moves made or strategies 
chosen, can perform all necessary computations 
needed to determine his optimal behavior. (This 
assumption of complete information is also com- 
monplace in current economic theory, although 
seldom stated explicitly. ) 

The payments made by all players may add up 
to zero, as in games played for entertainment. In 
this case the gains of some are exactly balanced by 
the losses of others. Such games are called zero- 
sum games. In other instances the sum of all pay- 
ments may be a constant (different from zero) or 
may be a variable; in these cases all players may 
gain or lose. Applications of game theory to eco- 
nomic or political problems require the study of 
these games, since in a purchase, for example, both 
sides gain. An economy is normally productive so 
that the gains outweigh any losses, whereas in a 
war both sides may lose. 

If a player chooses a particular strategy as iden- 
tified by its number, he selects a pure strategy; if 
he allows a chance mechanism, specified by him- 
self, to make this selection for him, he chooses a 
mixed or statistical strategy. The number of pure 
strategies for a player normally is finite, partly be- 
cause the rules of games bring the play to an end 
after a finite number of moves, partly because the 
player is confronted with only a finite number of 
alternatives. However, it is possible to treat cases 
with infinitely many strategies as well as to con- 
sider even the borderline case of games with in- 
finitely many players. These serve essentially to 
study pathological examples or to explore certain 
mathematical characteristics. 

Game theory uses essentially combinatorial and 
set-theoretical concepts and tools, since no specific 
calculus has as yet evolved—as happened when 
differential and integral calculus were invented 
simultaneously with the establishment of classical 
mechanics. Differential calculus is designed to de- 
termine maxima and minima, but in games, as well 
as in politics, these are not defined, because the out- 
come of a player’s actions does not depend on his 
actions alone (plus nature). This applies to all 
players simultaneously. A maximum (or mini- 
mum) of a function can be achieved only when all 
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variables on which the maximum (minimum) de- 
pends are under the complete control of the would- 
be maximizer. This is never the case in games of 
strategy. Therefore, in the equivalent business, po- 
litical, or military operations there obtains no max- 
imum (minimum) problem, whether with or with- 
out side conditions, as assumed in the classical 
literature of these fields; rather one is confronted 
there with an entirely different conceptual struc- 
ture, which the theory of games analyzes. 


Two-person, zero-sum games 

The simplest game of strategy is a two-person, 
zero-sum game, in which players A and B each have 
a finite number of strategies and make their choices 
unknown to each other. Let P be the payoff to the 
first player, and let —P be the payoff to the second 
player. Then P is greater than, equal to, or less than 
0, depending on whether A wins, draws, or loses. 
Let A,,A,,---,A, be the strategies available to 
player A and B,, B}, ++- , Bm be the strategies avail- 
able to player B. In the resulting n x m array of 
numbers, each row represents a pure strategy of A, 
each column a pure strategy of B. The intersections 
of the rows and columns show the payoffs to player 
A from player B. The first player wishes to maxi- 
mize this payoff, while the second wishes to min- 
imize it. This array of numbers is called the payoff 
matrix, an example of which is presented in Table 1, 
where payments go from B to A. Player A’s most 
desirable payoff is 8; B’s is —10. Should player A 
pick strategy A,, either of these two events may 
happen depending on B’s action. But if A picks A,, 
B in his own interest would want to pick B,, 
which would mean that A would have to pay 10 
units to B instead of receiving 8. The row minima 
represent the worst that could happen to A for each 
of his strategies, and it is natural that he would 
want to make as great as possible the least gain he 
can expect from each; that is, he seeks the maxi- 
mum of the row minima, or the maximin, which in 
Table 1 is —1 (strategy A,). Conversely, B will wish 
to minimize the column maxima—that is, seek the 


Table 1 — Payoff matrix for a two-person, zero-sum 
game 


B's strategy 


Row 
A's strategy B; Ba B, minima 
Ay 8 Fa —10 —10 
Aa 0 “2 6 = 
As 4 =i 5 Si 
Column maxima 8 sal 6 


minimax—which is also —1 (strategy B). We 
would say that each player is using a minimax 
strategy—that is, each player selects the strategy 
that minimizes his maximum loss. Any deviation 
from the optimal strategies A, and B. is fraught 
with danger for the deviating player, so that each 
will choose the strategy that contains the so-called 
saddle point of the payoff function. The saddle 
point is defined as the point at which the maximin 
equals the minimax. At this point the least that A 
can secure for himself is equal to the most that B 
may have to part with. (In the above example A 
has to pay one unit to B.) If there is more than one 
saddle point in the payoff matrix, then they are all 
equal to each other. Games possessing saddle points 
in pure strategies are called specially strictly de- 
termined. In these games it is immaterial whether 
the choice of the pure strategy by either player is 
made openly before the other makes his choice. 
Games of perfect information—that is, games in 
which each player at each move is always informed 
about the entire previous history of the play, so that 
what is preliminary to his choice is also anterior to 
it—are always specially strictly determined. Chess 
belongs in this class: bridge does not, since each 
of the two players (one “player” being the north- 
south team, the other the east-west team) is not 
even completely informed about himself—for ex- 
ample, north does not know precisely what cards 
south holds. 

Most games will have no saddle points in pure 
Strategies; they are then said to be not strictly de- 
termined. The simplest case is matching pennies. 
The payoff matrix for this game is presented in 
Table 2. Here, if one player has to choose openly 
before the other does, he is sure to lose. Each player 
will therefore strive to prevent information about 
his choice from flowing to the other. This is accom- 
plished by the player’s choice of a chance mecha- 
nism, which selects from among the available pure 
Strategies with probabilities determined by the 
player. In matching pennies, the chance mecha- 
nism should select “heads” with probability 4 and 
“tails” with probability 4. This randomization may 
be achieved by tossing the coin before showing it. 
If there is a premium, say on matching heads over 
matching tails, the payoff matrix would reflect this, 
and the probabilities with which the two sides of 
the coin have to be played in order to prevent dis- 
closure of a pattern of playing to the benefit of the 
Opponent would no longer be 4 for heads and 4 for 
tails. Thus, when there is no saddle point in pure 
strategies a randomization by a chance mechanism 
is called for. The players are then said to be using 
mixed, or statistical, strategies. This does not trans- 


Table 2 — Payoff matrix for matching pennies 


B's penny 
Row 
minima 


A's penny Heods Tails 


Column maxima 


form a game of strategy into a game of chance: the 
strategic decision is the specification of the random- 
ization device and the assignment of the proper 
probabilities to each available pure strategy. 
Whether pure or mixed strategies are needed to 
assure a saddle point, the theory at no point re- 
quires that the players make assumptions about 
each other's intelligence, guesses, and the like. The 
choice of the optimal strategy is independent of all 
such considerations. Strategies selected in this way 
are perfect from the defensive point of view. A 
theory of true offensive strategies requires new 
ideas and has not yet been developed. 

Von Neumann proved that each matrix game 
can be made strictly determined by introducing 
mixed strategies. This is the fundamental theorem 
of game theory. It shows that each zero-sum, two- 
person game has a saddle point in mixed strategies 
and that optimal mixed strategies exist for each of 
the two players. The original proof of this theorem 
made use of rather complex properties of set theory, 
functional calculus, and combinatorics. Since the 
original proof was given, a number of alternative, 
simplified versions have been given by various 
authors. The numerical solution of a matrix game 
with m columns and n rows demands the solution 
of a system of linear inequalities of m +n + 1 un- 
knowns, the m + n probabilities for the strategies 
of players A and B and the minimax value. There 
exist many techniques for solving such systems; 
notably, an equivalence with solving dual linear 
programs has proved to be of great importance [see 
PRocRAMMING]. High-speed computers are needed 
to cope with the rapid rise of the required arith- 
metical operations. A more modest view of mixed 
strategies is the notion of behavioral strategies, 
which are the probability distributions over each 
player’s information sets in the extensive form of 
the game. For games such as chess, even the opti- 
mal pure strategy cannot be computed, although 
the existence of a saddle point in pure strategies 
can be proved and either white or black has a 
winning pure strategy no matter what the other 
does (or both have pure strategies that enforce a 
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draw). The problems of finding further computa- 
tional techniques are actively being investigated. 


n-Person, zero-sum games 


When the number of players increases to n 2 3, 
new phenomena arise even when the zero-sum re- 
striction remains. It is now possible that coopera- 
tion will benefit the players. If this is not the case, 
the game is called inessential. In an essential game 
the players will try to form coalitions and act 
through these in order to secure their advantage. 
Different coalitions may have different strength. 
A winning coalition will have to divide its proceeds 
among its members, and each member must be 
satisfied with the division in order that a stable so- 
lution obtains [see COALITIONS]. 

Any possible division of payments among all 
players is called an imputation, but only some of 
all possible imputations will be contained in a solu- 
tion. An inessential game has precisely one imputa- 
tion that is better than any other, that is, one that 
dominates all others. This unique imputation forms 
the solution, but this uniqueness is trivial and ap- 
plies only to inessential games. There is no coopera- 
tion in inessential games. 

A solution of an essential game is characteristi- 
cally a nonempty set of several imputations with 
the following properties: (1) No imputation in the 
set is dominated by another imputation in the set. 
(2) All imputations not in the set are dominated 
by an imputation contained in the set. There may 
be an infinite number of imputations in a solution 
set, and there may be several solution sets, each of 
which has the above properties. Furthermore, it 
should be noted that every imputation in a solution 
set is dominated by some imputation not in that 
set, but property (2) assures that such a dominat- 
ing imputation is, in turn, dominated by an impu- 
tation in the solution set. 

To be considered as a member of a coalition, a 
player may have to offer compensations or side 
payments to other prospective members. A compen- 
sation or side payment may even take the form of 
giving up privileges that the rules of the game may 
attribute to a player. A player may be admitted to 
a coalition under terms less favorable than those 
obtained by the players who form the initial core 
of a coalition (this happens first when n = 4). Also, 
coalitions of different strength can be distin- 
guished. Discrimination may occur; for example, 
some players may consider others “taboo”—that is, 
unworthy as coalition partners. This leads to the 
types of discriminatory solutions that already oc- 
cur when n= 3. Yet discrimination is not neces- 
sarily as bad for the affected player as defeat is for 
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a nondiscriminated player, because cooperation 
against the discriminated player may not be per- 
fect. A player who by joining a coalition does not 
contribute more to it than what he can get by play- 
ing for himself merely has the role of a dummy. 

The fundamental fact of cooperation is that the 
players in a coalition can each obtain more than 
they could obtain by playing alone. This expresses 
the nonadditivity—specifically, the superadditivity 
—of value, the explanation of which has long been 
recognized as a basic problem in economics and 
sociology. In spite of many efforts, no solution was 
found, but it is now adequately described by the 
characteristic function v(S), a numerical set func- 
tion that states for any cooperative n-person game 
the proceeds of the coalition S, and an imputation 
that describes the distribution of all payments 
among all players (von Neumann & Morgenstern 
1944, chapter 6). 

Since there may be many solutions to a coopera- 
tive (essential) n-person game, the question arises 
as to which of them will in fact prevail. Each 
solution may correspond to a specific mode of be- 
havior of the players or a specific form of social 
organization. This expresses the fact that in the 
same physical setting different types of social or- 
ganization can be established, each one consistent 
in itself but in contradiction with other organiza- 
tions. For example, we observe that the same tech- 
nology allows the maintenance of varying economic 
systems, income distributions, and so on. If a stable 
standard of behavior exists (a mode of behavior 
accepted by society), then it can be argued that the 
only relevant solution is the one corresponding to 
this standard. 

The choice of an imputation not in the solution 
set, while advantageous to each of those in the 
particular coalition that is able to enforce this im- 
putation, cannot be maintained because another 
coalition can enforce another imputation, belong- 
ing to the solution set, that dominates the first one. 
Hence, a standard is set and proposals for imputa- 
tions that are not in the solution will be rejected. 
The theory cannot state which imputation of all 
those belonging to the standard of behavior actu- 
ally will be chosen—that is, which coalition will 
form. Work has been done to introduce new as- 
sumptions under which this may become feasible. 
No imputation contained in the solution set guar- 
antees stability by itself, since each is necessarily 
dominated from the outside. But in turn each im- 
putation is always protected against threats by an- 
other one within the solution set that dominates 
the imputation not in the solution set. 

Since an imputation is a division of proceeds 
among the players, these conditions define a cer- 


tain fairness, such that the classical problems of 
fair division (for example, cutting a cake) become 
amenable to game-theoretic analysis. 

This conceptual structure is more complicated 
than the conventional view that society could be 
organized according to some simple principle of 
maximization. The conventional view would be 
valid only if there were inessentiality—that is, if 
there were no advantage in cooperation, or if co- 
operation were forbidden, or, finally, if a supreme 
authority were to do away with the entire imputa- 
tion problem by simply assigning shares of income 
to the members of the society. Inessentiality would 
be the case for a strictly communistic society, 
which is formally equivalent to a Robinson Crusoe 
economy. This, in turn, is the only formal setup 
under which the classical notion of marginal utility 
is logically valid. Whether cooperation through for- 
mation of coalitions is advantageous to participants 
in a society, whether such cooperation, although 
advantageous, is forbidden, or whether compensa- 
tions or side payments are ruled out by some au- 
thority although coalitions may be entered—these 
are clearly empirical questions. The theory should 
take care of all eventualities, and current investiga- 
tions explore the different avenues. In economic 
life, mergers, labor unions, trade associations, car- 
tels, etc., express the powerful tendencies toward 
Cooperation. The cooperative case with side pay- 
ments is the most comprehensive, and the theory 
was originally designed to deal with this case. Im- 
portant results have been obtained for cooperative 
games without side payments (Aumann & Peleg 
1961), and the fruitful idea of “bargaining sets” 
has been introduced (Aumann & Maschler 1964). 

All indications point overwhelmingly to the bene- 
fits of cooperation of various forms and hence to 
the empirical irrelevance of those noncooperative, 
inessential games with uniquely determined solu- 
tions consisting only of one single imputation dom- 
inating all others (as described in the Lausanne 
school’s general economic equilibrium ). 

Cooperation may depend on a particular flow of 
information among the players. Since the required 
level may not in fact be attainable, noncooperative 
solutions become important. Economic markets in 
which players act independently and have no in- 
centive to deviate from a given state have been 
studied (Nash 1950). Equilibrium points can be 
determined as those points for which unilateral 
changes in strategy are unprofitable to everyone. 
As Nash has shown, every finite game, or the do- 
main of mixed strategies, has at least one equilib- 
rium point. If there is more than one equilibrium 
point, an intermixture of strategy choices need not 
give another equilibrium point, nor is the payoff 


to players the same if the points differ from each 
other. 

There is no proof, as yet, that every cooperative 
n-person, zero-sum game for any n > 4 has a solu- 
tion of the specified kind. However, every individ- 
ual game investigated, even with arbitrarily large 
n, has been found to possess a solution. The indi- 
cations are that the proof for the general case will 
eventually be given. Other definitions of solutions— 
still differing from that of the Lausanne—Robinson 
Crusoe convention—are possible and somewhat 
narrow the field of choices. They are inevitably 
based on further assumptions about the behavior of 
the participants in the game, which have to be 
justified from case to case. 


Simple games 

In certain n-person games the sole purpose is to 
form a majority coalition. These games are the 
“simple” games in which voting takes place. Ties 
in voting may occur, and weights may differ from 
one player to another; for example, the chairman 
of a committee may have more than one vote. A 
player’s presence may therefore mean the differ- 
ence between victory or defeat. Games of this na- 
ture can be identified with classical cases of pro- 
duction, where the players represent factors of 
production. It has been proven that even in rela- 
tively simple cases, although complete substitu- 
tability among players may exist, substitution rates 
may be undetermined and values are attributed to 
the players (factors) only by virtue of their relation 
to each other and not by virtue of their individual 
contribution. Thus, contrary to current economic 
doctrine, substitutability does not necessarily guar- 
antee equality as far as value is concerned. 

Simple games are suited for interpretation of 
many political situations in that they allow the 
determination of the weights, or power, of partici- 
pants in decision processes. A particular power in- 
dex has been proposed by Shapley. It is based on 
the notion of the average contribution a player can 
make to the coalitions to which he may belong, 
even considering, where necessary, the order in 
which he joins them. The weight of a senator, a 
congressman, and the president in the legislative 
process has been calculated for the United States. 
The procedure is applicable to other political sys- 
tems—for example, the Security Council of the 
United Nations (Shapley 1953). 


Composition of games 

Every increase in the number of players brings 
new phenomena: with the increase from two to 
three players, coalitions become possible, from 
three to four, ties may occur among coalitions, etc. 
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There is no guarantee that for very large n an 
asymptotic convergence of solutions will occur, 
since coalition formation always reduces large 
numbers of individual players to small numbers 
of coalitions acting upon each other. Thus, the 
increase in the number of players does not neces- 
sarily lead to a simplification, as in the case of an 
enlargement of the numbers of bodies in a physical 
system, which then allows the introduction of 
classical methods of statistical averages as a sim- 
plification. (When the game is inessential, the 
number of participants is irrelevant in any case.) 

An effective extension of the theory by the en- 
largement of numbers can be achieved by viewing 
games played separately as one composite game and 
by introducing contributions to, or withdrawals 
from, the proceeds of a given game by a group of 
players outside the game under consideration. 
These more complicated notions involve constant- 
sum games and demonstrate, among other things, 
how the coalition formation, the degree of coopera- 
tion among players, and consequently the distribu- 
tion of the proceeds among them are affected by 
the availability of amounts in excess of those due 
to their own strategies alone. Strategy is clearly 
greatly influenced by the availability of greater pay- 
ments than those that can be made by only the 
other players. Thus, coalitions—namely, social 
structures—cannot be maintained if outside con- 
tributions become larger than specified amounts, 
such that as a consequence no coalition can ex- 
haust the amounts offered. It can also be shown 
that the outside source, making contributions or 
withdrawals, can never be less than a group of 
three players. 

These concepts and results are obviously of a 
rather complicated nature; they are not always 
directly accessible to intuition, as corresponds to a 
truly mathematical theory. When that level is 
reached, confidence in the mathematical results 
must override intuition, as the experience in the 
natural sciences shows. The fact that solutions of 
n-person games are not single numbers or single 
sets of numbers—but that the above-mentioned, 
more complicated structures emerge—is not an im- 
perfection of the theory: it is a fundamental prop- 
erty of social organization that can be described 
only by game-theoretic methods. 


Nonzero-sum games 

Nonzero-sum games can be reduced to zero-sum 
games—which makes that entire theory applica- 
ble—by the introduction of a fictitious player, so 
that an n-person, nonzero-sum game becomes equiv- 
alent to an (n + 1)-person, zero-sum game. The 
fictitious player is either winning or losing, but 
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since he is fictitious he can never become a member 
of a coalition. Yet he can be construed as proposing 
alternative imputations, thereby influencing the 
players’ strategies and thus the course of the play. 
He will lose according to the degree of cooperation 
among the players. If the players cooperate per- 
fectly, the maximum social benefit will be attained. 
In these games there is an increased role of threats, 
and their costs to the threatening player, although 
threats already occur in the zero-sum case. 

The discriminatory solutions, first encountered 
for the three-person, zero-sum game, serve as in- 
struments to approach these problems. Most ap- 
plications to economics involve gains by the com- 
munity—an economy being productive and there 
being no voluntary exchange unless both sides 
profit—while many other social phenomena fall 
under the domain of zero-sum games. The non- 
zero-sum theory is so far the part of game theory 
least developed in detail, although its foundations 
seem to be firmly established by the above proce- 
dure. 


Applications 


Game theory is applicable to the study of those 
social phenomena in which there are agents striv- 
ing for their own advantage but not in control of 
all the variables on which the outcome depends. 
The wide range of situations of which this is true 
is obvious: they are economic, political, military, 
and strictly social in nature. Applications have 
been made in varying degree to all areas; some 
have led to experiments that have yielded important 
new insights into the theory itself and into special 
processes such as bargaining. Finally, the possi- 
bility of viewing the basic problem of statistics as 
a game against nature has given rise to modern 
statistical decision theory (Wald 1950). The influ- 
ence of game theory is also evident in philosophy, 
information theory, cybernetics, and even biology. 


Oskar MORGENSTERN 


[See also the biography of von NEUMANN. ] 
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Ii 
ECONOMIC APPLICATIONS 


The major economic applications of game theory 
have been in oligopoly theory, bargaining theory, 
and general equilibrium theory. Several distinct 
branches of game theory exist and need to be iden- 
tified before our attention is limited to economic 
behavior. John von Neumann and Oskar Morgen- 
stern, who first explored in depth the role of game 
theory in economic analysis (1944), presented 
three aspects of game theory which are so funda- 
mentally independent of one another that with 
a small amount of editing their opus could have 
been published as three independent books. 

The first topic was the description of a game, or 
interdependent decision process, in extensive form. 
This provided a phraseology (“choice,” “decision 
tree,” “move,” “information,” “strategy,” and “pay- 
off”) for the precise definition of terms, which has 
served as a basis for studying artificial intelligence, 
for developing the behavioral theory of the firm 
(Cyert & March 1963), and for considering statis- 
tical decision making [see DECISION THEORY]. The 
definition of “payoff” has been closely associated 
with developments in utility theory [see UTILITY]. 

The second topic was the description of the two- 
person, zero-sum game and the development of the 
mathematical theory based upon the concept of the 
minimax solution. This theory has formal mathe- 
matical connections with linear programming and 
has been applied successfully to the analysis of 
problems of pure conflict; however, its application 
to the social sciences has been limited because pure 
conflict of interests is the exception rather than the 
rule in social situations [see PROGRAMMING]. 

The third subject to which von Neumann and 
Morgenstern directed their attention was the de- 


velopment of a static theory for the n-person 
(n > 3), constant-sum game. They suggested a set 
of stability and domination conditions which should 
hold for a cooperative solution to an n-person game. 
It must be noted that the implications of this solu- 
tion concept were developed on the assumption of 
the existence of a transferable, interpersonally 
comparable linear utility which provides a mecha- 
nism for side payments. Since the original work of 
von Neumann and Morgenstern, twenty to thirty 
alternative solution concepts for the n-person, non- 
constant-sum game have been suggested, Some 
have been of purely mathematical interest, but 
most have been based on considerations of bargain- 
ing, fair division, social stability, and other aspects 
of human affairs. Many of the solution concepts 
do not use the assumption of transferable utility. 


Oligopoly and bargaining 

Markets in which there are only a few sellers 
(oligopoly), two sellers (duopoly, a special case of 
oligopoly), one seller and one buyer ( bilateral mo- 
nopoly), and so on, lend themselves to game-theo- 
retic analyses because the fate of each participant 
depends on the actions taken by the other partici- 
pant or participants. The theory of games has pro- 
vided a unifying basis for the mathematical and 
semimathematical works dealing with such situa- 
tions and has also provided some new results. The 
methodology of game theory requires explicit and 
detailed definition of the strategies available to the 
players and of the payoffs associated with the 
strategies. This methodology has helped to clarify 
the different aspects of intent, behavior, and mar- 
ket structure in oligopolistic markets (Shubik 
1957). So-called conjectural variations and lengthy 
statements regarding an oligopolist’s (or duopolist’s 
or bargainer’s) moves and countermoves can be 
investigated in a unified way when expressed in 
terms of strategies. 

Oligopoly. Perhaps the most pervasive concept 
underlying the writings on oligopoly is that of a non- 
cooperative equilibrium. A group of individuals is 
in a state of noncooperative equilibrium if, in the 
individual pursuit of his own self-interest, no one 
in the group is motivated to change his strategy. 
This concept is basic in the works of Cournot, Ber- 
trand, Edgeworth, Chamberlin, von Stackelberg, 
and many others. Nash (1951) has presented a 
general theory of noncooperative games, based on 
the equilibrium-point solution. This theory is di- 
rectly related to Chamberlin’s theory of monopo- 
listic competition, among others. 

The outcome given by a solution is called Pareto 
optimal if no participant can be made better off 
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without some other participant’s being made worse 
off. Noncooperative solutions, whose outcomes need 
not be Pareto optimal, have been distinguished 
from cooperative solutions, whose outcomes must 
be Pareto optimal. Also, equilibrium points are dis- 
tinguished on the basis of whether the oligopoly 
model studied is static or dynamic. In much of the 
literature on oligopoly, quasi-cooperative solutions 
have been advanced and quasi-dynamic models 
have been suggested. Thus, while the Chamberlin 
large-group equilibrium can be interpreted as the 
outcome of a static noncooperative game, the small- 
group equilibrium and the market resolution sug- 
gested by Fellner (1949) are cast in a quasi-dy- 
namic, quasi-cooperative framework. A limited 
amount of development of games of survival 
(Milnor & Shapley 1957) and games of economic 
survival (Shubik & Thompson 1959) has provided 
a basis for the study of multiperiod situations and 
for an extension of the noncooperative equilibrium 
concept to include quasi-cooperative outcomes. 

New results. The recasting of oligopoly situations 
into a game-theory context has produced some new 
results in oligopoly theory (see, for example, May- 
berry, Nash, & Shubik 1953; Shubik 1959a). Nash 
(1953) and Shubik (1959a) have developed the 
definition of “optimum threat” in economic war- 
fare. The kinky oligopoly demand curve and the 
more general problem of oligopolistic demand have 
been re-examined and interpreted. Other results 
concern stability and the Edgeworth cycle in price- 
variation oligopoly; duopoly with both price and 
quantity as independent variables; and the develop- 
ment of diverse concepts applicable to cartel be- 
havior, such as blocking coalitions (Scarf 1965), 
discriminatory solutions, and decomposable games. 

Selten (1965) has been concerned with the 
problem of calculating the noncooperative equilib- 
ria for various classes of oligopolistic markets, His 
work has focused on both the explicit calculation 
and the uniqueness of equilibrium points. Vickrey 
(1961), Griesmer and Shubik (1963), and others 
have studied a class of game models applicable to 
bidding and auction markets. Working from the 
viewpoint of marketing and operations research, 
Mills (1961) and others have constructed several 
noncooperative game-theoretic models of competi- 
tion through advertising. Jacot ( 1963) has consid- 
ered problems involving location and spatial com- 
petition. 

Behavioristic findings. Game theory can be given 
both a normative and a behavioristic interpretation. 
The meaning of “rational behavior” in situations 
involving elements of conflict and cooperation is 
not well defined. No single set of normative criteria 


has been generally accepted, and no universal be- 
havior has been validated. Closely related to and 
partially inspired by the developments in game 
theory, there has been a growth in experimental 
gaming, some of which has been in the context of 
economic bargaining (Siegel & Fouraker 1960) or 
in the simulated environment of an oligopolistic 
market (Hoggatt 1959). Where there is no verbal 
or face-to-face communication, there appears, un- 
der the appropriate circumstances, to be some evi- 
dence in favor of the noncooperative equilibrium. 

Bargaining. The theory of bargaining has been 
of special interest to economists in the context of 
bilateral monopoly, which can involve two firms, 
a labor union and a firm, or two individuals en- 
gaged in barter in the market place or trying to 
settle a joint estate. Any two-person, nonconstant- 
sum situation, be it haggling in the market or in- 
ternational negotiations, can be formally described 
in the same game-theoretic framework. However, 
there are several substantive problems which limit 
application of this framework and which have re- 
sulted in the development of different approaches. 
In nonconstant-sum games communication be- 
tween the players is of considerable importance, 
yet its role is exceedingly hard to define. In games 
such as chess and even in many oligopolistic mar- 
kets, a move is a well-defined physical act—moving 
a pawn in a definite manner or changing a price 
or deciding upon a production rate; in bargaining 
it may be necessary to interpret a statement as a 
move. The problem of interpreting words as moves 
in negotiation is critical to the description and un- 
derstanding of bargaining and negotiation proc- 
esses. This “coding” problem has to be considered 
from the viewpoint of many other disciplines, as 
well as that of game theory. 

A desirable property of a theoretical solution to 
a bargaining problem is that it predicts a unique 
outcome. In the context of economics this would be 
a unique distribution of resources (and unique 
prices, if prices exist at all). Unfortunately, there 
are few concepts of solution pertaining to economic 
affairs which have this property. The price system 
and distribution resulting from a competitive mar- 
ket may in general not be unique; Edgeworth’s so- 
lution to the bargaining problem was the contract 
curve, which merely predicts that the outcome will 
be some point among an infinite set of possibilities. 

The contract curve has the property that any 
point on it is jointly optimal (both bargainers can- 
not improve their position simultaneously from a 
point on this curve) and individually rational (no 
point gives an individual less than he could obtain 
without trading), The Pareto-optimal surface is 
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larger than the contract curve, for it is restricted 
only by the joint optimality condition. If it is as- 
sumed that a transferable comparable utility exists, 
then the Pareto-optimal surface (described in the 
space of the traders’ utilities) is flat; if not, it will 
generally be curved. Any point on the Pareto-opti- 
mal surface that is individually rational is called 
an imputation. In the two-person bargain the Edge- 
worth contract curve coincides with two game- 
theoretic solutions, the core and the stable set. The 
core consists of all undominated imputations (it 
may be empty). A stable set is a set of imputations 
which do not dominate each other but which to- 
gether dominate all other imputations. An imputa- 
tation, œ, is said to dominate another imputation, 
8, if (1) there exists a coalition of players who, 
acting jointly but independently of the others, could 
guarantee for themselves at least the amounts they 
would receive if they accepted a, and (2) each 
player obtains more in « than in B. The core and 
stable-set solutions can be defined with or without 
the assumption of transferable utilities. Neither of 
these solution concepts predicts a unique outcome. 

One approach to bilateral monopoly has been to 
regard it as a “fair-division” problem, and several 
solution concepts, each one embodying a formaliza- 
tion of concepts of symmetry, justice, and equity, 
have been suggested (Nash 1953; Shapley 1953; 
Harsanyi 1956). These are generally known as 
value solutions, since they specify the amount that 
each participant should obtain. For the two-person 
case, some of the fair-division or arbitration 
schemes do predict unique outcomes. The Nash 
fair-division scheme assumes that utilities of the 
players are measurable, but it does not need as- 
sumptions of either comparability or transferability 
of utilities (Shubik 1966). Shapley’s scheme does 
utilize the last two assumptions. Other schemes 
have been suggested by Raiffa (1953), Braithwaite 
(1955), Kuhn (in Shubik 1967), and others. 

Another approach to bargaining is to treat it in 
the extensive form, describing each move explicitly 
and showing the time path taken to the settlement 
point. This involves attempting to parametrize 
qualitiés such as “toughness,” “flexibility,” etc. Most 
of the attempts to apply game theory in this man- 
ner belong to studies in social psychology, political 
science, and experimental gaming. However, it has 
been shown (Harsanyi 1956) that the dynamic 
process suggested by Zeuthen (1930) is equivalent 
to the Nash fair-division scheme. 


General equilibrium 


Game theory methods have provided several new 
insights in general equilibrium economics. Under 


the appropriate conditions on preferences and pro- 
duction, it has been proved that a price system that 
clears the market will exist, provided that each 
individual acts as an independent maximizer. This 
result holds true independently of the number of 
participants in the market; hence, it cannot be 
interpreted as a limiting phenomenon as the num- 
ber of participants increases. Yet, in verbal discus- 
sions contrasting the competitive market with bi- 
lateral monopoly, the difference generally stressed 
is that between the market with many participants, 
each with little if any control over price, and the 
market with few participants, where the interac- 
tions of each with all the others are of maximum 
importance. 

The competitive equilibrium best reflects the 
spirit of “the invisible hand” and of decentraliza- 
tion. The use of the word “competitive” is counter 
to both game-theoretic and common-language im- 
plications. It refers to the case in which, if each 
individual considers himself an isolated maximizer 
operating in an environment over which he has no 
control, the results will be jointly optimal. 

Game-theoretic solutions. The power and ap- 
peal of the concept of competitive equilibrium ap- 
pears to be far greater than that of mere decentral- 
ization. This is reflected in the finding that under 
the appropriate conditions the competitive equilib- 
rium may be regarded as the limit solution for sev- 
eral conceptually extremely different game-theo- 
retic solutions. 

Convergence of the core. It has been noted that 
for bilateral monopoly the Edgeworth contract 
curve is the core. Edgeworth had suggested and pre- 
sented an argument to show that if the number of 
traders is increased on both sides of the market, 
the contract curve would shrink (interpreted ap- 
propriately, given the change in dimensions ). 
Shubik (1959b) observed the connection between 
the work of Edgeworth and the core; he proved the 
convergence of the core to the competitive equilib- 
rium in the special case of the two-sided market 
with transferable utility and conjectured that the 
result would be generally true for any number of 
markets without transferable utility. This result 
was proved by Scarf (the proof, although achieved 
earlier, is described in Scarf 1965); Debreu and 
Scarf improved upon it (1963). Using the concept 
of a continuum of players (rather than considering 
a limit by replicating the finite number of players 
in each category, as was done by Shubik, Scarf, and 
Debreu), Aumann (1966) proved the convergence 
of the core under somewhat different conditions. 
When transferable utility is assumed, the core con- 
verges to a single point and the competitive equilib- 
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rium is unique. Otherwise it may split and converge 
to the set of competitive equilibria. 

The convergence of the core establishes the ex- 
istence of a price system as a result of a theory 
which makes no mention of prices. The theory’s 
prime concern is with the power of coalitions. It 
may be looked upon as a formalization of counter- 
vailing power, inasmuch as it rules out imputations 
which can be dominated by any group in the so- 
ciety. 

Shapley and Shubik (1966) have shown the con- 
vergence of the value in the two-sided market with 
transferable utility. In unpublished work Shapley 
has proved a more general result for any number 
of markets, and Shapley and Aumann have worked 
on the convergence of a nontransferable utility 
value recently defined by Shapley. Harsanyi (1959) 
was able to define a value that generalized the Nash 
two-person fair-division scheme to situations in- 
volving many individuals whose utilities are not 
transferable. This preceded and is related to the 
new value of Shapley, and its convergence has not 
been proved. 

There are several other value concepts (Selten 
1964), all of which make use of symmetry axioms 
and are based upon some type of averaging of the 
contributions of an individual to all coalitions. 

If one is willing to accept the value as reflecting 
certain concepts of symmetry and fairness, then in 
an economy with many individuals in all walks of 
life, and with the conditions which are required for 
the existence of a competitive equilibrium satis- 
fied, the competitive equilibria will also satisfy 
these symmetry and fairness criteria, 

Noncooperative equilibrium. One of the impor- 
tant open problems has been the reconciliation of 
the various noncooperative theories of oligopolistic 
competition with general equilibrium theory. The 
major difficulty is that the oligopoly models are 
Open in the sense that the customers: are usually 
not considered as players with strategic freedom, 
while the general equilibrium model considers 
every individual in the same manner, regardless of 
his position in the economy. Since the firms are 
players in the oligopoly models, it is necessary to 
specify the domain of the strategies they control 
and their payoffs under all circumstances. In a gen- 
eral equilibrium model no individual is considered 
a player; all are regarded as individual maximizers. 
Walras’ law is assumed to hold, and supply is as- 
sumed to equal demand. 

When an attempt is made to consider a closed 
economic model as a noncooperative game, con- 
siderable difficulties are encountered in describing 
the strategies of the players. This can be seen im- 


mediately by considering the bilateral monopoly 
problem; each individual does not really know what 
he is in a position to buy until he finds out what 
he can sell. In order to model this type of situation 
as a game, it may be necessary to consider strate- 
gies which do not clear the market and which may 
cause a player to become bankrupt—i.e., unable to 
meet his commitments. Shapley and Shubik (in 
Shubik 1967) have successfully modeled the closed 
two-sided two-commodity market without side pay- 
ments and have shown that the noncooperative 
equilibrium point converges from below the Pareto- 
optimal surface to the competitive equilibrium 
point. They also have considered more goods and 
markets on the assumption of the existence of a 
transferable (but not necessarily comparable) 
utility. 

When there are more than two commodities and 
one market, the existence of a unique competitive 
equilibrium point appears to be indispensable in 
defining the strategies and payoffs of players in a 
noncooperative game. No one has succeeded in con- 
structing a satisfactory general market model as a 
noncooperative game without using a side-payment 
mechanism. The important role played by the side- 
payment commodity is that of a strategy decoupler. 
It means that a player with a supply of this type of 
“money” can decide what to buy even though he 
does not know what he will sell. 

In summary, it appears that, in the limit, at 
least three considerably different game-theoretic 
solutions are coincidental with the competitive 
equilibrium solution. This means that by consider- 
ing different solutions we may interpret the com- 
petitive market in terms of decentralization, fair 
division, the power of groups, and the attenuation 
of power of the individual. 

The stable-set solution of yon Neumann and 
Morgenstern, the bargaining set of Aumann and 
Maschler (1964), the “self-policing” properties of 
certain imputation sets of Vickrey (1959), and 
several other related cooperative solutions appear 
to be more applicable to sociology, and possibly an- 
thropology, than to economics. There has been no 
indication of a limiting behavior for these solutions 
as numbers grow; on the contrary, it is conjectured 
that in general the solutions proliferate. When, 
however, numbers are few, as in cartel arrange- 
ments and in international trade, these other solu- 
tions provide insights, as Nyblén has shown in his 
work dealing with stable sets (1951). 

Nonexistence of competitive equilibrium. When 
conditions other than those needed for the existence 
of a competitive equilibrium hold, such as external 
economies or diseconomies, joint ownership, in- 


creasing returns to scale, and interlinked tastes, 
then the different solutions in general do not con- 
verge. There may be no competitive equilibrium; 
the core may be empty; and the definition of a non- 
cooperative game when joint property is at stake 
will call for a statement of the laws concerning 
damages and threats. (Similarly, even though the 
conditions for the existence of a competitive equi- 
librium are satisfied, the various solutions will be 
different if there are few participants.) When the 
competitive equilibrium does not exist, we must 
seek another criterion to solve the problem of dis- 
tribution or, if possible, change the laws to rein- 
troduce the competitive equilibrium. The other so- 
lutions provide different criteria. However, if a 
society desires, for example, to have its distribution 
system satisfy conditions of decentralization and 
fair division, or of fair division and limits on power 
of groups, it may be logically impossible to do so. 

Davis and Whinston (1962), Scarf (1964), and 
Shapley and Shubik (1964) have investigated ap- 
plications of game theory to external economies, 
to increasing returns to scale, and to joint owner- 
ship. In the case of joint ownership the relation 
between economics and politics as mechanisms for 
the distribution of the proceeds from jointly owned 
resources is evident. 

It must be noted that the “many solutions” ap- 
proach to distribution is in contrast to the type of 
welfare economics that considers a community 
welfare function or social preferences, which are 
not necessarily constructed from individual pref- 
erences. 


Other applications 


Leaving aside questions of transferable utility, 
there is a considerable difference between an econ- 
omy in which there is only barter or a passive shad- 
ow price system and one in which the government, 
and possibly others, have important monetary strat- 
egies. Faxen (1957) has considered financial policy 
from a game-theoretic viewpoint. 

There have been some diverse applications of 
game theory to budgeting and to management sci- 
ence, as can be seen in the articles by Bennion 
(1956) and Shubik (1955). 

Nyblén (1951) has attempted to apply the von 
Neumann and Morgenstern concept of stable set 
to problems of macroeconomics. He notes that the 
Walrasian system bypasses the problem of individ- 
ual power by assuming it away. He observes that 
in game theory certain simple aggregation proce- 
dures do not hold; thus, the solutions to a four- 
person game obtained by aggregating two players 
in a five-person game may have little in common 
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with the solutions to the original five-person game. 
He outlines an institutional theory of the rate of 
interest based upon a standard of behavior and 
(primarily at a descriptive level) links the concepts 
of discriminatory solution and excess to inflation 
and international trade. 

MARTIN SHUBIK 


[The reader who is not familiar with oligopoly theory 
and general equilibrium theory should consult Eco- 
NOMIC EQUILIBRIUM; OLIGOPOLY; WELFARE ECO- 
NOMICS.]} 
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GAMING 
See SIMULATION. 


GANGS 
See under DELINQUENCY. 


GAUSS, CARL FRIEDRICH 


Carl Friedrich Gauss (1777-1855), greatest of 
German mathematicians, was born in Brunswick 
on April 30, 1777. (He was baptized Johann 
Friedrich Carl, but he later dropped his first name 
and reversed the second and third.) Ranked with 
Archimedes and Newton as one of the three 
greatest mathematicians of all time, he com- 
bined in a most unusual way a pure mathema- 
tician’s interest in abstract ideas and logical rigor, 
a theoretical physicist’s interest in the creation 
of mathematical models of the physical world, an 
astronomer’s talent for keen observation, and an 
experimentalist’s skill in the application and in- 
vention of methods of measurement. He was 
blessed with a stupendous faculty for mental cal- 
culation, which enabled him to explore numerical 
relationships experimentally and to carry out ex- 
tensive or involved routine computations quickly 
and accurately; he also had a gift for learning 
ancient and modern languages, which became his 
hobby. He contributed mightily to every branch of 
pure and applied mathematics that existed in his 
day, some of which he had founded, and he made 
major contributions in astronomy, geodesy, physics, 
and metrology. Each of his many interests had its 
principal, but not exclusive, season: before 1800, 
philology and number theory; 1800-1820, astron- 
omy; 1820-1830, geodesy, differential geometry, 
and conformal mapping; 1830-1840, geomagne- 
tism, electromagnetism, and general theory of in- 
verse-square forces; 1840-1855, topology and the 


geometry of functions of a complex variable. He 
died in Göttingen on February 23, 1855. Of his 
many contributions, those used most widely in the 
physical, biological, and social sciences today re- 
late to the method of least squares, his first formu- 
lation of which dates from 1795 to 1798 and his 
second from 1821 to 1823. 

Gauss was the only child of Gebhard Dietrich 
Gauss, a bricklayer and gardener, by his second 
wife, née Dorothea Benze, daughter of a stone- 
mason, Gebhard Gauss, being skilled in writing 
and calculating, kept accounts for a local insurance 
company; he was esteemed by the townspeople, 
but at home he was harsh and uncouth, which 
repelled his brilliant son. Gauss’s mother had no 
special schooling, could not write, and could barely 
read, but she was cheerful, intelligent, and of 
strong character. She had a genial and extraordi- 
narily intelligent younger brother, Johann Friedrich, 
a skilled weaver of artistic damasks, who quickly 
spotted his nephew's unusual talents and capaci- 
ties and enjoyed sharpening his wits on those of 
his sister’s young genius. Gauss, as a small boy, 
thought highly of his uncle and later, lamenting 
his untimely death in 1809, often declared that 
a born genius had been lost in him. 

Gauss’s precocity is unequaled in the history of 
mathematics. Before he was three, while watching 
his father’s payroll calculations he detected an 
error and announced the correct result. He was 
admitted at age ten to the beginners’ class in arith- 
metic at St. Katherine’s Volksschule in Brunswick, 
where the speed and accuracy of his mental calcula- 
tions so astonished the schoolmaster that he pur- 
chased for the boy the best obtainable textbook on 
arithmetic, which Gauss quickly mastered, con- 
vincing the schoolmaster that Gauss had gone 
beyond him. Luckily the schoolmaster had an as- 
sistant who was interested in mathematics. He 
and Gauss studied algebra and the rudiments of 
calculus together in the evenings, helping each 
other over difficulties and amplifying the textbook 
proofs. Thus in his eleventh year Gauss became 
acquainted with the binomial theorem and, find- 
ing the textbook “proof” unsatisfactory when the 
exponent n is not a positive integer, devised his 
own proof of the convergence of the infinite series 
involved, which established him as one of the first 
of the “rigorists” and served as an inspiration for 
some of his greatest later work. 

Early in 1791 Gauss’s amazing powers came to 
the attention of Carl Wilhelm Ferdinand, duke of 
Brunswick, who became Gauss’s patron, paid the 
expenses of his education at Caroline College from 
1792 to 1795 and at the University of Gottingen 
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from 1795 to 1798, and until his death in 1806 
gave Gauss considerable additional financial support. 
At Caroline College, Gauss devoted himself with 
equal success to classical literature, philosophy, 
and advanced mathematics. He studied carefully 
the original works of Newton, Euler, and Lagrange. 
In March 1795 (Dunnington 1955, p. 391) he 
rediscovered that invaluable principle of number 
theory, the law of quadratic reciprocity, which 
Legendre had published in 1785, and of which 
Gauss was to publish the first rigorous proof in 
1801. Yet, on entering the University of Göttingen 
in October 1795, he was still undecided whether to 
make mathematics or philology his career. 

Mathematics won on March 30, 1796, the day 
Gauss discovered that a regular polygon of 17 sides 
is amenable to straightedge-and-compass construc- 
tion; such a construction had eluded mathemati- 
cians for two thousand years. By June 1, 1796, he 
had discovered much more: a regular polygon with 
an odd number of sides is amenable to such con- 
struction if and only if the number of its sides is 
a prime Fermat number, F, = 2° + 1, or a product 
of different prime Fermat numbers (five of which 
are known today). In 1796 he also developed the 
first rigorous proof of the fundamental theorem 
of algebra (i.e., that every nonconstant polynomial 
has a root), which became the subject of the doc- 
toral dissertation for which he was awarded a PH.D. 
in absentia by the University of Helmstedt in 1799. 
In the autumn of 1798 Gauss polished the final 
draft of his greatest masterpiece, Disquisitiones 
arithmeticae, the printing of which began in 1799 
but was not completed until September 1801, 
owing to the sale of the original print shop. Gauss 
brought together in this work his own original 
contributions to the theory of integral numbers 
and rational fractions and all of the principal re- 
lated, but somewhat disconnected, results of his 
predecessors, so enriching the latter by rigorous 
reformulation and blending into a unified whole 
that this great book is regarded as marking the 
beginning of the theory of numbers as a separate, 
systematic branch of mathematics. 

Among Gauss’s early achievements was his re- 
duction of the ecclesiastical calendar’s extremely 
complicated computational procedure for finding 
the date of Easter (see Encyclopaedia Britannica, 
11th ed., vol. 4, pp. 991-999) to a set of simple 
formulas from which the answer for any given 
year can be found in a few minutes. His motiva- 
tion stemmed from his mother’s inability to recall 
the exact date of his birth—only that it was a 
Wednesday, eight days before Ascension Day. As 
first published (1800), his procedure gave an in- 
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correct date for Easter in 1734 and would have 
been incorrect again in 1886. Gauss supplied the 
necessary correction in 1807 and later provided 
an additional correction needed from 4200 on. (A 
semipopular exposition of Gauss’s procedure, with 
a worked example, has been provided by H. Herbert 
Howe [1954]. ) 

Gauss’s first formulation of the method of least 
squares dates from his student days. In the au- 
tumn of 1794 he read (Galle 1924, p. 5) Lambert’s 
discussion (1765, pp. 428-488) of the determina- 
tion of the coefficients of a linear relationship 
y=a-+ Bx from a set of n (>2) observational 
points (Yj, x;) by the method of averages. In 1795 
Gauss conceived the simpler and more objective 
procedure of taking for aœ and £ the values a and b 
that minimize the sum of squared residuals, 
Xi(¥Y; — a— bx; )?, and worked out the computa- 
tional details, except for the weighting of observa- 
tions of unequal precision (Galle 1924, p. 7). In 
1797 he attempted to justify his minimum-sum-of- 
squared-residuals (mssr) technique by means of 
the theory of probability but “found out soon that 
determination of the most probable value of an 
unknown quantity is impossible unless the prob- 
ability distribution of errors of observation is known 
explicitly” ([1821a] 1880, p. 98). Therefore, “it 
seemed to him most natural to proceed the other 
way around” and to seek the probability distribution 
of errors that “in the simplest case would result 
in the rule, generally accepted as good, that the 
arithmetic mean of several values for the same un- 
known quantity obtained from equally reliable ob- 
servations shall be considered the most probable 
value” ([1821a] 1880, p. 98). 

The concept of a probability distribution, or 
“law,” of errors originated with Thomas Simpson 
(1755): he studied the sampling distribution of 
the arithmetic mean of n independent and identi- 
cally distributed errors subject to a discrete rec- 
tangular or a discrete triangular law of error, and 
concluded that “the more observations or experi- 
ments are made, the less will the conclusion be 
subject to err, provided they admit of being made 
under the same circumstances” (p. 93). In 1757 
he extended his analysis to samples of n from a 
continuous triangular distribution. Studies of 
other continuous laws of error followed: rectangu- 
lar, by Lagrange, published in 1774; double-expo- 
nential (Ce, — æ <x<+%), by Laplace in 
1774; semicircular (CV7r?—x?,-r<x<+r), by 
Daniel Bernoulli in 1778; and double-logarithmic 
(Clog a/|x|,-a<x<+ a), by Laplace in 1781 
(for further details, see Eisenhart 1964; Todhunter 
1865; Merriman 1877). 


Using inverse probability, Gauss found f(v) = 
(h/V/T) exp (— h*v*) to be the required probability 
distribution of errors (1809, art. 175-177). He not- 
ed that k (= 1/(a@\/2) in modern notation, where 
g is the root-mean-square error, or standard devia- 
tion, of the distribution) “can be considered as the 
measure of precision of the observations” (art. 
178). Although De Moivre in 1733 had adduced 
the function e- to approximate sums of successive 
terms of the binomial expansion of (a+ b)” when 
n is large (for what he actually wrote, see Smith 
[1929] 1959, pp. 566-575), and Laplace, in a series 
of papers published during the period 1774-1786, 
had explored in great detail the use of this func- 
tion and its derivatives to approximate various 
probability distributions arising in games of 
chance, notably the binomial and hypergeometric 
distributions and the corresponding incomplete 
beta-function forms obtained through application 
of Bayes’ theorem, and had suggested tabulation 
of its integral for use in such problems (see 
Todhunter [1865] 1949, art. 890-911) neither 
De Moivre nor Laplace seems to have considered 
C exp (—h*x*), or an equivalent expression, as a 
law of error or as a probability distribution in its 
own right. 

Gauss then went on to deduce his mssr tech- 
nique from f(v), by what we would today call the 
method of maximum likelihood, finding that when 
the respective observations, Y;,i=1,2,-:-,n, are 
of unequal precision, each residual should be mul- 
tiplied by the corresponding measure of precision, 
hi,i=1,2,-+-,n, before squaring (art. 179); 
derived the formula for the precision, h(Y), of any 
weighted arithmetic mean, Y, of n independent 
observations (art. 181), finding that the precision, 
h(Y), of the unweighted arithmetic mean, Y, of n 
equally precise independent observations is propor- 
tional to Vm (art. 173); derived for the case of ob- 
servations on linear functions of several unknown 
quantities a, g, the rule of formation of 
(what we call) the normal equations that jointly 
determine optimal estimators A, B, +++ (art. 180); 
outlined (art. 182) his famous method of elimina- 
tion for solving the normal equations—most widely 
known today through a modification published by 
M. H. Doolittle (1878)—that provides as a by- 
product an easily evaluated expression (art. 182, 
sec. 5) for the corresponding minimum sum of 
Squared residuals in terms of quantities found in 
the course of their solution; and gave similar ex- 
pressions (art. 183) for evaluating the precisions 
ha, hn, +++ of A, B, : +- as determinations of a, Pirie 

An entry in Gauss’s diary (see Werke, vol. 10, 
P. 533) indicates that he completed the foregoing 


development of the method of least squares on 
June 17, 1798. A fragment of his first application 
of these procedures in the spring of 1799 has been 
found (see Werke, vol. 12, pp. 64-68), and a short 
note ([1799] 1900, p. 136) dated August 24, 
1799, on another application was published in 
October 1799. 

In 1801 Gauss became deeply involved in the 
development and application of new astronomical 
methods which, in conjunction with his least 
squares techniques, resulted in amazingly accurate 
predictions, despite the scanty data available, of 
the orbits of some newly discovered planets; the 
success of his new methods brought about Gauss’s 
immediate recognition as a first-rank astronomer 
(Bell 1937; Dunnington 1955, pp. 49-57). In 
1805 Gauss began preparation (in German) of 
his second masterpiece, Theoria motus corporum 
coelestium . . . (1809), in which he was to give a 
complete system of formulas and procedures for 
computing the motion of a body whose orbit is a 
conic section, and a general method for determin- 
ing the orbit of a comet or a planet from only three 
observed positions. The third section (art. 172- 
189) he devoted to a detailed exposition of the 
theory and application of his least squares methods. 
The German text was completed in 1806 (Galle 
1924, p. 10). When the work was offered for pub- 
lication in 1807, the publisher accepted it only on 
condition that Gauss translate it into Latin, be- 
cause of the very unsettled political situation in 
Germany following the disastrous defeat of the 
Prussian army (under the leadership of Gauss’s 
patron, the duke of Brunswick) by the Napoleonic 
forces at the battle of Auerstedt in October 1806. 
Consequently, full details of Gauss’s first formula- 
tion of the method of least squares were not pub- 
lished until the Latin translation appeared in 1809. 
In the meantime, Legendre’s independent formula- 
tion of his methods of least squares had appeared 
in print (1805); this went little beyond what 
Gauss had developed in 1795—no probability con- 
siderations are involved, and there is no discussion 
of precision or weighting of observations. A con- 
troversy followed, bitter on Legendre’s part, in 
which many persons became involved, and which, 
as Bell explains, “was most unfortunate for the 
future development of mathematics” ({1937] 1956, 
p. 331). 

In his 1809 presentation, Gauss regarded the 
measures of precision h, hz, =+ of the respective 
observations involved as known quantities and said 
nothing about how to determine their values in 
practice. F. W. Bessel took the first step in this 
direction: in 1815 he introduced the probable error 
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as a measure of imprecision (1815, p. 234), which 
he later defined (1816, p. 142) as the magnitude, 
r, that an error has an equal chance of exceeding 
(or being less than) in absolute value; and from 
48 determinations of the right ascension of the 
polestar he obtained a value for the probable error 
of such determinations (1815, p. 234) by means 
of the formula R = 0.8453, |Y; — ¥|/n, where Y = 
S.Y,/n. (The numerical result given conforms to 
this formula [cf. Gauss 1816, art. 8], which is not 
stated.) The following year, Bessel (1816, pp. 
141-142) showed that the probable error for 
Gauss’s error distribution, f(v), is given by r= 
0.476936h" = 0.8453e = 0.67450. In this formula- 
tion ¢=1/(h\/7) represents the mean absolute 
error of the distribution and ø = 1/(h\V/2) is the 
root-mean-square error of the distribution, re- 
spectively. Gauss immediately pointed out (1816, 
art. 3) that given m independent errors V,, Vs, 
+++, Vm distributed according to his error func- 
tion, f(v), then for “m large or small” the “most 
probable values” (or, in modern terminology, the 
maximum likelihood estimators) of h and r are 
H = V m/2>,,V? and R = 0.6744897 VÈ; V?/m, re- 
spectively. He showed (art. 4) that for large m, 
A —h and R -r are distributed approximately as 
f(v), with hı = Vm/h and hg = ym/r, and gave 
(loc. cit.) explicit expressions, in terms of Ĥ and 
R, for the “probable limits” of (in modern termi- 
nology, 50 per cent confidence limits for) “the true 
values of h and r.” Next he considered (art. 5) the 
estimation of r by R, =C, VX|V,?/m, with C, = 
C,(m,h) chosen so that r is the mean of the large- 
sample distribution of R,, and found R, to be the 
most precise, noting (art. 6) that R, for m = 100 
is as precise a measure of r as is R, for m= 114, 
R, for m= 109, R, for m = 133, R; for m = 178, 
and R, for m = 251. Finally, he considered (art. 7) 
the estimation of r by the “middlemost” (i.e., the 
median) of the absolute errors |V;| when m is odd 
and found that this procedure requires m = 249 in 
order to achieve precision comparable to R. for 
m = 100, Gauss then evaluated (art. 8) R,, R., Rs, 
and R, for the 48 determinations considered by 
Bessel (1815), taking the residuals, Y;—Y, as 
measures of the corresponding errors, V;, and con- 
cluded that Bessel had used the formula R,. 

On February 15, 1821, Gauss presented to the 
Royal Academy of Sciences in Göttingen a com- 
pletely new formulation of the method of least 
squares that was entirely free from dependence on 
any particular probability distribution of errors. 
Entitled “Theoria combinationis observationum 
erroribus minimis obnoxiae. Pars prior” (1821a), 
it stemmed from the earlier work of Laplace: Three 
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years before Gauss’s birth, Laplace ([1774] 1891, 
pp. 41-48) had said that to estimate a parameter 0 
one ought to use that function T= T(Y,, Y., ---) 
of the observations, Y,;,Y.,-:-, for which the 
mean (or expected) absolute error of estimation, 
E{|T — 0|}, is a minimum for the given probability 
distribution of errors, and, for the case of independ- 
ent identically distributed observations, Laplace 
gave an explicit procedure for finding such a func- 
tion for estimating the location parameter of their 
common distribution, f(y — 7), when this is com- 
pletely specified except for the value of r. Finally, 
Laplace showed that in the case of independent 
observations of equal (or unequal) precision, 
Gauss’s technique of minimizing the sum of 
squared residuals leads to the same estimators as 
Laplace’s own procedure for minimizing the mean 
absolute error of estimation when and only when 
(Laplace [1812] 1820, book 2, chapter 4, sec. 23) 
the errors X; = Y; — 7 of the respective observations 
are distributed in accordance with Gauss’s law of 
error, 
hi exp (=h? x?) i=1,2,--- 
Vit [res Se) give 

Gauss ([1821a] 1880, art. 6-7) proposed using in- 
stead the mean square error of an observation, 
E{(Y—y)*}, or of an estimator E{(T — @)?}, 
as a better measure of “uncertainty” (“incerti- 
tudo”) and derived (art. 9) his remarkable inequali- 
ties (see Savage 1961, eq. 9) for the probability 
that a random variable Z with a continuous uni- 
modal probability distribution will differ (positively 
or negatively) from its modal value z, by more 
than ) times its root-mean-square error measured 
from z,. Then he showed (art. 18-23) that when 
the mean values E{Y;} = ni of a set of n independ- 
ent observations Y,, Y2, -++,Y, are linear func- 
tions of k (< n) unknown parameters 6; (j= 1, 2, 
‘++, k) and (in modern terminology ) the variances 
E{(Y, — ;)*} = o? of the Y, are all finite, then the 
mssr technique yields estimators T; of the 6; having 
minimum mean-square error (j= 1,2, hashes 
whatever the distribution(s) of errors of the ob- 
servations. In a second memoir ( 1823), he extend- 
ed the preceding result to estimators L, of linear 
functions A,(0;,02,+:+,6,) of the 6’s (r= 1, 2, 

*+,m; m Sk) and showed (art. 37—40) that the 
resultant minimum sum of squared residuals is 
strictly equivalent to the sum of n — m independent 
errors (i.e., has n — m degrees of freedom). These 
results, at one time attributed erroneously to A. A. 
Markov (for explanation, see Neyman ([1938] 
1952, p. 228), are derived and discussed by various 
recent authors (e.g., Graybill 1961; Scheffé 1959; 


Zelen 1962) as the “Gauss—Markov theorem,” and 
the method of least squares is thereby endorsed as 
a procedure for obtaining minimum variance linear 
unbiased estimators. 

Gauss definitely preferred his 1821 formulation 
of the method of least squares above all others (for 
his statement to this effect on February 26, 1821, 
see [1821b] 1880, p. 99). In a letter to F. W. Bessel 
(Gauss & Bessel 1880) dated February 28, 1839 
(an excerpt from which is given in his Werke, 
vol. 8, pp. 146-147), he remarked that he had 
never made a public statement of his reasons for 
abandoning the metaphysical approach of his first 
formulation but that a decisive reason was his be- 
lief that maximizing the probability of a zero error 
is less important than minimizing the probability 
of committing large errors. 

In his later years, Gauss took special pride in his 
contributions to the development of the method of 
least squares and, despite a lifelong aversion to 
teaching, gave a course on this subject at the Uni- 
versity of Göttingen each year from 1835 until his 
death. During his lifetime, the method of least 
squares became a basic tool in astronomy and 
geodesy throughout the world and has remained 
one to this day. And when Karl Pearson and G. 
Udny Yule began to develop the mathematical the- 
ory of correlation in the 1890s, they found that 
much of the mathematical machinery that Gauss 
devised for finding “best values” for the parameters 
of empirical formulas by the method of least 
squares was immediately applicable in correlation 
analysis, in spite of the fact that the aims of corre- 
lation analysis are the very antithesis of those of 
the theory of errors. As Galton remarked in his 
Memories of My Life (1908, p. 305), “The primary 
objects of the Gaussian Law of Error were exactly 
opposed, in one sense, to those to which I applied 
them. They were to get rid of, or to provide a just 
allowance for errors. But these errors or deviations 
were the very things I wanted to preserve and to 
know about.” In consequence, Gauss’s contributions 
to the method of least squares embody mathematics 
essential to statistical theory and its applications in 
almost every field of science today. 

Gauss was visited in 1829 by the Belgian astron- 
omer—physicist-statistician Adolphe Quetelet, who 
was engaged in making geomagnetic measurements 
in Holland, Germany, Italy, and Switzerland. Gauss 
had been interested in geomagnetism from around 
the turn of the century and had remarked, in a 
letter to Heinrich Olbers dated March 1, 1803, “I 
believe that this offers a greater field for the appli- 
cation of mathematics than has yet been supposed” 
(Olbers & Gauss, Briefwechsel . . .); his intense ac- 


tivity in other fields had thus far prevented his 
undertaking research in geomagnetism, despite the 
strenuous efforts of Alexander von Humboldt, in 
1804 and again in 1828, to persuade him to do so. 
When Quetelet arrived, he found Gauss studying 
Russian for relaxation: “I have been very fatigued,” 
he said, “from occupying myself with astronomy, 
geodesy, and other subjects that I know fairly well; 
I wanted to turn my attention to a language that I 
did not know at all, and now I am reading Russian” 
(Quetelet 1866, p. 646). Measurement of the inten- 
sity of the earth’s magnetism was new to Gauss, and 
he was eager to know how such measurements 
were made and the precision that could be achieved 
(p. 645). Quetelet set up his apparatus in Gauss’s 
yard,, and together they conducted a series of ex- 
periments, taking observations simultaneously but 
in slightly different manners. The agreement of the 
results obtained astonished Gauss, who exclaimed: 
“But these observations conform to the precision of 
astronomical observations” (p. 646). 

From January 1831 on, geomagnetic measure- 
ments were made regularly at Göttingen. By Febru- 
ary 1832, Gauss was deeply involved in research 
on geomagnetism and had found that he could ex- 
press the intensity of geomagnetism in what he 
called “absolute units,” that is, in terms of units of 
the three fundamental physical quantities: length, 
mass, and time. On December 15, 1832, he pre- 
sented his findings to the Royal Academy of Sci- 
ences in Géttingen in a paper entitled “Intensitas 
vis magneticae terrestris ad mensuram absolutam 
revocata” (1832), which was promptly recognized 
as one of the most important papers of the century. 

Gauss made electromagnetic measurements for 
the first time in October 1832. By Easter 1833, he 
and his young physicist colleague Wilhelm Weber 
had put into operation an electromagnetic tele- 
graph between Gauss’s observatory and Weber's 
physics laboratory (Dunnington 1955, pp. 147- 
148, 395). They sent only individual words at first, 
and later complete sentences. Plans were drawn up 
in 1835-1836 for its use on the Leipzig—Dresden 
railroad but were dropped when the railroad au- 
thorities declared that the wires would have to be 
put underground. A monument showing Gauss and 
Weber discussing their telegraph was erected on 
the campus of the University of Göttingen. 

In 1832 Gauss had begun preparation of his 
“Allgemeine Theorie des Erdmagnetismus” (“Gen- 
eral Theory of Terrestrial Magnetism”); comple- 
tion was delayed by lack of experimental mate- 
rial. At Gauss’s suggestion a magnetic observatory 
(of nonmagnetic construction) was erected at the 
University of Gottingen in 1833. By 1836 Gottingen 
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had become the principal European center for re- 
search on geomagnetism; and an association of 
magnetic observatories, known as the Göttingen 
Magnetic Union, was formed to coordinate simul- 
taneous measurement of geomagnetic phenomena 
throughout Europe. In 1837 Gauss and Weber col- 
laborated in the invention of a galvanometerlike 
device, the bifilar magnetometer, for measuring 
magnetic field intensities; and with it, Gauss veri- 
fied the inverse-square law of magnetic attraction 
to which he had already been led by theory. His 
“Allgemeine Theorie . . .” appeared in 1839 and 
was followed in 1840 by his (with Weber) “Atlas 
des Erdmagnetismus” (1840) and his great treatise 
“Allgemeine Lehrsitze in Beziehung auf die im 
verkehrten Verhältnisse . . .” (1840)—i.e., what 
today we call “potential theory’—which marked 
the peak of his work in physics and the close of 
his work on magnetism. Near the end of his life, 
Gauss, like Laplace, Fourier, and Poisson, turned 
his attention to the social sciences and to the help 
that they might derive from the physical sciences. 
In particular, he took an active interest in applica- 
tion of the theory of probability to social laws. 
Thus, in 1847, he corresponded with the Danish 
astronomer Heinrich Christian Schumacher on the 
laws of mortality and on the construction of mor- 
tality tables (Quetelet 1866, pp. 653-655). 


Called “the prince of mathematicians” even in 
his lifetime, Gauss received a steady stream of 
honors (listed chronologically in Dunnington 1955, 
appendix B), beginning with his election in 1802 
as a corresponding member of the Imperial Acad- 
emy of Arts and Sciences in St. Petersburg. He was 
elected a fellow of the Royal Society of London in 
1804 and received the Copley Medal in 1838; he 
became a full member of the Royal Academy of 
Sciences in Berlin in 1810, a fellow of the Ameri- 
can Academy of Arts and Sciences in Boston, Mas- 
sachusetts, in 1822, and a member of the American 
Philosophical Society in Philadelphia in 1853, In 
1842 the highest order conferred by the kingdom 
of Prussia, Pour le mérite, was awarded to him. 
On July 16, 1849, exactly 50 years after receipt of 
his doctorate, a Gauss jubilee was held in Göt- 
tingen at which honors were showered upon him, 
including honorary citizenship of Brunswick and 
Géttingen, which he prized above all the rest. 
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[For the historical context of Gauss’s work, see the 
biographies of LAPLACE; MOIVRE; for discussion of 
the subsequent development of his ideas, see EstTI- 
MATION; LINEAR HYPOTHESES, article on REGRESSION; 
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MULTIVARIATE ANALYSIS, articles on CORRELATION: 
and the biographies of PEARSON; QUETELET; YULE.] 
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GEDDES, PATRICK 


Patrick Geddes (1854-1932), British biologist, 
sociologist, and town planner, was born in Ballater, 
Scotland, and died in Montpellier, France. His 
childhood was spent in Perth. Because he was 
small and frail, Geddes did not go to school until 
he was eight; but through working in the cottage 
garden with his father, a retired professional 
soldier, he acquired his lifelong interest in plants 
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and gardens. When he was 15, his father took him 
on a walking trip of more than two hundred miles 
through a series of river valleys. This tour sug- 
gested to young Geddes the practice he was to 
introduce later into both British education and 
town planning—the city and regional survey. 
After his graduation when he was barely 16 from 
the Perth Academy, where he had been a prize- 
winning student, his father gave him a free year 
to spend both reading and working with a 
local craftsman. During that year, Geddes’ inter- 
ests turned toward the biological sciences; and the 
reading of Thomas Huxley's Lay Sermons led to 
his beginning his career in the study of biology, at 
first under Huxley himself. 

While pursuing zoological studies in Paris, 
Geddes came under the influence of Edmond 
Demolins of the science sociale group, who intro- 
duced him to Frédéric Le Play’s detailed occupa- 
tional and regional studies of famille, travail, 
lieu. These categories, like organism, function, and 
environment in biology, constituted the basic triad 
that Geddes used in his later analyses of society; 
famille was replaced by “folk” or “people.” 

In 1879, on a British Association for the Ad- 
vancement of Science grant, Geddes went to Mexico. 
There his health broke down and he suffered a two- 
month period of blindness. This misfortune had a 
decisive effect on his intellectual life, To overcome 
his handicap, he invented a system of graphic no- 
tations by means of paper folded into nine squares, 
It was the first of his “thinking machines” and one 
of the earliest applications of graphic methods to 
nonmathematical problems. By the elaboration of 
these graphs, using as many as 36 squares, Geddes 
sought to demonstrate the constant interplay of 
ideas, forces, functions, groups, and institutions, 
all of which are usually treated by the specialized 
Sciences as if they were independent and isolated. 
His basic graph, in which the central figure of the 
triad represents an activity or a function, and the 
surrounding squares modes of interaction, was an 
early systematization of field theory; and the 
holistic view it presented was much later developed 
independently by J. L. Moreno. 

No longer able to use the microscope, Geddes 
nevertheless produced a steady stream of scientific 
papers and encyclopedia articles during the 1880s; 
he even opened up a new area in biology with his 
lifelong collaborator, J. Arthur Thomson, in their 
pioneer volume on the evolution of sex (1889) 
[see Ecoxocy, article on HUMAN ECOLOGY]. In 
that year a special chair in botany was established 
for him at University College, Dundee, a post he 
occupied for three months annually until 1919. 
Meanwhile, his widening social, aesthetic, and 


historical explorations turned him toward social 
problems. As early as 1882, Geddes introduced the 
new concept of energetics, later developed by 
Wilhelm Ostwald, into the analysis of census sta- 
tistics; in 1884 he applied functional biological 
criteria to the more conventional economic anal- 
yses of the division of labor. 

After his marriage in 1886, Geddes settled in 
Edinburgh and engaged in a series of activities 
that laid the basis for his later career as a town 
planner and as the leading British exponent of 
regionalism. Among these were the founding of 
four badly needed university residence halls, run 
by students; the rehabilitation of sundry sordid 
tenements of the Old Town; and the transforma- 
tion by voluntary labor of neglected patches of 
land into gardens and play spaces. In the 1890s 
he converted the Outlook Tower atop Castle Hill 
into what Charles Zueblin described as “the world’s 
first sociological laboratory’—actually a sociolog- 
ical museum, library, and meeting place. In 1891 
he instituted a series of 14 collegiate summer 
meetings at the tower, with scholars like Petr 
Kropotkin and Elisée Reclus as lecturers. In the 
1890s Geddes gathered round him a group of art- 
ists and writers who formed the center of the 
short-lived Celtic renascence, Among them was 
the writer of Celtic romances, William Sharp, who 
called himself Fiona Macleod. 

In 1903 Geddes embarked more formally on his 
career as sociologist and planner by assisting his 
colleague Victor Branford in the founding of the 
Sociological Society in London and by entering a 
planning competition held by the Carnegie Trust 
for the civic improvement of Dunfermline. Al- 
though his ambitious designs for Dunfermline 
were rejected, Geddes’ comprehensive report, City 
Development (1904), remains a landmark in city 
planning, for it demonstrates his method of “con- 
servative surgery,” that is, preserving the complex 
historic tissue of a city while boldly introducing 
desirable innovations (see Geddes 1947). 

Geddes’ own survey of Edinburgh, modest in 
scale, established the civic survey as an essential 
first step in town planning, while in a series of city 
exhibitions and Sociological lectures, including 
numerous extension courses at the University of 
London, he opened up the neglected study of the 
city itself. This rediscovery of the city was his out- 
standing contribution to sociology. Geddes’ leader- 
ship helped pave the way for the British Housing and 
Town Planning Act of 1909 and brought him an 
invitation to put his ideas to work in India. There, 
between 1914 and 1924, he surveyed and planned 
some fifty cities; and from 1919 to 1924 he served 
as professor of civics and sociology at the Univer- 


sity of Bombay. His unpublished plans for the 
Hebrew University of Jerusalem and for Tel Aviv 
and his remarkable two-volume report on Indore 
(1918) were also completed between 1914 and 
1925. The report on Indore contains his exhaustive 
critical appraisal of the failings and potentialities 
of the modern university. 

Although Geddes considered his thought a union 
of the traditions of Comte and Le Play, in his effort 
at synthesis he was a continuator of Herbert Spen- 
cer and more remotely of Aristotle. He treated 
sociology as a unifying discipline whose main com- 
ponents are geography, economics, and anthro- 
pology, all taken in their widest human context. 
His aim as a systematic thinker was to break down 
the sterile isolation and impoverished abstraction 
of specialized knowledge, so as to be able to move 
and act freely over the entire range of human ex- 
perience, even that which lay beyond rigorously 
scientific description. He was as fertile as Bentham 
in coining neologisms, and many of his new terms, 
such as “geotechnics,” “biotechnics,” and “conur- 
bation,” have already entered the dictionary, while 
his characterization of specialism—‘“knowing more 
and more about less and less”—has become classic. 

The best presentation of Geddes’ method, along 
with his general sociological conspectus, appears 
in Volume 2 of Life (1931, chapters 11-13). These 
pages reveal the detailed wealth of his scholarship 
and first-hand experience, which offsets the some- 
times arbitrary explications of his graphs. Since 
he was essentially an oral teacher, like G. H. Mead, 
his direct influence is best shown in the work of 
his students and colleagues, such as P. Abercrom- 
bie (town planning), V. Branford (sociology), 
H. J. Fleure (geography), and J. Arthur Thomson 
(biology). But more important, the influence of 
his ideas is being felt throughout the world, even 
in cases where their origin has been forgotten. 


Lewis MUMFORD 


[For the historical context of Geddes’ work, see CITY, 
article on FORMS AND FUNCTIONS; and the biog- 
raphies of COMTE; LE PLAY; SPENCER. For discussion 
of the subsequent development of his ideas, see 
Housine, article on SOCIAL ASPECTS; REGION; and 
the biography of FLEURE.] 
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Theodor Geiger (1891-1952) was a German- 
born sociologist whose chief interests were the 
fields of general social theory, methodology, social 
stratification and mobility, the sociology of law, 
and ideology. Even in Germany, Geigers work is 
not widely known, although his writings on social 
stratification and mobility and on the sociology of 
law possess current significance. There is practi- 
cally no secondary literature on Geiger. It was not 
until some years after his death that his influence 
was felt at all beyond the confines of Germany and 
Scandinavia. 

Geiger’s life mirrors German history in the 
twentieth century. His numerous publications—a 
bibliography of his works contains some 160 titles 
—constitute a courageous and nonideological dis- 
cussion of the social and methodological problems 
of his time. He is noteworthy as an early critic of 
Nazism. 
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Geiger was the son of a teacher in a Munich 
Gymnasium; he studied law at the University of 
Munich and at Wiirzburg—where he received his 
doctorate in law—and volunteered for military 
service in World War 1. After the war, until 1932, 
he was a member of the German Social Democratic 
party. During this time he began work on some 
problems relevant to the sociology of law. For some 
years he was active as a tutor, translator, and 
writer. He was fluent in the Scandinavian lan- 
guages and also in Finnish. From 1920 on, he lived 
in Berlin, collaborating in the publication of the 
periodical Die Fremde Presse and working in- 
tensively in the field of adult education. He taught 
at the Volkshochschule Gross-Berlin and had be- 
come the principal of that school by the time he 
left, in 1928, to occupy the chair of sociology at 
the Brunswick Institute of Technology. While he 
was in Berlin he also worked at the National Sta- 
tistical Bureau. 

In 1933 Geiger emigrated to Denmark. For some 
time his studies in Copenhagen were supported by 
the Rockefeller Foundation. In 1938 he became 
professor of sociology at Aarhus. With the German 
occupation of Denmark, it became desirable for 
Geiger to leave, and in 1943 he escaped to Sweden. 
There he was stimulated by contact with the jurists 
and social scientists of the “Uppsala school.” In 
1945 he returned to Aarhus and founded the first 
institute of sociological research in Scandinavia. 
In collaboration with T. T. Segerstedt (of Uppsala), 
V. Verkko (of Helsinki), and J. Vogt (of Oslo), 
Geiger founded a series of publications, Nordiske 
studier i sociologi, Volume 1 of which was pub- 
lished in Copenhagen in 1948. He was also busy 
with several large-scale empirical studies on strati- 
fication and mobility and with developing his ideas 
on the sociology of law and the critique of ideology 
—ideas dating back to the stimulation he had re- 
ceived in Sweden. He was one of the founders of 
the International Sociological Association (ISA) 
and, together with D. Glass, was a leading spirit 
in the First International Working Conference on 
Social Stratification and Social Mobility. In 1951 
he went to the University of Toronto as visiting 
professor; on the return voyage he died suddenly 
on shipboard, 

General social theory. Geiger made important 
theoretical contributions to the study of social 
groups and to the refinement of Tonnies’ cate- 
gories of Gemeinschaft and Gesellschaft. He was 
also concerned with the role of secondary groups 
in modern society, notably in two articles, one, 
“Die Gruppe und die Kategorien Gemeinschaft und 
Gesellschaft,” published in 1927 and the other, 
“Die Legende von der Massengesellschaft,” pub- 


lished in 1951. After briefly outlining a sociological 
theory (which he later partly retracted) in Die 
Gestalten der Gesellung (1928), in the 1930s he 
wrote his powerful introductory text Sociologi 
(1939), at the time a unique work in Scandinavia 
and for several decades an important textbook. 
Subsequently, he shifted entirely from the con- 
struction of general theory to the study of specific 
problems. 

Methodology. In the 1920s Geiger was one of 
the few European sociologists doing empirical re- 
search. One of his first studies was an evaluation 
of illegitimacy statistics (1920). His book Die 
soziale Schichtung des deutschen Volkes (1932) 
and several articles—on employees (19334), on the 
self-employed (1933b), and on problems of adult 
education—also have an empirical foundation. 
After these early works Geiger, influenced by Scan- 
dinavian thought and by American sociology, 
sought to refine the methods of empirical social re- 
search. He developed his own methods of inquiry 
and presentation, notably in his studies of Danish 
intellectuals and of changes in social stratification 
in Aarhus. His theoretical considerations of metho- 
dology (e.g., several articles written in 1948 and 
1949: see Arbeiten zur Soziologie) also raised prob- 
lems of the philosophy of science and of knowledge. 

Stratification and mobility. In his studies of 
Social stratification, Geiger initially accepted the 
Marxist view that German society had a clear-cut 
class structure, but he departed increasingly from 
this conception. Under the influence of American 
sociology and of his own empirical data, he wrote, 
while he was in Sweden, Klassesamfundet i støbe- 
gryden (“Class Society in the Melting Pot,” 1948). 
By means of large-scale research he developed a 
dynamic analysis of social mobility and a typology 
of social fluctuation, together with a multidimen- 
sional model of stratification, A brief article by 
Geiger on his theory of social stratification was 
published in the Wérterbuch der Soziologie, edited 
by W. Bernsdorf and F. Biilow (1955). 

Sociology of law and ideology. Geiger was 
early concerned with the sociology of law, and his 
association with the Uppsala school intensified this 
interest (1946qa; 1946b; 1947). In several publica- 
tions he took issue with the views of S. Ranulf, A. 
Ross, and K. Illum. Geiger linked the sociology of 
law with the analysis of ideology, discussing juris- 
tic concepts—norms, sources of law, consciousness 
of law, etc.—as phenomena of social reality. He 
examined the general nature of civil order and 
broke it down into such partial orders as habit, 
Custom, usage, and law. Although this analysis is 
based on Continental legal conceptions, it consti- 
tutes a theory of social control that has bearing, 


also, on non-European social structures. Indeed, the 
Vorstudien (1947) are, with the fundamental 
works of G. Gurvitch and N. S. Timasheff, among 
the most important contributions to the sociology 
of law. 

Geiger’s conception of ideology has frequently 
been misinterpreted. Ideology, to him, is a concept 
in the theory of knowledge: Ideologie ist theoretisch 
gemeintes A-Theoretisches (“Ideology is the atheo- 
retical taken theoretically”). He was led to this 
approach by the Uppsala school, especially by the 
work of A. Hägerström, although he criticized 
Hägerström and by the same token departed from 
the view of Karl Mannheim. Geiger’s “value nihil- 
ism” must be understood in terms of a theory of 
knowledge: his value nihilism permits primary value 
judgments, but not their conversion into theories. 
In his last work, Demokratie ohne Dogma (1960), 
he made a plea for “intellectual humanism,” the 
“enlightenment of the masses,” the “democratiza- 
tion of reason,” the “asceticism of emotion,” and 
“abstinence from value judgment.” Yet he was a 
pragmatist: he considered his last work to be his 
political testament and hoped that his ideas would 
be put into practice. 

PauL TRAPPE 


[See also COMMUNITY-SOCIETY CONTINUA; IDEOLOGY; 
ILLEGITIMAcy; LAW; SOCIAL CONTROL; SOCIAL MO- 
BILITY; STRATIFICATION, SOCIAL; and the biographies 
of MANNHEIM and TONNIES.] 
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GENERAL WILL 


The concept of the general will involves the 
moral values and the political aspirations that are 
shared by the members of a community and to 
which the policies of its government must broadly 
conform if that government is to be considered 
legitimate. The term was used in this minimal 
sense by Jean Jacques Rousseau, its originator, 
who also used it, more importantly, to describe the 
will to justice that would characterize his ideal 
democracy and achieve its authoritative expression 
in legislative decisions [see ROUSSEAU]. 

Since Rousseau, different and precise meanings, 
depending upon the political theory in which they 
are embedded, have been attached to the term; 
these have had as their primary and common aim 
its adaptation to the analysis of politics and na- 
tional character. Typically, some theory of the 
general will has been used to explain, to justify, 
and to prescribe for the institutions of constitu- 
tional and liberal democracy, especially by thinkers 
within or influenced by the British idealist school 
of political thought. Often, existence of a general 
will is made the cardinal criterion of community 
and is seen as the essential prerequisite to political 
stability and self-government. 

For Rousseau, the “general will” was the con- 
cept by means of which he summarized his theory 
of political obligation and displayed its logical de- 
pendence upon the psychological, ethical, and in- 
stitutional components of his political philosophy. 
In Rousseau’s ideal society, as presented in The 
Social Contract, the natural right to moral freedom, 
to live in accordance with the dictates of one’s own 
conscience and sense of morality, is psychologi- 
cally and institutionally reconciled with the social 
necessity for political authority, because there law 
is the reflection of the individual’s desire for jus- 
tice. Laws that express the general will are accept- 
able both to reason and to conscience and thus 
are held not so much to restrict freedom as to 
enlarge and to sustain it. 

Rousseau’s general will implies that neither a 
society that lacks a general will nor a government 
that disregards it can have rightful authority over 
the individual. Failure to establish the institutions 
essential to the creation of the general will inevi- 
tably means moral distortion of human personality 
and frustration of man’s capacity for natural good- 
ness, According to Rousseau’s view of human 


dynamics and moral development, only in a small 
and equalitarian society can man become an ethi- 
cal being for whom the realization of justice and 
its integration with the claim to moral freedom 
are paramount and compelling objectives—and 
then only if men participate directly and steadily 
in the making of the laws to which they will owe 
obedience. 

The social and institutional requisites of the 
general will as conceived of by Rousseau would 
appear to preclude its use for the purpose of legiti- 
mating constitutional democracy on the scale of 
the national state. The fundamentally moral and 
synthetic nature of the concept, however, opens 
the way for its modification, inspired by the hope 
of adapting it to the justification of representative 
government. This process of reinterpretation began 
with Kant, who derived from Rousseau’s general 
will his conception of the categorical imperative, 
which served for him as the supreme criterion of 
both morality and legality. In contrast to Kant’s 
primarily ethical elucidation of the concept, Hegel’s 
political interpretation of it takes the form of his 
metaphysical and historical conception of reason. 
The Hegelian conception of rationality as cumula- 
tive may be regarded as a historicizing of the gen- 
eral will. The result is that the reconciliation of the 
right to moral freedom with the demands of social 
justice is obtained not in acts of legislation but, 
rather, in reflective acceptance of and willing ad- 
herence to the social and political arrangements 
that have emerged historically in the shape of the 
national and constitutional state. 

The British idealist T. H. Green spoke of a gen- 
eral will with reference to the hopes and aspira- 
tions of a people, on which government depends 
and to which it should be both responsive and 
facilitative (1882). The most elaborate effort to 
employ the idea of a general will in the rationale 
of liberal constitutionalism was made by Bernard 
Bosanquet (1899). He conceived of the state as a 
concrete universal, a dynamic and rationally ar- 
ticulated totality, the fulfillment of whose require- 
ments may be understood in terms of “will,” which 
is the moving system of interlocking attitudes and 
functions that constitutes a politically organized 
and sovereign society. Here the general will be- 
comes more an attribute of the polity than a moral 
characteristic of individuality and, as such, gives 
both direction and significance to the activities of 
individuals, who are self-governing insofar as they 
sense and respond appropriately to the intimations 
of their society [see BOSANQUET and GREEN]. 

Less like Hegel and more like Rousseau in tone 
is the theory of the “neighborhood group” advo- 


cated by Mary Parker Follett (1918). She saw the 
neighborhood as the necessary source of what 
Rousseau would have recognized as the general 
will, and she urged the recasting of democratic 
institutions so as to make this social unit an in- 
tegrating force of moral and political importance. 
Her ideas have found effective application in ad- 
ministration and urban planning [see FOLLETT]. 

Still other interpretations of the general will 
seek to locate it in the nature of man in society, in 
the psychological and social foundations of polit- 
ical authority, rather than directly in political 
agreement and legislative performance. These in- 
clude W. Ernest Hocking’s “will to power” (1926), 
which requires the state as its vehicle, and Robert 
Maclver’s redesignation of the general will as the 
“will for the state” (1926), which derives social 
unity and political authority from a common root 
in individual and group freedoms. In the field of 
jurisprudence, Hugo Krabbe’s assertion of the 
“sense of right” of a community as the criterion of 
the validity of law (1915) is an attempt to con- 
vert the general will into a dynamic type of natural 
law. 

More complex reformulations of the general 
will, which are more directly in the spirit if not 
the letter of Rousseau’s political theory, are pre- 
sented in the works of Lindsay (1943) and Barker 
(1951). Both regard discussion as the process dis- 
tinctive to democratic society and government by 
which, fittingly differentiated and articulated, the 
general will may be generated and expressed. In 
this perspective, the general will is the formative 
conception in the theory of the deliberative state 
as set forth in Frederick Watkins’ analysis of liber- 
alism (1948) and in J. Roland Pennock’s exposi- 
tion of the principles of liberal democracy (1950). 
In a deliberative state, political participation not 
only is essential to social unity but should also be 
active and substantive, without excessive depend- 
ence upon either leadership or disciplined and 
programmatic parties. Deliberative democracy is the 
type of democracy most closely in accord with 
Rousseau’s ideal. 

Viewed in the light of its origin and develop- 
ment, the general will does not have any single 
meaning or accepted role in political theory. There 
is, however, an agreed core of meaning and im- 
plication. (1) As a legitimating idea the concept 
directs attention to the criterion of popular con- 
sent, expressed through the methods of representa- 
tive and responsible government; to the desirability 
of a diversity of forms of participation and access; 
and to justice and freedom as the proper ends of 
the state. (2) As an analytic concept the general 
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will suggests consideration of those conditions of 
social unity and common purpose sufficiently 
strong to permit the establishment, acceptance, 
and control of political authority. (3) From a diag- 
nostic standpoint the concept indicates that a so- 
ciety lacking in moral and political unity is un- 
likely to be capable of self-government and is 
liable, therefore, to an imposition of coherence and 
direction by authoritarian techniques and ideolo- 
gies. (4) Prescriptively, the general will continues 
to influence the construction of democratic theory 
in a liberal, as opposed to a majoritarian, mode 
and to guide the design of political, urban, and 
administrative democratic institutions in ways con- 
sistent with the meaning Rousseau gave it. 
Owing to its fundamentally moral nature, the 
concept continues to inspire investigation and ex- 
plication of the ethical purposes of political society 
and political activity. But in keeping with its com- 
posite nature, future research may be conducted 
in a number of directions: historical investigation 
of the development of political culture; psycho- 
logical investigation of the formation of moral and 
political attitudes, especially those basic to per- 
sonal independence and resilience; specification 
and interpretation of the principles of justice; and 
explanation of the social and political processes 
integral to liberal constitutionalism in both more 
mature and less mature industrializing societies. 
Above all, the concept of the general will invites 
attention to the interdependence of psychological 
processes, moral character, and political insti- 
tutions. 
Jonn W. CHAPMAN 


[See also AUTHORITY; DEMOCRACY; LEGITIMACY; LIB- 
ERALISM; PUBLIC INTEREST; REPRESENTATION; So- 
CIAL CONTRACT; STATE.] 
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GENERATIONS 


I. THE CONCEPT 
mW. POLITICAL GENERATIONS 


Julián Marías 
Marvin Rintala 


I 
THE CONCEPT 


Ever since antiquity, the concept of generation 
has been held in a biological, and consequently in 
a genealogical, sense of regular descent of a group 
of organisms from a progenitor. But since the early 
nineteenth century there has been developed a social 
and historical concept of generations as comprising 
the structure not only of societies but also of his- 
tory itself. Nevertheless, attempts to formulate a 
sociological theory of generations in the biological 
sense of kinship descent have been unproductive 
because the temporal continuity of births makes 


impossible any determination of social generations 
so long as “generation” is understood in a purely 
biological sense. It is therefore necessary to arrive 
at a strictly social and historical interpretation of 
the generation concept in order for it to acquire 
relevance in the field of the social sciences. 

History of the generation concept. Auguste 
Comte, the founder of modern sociology, considered 
the duration of human life a decisive element in 
determining the velocity of human evolution and 
therefore the passing of one generation to another, 
the term of full activity for man being thirty years 
(1830-1842, pp. 635-639 in part 4 of the 1839 
edition). Comte did not deal with the phenomena 
of individual or simply familial life but with social 
life based on “the unanimous adhesion to certain 
fundamental notions” (1851—1854, part 4, p. 679). 

John Stuart Mill obtained the concept of social 
generations from Comte and added further refine- 
ments of considerable interest. He argued that in 
each successive age the “principal phenomena” of 
society are different, and that the interval which 
marks these changes most clearly is the generation 
—that is, the period of time in which a “new set” 
of individuals reaches maturity and takes posses- 
sion of society. He believed that each social state 
is generated not only by the preceding one but also 
by the whole previous history of humanity; one of 
the key concepts in understanding this process is 
that of the generation. “History accordingly does, 
when judiciously examined, afford Empirical Laws 
of Society” (Mill [1843] 1961, p. 598). 

After this philosophical beginning of the theory 
of generations, the theory received further develop- 
ment at the hands of statisticians and historians. 
Antoine Augustin Cournot, the French economist 
and mathematician, first explicated the fact that 
epochs succeed themselves in continuity and only 
historical events give evidence of the articulation 
of generations: “Through education, each genera- 
tion transmits to the one immediately following, a 
certain groundwork of ideas; and while this act of 
education or transmission is in operation, the edu- 
cating generation is still present; unexpected, more- 
over, is the influence of the survivors of a preceding 
generation who have not ceased taking a notable 
part in the government of the society, on the move- 
ment of ideas and affairs” (1872, vol. 1, chapter 8). 

Giuseppe Ferrari, Italian historian, politician, 
and editor of Vico, limited himself to the examina- 
tion of political history, in which he believed he 
had discovered that the scene changes every thirty 
years and that the generation is the decisive ele- 
ment. Starting from this basic premise, he formu- 
lated laws of political succession: “Generations 
behave according to these principles, and are in 


turn preparatory, revolutionary, reactionary, and 
conciliatory” (1874, p. 182). 

The German historian Wilhelm Dilthey also 
found the idea of generation useful for studying 
the culture of an epoch, and he applied it in many 
of his writings. In 1875 Dilthey arrived at the 
notion that the generation is at once a space of 
time, an internal metrical concept of human life of 
about thirty years’ duration, and a contemporary 
relation of individuals to each other. 


The relationship between individuals denoted by the 
term generation is therefore one of simultaneity. We 
say that certain people belong to “the same genera- 
tion” when they have, in a certain sense, grown up 
together, passed through childhood and youth at about 
the same time, and enjoyed their period of maturity 
during more or less the same years. It follows, then, 
that such people are bound together in another, deeper 
relationship: they also constitute “the same generation” 
because, in their impressionable years, they have been 
subject to the same leading influences. ({1875], 1924, 
vol. 5, p. 37) 


The historian Leopold von Ranke and, to a much 
greater extent, his disciple Ottokar Lorenz put the 
theme of generations into a historical context from 
the point of view of periodization. Ranke took the 
concept of generation in the usual sense of the 
word. Lorenz took his point of departure from 
Ranke and from the French psychologist TIA 
Ribot, whose studies of heredity were just begin- 
ning to appear; he put forward the conception of a 
scientific history founded on the study of heredity 
and genealogy. Lorenz was especially interested in 
the century as a method of dividing history into 
periods; thus, he defined a generation as the sum 
of all those who were employed within a given 
cultural area during one-third of a century. 

In the twentieth century an attempt was begun 
to study history in terms of generations; thus, new 
refinements were gradually added to the theory. 
The principal studies are those of François Mentré 
(1920), Karl Mannheim (1928), Engelbert Drerup 
(1933), Pedro Laín Entralgo (1945), and Julián 
Marías (1949a; 1955). Attempts were also made 
to apply the doctrine of generations to particular 
themes: for example, art (Pinder 1926) and liter- 
ature (Petersen 1930; Peyre 1948). But the most 
extensive general contribution, based on a philo- 
sophical theory of social life, was made by José 
Ortega y Gasset. His student Julián Marías has 
continued to write in this tradition. 


The reality of generations 

In varying degrees, the studies of generations 
have left in obscurity the questions of what gen- 
erations are, why they exist, how long they last, 
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what their scope is, and how they are determined. 
Some attempts have clarified some of these aspects, 
but in general without sufficient justification. Ortega 
y Gasset’s theory of human life and, more con- 
cretely, of historical and social life has the merit 
of dealing with all of these questions. According 
to Ortega, all purely biological (and therefore gene- 
alogical ) consideration of human life is insufficient, 
since human life does not consist in its psycho- 
physical structures alone, but in what man does 
with them. Human life is a drama with character, 
plot, and scenery—the world; and this world is 
primarily a mass of social interpretations of reality: 
beliefs, ideas, customs, estimations, etc. These have 
a life independent of our individual wills; like laws, 
they stand in force, and we cannot avoid meeting 
them and having to deal with them. Ortega called 
them vigencias (states of being in force), giving 
social value to a juridical term. The world thus is 
a system of vigencias, or reigning norms, that both 
permits and compels man to orient himself and 
live his life (Ferrater Mora [1957] 1963, pp. 46- 
65; Marias 1949b). 

The world system at all times presents the aspect 
of a determined historical level in which historical- 
social generations appear: 


The changes in vital sensibility which are decisive in 
history, appear under the form of the generation, A 
generation is not a handful of outstanding men, nor 
simply a mass of men; it resembles a new integration 
of the social body, with its select minority and its gross 
multitude, launched upon the orbit of existence with 
a pre-established vital trajectory. The generation is a 
dynamic compromise between mass and individual, 
and is the most important conception in history. It is, 
so to speak, the pivot responsible for the movements of 
historical evolution. (Ortega [1923] 1961, pp. 14-15) 


Thus, a generation is a human variation; every 
generation manifests a certain vital attitude. Gen- 
erations are born one after the other, each encoun- 
tering the forms of the previous one. There are 
cumulative epochs and others that are eliminatory 
and polemic. 

A distinction must also be made between the 
contemporary (those who live in the same time) 
and the coeval (those who are the same age and 
are, by turns, young, mature, and old together) 
who constitute a generation (Ortega [1933] 1962, 
pp. 42-43). Ortega was, of course, well aware of 
the criticism that, since men are born every day, 
and since only those born on the same day are of 
the same age, the concept of generation is an illu- 
sion, But he rejected this kind of mathematical 
exactitude in dealing with the stuff of human life: 
“Within the human trajectory of life, age is a cer- 
tain way of living. . . . Age, then, is not a date, but 
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a ‘zone of dates’; and it is not only those born in 
the same year who are the same age in life and in 
history, but those who are born within a zone of 
dates” (ibid., p. 47). 

Having defined the concept of generation, Ortega 
went on to consider the problem of applying it to 
history. Here he made use of the traditional division 
of human life into five “ages”—childhood, youth, 
“initiation” (or early manhood), “dominance” (or 
maturity), and old age. If the complete life-span is 
set at 75 years, the “ages” or generations of the 
life-span can be regarded as equal periods of 15 
years each. Ortega pointed out that, since history 
is made largely by men between the ages of 30 and 
60, it is the third and fourth generations that are 
of crucial interest to historians at any one time. 
But, from the point of view of individual develop- 
ment, these two generations are very different in 
character: “from thirty to forty-five is the period of 
gestation, or creation and conflict; from forty-five 
to sixty, the stage of dominance and command” 
(ibid., p. 59). Because of their very great differ- 
ence in outlook, there is always potential or actual 
conflict between these two generations, and herein 
Ortega detected the locus of historical change. 

There remains the problem of finding a starting 
point for the generational series: why should any 
one division of history into 15-year periods be more 
valid than another? Ortega thought he had solved 
this problem by introducing the notion of the 
“decisive generation”; he argued that since it was 
already clear, from independent historical consider- 
ations, which were the crucial years in which a 
new era came into being, one had only to seek out 
the individuals who were the most decisive innova- 
tors of that period and note when they entered early 
manhood. Thus, if we take the Middle Ages as 
lasting until about 1350, and the modern age as 
beginning at some time between 1600 and 1650, 
the late fourteenth and all of the fifteenth and 
sixteenth centuries being a period of transition, the 
problem, as Ortega saw it, was to find the individ- 
ual who most decisively represented the emergence 
of the modern age. In the present case, Ortega 
decided that that individual was Descartes, who 
ended his thirtieth year in 1626; thus the key date 
of the previous generation was 1611, and that of 
the generation following Descartes was 1641. Not 
that these dates were hard-and-fast divisions; Or- 
tega intended them only as major reference points: 
“the center of the zone of dates which corresponds 
to the decisive generation” (Ortega [1933] 1962, 
p. 63). Nor did all generations have their great 
men, although there was a strong presumption that 
what Ortega called the “tone” of history changed 
more or less decisively every 15 years, 


Ortega did not pretend to have solved the prob- 
lem of generations in any final sense; what he 
offered the historian was a series of challenges, of 
demands to rethink his conception of the past in 
the light of the generational approach. Indeed, 
Ortega was not engaged in the writing of history 
as such; his main interest, so far as history was 
concerned, was in isolating and defining those peri- 
ods of crisis and decisive social change which 
redefine the entire meaning of human existence, 
Like Wilhelm Dilthey, he continually stressed the 
importance of studying the meaning of historical 
events for those who took part in them. His gen- 
erational schema was thus an indispensable means 
of detecting regularities in the seemingly endless 
variation of human experience. 


Dynamics of generations 


Most of Ortega’s writings were issued as public 
lectures or newspaper articles, in which forms he 
excelled. This gives his work great cogency of style 
but sometimes deters the specialist. A certain 
amount of systematic elaboration and clarification 
of his ideas is therefore necessary in order to 
demonstrate their usefulness to social science and 
to provide further developments of his method. 

The macrostructure of history—epochs—can 
not be determined a priori or cyclically, because 
it is dependent not on any constant structural 
factor of human life, but on its empirical contents. 
The microstructure—generations—is based on the 
mechanism of ages that is constant within certain 
extensive historical limits. Generations are, by 
turns, the “actors” and the “acts” of the historical 
drama, the “who” and the “what” of history whose 
movement is not continual but gradual. The gen- 
eration is the elemental historical present, the term 
of relative stability of a world figure, and the 
rhythm of historical variation. The empirical de- 
termination of generations within a society is diffi- 
cult and requires investigations that are rigorous 
and extremely cautious. Only the reigning norms of 
a society and their variation can be analyzed in 
detail. If a certain number of representative figures 
are taken whose origins fall within the same 15- 
year period (though the boundaries of the genera- 
tions in question are not known, nor are the 
generations to which these figures belong), we can 
at least take each individual as representative of 
his own generation and study its characteristics in 
him. If other persons born in subsequent 15-year 
periods are added to the series, we can continue to 
assume that each group belongs to the same gen- 
eration. But if, at a certain moment, a change in 
the whole series is observed, a “boundary” between 
two generations is indicated, and an entire gen- 


erational series can then be established, at least in 
a provisional manner. This hypothetical series can 
be applied like a framework to historical reality, 
which will then confirm or correct it. Thus the pro- 
posed scale of generations serves as a provisional 
“working hypothesis”; its value is methodological. 
Generations are not small groups of illustrious 
men; the latter are only the representative men of 
a generation that comprises innumerable anony- 
mous men born within a certain “zone of dates.” If 
a society is studied from the point of view of genera- 
tions, it appears to be joined in groups or strata of 
coevals, each of whom occupies a definite position 
by virtue of his experiences, his pretensions, and, 
finally, his social level, since stratification is uni- 
versal, The generational perspective introduces dis- 
continuity and articulation in place of an amor- 
phous and confused whole. From a historical point 
of view—inseparable from sociology because soci- 
ety is intrinsically historical—a date “unfolds” itself 
in several dates that correspond to the different 
generations. In each date there are four major 
human strata or generations, coexistents in inter- 
action, with precise and unsubstitutable functions: 
(a) the “survivors” of a previous epoch who indi- 
cate the origins of the present situation, that is, 
the men “of another time” who nevertheless remain 
in this one; (b) those in power in all areas, whose 
pretensions generally coincide with the actual state 
of the world; (c) the “opposition” or active genera- 
tion that has not yet triumphed and fights with the 
previous generation for the transfer of power and 
the realization of its own innovations; and (d) the 
young, who have new pretensions and look forward 
to a “putsch” or downfall of the status quo. A his- 
torical epoch is therefore defined by a principle or 
form of life that differentiates it from the previous 
one and affects the totality of a society. It is a 
process by which an innovation that begins by 
being individual goes on to permeate a minority 
and finally becomes dominant throughout the en- 
tire society, so that it is the form that individuals 
encounter as the prevailing way of life. Such a 
process requires the intervention of at least four 
successive generations, or about sixty years. 
Empirical determination of generational series 
requires considerable work that has not as yet been 
accomplished anywhere. It is advisable to take a 
clearly defined society (a nation, for example) and 
a not too large space of time, since the normal 
concatenation of generations can be disturbed by 
serious historical upheavals. Between two different 
societies there normally is a time lag in the dates 
of generations. But the fact that different societies 
often have certain key dates in common is an in- 
dication that the same customs, manners, ideas, 
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beliefs, etc. dominate over a certain historical area; 
in other words, that this constitutes one society, at 
least on a certain level. For example, it is very 
probable that the generations of western Europe 
coincide since the eighteenth century—in our time, 
perhaps throughout the West. It is certain, on the 
other hand, that the key dates in the United States 
are different from those in India or China. 

The method of generations would permit a “social 
cartography” to be arrived at, the difficulties of 
which would be great but no greater than the re- 
quirements for any other empirical investigation of 
social phenomena. It should be noticed that his- 
torical-social generations affect all the men of a 
society in all the dimensions of their life; properly 
speaking, there is no such thing as (for example) 
a purely “literary” or “political” generation. 

Particular fields can, of course, be investigated 
to obtain a greater facility of comparison or to pur- 
sue a particular interest; it must be borne in mind, 
however, that the results obtained cannot be ex- 
tended to the whole society but must be related to 
a general theory of generations. 

JULIÁN MARÍAS 


[Directly related is the entry COHORT ANALYSIS. Other 
relevant material may be found in History, article 
on THE PHILOSOPHY OF HISTORY; PERIODIZATION; 
and in the biographies of ComTE; Cournot; DIL- 
THEY; MANNHEIM; MILL; ORTEGA Y GASSET; RANKE; 
Vico.} 
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I 
POLITICAL GENERATIONS 


Group consciousness is universally recognized as 
one of the fundamental elements of political mo- 
tivation. Individual human beings in every political 
system seek the security provided by membership 
in groups. Group consciousness is created by cer- 
tain basic similarities, and this consciousness in 
turn creates more homogeneity within the group, 
a conformity which often leads to common action. 
Individuals think of themselves as members of a 
group and therefore act as members of that group. 
Since the interests of different groups are not al- 
ways mutually compatible, there is social conflict. 
Since all individuals have allegiances to more than 
one group, there is conflict within the individual, 
who must decide which group is most important to 
him in given circumstances. In European politics, 
for instance, there has often been a conflict be- 
tween national consciousness and class conscious- 
ness, which has generally been resolved in favor 
of the former. 

Although political scientists have studied in- 
tensively both national consciousness and class 
consciousness, they have not explored to any con- 
siderable extent another kind of group conscious- 
ness, that of belonging to a distinct generation. 
This omission is surprising, for novelists, cultural 
and political historians, and sociologists have all 
used the concept of generation with considerable 
success, Although Turgenev's Fathers and Sons is 
the outstanding example of a literary work in 
which different generations cannot communicate 
effectively with each other, generational conflict is 
one of the major themes running through most of 
world literature. This fact is perhaps responsible 
for the emphasis which cultural historians have 
given to the study of different literary, artistic, and 
musical generations. Sometimes, indeed, cultural 
historians have seen only the successive alterna- 
tion of “romantic” and “classical” generations. His- 
torians, among them Ranke, have speculated 
about the significance of a generations approach 
to political history. Sociologists, especially Karl 
Mannheim (1928) and Rudolf Heberle (1951), 
have emphasized the importance of generational 
differences in social movements and social change. 
Political scientists, however, have used political 
generation as a conceptual tool only in the study 


of modern totalitarianism. Sigmund Neumann 
(1939; 1942) convincingly stressed the genera- 
tional consciousness of National Socialist leaders 
and followers. Students of Soviet politics have 
found basic differences, not attributable merely to 
life cycle differences, between the responses of 
younger and older generations to the enforcement 
of conformity within the Soviet Union. 

It could be argued that the concept of political 
generation can be of major assistance in under- 
standing the motivation of leaders and followers 
in all political systems, not merely those which are 
totalitarian. Much of the hesitance of political 
scientists in applying this concept undoubtedly has 
arisen from uncertainty as to the precise meaning 
of a generation in politics. A political generation is 
not to be equated with a biological generation. Po- 
litical generations do not suddenly “change” every 
30 or 35 years. The process of social change is con- 
tinuous, and it cannot wait until political power is 
handed on from father to son. There is undoubtedly 
much personal conflict between fathers and sons, 
but most of this conflict has no direct political sig- 
nificance. The history of politics is not primarily 
that of conflict, or even of consensus, between 
fathers and sons—although such conflict may take 
place, as it did between James and John Stuart 
Mill, with important political consequences. There 
are always—assuming a minimal rate of social 
change—more than two generations in politics at 
any one time. The idea of a “young” generation and 
an “old” generation as the sole participants in the 
political process is as oversimplified as Karl Marx’s 
assumption that there are only two classes in mod- 
ern industrial society. The idea of two generations 
in politics is frequently linked with the erroneous 
assumption that the “young” generation is always 
liberal and the “old” generation is always con- 
servative. 

Implicit in a generations approach to politics is 
the assumption that an individual's political atti- 
tudes do not undergo substantial change during 
the course of his adult lifetime. Once a set of polit- 
ical beliefs has been embraced, it is regarded as 
unlikely that the individual will abandon his be- 
liefs. Rather than altering his previous outlook on 
the basis of new facts, the individual either rejects 
or accepts these new facts depending on whether 
or not they are consistent with his previous outlook. 
In this sense, a “liberal” generation will remain 
“liberal” in terms of the formative years throughout 
the physical lifetime of its members. Whether this 
same political attitude will appear liberal under 
radically changed circumstances is another matter. 
Much of the confusion in the political vocabulary 
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of any nation may be due to the different meanings 
given to the same term by members of different 
generations. In thus explaining the tenacity of 
“outmoded” policies, generations theory removes 
these policies from the realm of free choice, from 
the realm of moral judgments. In this respect, a 
generations approach has the same fundamental 
weaknesses and strengths as any other determinist 
approach. 

Especially in times of rapid social change, it is 
important to know precisely at what period of life 
political attitudes are formed. It is interesting that 
no serious advocate of a generations approach has 
argued that political attitudes are formed during 
childhood. Rather, it is argued that late adolescence 
and early adulthood are the formative years during 
which a distinctive personal outlook on politics 
emerges, which remains essentially unchanged 
through old age. The crucial years are regarded 
as approximately 17 to 25. If these years are in 
fact formative, neither the years preceding nor the 
years following them are decisive in the formation 
of political attitudes. It is during the formative 
years that the youth discovers his own identity. 
When he defines who he is in terms of society, he 
defines his political outlook as well. A political 
generation is seen as a group of individuals who 
have undergone the same basic historical experi- 
ences during their formative years. Such a genera- 
tion would find political communication with 
earlier and later generations difficult, if not im- 
possible. 

The size of a political generation. Not all his- 
torical events are experienced to the same degree 
by the same number of individuals in their form- 
ative years, and political generations therefore vary 
widely in size. The 1825 Decembrist conspiracy, 
for instance, decisively influenced a much smaller 
number of young Russians than did the Russian 
Revolution of 1917. Assuming that the total degree 
of political participation is stable, it is the tem- 
poral and spatial “limits” of a given historic event 
that define the size of the resulting political gen- 
eration, just as the degree of uniqueness of that 
event determines the degree of difficulty that gen- 
eration will have in communicating with earlier 
and later generations. In this sense, for instance, 
the political generation created by World War 1 
was a general European phenomenon. It was neither 
world-wide nor confined to one or a few European 
nations. The enormous transformation of Euro- 
pean society which the war involved meant that a 
new political generation was created throughout 
Europe. But the limited nature of the war experi- 
ence outside Europe meant that the war had far 
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less impact on, say, American or Japanese youth. 
Those Europeans whose formative years, in whole 
or in part, occurred during 1914-1918, especially 
those who actually fought in the war, were not all 
influenced in the same way; but they were never- 
theless all decisively influenced by it. Their reac- 
tions to the war were often very different, depend- 
ing upon their national, class, and, especially, 
personality differences. In a real sense, the fact 
that they were all involved in the war in their 
formative years determined their attitude toward 
politics for the rest of their lives. Some never re- 
covered from the war and retreated from politics, 
which had, they felt, led directly to such suffering. 
Others entered politics after 1918 determined above 
all to prevent a recurrence of war. Still others never 
spiritually left the battlefield and as a result en- 
gaged in politics as the continuation of war by 
other means. These three alternatives formed what 
Karl Mannheim termed “generation units,” which 
together constituted a generation precisely because 
they were oriented toward one another, if only to 
fight one another. Politics involves more than 
knowledge of friend and foe, but many of the per- 
sonal friendships and enmities which in fact later 
influence political behavior originate during the 
formative years, Each generation speaks out with 
more than one voice—there is conflict within each 
generation as well as among generations. 

The time span of a political generation. If a 
political generation is not the same as a biological 
generation, it becomes all the more important to 
define the specific time span within which all who 
experience their formative years can be said to be 
molded into one distinct political generation. With- 
out such a definition, it is impossible to classify 
individuals as members of one or another political 
generation with any precision, especially since few 
significant historic events “begin” and “end” with 
as much definiteness as World War 1. Such classi- 
fication cannot be done on an ad hoc basis. If the 
ad hoc method is attempted, it may properly be 
Suspected. that generational differences are being 
used as a deus ex machina in much the same way 
that differences in national character are some- 
times invoked when there seems to be no other 
explanation of some political phenomenon. On the 
other hand, since the concept of political genera- 
tion is closely related to the process of social 
change, any arbitrary choice of time span is likely 
to violate the complexity of reality. The actual time 
span involved in a generation will differ substan- 
tially in periods in which the process of social 
change is more rapid or less rapid. Karl Mannheim 
(1928) suggested that whether a new generation 


appears every year, every thirty years, or every 
hundred years, or whether it emerges rhythmically 
at all, depends entirely on the specific social con- 
text. This variability, of course, is merely one as- 
pect of the general problem of applying ideal types 
to concrete situations. All concepts of comparative 
analysis must be defined in time and space if they 
are to be of any use in clarifying reality. This is 
especially true of the concept of political genera- 
tion, since membership in the same generation 
involves common location in time and space, In 
twentieth-century Western society, in which social 
change is not only rapid but cataclysmic, the time 
span of a political generation is considerably 
shorter than in more stable societies. The most 
reasonable estimate for the time span of a political 
generation in twentieth-century Western society is 
probably ten to fifteen years. In twentieth-century 
non-Western societies experiencing revolutionary 
social, economic, and political changes, this time 
span may be even shorter. 

The spatial span of a political generation. Al- 
though the spatial span of a political generation is 
difficult to define with precision, it is clear that 
many coevals in the human race are not members 
of the same generation. Those whose formative ex- 
periences are fundamentally different cannot be 
members of the same generation, even though 
they may coexist in time. There was, for instance, 
as Mannheim pointed out, no community of experi- 
ence between youths in China and Germany about 
1800. It could be argued that there is considerable 
community of experience between youths in East 
Germany and Communist China today, since their 
formative years are spent in similar totalitarian 
political environments. Indeed, such phenomena as 
the spread of nationalism and of industrialization 
indicate that the spatial barriers between genera- 
tions may be breaking down, at the same time that 
more rapid social change is increasing the impor- 
tance of the temporal barriers between generations. 
The effect of the latter development is to make 
communication between different political genera- 
tions more difficult, while the effect of the former 
development is to increase the world-wide signifi- 
cance of this decline in ability to communicate. 
The implications of these long-range developments 
for meaningful communication within and among 
political systems are not entirely encouraging. 

Significance of the concept. Generational con- 
sciousness is undoubtedly less significant as a 
source of political motivation than either national 
or class consciousness. National consciousness finds 
its organizational expression in national states, and 
class consciousness finds its organizational expres- 


sion in class-based private associations and parties. 
There is no analogous organizational structure cre- 
ated by generational consciousness, except for 
occasional loosely knit groups such as “Young 
Turks,” “Young England,” or “Young Conserva- 
tives.” There are many powerful men recognized 
as primarily national or class leaders, but there are 
relatively few such men recognized as primarily 
generational leaders, Political personages neverthe- 
less write, as did one Canadian leader, autobiog- 
raphies with such titles as My Generation of 
Politics and Politicians (Preston 1927). Indeed, 
perhaps the most impressive evidence for the exist- 
ence of generational consciousness in politics is 
the frequency with which it is articulated in the 
autobiographical writings of political leaders. These 
writings often demonstrate that, as Léon Blum 
put it, “a man remains essentially what his youth 
has made him” and that there is a special kind of 
communication, in politics as in other aspects of 
life, between members of the same generation. 
This should not be surprising, for the latter are 
simultaneously experiencing the problems and the 
promise of the life cycle itself. 

Problems for further research. Clearly genera- 
tional consciousness is not the single key to under- 
standing political motivation, but equally clearly 
there is enough evidence of its importance to jus- 
tify further research efforts. An exact evaluation 
of this importance waits upon full answers to sev- 
eral questions, some of which have their implica- 
tions for the social sciences in general: 

(1) When do an individual's politically form- 
ative years begin and end? To what extent is this 
period different in different societies? 

(2) How substantial is the continuity between 
political attitudes accepted during the formative 
years and political attitudes in later years? (Care 
should be taken not to concentrate exclusively in 
this research on any one single element of political 
action, such as voting behavior, for the same fun- 
damental attitudes might cause a voter to shift 
party allegiance in a changing party constellation. 
In the United States, for instance, an individual 
who became a Wilsonian liberal in his formative 
years might consider himself consistent in voting 
for Republican candidates in elections after the 
Democratic party was transformed under Franklin 
Roosevelt. A supporter of the Congress party at the 
achievement of Indian independence might in later 
decades feel that his continuing personal beliefs 
were best articulated by some other—or by no— 
party.) 

(3) How long is the time span necessary to 
form a distinct political generation? What is the 
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relationship between the rate of social change— 
and changes in the rate of social change—and the 
length of this time span? 

(4) What is a generation unit? To what extent 
do generation units see the totality of politics in 
terms of each other? 

MARVIN RINTALA 


[See also IDENTIFICATION, POLITICAL. Other relevant 
material may be found in COHORT ANALYSIS; SO- 
CIALIZATION; and in the biography of MANNHEIM.] 
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Behavior genetics is a relatively new cross-dis- 
ciplinary specialization between genetics and psy- 
chology. It is so new that it hardly knows what to 
call itself. The term “behavior genetics” is gaining 
currency in the United States; but in some quarters 
there, and certainly elsewhere, the term “psycho- 
genetics” is favored. Logically, the best name would 
be genetical psychology, since the emphasis is on 
the use of the techniques of genetics in the analysis 
of behavior rather than vice versa; but the in- 
evitable ambiguity of that term is apparent. Psy- 
chologists generally use the terms “genetic” or 
“genetical” in two senses: in the first and older 
sense of developmental, or ontogenetic; and in the 
second, more recent usage relating to the analysis 
of inheritance. The psychologist G. Stanley Hall 
coined the term “genetic” before the turn of the 
century to denote developmental studies (witness 
the Journal of Genetic Psychology), and Alfred 
Binet even used the term “psychogenetic” in this 
sense. But with the rapid rise of the discipline now 
known as genetics after the rediscovery of the 
Mendelian laws in 1900, William Bateson, one of 
the founders of this new science, pre-empted the 
term “genetic” in naming it, thereby investing 
“genetic” with the double meaning that causes the 
current confusion. Psychological genetics, with its 
obvious abbreviation, psychogenetics, is probably 
the best escape from the dilemma, 

Importance of genetics in behavior. The im- 
portance of psychogenetics lies in the fundamental 
nature of the biological processes in our under- 
standing of human social behavior. The social sci- 
ences, and psychology in particular, have long 
concentrated on environmental determinants of be- 
havior and neglected hereditary ones. But it is 
clear that in many psychological functions a sub- 
stantial portion of the observed variation, roughly 
of the order of 50 per cent for many traits, can be 
ascribed to hereditary causation. To ignore this 


hereditary contribution is to impede both action 
and thought in this area. 

This manifold contribution to behavioral varia- 
tion is not a static affair. Heredity and environ- 
ment interact, and behavior is the product, rather 
than the sum, of their respective contributions. 
The number of sources of variability in both he- 
redity and environment is large, and the consequent 
number of such possible products even larger, 
Nevertheless, these outcomes are not incalculable, 
and experimental and other analyses of their limits 
are of immense potential interest to the behavioral 
scientist. The chief theoretical interest lies in the 
analysis of the evolution of behavior; and the chief 
practical significance, so far as can be envisaged 
at present, lies in the possibilities psychogenetics 
has for the optimization of genetic potential by 
manipulation of the environmental expression of it. 

Major current approaches. The major ap- 
proaches to the study of psychogenetics can be 
characterized as the direct, or experimental, and 
the indirect, or observational. The former derive 
principally from the genetical parent of this hybrid 
discipline and involve the manipulation of the he- 
redity of experimental subjects, usually by restrict- 
ing the choice of mates in some specially defined 
way. Since such techniques are not possible with 
human subjects a second major approach exists, 
the indirect or observational, with its techniques 
derived largely from psychology and sociology. The 
two approaches are largely complementary in the 
case of “natural” genetic experiments in human 
populations, such as twinning or cousin marriages. 
Thus, the distinction between the two is based on 
the practicability of controlling in some way the 
essentially immutable genetic endowment—in a 
word, the genotype—of the individuals subject to 
investigation. With typical experimental animals 
(rats, mice, etc.) and other organisms used by the 
geneticist, such as the fruit fly and many microor- 
ganisms, the genotype can often be specified in 
advance and populations constructed by the hy- 
bridization of suitable strains to meet this specifica- 
tion with a high degree of accuracy. Not so with 
humans, where the genotype must, remain as given, 
and indeed where its details can rarely be specified 
with any degree of accuracy except for certain 
physical characteristics, such as blood groups. 
Observational, demographic, and similar tech- 
niques are therefore all that are available here. 
The human field has another disadvantage in rig- 
orous psychogenetic work: the impossibility of 
radically manipulating the environment—for ex- 
ample, by rearing humans in experimental environ- 
ments from birth in the way that can easily be 


done with animals in the laboratory. Since in psy- 
chogenetics, as in all branches of genetics, one 
deals with a phenotype—in this case, behavior— 
and since the phenotype is the end product of the 
action, or better still, interaction of genotype and 
environment, human psychogenetics is fraught 
with double difficulty. Analytical techniques to be 
mentioned later can assist in resolving some of 
these difficulties. 

Definition. To define psychogenetics as the 
study of the inheritance of behavior is to adopt a 
misleadingly narrow definition of the area of study, 
and one which is unduly restrictive in its emphasis 
on the hereditarian point of view. Just as the parent 
discipline of genetics is the analysis not only of the 
similarities between individuals but also of the dif- 
ferences between them, so psychogenetics seeks to 
understand the basis of individual differences in 
behavior. Any psychogenetic analysis must there- 
fore be concerned with the environmental determi- 
nants of behavior (conventionally implicated in the 
genesis of differences ) in addition to the hereditary 
ones (the classic source of resemblances). But 
manifestly this dichotomy does not always operate, 
so that for this reason alone the analysis of en- 
vironmental effects must go hand in hand with the 
search for genetic causation. This is true even if 
the intention is merely to exclude the influence of 
the one the better to study the other; but the 
approach advocated here is to study the two in 
tandem, as it were, and to determine the extent to 
which the one interacts with the other. Psycho- 
genetics is best viewed as that specialization which 
concerns itself with the interaction of heredity and 
environment, insofar as they affect behavior. To 
attempt greater precision is to become involved in 
subtle semantic problems about the meanings of 
terms. 

At first sight many would tend to restrict en- 
vironmental effects to those operating after the 
birth of the organism, but to do so would be to 
exclude prenatal environmental effects that have 
been shown to be influential in later behavior. On 
the other hand, to broaden the concept of environ- 
ment to include all influences after fertilization— 
the point in time at which the genotype is fixed 
—permits consideration of the reciprocal influence 
of parts of the genotype upon each other, Can en- 
vironment include the rest of the genotype, other 
than that part which is more or less directly con- 
cerned with the phenotype under consideration? 
This point assumes some importance since there 
are characteristics, not behavioral—at least, none 
that are behavioral have so far been reported— 
whose expression depends on the nature of the 
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other genes present in the organism. In the absence 
of some of them, or rather certain alleles of the 
gene pairs, the value phenotypically observed 
would be different from what it would be if they 
were present. That is, different components of the 
genotype, in interplay with one another, modify 
phenotypic expression of the characteristic they in- 
fluence. Can such indirect action, which recalls 
that of a chemical catalyst, best be considered as 
environmental or innate? It would be preferable to 
many to regard this mechanism as a genetic effect 
rather than an environmental one in the usually ac- 
cepted sense. Hence, the definition of the area of 
study as one involving the interaction of heredity 
and environment, while apparently adding com- 
plexity, in fact serves to reduce confusion. 

It must be conceded that this view has not as 
yet gained general acceptance. In some of the 
work reviewed in the necessarily brief survey of the 
major findings in this area, attempts have been 
made to retain a rather rigid dichotomy between 
heredity and environment—nature versus nurture 
—in fact, an “either/or” proposition that the facts 
do not warrant. The excesses of both sides in the 
controversies of the 1920s—for example, the fa- 
mous debate between Watson and McDougall over 
the relative importance of learned (environmental) 
and instinctive (genetic) determinants of behavior 
—show the fallacies that extreme protagonists on 
either side can entertain if the importance of the 
interaction effect is ignored. 

Gene action. The nature of gene action as such 
is essentially conducive to interaction with the 
environment, since the behavioral phenotype we 
observe is the end product of a long chain of action, 
principally biochemical, originating in the chro- 
mosome within the individual cell. A chromosome 
has a complex structure, involving DNA (deoxyribo- 
nucleic acid) and the connections of DNA with 
various proteins, and may be influenced in turn by 
another nucleic acid, RNA ( ribonucleic acid), also 
within the cell but external to the nucleus. There 
are complex structures and sequences of processes, 
anatomical, physiological, and hormonal, which 
underlie normal development and differentiation of 
structure and function in the growth, development, 
and maturation of the organism. Much of this in- 
fluence is determined genetically in the sense that 
the genotype of the organism, fixed at conception, 
determines how it proceeds under normal environ- 
mental circumstances. But it would be a mistake to 
regard any such sequence as rigid or immutable, 
as we shall see. 

The state of affairs that arises when a number 
of genetically determined biochemical abnormal- 
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ities affect behavior is illustrative of the argument. 
Many of these biochemical deficiencies or inborn 
errors of metabolism in humans are the outcome of 
a chain of causation starting with genic structures, 
some of them having known chromosomal loca- 
tions. Their effects on the total personality—that is, 
the sum total of behavorial variation that makes 
the individual unique—can range from the trivial 
to the intense. The facility with which people can 
taste a solution of phenylthiocarbamide (PTC), a 
synthetic substance not found in nature, varies 
in a relatively simple genetical way: people are 
either “tasters” or “nontasters” in certain rather 
well-defined proportions, with a pattern of inheri- 
tance determined probably by one gene of major 
effect. But being “taste blind” or not is a relatively 
unimportant piece of behavior, since one is never 
likely to encounter it outside a genetical experi- 
ment. (It should perhaps be added that there is 
some evidence that the ability to taste PTC may be 
linked with other characteristics of some impor- 
tance, such as susceptibility to thyroid disease. ) 
Nevertheless, this example is insignificant com- 
pared with the psychological effect of the absence 
of a biochemical link in patients suffering from 
phenylketonuria. They are unable to metabolize 
phenylalanine to tyrosine in the liver, with the re- 
sult that the phenylalanine accumulates and the 
patient suffers multiple defects, among which is 
usually gross intellectual defect, with an IQ typi- 
cally on the order of 30. This gross biochemical 
failure is mediated by a single recessive gene that 
may be passed on in a family unnoticed in hetero- 
zygous—single dose—form but becomes painfully 
apparent in the unfortunate individual who hap- 
pens to receive a double dose and consequently is 
homozygous for the defect, 

Alternatively, a normal dominant gene may 
mutate to the recessive form and so give rise to 
the trouble. While mutation is a relatively rare 
event individually, the number of genes in each 
individual—probably on the order of ten thousand 
—and the number of individuals in a population 
make it Statistically a factor to be reckoned with. 
One of the best documented cases of a deleterious 
mutation of this kind giving rise to a major defect 
relates to the hemophilia transmitted, with certain 
important political consequences, to some of the 
descendants of Queen Victoria of England. The de- 
pendence of the last tsarina of Russia on the monk 
Rasputin was said to be based in part on the bene- 
ficial therapeutic effect of his hypnotic techniques 
on the uncontrollable bleeding of the Tsarevitch 
Alexis. Victoria was almost certainly heterozygous 
for hemophilia and, in view of the absence of any 


previous record of the defect in the Hanoverian 
dynasty, it seems likely that the origin of the 
trouble was a mutation in one of the germ cells 
in a testicle of Victoria’s father, the duke of Kent, 
before Victoria was conceived in August 1818, 

But however it comes about, a defect such as 
phenylketonuria can be crippling. Fortunately, its 
presence can be diagnosed in very early life by a 
simple urine test for phenyl derivatives. The de- 
pendence of the expression of the genetic defect on 
the environmental circumstances is such that its 
effect can be mitigated by feeding the afflicted in- 
fant with a specially composed diet low in the 
phenylalanine with which the patient’s biochemical 
make-up cannot cope. Here again, therefore, one 
sees the interaction of genotype and environment 
—in this case the type of food eaten. Many of the 
human biochemical defects that have been brought 
to light in recent years are rather simply de- 
termined genetically, in contrast with the prevail- 
ing beliefs about the bases of many behavioral 
characteristics including intelligence, personality, 
and most psychotic and neurotic disorders. This is 
also true of several chromosomal aberrations that 
have been much studied recently and that are now 
known to be implicated in various conditions of 
profound behavioral importance. Prominent among 
these is Down’s syndrome (mongolism) with, 
again, effects including impairment of cognitive 
power. [See INTELLIGENCE AND INTELLIGENCE 
TESTING; MENTAL DISORDERS, articles on BIOLOGI- 
CAL ASPECTS and GENETIC ASPECTS. ] 

Sex as a genetic characteristic. The sex differ- 
ence is perhaps the most striking genetically de- 
termined difference in behavior and the one that is 
most often ignored in this connection. Primary sex 
is completely determined genetically at the moment 
of fertilization of the ovum; in mammals sex de- 
pends on whether the spermatozoon effecting fer- 
tilization bears an X or a Y chromosome to com- 
bine with the X chromosome inevitably contributed 
by the ovum. The resulting gamete then has the 
form of an XX (female) or an XY (male) indi- 
vidual. This difference penetrates every cell of 
every tissue of the resulting individual and in turn 
is responsible for the observable gross differences 
in morphology, These, in turn, subserve differences 
of physiological function, metabolism, and endo- 
crine function which profoundly influence not only 
those aspects of behavior relating to sexual behav- 
ior and reproductive function in the two sexes but 
many other aspects as well. But behavior is also 
influenced by social and cultural pressures, so that 
the resulting sex differences in behavior as ob- 
served by the psychologist are especially good ex- 


amples of a phenotype that must be the ..3 
product of both genetic and environmental forces. 
There is a large literature on sex differences in 
human behavior and a sizable one on such differ- 
ences in animal behavior, but there has been little 
attempt to assess this pervasive variation in terms 
of the relative contribution of genetic and environ- 
mental determinants. This is partly because of the 
technical difficulties of the problem, in the sense 
that all subjects must be of either one sex or the 
other—crossing males with females will always 
result in the same groups as those one started with, 
either males or females—there being, in general, 
no genetically intermediate sex against which to 
evaluate either and identical twins being inevitably 
of like sex. It is also partly because the potential of 
genetic analyses that do not involve direct experi- 
mentation has not been realized. This is especially 
so since the causal routes whereby genetic determi- 
nants of sex influence many of the behavioral 
phenotypes observed are often better understood 
than in other cases where the genetic determinants 
underlying individual differences manifest in a 
population are not so clear-cut. [See INDIVIDUAL 
DIFFERENCES, article on SEX DIFFERENCES. ] 

Sex linkage. There is one exception to the gen- 
eral lack of interest in the biometrical analysis of 
sex differences having behavioral connotations: 
sex-linked conditions. That is to say, it is demon- 
strated or postulated that the gene or genes re- 
sponsible for the behavior—often a defect, as in 
the case of color blindness, which has a significantly 
greater incidence in males than in females—are 
linked with the sex difference by virtue of their 
location on the sex chromosome determining ge- 
netic sex. Thus it is that sex can be thought of as 
a chromosomal difference of regular occurrence, 
as opposed to aberrations of the sort which give 
rise to pathological conditions, such as Down’s 
syndrome. Indeed there are also various anomalies 
of genetic sex that give rise to problems of sexual 
identity, in which the psychological and overt be- 
havioral consequences can be of major importance 
for the individual. While the evidence in such cases 
of environmental modification of the causative 
genetic conditions is less dramatic than in phenyl- 
ketonuria, interaction undoubtedly exists, since 
these chromosomal defects of sex differentiation 
can in some cases be alleviated by appropriate 
surgical and hormonal treatment. [See SEXUAL BE- 
HAVIOR, article on SEXUAL DEVIATION: PSYCHOLOG- 
ICAL ASPECTS; and VISION, article on COLOR VISION 
AND COLOR BLINDNESS. ] 

Human psychogenetics. It is abundantly clear 
that most of the phenotypes the behavioral scien- 
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tist is interested in are multidetermined, both en- 
vironmentally and genetically. The previous exam- 
ples, however, are the exception rather than the 
rule, and their prominence bears witness that our 
understanding of genetics and behavior is as yet 
so little advanced that the simpler modes of genetic 
expression have been the first to be explored. In 
genetics itself, the striking differences in seed con- 
figuration used by Mendel in his classic crosses of 
sweet peas are determined by major genes with 
full dominance acting simply. But such clear-cut 
expression, especially of dominance, is unusual in 
human psychogenetics, and more complex statis- 
tical techniques are necessary to evaluate multiple 
genetic and environmental effects acting to pro- 
duce the observed phenotype. 

Whatever the analysis applied to the data gath- 
ered in other fields, in human psychogenetics the 
method employed cannot be the straightforward 
Mendelian one of crossbreeding which, in various 
elaborations, remains the basic tool of the geneti- 
cist today. Neither can it be the method of selection 
—artificial, as opposed to natural—that is other- 
wise known as selective breeding. Indeed, none of 
the experimental techniques that can be applied to 
any other organism, whatever the phenotype being 
measured, is applicable to man, since experimental 
mating is effectively ruled out as a permissible 
technique in current cultures. It may be remarked 
in passing that such has not always been the case. 
The experiment of the Mogul emperor, Akbar, who 
reared children in isolation to determine their 
“natural religion” (and merely produced mutes) 
and the eugenics program of J. H. Noyes at the 
Oneida Community in New York State in the nine- 
teenth century are cases in point. The apparent 
inbreeding of brother with sister among the rulers 
of ancient Egypt in the eighteenth dynasty (six- 
teenth to fourteenth century B.c.), which is often 
quoted as an example of the absence in humans of 
the deleterious effects of inbreeding (“inbreeding 
depression”), may not be all it seems. It is likely 
that the definition of “sister” and “brother” in this 
context did not necessarily have the same biologi- 
cal relevance that it has today but was rather a 
cultural role that could be defined, at least in this 
case, at will. 

Twin study. In the absence of the possibility 
of an experimental approach, contemporary re- 
search in human psychogenetics must rely on nat- 
ural genetic experiments. Of these, the one most 
widely used and most industriously studied is the 
phenomenon of human twinning. Credit for the 
recognition of the value of observations on twins 
can be given to the nineteenth-century English sci- 
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entist Francis Galton, who pioneered many fields 
of inquiry. He may be justly regarded as the father 
of psychogenetics for the practical methods he in- 
troduced into this field, such as the method of twin 
study, as well as for his influence which extended, 
although indirectly, even to the American experi- 
menters in psychogenetics during the early decades 
of the present century. 

Twin births are relatively rare in humans and 
vary in frequency with the ethnic group. However, 
the extent to which such ethnic groups differ 
among themselves behaviorally as a result of the 
undoubted genetic differences, of which incidence 
of multiple births is but one example, is contro- 
versial. As is well known, there are two types of 
twins: the monozygotic or so-called identical twins, 
derived from a single fertilized ovum that has split 
into two at an early stage in development, and the 
dizygotic or so-called fraternal twins, developed 
from two separate ova fertilized by different sper- 
matozoa. These two physical types are not always 
easy to differentiate, although this difficulty is rel- 
atively mincr in twin study. Nonetheless, they have 
led to two kinds of investigation. The first relates 
to differences in monozygotic twins who have iden- 
tical hereditary make-up but who have been reared 
apart and thus subjected to different environmental 
influences during childhood; and the second relates 
to the comparison of the two types of twins— 
usually restricted to like-sex pairs, since fraternal 
twins can differ in sex. The latter method supposes 
all differences between monozygotic pairs to be due 
to environmental origin, whereas the (greater) dif- 
ference between dizygotic pairs is of environmental 
plus genetic origin. Thus, the relative contribution 
of the two sources of variation can be evaluated. 

Findings obtained from either method have not 
been especially clear-cut, both because of intrac- 
table problems regarding the relative weight to be 
placed upon differences in the environment in 
which the twins have been reared and because of 
the sampling difficulties, which are likely to be 
formidable in any twin study. Nevertheless, inter- 
esting inferences can be drawn from twin study. 
The investigation of separated monozygotic twins 
has shown that while even with their identical 
heredity they can differ quite widely, there exists 
a significant resemblance in basic aspects of per- 
sonality including intelligence, introversion, and 
neurotic tendencies, and that these resemblances 
can persist despite widely different environments 
in which the members of a pair are reared. Such 
findings emphasize the need to consider the con- 
tribution of genotype and environment in an inter- 
active sense—clearly some genotypes represented 


in the personality of monozygotic twin pairs are 
sensitive to environmentally induced variation, 
whereas others are resistant to it. 

Comparisons between monozygotic and dizygotic 
twins reared together suggest that monozygotic 
twins more closely resemble each other in many 
aspects of personality, especially those defining 
psychological factors such as neuroticism and 
introversion—extroversion. The increase in the dif- 
ferences between the two types of twins when 
factor measures are used—as opposed to simple 
test scores—suggests that a more basic biological 
stratum is tapped by factor techniques, since the 
genetic determination seems greater than where 
individual tests are employed. Here again, the de- 
gree to which any phenotype is shown to be heredi- 
tary in origin is valid only for the environment in 
which it developed and is measured; different en- 
vironments may well yield different results, The 
problems of environmental control in human sam- 
ples are so intractable that some students of the 
subject have questioned whether the effort and 
undoubted skill devoted to twin study have been 
well invested, in view of the inherent and persist- 
ing equivocality of the outcome. 

Multivariate methods. Methods of twin study, 
introduced largely to improve upon the earlier 
methods of familial correlation (parents with off- 
spring, sib with sib, etc.), have been combined 
with them. Familial correlation methods them- 
selves have not been dealt with here, since within- 
family environments are bound to be even greater 
contaminants in determining the observed behavior 
than environments in twin study methods. Never- 
theless, used on a large scale and in conjunction 
with twin study and with control subjects selected 
at random from a population, multivariate methods 
show promise for defining the limits of environ- 
mental and genotypic interaction. So far, the solu- 
tions to the problems of biometrical analysis posed 
by this type of investigation have been only partial, 
and the sheer weight of effort involved in locating 
and testing the requisite numbers of subjects stand- 
ing in the required relationships has deterred all 
but a few pioneers. Despite the undoubtedly useful 
part such investigations have played in defining 
the problems involved, the absence of the possi- 
bility of experimental breeding has proved a draw- 
back in the provision of socially useful data. 

Animal psychogenetics. Recourse has often been 
had to nonhuman subjects. The additional problem 
thereby incurred of the relevance of comparative 
data to human behavior is probably balanced by 
the double refinements of the control of both the 
heredity and the environment of the experimental 


subjects. Two major methods of genetics have been 
employed, both intended to produce subjects of pre- 
determined genotype: the crossbreeding of strains 
of animals of known genotype; and phenotypic 
selection, the mating of like with like to increase 
a given characteristic in a population. 

Selection. Behavioral phenotypes of interest 
have been studied by the above methods, often 
using laboratory rodents. For example, attributes 
such as intelligence, activity, speed of condition- 
ing, and emotionality have been selectively bred 
in rats. 

Selection for emotional behavior in the rat will 
serve as an example of the techniques used and 
the results achieved, Rats, in common with many 
other mammalian species, defecate when afraid. A 
technique of measuring individual differences in 
emotional arousal is based on this propensity. The 
animal under test is exposed to mildly stressful 
noise and light stimulation in an open field or 
arena. The number of fecal pellets deposited serves 
as an index of disturbance, and in this way the 
extremes among a large group of rats can be 
characterized as high or low in emotional respon- 
siveness. Continued selection from within the 
“high” and “low” groups will in time produce two 
distinct strains. Control of environmental variables 
is achieved by a rigid standardization of the con- 
ditions under which the animals are reared before 
being subjected to the test as adults. Careful checks 
on maternal effects, both prenatal and postnatal, 
reveal these effects to be minimal. 

Such an experiment does little beyond establish- 
ing the importance of the genetic effect on the 
given strains in the given environment. While 
there are techniques for assessing the relative im- 
portance of the genetic and environmental con- 
tributions to the variation observed under selection, 
they are better suited to the analysis of the out- 
come of experiments using the alternative major 
genetical method, that of crossbreeding of inbred 
strains. 

Crossbreeding. Strains used in crossbreeding 
experiments have usually been inbred for a pheno- 
typic character of interest, although not usually a 
behavioral one. However, this does not preclude 
the use of these inbred strains for behavioral 
studies, since linkage relationships among genes 
ensure that selection for factors multidetermined 
genetically often involves multiple changes in char- 
acteristics other than those selected for, and be- 
havior is no exception to this rule. Moreover, the 
existence of such inbred strains constitutes perhaps 
the most important single advantage of animals as 
subjects, since it enables simplifying assumptions 
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regarding the homozygosity or genetic uniformity 
of such strains to be made in analysis of the out- 
come of crosses. Members of inbred strains are 
theoretically as alike as monozygotic twin pairs, so 
that genetic relationships—which in human popu- 
lations can be investigated only after widespread 
efforts to find them—can be multiplied at will in 
laboratory animals. 

This approach allows a more sensitive analysis 
of the determinants, both environmental and ge- 
netic, of the behavioral phenotype under observa- 
tion. In addition, the nature of the genetic forces 
can be further differentiated into considerations of 
the average dominance effects of the genes in- 
volved, the extent to which they tend to increase or 
decrease the metrical expression of the behavioral 
phenotype, and the extent to which the different 
strains involved possess such increasers or de- 
creasers. Finally, rough estimates of the number 
of these genes can be given. But the analysis de- 
pends upon meeting requirements regarding the 
scaling of the metric upon which the behavior is 
measured and is essentially a statistical one. That 
is, only average effects of cumulative action of the 
relatively large number of genes postulated as in- 
volved can be detected. Gone are the elegantly 
simple statistics derived from the classical Men- 
delian analyses of genes of major effect, often dis- 
playing dominance, like those encountered in cer- 
tain human inborn errors of metabolism. There is 
little evidence of the existence of comparable genes 
of major effect mediating behavior in laboratory 
animals, although some have been studied in in- 
sects, especially the fruit fly. 

A typical investigation of a behavioral phenotype 
might take the form of identifying two inbred 
strains known to differ in a behavioral trait, meas- 
uring individuals from these strains, and then sys- 
tematically crossing them and measuring all 
offspring. When this was done for the runway 
performance of mice, an attribute related to their 
temperamental wildness, the results, analyzed by 
the techniques of biometrical genetics, showed that 
the behavior was controlled by at least three groups 
of genes (a probable underestimate). The contribu- 
tions of these groups were additive to each other 
and independent of the environment when meas- 
ured on a logarithmic scale but interacted with 
each other and with the environment on a linear 
scale. These genes showed a significant average 
dominance effect, and there was a preponderance 
of dominant genes in the direction of greater wild- 
ness. The heritability ratio of the contributions of 
“nature” and “nurture” was around seven to three. 

The use of inbred lines may be restricted to first 
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filial crosses if a number of such crosses are made 
from several different lines. This increases preci- 
sion of analysis in addition to allowing a propor- 
tionate decrease in the amount of laboratory work. 
One investigation examined the exploratory behav- 
ior of six different strains of rats in an open field 
of the kind used for the selection mentioned above. 
On a linear scale there were no untoward environ- 
mental effects, including specifically prenatal ma- 
ternal ones. The heritability ratio was high, around 
nine to one; and while there was a significant 
average dominance component among the genes 
determining exploration, there was no preponder- 
ance of dominants or recessively acting genes 
among increasers or decreasers. The relative stand- 
ing in this respect of the parental strains could be 
established with some precision. 

Limitations. While the methods described above 
have allowed the emergence of results that ulti- 
mately may assist our understanding of the mech- 
anisms of behavioral inheritance, it cannot be said 
that much substantial progress has yet been made. 
Until experiments explore the effect of a range of 
different genotypes interacting with a range of en- 
vironments of psychological interest and conse- 
quence, little more can be expected. Manipulating 
heredity in a single standard environment or 
manipulating the environment of a single standard 
genotype can only provide conclusions so limited 
to both the genotypes and conditions employed 
that they have little usefulness in a wider context. 
When better experiments are performed, as seems 
likely in the next few decades, then problems of 
some sociological importance and interest will arise 
in the application of these experiments to the tasks 
of maximizing genetic potential and perfecting en- 
vironmental control for the purpose of so doing. 
A new eugenics may well develop, but grappling 
with the problems of its impact on contemporary 
society had best be left to future generations. 


P. L. BROADHURST 
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DIFFERENCES, article on SEX DIFFERENCES; INSTINCT; 
INTELLIGENCE AND INTELLIGENCE TESTING; MENTAL 
RETARDATION; PSYCHOLOGY, article on CONSTITU- 
TIONAL PSYCHOLOGY, ] 
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II 
DEMOGRAPHY AND POPULATION GENETICS 


The best available definition of population genet- 
ics is doubtless that of Malécot: “It is the totality 
of mathematical models that can be constructed to 
represent the evolution of the structure of a popu- 
lation classified according to the distribution of its 
Mendelian genes” (1955, p. 240). This definition, 
by a probabilist mathematician, gives a correct idea 
of the “constructed” and abstract side of this branch 
of genetics; it also makes intelligible the rapid de- 
velopment of population genetics since the advent 
of Mendelism. 

In its formal aspect this branch of genetics might 
even seem to be a science that is almost played out. 
Indeed, it is not unthinkable that mathematicians 
have exhausted all the structural possibilities for 
building models, both within the context of general 
genetics and within that of the hypotheses—more 
or less complex and abstract—that enable us to 
characterize the state of a population. 

Two major categories of models can be distin- 
guished: determinist models are those “in which 
variations in population composition over time are 
rigorously determined by (a) a known initial state 
of the population; (b) a known number of forces 
or ‘pressures’ operating, in the course of genera- 
tions, in an unambiguously defined fashion” (Malé- 
cot 1955, p. 240). These pressures involve muta- 
tion, selection, and preferential marriages (by 
consanguinity, for instance), Determinist models, 
based on ratios that have been exactly ascertained 
from preceding phenomena, can be expressed only 
in terms of populations that are infinite in the 
mathematical sense. In fact, it is only in this type 
of population that statistical regularities can emerge 
(Malécot 1955). In these models the composition 
of each generation is perfectly defined by the com- 
position of the preceding generation. 

Stochastic models, in contrast to determinist 
ones, involve only finite populations, in which the 


GENETICS: 


gametes that, beginning with the first generation, 
are actually going to give birth to the new genera- 
tion represent only a finite number among all pos- 
sible gametes. The result is that among these active, 
or “useful,” gametes (Malécot 1959), male or fe- 
male, the actual frequency of a gene will differ 
from the probability that each gamete had of car- 
rying it at the outset. 

The effect of chance will play a prime role, and 
the frequencies of the genes will be able to drift 
from one generation to the other. The effects of 
random drift and of genetic drift become, under 
these conditions, the focal points for research. 

The body of research completed on these assump- 
tions does indeed form a coherent whole, but these 
results, in spite of their brilliance, are marked by 
a very noticeable formalism. In reality, the models, 
although of great importance at the conceptual 
level, are often too far removed from the facts. In 
the study of man, particularly, the problems posed 
are often too complex for the solutions taken di- 
rectly from the models to describe concrete reality. 

Not all these models, however, are the result of 
purely abstract speculation; construction of some 
of them has been facilitated by experimental data. 
To illustrate this definition of population genetics 
and the problems that it raises, this article will limit 
itself to explaining one determinist model, both 
because it is one of the oldest and simplest to under- 
stand and because it is one of those most often 
verified by observation. 

A determinist model. Let us take the case of a 
particular human population: the inhabitants of 
an island cut off from outside contacts. It is obvi- 
ous that great variability exists among the genes 
carried by the different inhabitants of this island. 
The genotypes differ materially from one another; 
in other words, there is a certain polymorphism in 
the population—polymorphism that we can define 
in genetic terms with the help of a simple example. 

Let us take the case of autosome (“not connected 
with sex”) gene a, transmitting itself in a mono- 
hybrid diallely. In relation to it individuals can be 
classified in three categories: homozygotes whose 
two alleles are a (a/a); heterozygotes, carriers of 
a and its allele a’ (a/a’); and the homozygotes who 
are noncarriers of a (a'/a’). At any given moment 
or during any given generation, these three cate- 
gories of individuals exist within the population in 
certain proportions relative to each other. 

Now, according to Mendel’s second law (the law 
of segregation), the population born out of a cross 
between an individual who is homozygote for a 
(a/a) and an individual who is homozygote for 
a (a’/a’) will include individuals a/a, a/a', and 
a'/a’ in the following proportions: one-fourth a/a, 


Demography and Population Genetics 103 


one-half a/a’, and one-fourth a’/a’. In this popu- 
lation the alleles a and a’ have the same frequency, 
one-half, and each sex produces half a and half a’. 
If these individuals are mated randomly, a simple 
algebraic calculation quickly demonstrates that in- 
dividuals of the generation following will be quan- 
titatively distributed in the same fashion: one-fourth 
a/a, one-half a/a’, and one-fourth a’/a’, It will be 
the same for succeeding generations. 

It can therefore be stated that the genetic struc- 
ture of such a population does not vary from one 
generation to the other. If we designate by p the 
initial proportion of a/a individuals and by q that 
of a’/a’ individuals, we get p + q = 1, or the total- 
ity of the population. Applying this system of sym- 
bols to the preceding facts, it can be easily shown 
that the proportion of individuals of all three cate- 
gories in the first generation born from a/a and 
a’/a’ equals p°, 2pq, q°. In the second and third 
generation the frequency of individuals will always 
be similar: p*, 2pq, q°. 

Until this point, we have remained at the indi- 
vidual level. If we proceed to that of the gametes 
carrying a or a’ and to that of genes a and a’, we 
observe that their frequencies intermingle. In the 
type of population discussed above, the formula p°, 
2pq, q? still applies perfectly, therefore, to the 
gametes and genes. This model, which can be re- 
garded as a formalization of the Hardy-Weinberg 
law, has other properties, but our study of it will 
stop here. (For a discussion of the study of isolated 
populations, see Sutter & Tabah 1951.) 

Model construction and demographic reality. 
The Hardy-Weinberg law has been verified by 
numerous studies, involving both vegetable and 
animal species. The findings in the field of human 
blood groups have also been studied for a long time 
from a viewpoint derived implicitly from this law, 
especially in connection with their geographic dis- 
tribution. Under the system of reproduction by 
sexes, a generation renews itself as a result of the 
encounter of the sexual cells (gametes) produced 
by individuals of both sexes belonging to the living 
generation. In the human species it can be said 
that this encounter takes place at random. One can 
imagine the advantage that formal population ge- 
netics can take of this circumstance, which can be 
compared to drawing marked balls by lot from two 
different urns. Model construction, already favored 
by these circumstances, is favored even further if 
the characteristics of the population utilized are 
artificially defined with the help of a certain num- 
ber of hypotheses, of which the following is a 
summary description: 

(1) Fertility is identical for all couples; there is 
no differential fertility. 
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(2) The population is closed; it cannot, there- 
fore, be the locus of migrations (whether immi- 
gration or emigration ). 

(3) Marriages take place at random; there is no 
assortative mating. 

(4) There are no systematic preferential mar- 
riages (for instance, because of consanguinity ). 

(5) Possible mutations are not taken into con- 
sideration, 

(6) The size of the population is clearly defined. 

On the basis of these working hypotheses, the 
whole of which constitutes panmixia, it was pos- 
sible, not long after the rediscovery of Mendel’s 
laws, to construct the first mathematical models. 
Thus, population genetics took its first steps for- 
ward, one of which was undoubtedly the Hardy- 
Weinberg law. 

Mere inspection of the preceding hypotheses will 
enable the reader to judge how, taken one by one, 
they conflict with reality. In fact, no human popu- 
lation can be panmictic in the way the models are. 

The following evidence can be cited in favor of 
this conclusion: 

(1) Fertility is never the same with all couples. 
In fact, differential fertility is the rule in human 
populations. There is always a far from negligible 
sterility rate of about 18 per cent among the large 
populations of Western civilization. On the other 
hand, the part played by large families in keeping 
up the numbers of these populations is extremely 
important; we can therefore generalize by em- 
phasizing that for one or another reason individ- 
uals carrying a certain assortment of genes repro- 
duce themselves more or less than the average 
number of couples. That is what makes for the fact 
that in each population there is always a certain 
degree of selection. Hypothesis (1) above, essential 
to the construction of models, is therefore very far 
removed from reality, 

(2) Closed populations are extremely rare. Even 
among the most primitive peoples there is always 
a minimum of emigration or immigration. The only 
cases where one could hope to see this condition 
fulfilled at the present time would be those of 
island populations that have remained extremely 
primitive. 

(3) With assortative mating we touch on a 
point that is still obscure; but even if these phe- 
nomena remain poorly understood, it can never- 
theless be said that they appear to be crucial in 
determining the genetic composition of populations. 
This choice can be positive: the carriers of a given 
characteristic marry among each other more often 
than chance would warrant. The fact was demon- 
strated in England by Pearson and Lee (1903): 
very tall individuals have a tendency to marry each 


other, and so do very short ones. Willoughby 
(1933) has reported on this question with respect 
to a great number of somatic characteristics other 
than height—for example, coloring of hair, eyes 
and skin, intelligence quotient, and so forth. In- 
versely, negative choice makes individuals with the 
same characteristics avoid marrying one another. 
This mechanism is much less well known than the 
above. The example of persons of violent nature 
(Dahlberg 1943) and of red-headed individuals has 
been cited many times, although it has not been 
possible to establish valid statistics to support it. 

(4) The case of preferential marriages is not 
at all negligible. There are still numerous areas 
where marriages between relatives (consanguine- 
ous marriages) occur much more frequently than 
they would as the result of simple random encoun- 
ters. In addition, recent studies on the structures 
of kinship have shown that numerous populations 
that do not do so today used to practice preferential 
marriage—most often in a matrilinear sense. These 
social phenomena have a wide repercussion on the 
genetic structure of populations and are capable of 
modifying them considerably from one generation 
to the other. 

(5) Although we do not know exactly what the 
real rates of mutation are, it can be admitted that 
their frequency is not negligible. If one or several 
genes mutate at a given moment in one or several 
individuals, the nature of the gene or genes is in 
this way modified; its stability in the population 
undergoes a disturbance that can considerably 
transform the composition of that population. 

(6) The size of the population and its limits 
have to be taken into account. We have seen 
that this is one of the essential characteristics im- 
portant in differentiating two large categories of 
models. 


Demography and population genetics 


The above examination brings us into contact 
with the realities of population: fertility, fecundity, 
nuptiality, mortality, migration, and size are the 
elements that are the concern of demography and 
are studied not only by this science but also very 
often as part of administrative routine. Leaving 
aside the influence of size, which by definition is of 
prime importance in the technique of the models, 
there remain five factors to be examined from the 
demographic point of view. Mutation can be ruled 
out of consideration, because, although its impor- 
tance is great, it is felt only after the passage of a 
certain number of generations. It can therefore be 
admitted that it is not of immediate interest. 

We can also set aside choice of a mate, because 
the importance of this factor in practice is still 
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unknown. Accordingly, there remain three factors 
of prime importance: fertility, migration, and pref- 
erential marriage. Over the last decade the pro- 
gressive disappearance of consanguineous marriage 
has been noted everywhere but in Asia. In many 
civilized countries marriage between cousins has 
practically disappeared. It can be stated, therefore, 
that this factor has in recent years become con- 
siderably less important. 

Migrations remain very important on the genetic 
level, but, unfortunately, precise demographic data 
about them are rare, and most of the data are of 
doubtful validity. For instance, it is hard to judge 
how their influence on a population of Western 
culture could be estimated. 

The only remaining factor, fertility (which to 
geneticists seems essential), has fortunately been 
studied in satisfactory fashion by demographers. 
To show the importance of differential fertility in 
human populations, let us recall a well-known cal- 
culation made by Karl Pearson in connection with 
Denmark. In 1830, 50 per cent of the children in 
that country were born of 25 per cent of the par- 
ents. If that fertility had been maintained at the 
same rate, 73 per cent of the second-generation 
Danes and 97 per cent of the third generation 
would have been descended from the first 25 per 
cent. Similarly, before World War 1, Charles B. 
Davenport calculated, on the basis of differential 
fertility, that 1,000 Harvard graduates would have 
only 50 descendants after two centuries, while 
1,000 Rumanian emigrants living in Boston would 
have become 100,000. 

Measurement of fertility. Human reproduction 
involves both fecundity (capacity for reproduction ) 
and fertility (actual reproductive performance). 
These can be estimated for males, females, and 
married couples treated as a reproductive unit. Let 
us rapidly review the measurements that demog- 
raphy provides for geneticists in this domain. 

Crude birth rate. The number of living births 
in a calendar year per thousand of the average pop- 
ulation in the same year is known as the crude 
birth rate. The rate does not seem a very useful 
one for geneticists: there are too many different 
groups of childbearing age; marriage rates are too 
variable from one population to another; birth con- 
trol is not uniformly diffused, and so forth. 

General fertility rate. The ratio of the number 
of live births in one year to the average number of 
women capable of bearing children (usually de- 
fined as those women aged 15 to 49) is known as 
the general fertility rate. Its genetic usefulness is 
no greater than that of the preceding figure. More- 
over, experience shows that this figure is not very 
different from the crude birth rate. 


Demography and Population Genetics 105 


Age-specific fertility rates. Fertility rates accord- 
ing to the age reached by the mother during the 
year under consideration are known as age-specific 
fertility rates. Demographic experience shows that 
great differences are observed here, depending on 
whether or not the populations < e Malthusian—in 
other words, whether they practice birth control 
or not. In the case of a population where the fer- 
tility is natural, knowledge of the mother’s age is 
sufficient. In cases where the population is Mal- 
thusian, the figure becomes interesting when it is 
calculated both by age and by age group of the 
mothers at time of marriage, thus combining the 
mother’s age at the birth of her child and her age 
at marriage. This is generally known as the age- 
specific marital fertility rate. If we are dealing with 
a Malthusian population, it is preferable, in choos- 
ing the sample to be studied, to take into consider- 
ation the age at marriage rather than the age at 
the child’s birth. Thus, while the age at birth is suf- 
ficient for natural populations, these techniques can- 
not be applied indiscriminately to all populations. 

Family histories. Fertility rates can also be cal- 
culated on the basis of family histories, which can 
be reconstructed from such sources as parish regis- 
tries (Fleury & Henry 1965) or, in some countries, 
from systematic family registrations (for instance, 
the Japanese koseki or honseki). The method for 
computing the fertility rate for, say, the 25-29- 
year-old age group from this kind of data is first 
to determine the number of legitimate births in the 
group. It is then necessary to make a rigorous count 
of the number of years lived in wedlock between 
their 25th and 30th birthdays by all the women in 
the group; this quantity is known as the group’s 
total “woman-years.” The number of births is then 
divided by the number of “woman-years” to obtain 
the group’s fertility rate. This method is very useful 
in the study of historical problems in genetics, since 
it is often the only one that can be applied to the 
available data. 

Measurement of reproduction. Let us leave fer- 
tility rates in order to examine rates of reproduc- 
tion. Here we return to more purely genetic con- 
siderations, since we are looking for the mechanism 
whereby one generation is replaced by the one that 
follows it. Starting with a series of fertility rates 
by age groups, a gross reproduction rate can be cal- 
culated that gives the average number of female 
progeny that would be born to an age cohort of 
women, all of whom live through their entire re- 
productive period and continue to give birth at the 
rates prevalent when they themselves were born. 
The gross reproduction rate obtaining for a popu- 
lation at any one time can be derived by combining 
the rates for the different age cohorts. 
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A gross reproduction rate for a real generation 
can also be determined by calculating the average 
number of live female children ever born to women 
of fifty or over. As explained above, this rate is 
higher for non-Malthusian than for Malthusian 
populations and can be refined by taking into con- 
sideration the length of marriage. 

We have seen that in order to be correct, it is 
necessary for the description of fertility in Malthu- 
sian populations to be closely related to the date of 
marriage. Actually, when a family reaches the size 
that the parents prefer, fertility tends to approach 
zero. The preferred size is evidently related to 
length of marriage in such a manner that fertility 
is more closely linked with length of marriage than 
with age at marriage. In recent years great progress 
has been made in the demographic analysis of fer- 
tility, based on this kind of data. This should en- 
able geneticists to be more circumspect in their 
choice of sections of the population to be studied. 

Cohort analysis. Americans talk of cohort anal- 
ysis, the French of analysis by promotion (a term 
meaning “year” or “class,” as we might speak of 
the “class of 1955”). A cohort, or promotion, in- 
cludes all women born within a 12-month period; 
to estimate fertility or mortality, it is supposed that 
these women are all born at the same moment on 
the first of January of that year. Thus, women born 
between January 1, 1900, and January 1, 1901, 
are considered to be exactly 15 years old on Jan- 
uary 1, 1915; exactly 47 years old on January 1, 
1947; and so forth. f 

The research done along these lines has issued 
in the construction of tables that are extremely 
useful in estimating fertility in a human popula- 
tion. As we have seen, it is more useful to draw up 
cohorts based on age at marriage than on age at 
birth. A fertility table set up in this way gives for 
each cohort the cumulative birth rate, by order of 
birth and single age of mother, for every woman 
surviving at each age, from 15 to 49. The progress 
that population genetics could make in knowing 
real genic frequencies can be imagined, if it could 
concentrate its research on any particular cohort 
and its descendants. 
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This rapid examination of the facts that demog- 
raphy can now provide in connection with fertility 
clearly reveals the variables that population genet- 
ics can use to make its models coincide with reality. 
The models retain their validity for genetics be- 
cause they are still derived from basic genetic con- 
cepts; their application to actual problems, how- 
ever, should be based on the kind of data mentioned 
above. We have voluntarily limited ourselves to the 


problem of fertility, since it is the most important 
factor in genetics research. 

The close relationship between demography and 
population genetics that now appears can be illus- 
trated by the field of research into blood groups. 
Although researchers concede that blood groups are 
independent of both age and sex, they do not ex- 
plcre the full consequences of this, since their 
measures are applied to samples of the population 
that are “representative” only in a demographic 
sense. We must deplore the fact that this method 
has spread to the other branches of genetics, since 
it is open to criticism not only from the demo- 
graphic but from the genetic point of view. By 
proceeding in this way, a most important factor is 
overlooked—that of genic frequencies. 

Sample structure. Let us admit that the choice 
of a blood group to be studied is of little impor- 
tance when the characteristic is widely distributed 
throughout the population—for instance, if each 
individual is the carrier of a gene taken into ac- 
count in the system being studied (e.g., a system 
made up of groups A, B, and O). But this is no 
longer the case if the gene is carried only by a few 
individuals—in other words, if its frequency attains 
0.1 per cent or less. In this case (and cases like 
this are common in human genetics ) the structure 
of the sample examined begins to take on prime 
importance. 

A brief example must serve to illustrate this 
cardinal point. We have seen that in the case of 
rare recessive genes the importance of consan- 
guineous marriages is considerable. The scarcer 
that carriers of recessive genes become in the pop- 
ulation as a whole, the greater the proportion of 
such carriers produced by consanguineous mar- 
riages. Thus if as many as 25 per cent of all in- 
dividuals in a population are carriers of recessive 
genes, and if one per cent of all marriages in that 
population are marriages between first cousins, 
then this one per cent of consanguineous unions 
will produce 1.12 times as many carriers of re- 
cessive genes as will be produced by all the unions 
of persons not so related. But if recessive genes 
are carried by only one per cent of the total popu- 
lation, then the same proportion of marriages be- 
tween first cousins will produce 2.13 times as many 
carriers as will be produced by all other marriages. 
This production ratio increases to 4.9 if the total 
frequency of carriers is 01 per cent, to 20.2 if it 
is .005 per cent, and to 226 if it is .0001 per cent. 
Under these conditions, one can see the importance 
of the sampling method used to estimate the fre- 
quency within a population, not only of the indi- 
viduals who are carriers but of the gametes and 
genes themselves, 
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Genealogical method. It should be emphasized 
that genetic studies based on genealogies remain 
the least controversial. Studying a population where 
the degrees of relationship connecting individuals 
are known presents an obvious interest. Knowing 
one or several characteristics of certain parents, we 
can follow what becomes of these in the descend- 
ants. Their evolution can also be considered from 
the point of view of such properties of genes as 
dominance, recessiveness, expressivity, and pene- 
trance. But above all, we can follow the evolution 
of these characteristics in the population over time 
and thus observe the effects of differential fertility. 
Until now the genealogical method was applicable 
only to a numerically sparse population, but prog- 
ress in electronic methods of data processing per- 
mits us to anticipate its application to much larger 
populations (Sutter & Tabah 1956). 

Dynamic studies. In very large modern popu- 
lations it would appear that internal analysis of 
cohorts and their descendants will bring in the 
future a large measure of certainty to research in 
population genetics. In any case, it is a sure way to 
a dynamic genetics based on demographic reality. 
For instance, it has been recommended that blood 
groups should be studied according to age groups; 
but if we proceed to do so without regard for 
demographic factors, we cannot make our observa- 
tions dynamic. Thus, a study that limits itself to, 
let us say, the fifty- to sixty-year-old age group will 
have to deal with a universe that includes certain 
genetically “dead” elements, such as unmarried and 
sterile persons, which have no meaning from the 
dynamic point of view. But if a study is made of 
this same fifty- to sixty-year-old age group and then 
of the twenty- to thirty-year-old age group, and if in 
the older group only those individuals are con- 
sidered who have descendants in the younger 
group, the dynamic potential of the data is max- 
imized. It is quite possible to subject demographic 
cohorts to this sort of interpretation, because in 
many countries demographic statistics supply series 
of individuals classified according to the mothers’ 
age at their birth. 

Other demographic factors. This discussion 
would not be complete if we did not stress another 
aspect of the genetic importance of certain demo- 
graphic factors, revealed by modern techniques, 
which have truly created a demographic biology. 
Particularly worthy of note are the mother’s age, 
order of birth, spacing between births, and size 
of family. 

The mother’s age is a great influence on fe- 
cundity. A certain number of couples become in- 
capable of having a second child after the birth of 
the first child; a third child after the second; a 
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fourth after the third; and so forth. This sterility 
increases with the length of a marriage and espe- 
cially after the age of 35. It is very important to 
realize this when, for instance, natural selection 
and its effects are being studied. 

The mother’s age also strongly influences the 
frequency of twin births (monozygotic or dizygotic), 
spontaneous abortions, stillborn or abnormal births, 
and so on. Many examples can also be given of the 
influence of the order of birth, the interval between 
births, and the size of the family to illustrate their 
effect on such things as fertility, mortality, mor- 
bidity, and malformations. 


It has been demonstrated above how seriously 
demographic factors must be taken into considera- 
tion when we wish to study the influence of the 
genetic structure of populations. We will leave 
aside the possible environmental influences, such 
as social class and marital status, since they have 
previously been codified by Osborn (1956/1957) 
and Larsson (1956-1957), among others. At the 
practical level, however, the continuing efforts to 
utilize vital statistics for genetic purposes should 
be pointed out. In this connection, the research of 
H. B. Newcombe and his colleagues (1965), who 
are attempting to organize Canadian national sta- 
tistics for use in genetics, cannot be too highly 
praised. The United Nations itself posed the prob- 
lem on the world level at a seminar organized in 
Geneva in 1960. The question of the relation be- 
tween demography and genetics is therefore being 
posed in an acute form. 

These problems also impinge in an important 
way on more general philosophical issues, as has 
been demonstrated by Haldane (1932), Fisher 
(1930), and Wright (1951). It must be recognized, 
however, that their form of Neo-Darwinism, al- 
though it is based on Mendelian genetics, too often 
neglects demographic considerations. In the future 
these seminal developments should be renewed in 
full confrontation with demographic reality. 


JEAN SUTTER 


[Directly related are the entries COHORT ANALYSIS; 
FERTILITY; FERTILITY CONTROL. Other relevant ma- 
terial may be found in Nuptiatiry; RACE; SOCIAL 
BEHAVIOR, ANIMAL, article on THE REGULATION OF 
ANIMAL POPULATIONS.] 
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Til 
RACE AND GENETICS 


Since 1900, the study of genetics has made two 
major contributions to the theoretical understand- 
ing of the biology of race. First is the replacement 


of earlier theories of racial heredity based on the 
blending of characteristics by the newer, genetical 
notion of a breeding population incorporating gene 
frequencies based on the particulate, or Mendelian, 
theory of heredity. Because of the use of a mislead- 
ing theory of heredity, nearly all of the writing on 
subspecific taxonomy of the human species before 
1900, and unfortunately much written since then, 
is incorrect. 

The second contribution of genetics has been to 
place race in the perspective of a general theory 
of organic evolution. This interpretation of racial 
biology derives from those parts of population ge- 
netics initially developed by Wright (1931; 1951) 
and Haldane (1932) and interpreted variously 
by Dobzhansky (1962), Mayr (1963), Simpson 
(1961), and Morton and Yasuda (1963). We can- 
not adequately understand the evolutionary history 
of mankind either in terms of individuals or in 
terms of the species as a whole because individuals 
do not evolve and the details of the evolutionary 
process are not necessarily uniform for all subdi- 
visions of the species. This is why it is of theoret- 
ical importance to recognize subspecies in evolu- 
tionary biology, whether or not they are given 
taxonomic recognition. 

Race and subspecies. A “race” is a category logi- 
cally more inclusive than the notion of “individual” 
and less inclusive than the notion of “species.” 
Most contemporary authorities equate the anthro- 
pological concept of race with the zoological con- 
cept of subspecies. 

Subspecies and races are two or more genetically 
distinguishable populations of a species that have 
separate distributions but which can, and do, inter- 
breed freely in overlap zones or when brought into 
contact. The frequency of gene exchange between 
races is highest in the contact zone and decreases 
away from it. 

Since the geographical change in gene frequency 
is gradual, any attempt to separate sharply local 
populations into races having different gene fre- 
quencies is arbitrary. Often groups defined by one 
set of genetically determined characteristics show 
a clear geographical distribution. The distribution 
of a group defined by another set of alleles may be 
mapped with equal certainty; usually the geograph- 
ical boundaries of two such classifications will not 
coincide very closely. 

Knowledge of race. Statements about race in- 
volve two distinct kinds of knowledge, one about 
attributes, the other about relationships. Usually 
our knowledge of the phenotypes used to define 
races is direct and observational; in principle there 
may be full agreement among experts regarding 
the attributes which characterize a given group of 


people. Our knowledge about the biological rela- 
tionships among individuals often is indirect, being 
based on interviews or inferred from geographical, 
societal, or cultural circumstances. Two individuals 
are biological relatives when one developed from a 
zygote formed by a gamete from the other or both 
developed from zygotes formed by gametes from 
a common ancestor. Each individual is connected 
by gametes to his parents, to the members of one 
or more races, and, ultimately, to the species. In 
practice it is rare to know directly all relevant 
parent-child links connecting the members of a 
race. Genetics makes rules for predicting the distri- 
bution of phenotypes—genotypes among individuals 
of known relationship. In the study of race such 
rules are used to infer relationships of populations 
of individuals whose phenotypes—genotypes are 
known. 

Particulate versus blending heredity. The mo- 
lecular, particulate nature of the genetic material 
and the facts of gene segregation and recombina- 
tion have great theoretical significance for the 
biology of race. 

If the hereditary materials were blended (as 
nineteenth-century biologists assumed) like solutes 
in solvents, every member of a breeding population 
would soon reach hereditary uniformity and, bar- 
ring mutation and hybridization, a “pure” race 
would be established in each locality. The genetic 
material would become unique to each local breed- 
ing population, and genotypes of offspring would 
necessarily be a midway blend of those of their 
parents. But, in fact, segregation and recombina- 
tion do produce novel genic variation within the 
local breeding population. Genes in body cells oc- 
cur in pairs (AA, Aa, aa). When gametes are 
formed, the members of each pair segregate so 
that each gamete has only one gene from each pair 
(A or a) and each child receives only one member 
of a pair from each parent (i.e., two parents of 
genotype Aa may produce a child who is AA, Aa, 
or aa), Also, each pair of genes at other loci (non- 
alleles) is capable of undergoing segregation sep- 
arately, so that two pairs of genes which are to- 
gether in one parent (say AAbb or aaBB) may be 
recombined in the child (AaBb), the recombination 
rate being slower for linked than for independent 
nonalleles. 

Before the effective start of genetics in 1900, it 
was believed that hereditary material was a homo- 
geneous substance which could be mixed or diluted 
but was identical in all members of a “pure” race. 
If two individuals were alike in one hereditary as- 
pect, they would be alike in all others. Within a 
“pure” race all variation was due to environmental 
effects. Genetics now shows that hereditary mate- 
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rial is a heterogeneous collection of separate par- 
ticles which need not be the same even in close 
relatives. Two closely related individuals may have 
many genes in common but may differ in others. 
Likewise, because of parallel mutation, unrelated 
people may have some genes of the same function 
even though these genes are not derived from a 
common ancestor. 

Before genetics, it was believed that the heredi- 
tary material differed in kind between races of the 
same species, just as two elements differ. The pre- 
genetic view of race involved what Aristotle called 
the “essence” of material bodies. Just as atoms of 
copper and tin are of different construction, so 
two races were thought to have hereditary material 
different in essence; the difference was, accord- 
ingly, not in degree but in kind. As atoms of copper 
can join those of tin to make bronze, so it was 
thought “pure” races come together to form a mixed 
race. It was possible to think of individuals of 
mixed races as being “mostly Alpine” or “mostly 
Nordic,” in the same way that it was possible to 
think of alloys with varying fractions of different 
metals. Using an analogy from ceramics, certain 
anthropologists even as late as the 1940s spoke 
of race X having a “wash” of race Y. As is shown 
below, such beliefs are refuted by the identification 
of homologous genes in different races. 

Kinds of gene differences between races. It is 
clear we should not speak of racial characteristics, 
only of racial differences. Differences between any 
two races express differently occurring frequencies 
of autosomal genes. These may be written as fol- 
lows: 


Population 1: 

(pA + qa)?(p'B + q'b)? +- (p"N + q"n)? = 1, 
Population 2: 

(rA’ + sa’)*(7'B’ + s'b')? +++ (rN! + 8"n')? = 1, 
Here A,a,---, N’,n’ represent sets of alleles with 
frequencies p+q=1, r+s=1, where 0<p, r<1. 
(The arrays of possible genotypes can be derived 
as the products of the squares of the above bino- 
mials. If there are k alleles at a locus, the binomial 
is replaced by a k-nomial. For sex-chromosomal 
loci, the array of genotypes in the heterogametic 
sex is given by the first power of the k-nomial.) By 
identifying and comparing the frequencies of genes 
between populations, we may distinguish four 
kinds of differences: 

(1) If the same kinds and frequencies are pres- 
ent in both, the two populations are identical. 

(2) If gene frequencies equal zero or unity in 
one population but are not fixed (0 < p,r < 1) in 
the other, or if some but not all frequencies are 
unity in one and zero in the other, the two popula- 
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tions are qualitatively different in gene frequency 
at one or more but not at all loci; they show a 
specific difference in kind. 

(3) If the allele at all loci is zero in one popula- 
tion and unity in the other, there is a general dif- 
ference in kind. 

(4) If none of the frequencies equal zero or 
unity, the populations differ in degree not in kind. 

Prior to the twentieth century, most anthropolo- 
gists and biologists assumed the differences be- 
tween local populations within the same race were 
of sort (1) and those between races were of sort 
(3). The early students assumed the genetic ma- 
terial of (pure) races was homogeneous within, 
and qualitatively different between, races. 

Today we have frequency information on nearly 
fifty genes in which some of the alleles at each 
locus are common enough in one or more human 
populations to be of anthropological interest. Sev- 
eral hundred rare major genes are known to medi- 
cal genetics. The knowledge gained from both 
human and general genetics demonstrates that 
“pure” races in the sense of homogeneous breeding 
populations differing in the kind of genes present 
at all loci do not exist in the human species or in 
any known sexual species of animals or plants. 

For at least eight known human chromosomal 
loci, specific differences in kind of sort (2) are 
known to hold between some pairs of human races. 
For instance, the gene associated with blood group 
A of the ABO series is present in all known Euro- 
pean populations, as is the gene for B, but gene A 
is absent in some, and gene B is absent in many 
American Indian populations. The genes for the 
Diego, Henshaw, Hunter, Kell, Lutheran, Rhesus, 
and hemoglobin ACS loci show detectable differ- 
ences in kind between two or more human races, 

Differences of sort (4), i.e., cases where the 
same kinds of genes are present in different fre- 
quency, constitute the most common kind of ge- 
netic divergence observed between local and geo- 
graphical races of man for both normal and nearly 
all of the identified deleterious major genes. (See 
Dobzhansky 1962 or recent textbooks of human ge- 
netics for further data on the major genes men- 
tioned above. ) 

Typology and race. The pregenetic typological 
approach to racial anthropology used at least three 
different concepts of “types.” All are unacceptable 
to modern genetics, each for a different reason. 

(a) The average type of a population is defined 
by a set of traits used to identify members of the 
(“pure”) race. Thus Nordics in Sweden and else- 
where are characterized by tall stature, long heads, 
blond hair, and blue eyes. From the genetic point of 


view there are two basic difficulties in this ap- 
proach: Only a small fraction of the members of 
any local breeding population exhibit the full set 
of traits used to define the type, and the vast 
amount of genetic variability known to be present 
in all sexually reproduced populations is not rec- 
ognized. Retzius and Fiirst found only 10.07 per 
cent of a large sample of Swedish conscripts had 
all four of the above diagnostic traits for Nordics 
(Dahlberg 1942). Since a breeding population has 
no average genotype, it cannot have an average 
phenotype; thus, it is misleading to pick out a typ- 
ical (average) member of any race. 

(b) Morphological types, which occur in groups 
of two to about ten in a given population, are also 
defined by a cluster of traits. The racial history of 
a population is explained by the supposed coming 
together and intermixture of individuals belonging 
to the several types, all wrongly assumed to be 
stable over time and to represent ancestral stocks 
(see, e.g., Hooton 1926; 1931). 

An example of misinterpreted history comes 
from western Europe, where most people in the 
north are tall, blond, and blue-eyed; most in the 
south are short, brunet, and brown-eyed; and those 
in the midland are assorted in stature and pigmen- 
tation, with short blonds, tall brunets, brown-eyed 
blonds, etc. Those who use morphological types to 
explain racial history assume the people of the 
midland are a mixture of migrants from the north 
and south. But this is not necessarily what hap- 
pened in history; the “mixed” group may be the 
actual common ancestor of the two “pure” groups 
concentrated by natural selection in the two ex- 
tremes of the geographical range. The rather scanty 
information with historical depth suggests this lat- 
ter explanation is the correct one for the origin of 
the “morphological races” of western Europe. 

The processes of gene segregation and recombi- 
nation, together with those of parallel and recurrent 
mutation, negate the possibility of sorting the mem- 
bers of an interbreeding population into morpho- 
logical types representing ancestral stocks. A sim- 
ple hypothetical example will illustrate this. The 
initial generation includes two “races” (Table 1). 
Genes A and a are autosomal and lack dominance. 
Since the A allele is fixed in one and the a allele in 
the other, each “race” is genotypically invariant for 
the traits. Now if north and south send migrants to 


Table 1 


Phenotype x x. 
Genotype AA aa 


midland and there hybridize, the F, hybrids will all 
have genotype Aa and phenotype XY. Intercrossing 
of the F,’s will produce all three genotypes in subse- 
quent generations of the new midland “race.” But 
in this mixed “race,” individuals of type XY are 
more closely related (a distance of one parent-child 
step) to their homozygous parents of type X or Y 
than are individuals of type X or Y who turn up in 
subsequent generations (distant by two or more 
parent-child steps). It may also happen that, 
through mutation rather than descent, members of 
the population may come to possess genes identical 
to those of the founding generation. 

Sameness of phenotype represents neither degree 
of closeness nor the fact of relationship in a mixed 
population. Further, sameness of some specific 
phenotype does not guarantee that a given individ- 
ual in a mixed population will inherit other desir- 
able or undesirable traits from an ancestor of the 
same specific phenotype. (See Dahlberg 1943 for 
extension of this argument to other kinds of major 
genes and to polygenes. ) 

(c) The individual types of the Czekanowski 
group of Polish anthropologists are “racial ele- 
ments” defined by a cluster of traits assumed to be 
controlled by one, or several closely linked, pleio- 
tropic gene(s). These genes simultaneously affect 
the set of type-defining traits in whatever popula- 
tion they are found. This typology is compatible 
with population genetics if the frequencies of the 
“racial elements” have the empirical properties of 
the frequencies of pleiotropic genes, a possibility, 
however, not supported by family studies or by the 
fit of observed population frequencies of “racial 
elements” to those expected for genotypes in equi- 
librium Mendelian populations (Bielicki, 1965). 

Modes of change in gene frequency. The theory 
of population genetics builds upon a population 
model in which gene frequencies are in equilib- 
rium, The relative frequencies of genes and of 
genotypes remain steady in a randomly mated 
breeding population if mutation does not prefer- 
entially add or subtract genes, if genes flowing 
into the population from the outside differ neither 
in kind nor in frequency from those native to the 
local population, if there is no differential fertility 
or mortality between genes and genotypes, and if 
the breeding population is large enough in numbers 
that genetic drift does not occur. If these equilib- 
rium conditions do not hold, gene and/or genotype 
frequencies will change until a new steady state is 
attained. 

Inbreeding and assortative mating, the two most 
important departures from random mating, do not 
by themselves change gene frequencies but only 
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the relative frequencies of genotypes; such changes 
are usually in the direction of increased homozy- 
gosity. 

Mutation. Mutation is the primary source of all 
gene variation. The molecular basis of mutation is 
change in the sequence of nucleotides in DNA. 
Estimated spontaneous mutation rates in man vary 
from about one to one hundred mutations per 
million gametes. The rates are specific for loci and 
alleles but may vary somewhat according to the 
mutagenic nature of the environment. Probably the 
mutation rates for the more common human genes 
are sufficiently small compared with the rates for 
other modes of change that mutation is not a major 
determinant of observable local and regional differ- 
ences in gene frequencies. However, it should be 
emphasized that mutation is repetitive; within a 
species, a mutation is not likely to occur only once, 
nor to occur in one major race and not in another. 

Gene flow. Gene flow refers to introduction of 
genes from outside the local breeding population. 
The process is general for loci, haploid sets of 
genes being introduced on each occasion. In terms 
of the change in frequency, the most important 
type of gene flow involves recurrent, more or less 
regular introduction of gametes from neighboring 
groups which are partially isolated reproductively. 
When the parental populations are sufficiently dis- 
similar in gene frequencies, gene flow is called race 
mixture. Mixture may result in very rapid changes 
in gene frequencies. When clines are reflections of 
gene flow, the gene frequencies of several loci 
should show similar cline distribution when the 
incoming and native populations differ in those 
frequencies. 

Recurrent gene flow has been an important fac- 
tor preventing the splitting of mankind into allo- 
patric species. Before the development of extraso- 
matic modes of transportation, the vast majority of 
gene flow involved neighboring populations. With 
the development of more rapid transportation, the 
rate and distance of gene flow increased. Since A.D. 
1600 mixture has been the most significant factor 
in population differentiation. 

Selection. Selection is the main guiding force 
in race formation. Genes or genotypes which, on 
the average, increase genetic fitness will become 
more frequent in later generations. Genetic fitness is 
defined solely in terms of differential fertility and/ 
or mortality. For a given environment, genes may 
be harmful, neutral, or beneficial. The population 
frequencies of harmful genes are determined by 
the balance between their rate of entry by mutation 
and their rate of removal by selection. Neutral 
genes must be very rare, their spread very slow, 
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and their replacement by more successfully adap- 
tive genes very rapid. An important class of genes 
are those with increased fitness in heterozygotes; 
such genes spread rapidly through local popula- 
tions and, given gene flow, through those sections 
of the species population with similar environmen- 
tal conditions. These genes with a selective advan- 
tage of the heterozygote may have stable equilib- 
rium frequencies largely determined by selection 
coefficients and nearly independent of initial gene 
frequencies and mutation rates. If polygenes are 
additive, selection acts on each locus independently 
of the others. Linkage between polygenes, unless 
very strong, is unimportant for selection rates. 

Genetic drift. The two genes at each autosomal 
locus in an individual are a sample, one from each 
parent, of the four genes at that locus in the par- 
ents, With small population numbers, random fluc- 
tuation in this sampling process may change gene 
frequencies, leading to concentration or fixation of 
some alleles and loss of others in local populations. 
The resulting irregular and patchy gene-frequency 
distributions may be nonadaptive. 

The origin of races. The formation of races is 
not characteristic of all sexual species. Race forma- 
tion requires partial reproductive isolation. Change 
in gene frequency is the fundamental event in the 
formation of races. Race formation, i.e., adaptation 
to local conditions, must occur in all successful, 
widespread species with partially isolated local pop- 
ulations. Given the raw materials of gene diversity, 
the genetic subdivision of a species depends largely 
on local selection pressures, on the one hand, and 
gene flow, on the other. If there is much gene flow, 
local races cannot form; if there is less gene flow, 
clines may develop; if there is little gene flow, local 
races will differentiate. If any of the local popula- 
tions are small in number, say under one hundred, 
there may be considerable loss and fixation of genes 
by random genetic drift. 

Race differences in behavior. Human races dif- 
fer in languages and culture as well as in genes. 
One of the most significant contributions of the 
social sciences to the general knowledge of the 
twentieth century is the demonstration that a large 
fraction of functionally important differences be- 
tween human groups in learned behavior is largely 
independent of race, or as we put it, independent 
of genic differences distinguishing races. 

Differences in the frequency of major genes be- 
tween local populations as well as major geograph- 
ical races are well established for some behavioral 
traits, e.g., the ability to taste phenylthiocarbamide 
and the four kinds of X-linked partial color blind- 
ness, although these behavioral variations are of 


limited functional significance in human societies. 
One sort of theoretical model in population ge- 
netics suggests we should observe statistical differ- 
ences in the frequencies of polygenes making up 
the genetical components of intellectual abilities 
and temperament (e.g., Hogben 1932, p. 169; 
Sturtevant 1954). Equally valid theoretical models 
support the opposite conclusion by suggesting that 
the genotypes making up the capacity underlying 
cultural learning are a species characteristic main- 
tained in equilibrium in all human populations 
(e.g., Dobzhansky 1962, p. 320). Since the genetic 
basis of intelligence is polygenic and since we can- 
not identify polygenes in man, precise information 
about racial, i.e., inherited, differences in general 
intelligence is simply not available. But it seems 
clear that if differences between major races exist 
in general intellectual abilities, these differences 
are small in magnitude compared with the range 
of genetic variation within all major races (Spuhler 

& Lindzey 1967). 
J. N. SPUHLER 


[See also EvoLuTIon, article on HUMAN EVOLUTION; 
Race. ] 
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GENNEP, ARNOLD VAN 


Charles-Arnold Kurr van Gennep (1873-1957), 
ethnographer and folklorist, who is best known for 
his work on rites of passage, was born in Wirttem- 
berg, Germany, of a Dutch father and a French 
mother. At his father’s death van Gennep was 
adopted by a French doctor, and thus his already 
strong ties with France were further strengthened. 
He received most of his education in France. 

His career was not primarily an academic one. 
He served the French government for two periods, 
from 1903 to 1910 and again from 1919 to 1921, 
and worked for such cultural organizations as the 
Alliance Francaise and the International Congress 
of Popular Art. For relatively brief periods he held 
university appointments at Neuchatel, Oxford, and 
Cambridge. 

In The Rites of Passage (1909), van Gennep 
systematically compared those ceremonies which 
celebrate an individual’s transition from one status 
to another within a given society; he concluded 
that most such ritual observances have a tripartite 
sequence. The major phases of this sequence are 
separation (séparation), transition (marge), and 
incorporation (agrégation). Van Gennep went be- 
yond this analysis of rites of passage to an interpre- 
tation of their significance for the explanation of 
the continuing nature of life. He believed that rites 
of passage, with their symbolic representation of 
death and rebirth, illustrate the principles of the 
regenerative renewal required by any society. 

Although van Gennep followed the prevailing 
system of classification of beliefs and rites associ- 
ated with magic, when he analyzed rites of passage 
he introduced a new approach. Instead of utilizing 
a priori categories as the units of his taxonomy, he 
abstracted these units from the structure of the 
ceremonies themselves. This procedure led him to 
differentiate the rites-of-passage phases: separa- 
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tion, transition, and incorporation. His inductive 
procedure included consideration of the variables 
of time and space. Over a quarter of a century later 
Julian Huxley was to label this method of classifi- 
cation, in biology, as the “new systematics.” 

Van Gennep was obviously impressed with the 
importance of the transitional, or liminal, phase 
that he had noted within a ceremony. One aspect 
of its importance is related to his concept of 
social regeneration. When individuals or groups 
are in a state of suspension (limen), separated 
from their previous condition and not yet incorpo- 
rated into a new one, they constitute a threat to 
themselves and to the entire group. In this state 
they are outside the sphere of normal control and 
must be reintegrated in order to avoid becoming 
disruptive. The liminal period also has its own 
internal structure, and it is possible to observe 
stages of entry into the period, the period itself, 
and departure from the period of transition. 

Some other contributions by van Gennep deserve 
brief mention. He established, for example, that 
the time of so-called puberty, or initiation, rites 
does not coincide with physiological puberty; 
rather, they are scheduled according to societal defi- 
nitions. He also emphasized the importance of rit- 
ual exchange and -thus anticipated Malinowski’s 
analysis of reciprocity. Finally, he noted the sim- 
ilarity between the structure of individual and 
group rites when either kind is in a state of change. 
The subsequent designation of group ceremonies 
as “rites of intensification” by Chapple and Coon 
(1942) is a welcome conceptual extension. 

Van Gennep was highly regarded for his work 
in the field of European folklore. He did much to 
change its orientation from its historical and anti- 
quarian origins by introducing the methods and 
perspective of ethnography. European scholars of 
his time tended to view the customs of rural peo- 
ples as quaint reminders of the past and to treat 
them as elements of history. Van Gennep saw folk 
literature and practices as aspects of living culture, 
to be examined within the context of changes 
among individuals in their relations with each 
other. His concern with changes in traditional cul- 
ture led him to study the disappearance of some of 
its elements in the urban centers of France and 
the spread of cosmopolitan culture. 

Van Gennep was enormously energetic in the 
accumulation of folklore materials and in their pub- 
lication. His writings include several score articles, 
numerous monographs, and the monumental Man- 
uel de folklore francais contemporain (1937— 
1958). Van Gennep’s editorial activity in this field 
also was prodigious: for over thirty years, from 
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1906 to 1939, he edited the section “Ethnographie- 
Folklore-Religions-Préhistoire” in the Mercure de 
France, and he founded and directed several French 
journals of ethnography and folklore. 

Van Gennep’s adherence to the comparative 
method in ethnography helps us to understand his 
theoretical formulations. He based his interpreta- 
tions on the assumption that man is a part of na- 
ture and is therefore subject to the great natural 
laws of stability, variability, and change. He con- 
sidered his goal to be the formulation of general 
syntheses. On this basis he challenged the adequacy 
of the principles of individual psychology and 
apriorism. He thought regeneration a necessity of 
social and individual existence, noting similarities 
between the cyclical shift of seasons and the pas- 
sage of the individual through the stages of life. 
In the ceremonies which celebrate both transitions, 
he noted the recurring theme of death and rebirth, 
For him, neither the individual nor the rites can 
be divorced from the social context, nor from time 
and space. He insisted that ceremonial patterns 
should be examined as wholes and that comparison 
should be based upon similarities in structure 
rather than upon content. 

There are resemblances between van Gennep’s 
concerns and those of his contemporaries, but his 
approach was so radically different from theirs that 
the resemblances are largely of a superficial kind. 
For example, the interest in religion, magic, and 
myth, which he shared with Lang, Frazer, Marett, 
and others, proceeded from entirely different con- 
ceptual assumptions. Their rational, a priori ap- 
proach was rooted in a historical perspective which 
sought to isolate stages of development such as 
fetishism, animism, animatism, or totemism and 
to provide definitions for each. Frazer in particular 
sought to ground his conclusions in universal psy- 
chological characteristics—the principle of associ- 
ation and the law of sympathy. In contrast, van 
Gennep was more interested in the dynamics of 
change than in the statics of classification and 
definition. For him, impersonal and personal power 
(dynamism and animism) unite to constitute re- 
ligion, while magic consists of the techniques of 
control. In this he differed from Frazer, in whose 
view magic preceded religion and was in conflict 
with it. Van Gennep also differed from Hubert and 
Mauss, who defined religion as official doctrine and 
magic as socially prohibited (a view incorporated 
by Durkheim in his polarization of the sacred and 
the profane). 

Thus, van Gennep saw the tradition, methods, 
and goals of ethnography as different from those 
of both the French sociologists and the British 


school of anthropology. The full exposition of these 
differences awaits future scholarship. A recently 
renewed interest in the study of rites of passage is 
evidenced by the writings of Whiting and Child 
(1953), Young (1965), and Turner (1964). This 
resurgence may extend to further study of van 
Gennep’s ethnographic method and of his theoret- 
ical formulations, 

SoLon T. KIMBALL 


[For the historical context of van Gennep’s work, see 
the biographies of FRAZER; MARETT; Mauss: for 
discussion of the subsequent development of his 
ideas, see LIFE CYCLE; MYTH AND SYMBOL: RITUAL; 
THEOLOGY, PRIMITIVE. ] 
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GEOGRAPHY 


The articles under this heading describe the 
main fields of contemporary geography and the 
field of statistical geography, which is an approach 
to geography rather than a discrete field. Other ma- 
terial of direct or related interest to geography may 
be found under AREA; CARTOGRAPHY; CENTRAL 
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MENT; ENVIRONMENTALISM; INDUSTRIAL CONCEN- 
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sourcES. Biographical articles of relevance to 
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1, THE FIELD 

11. POLITICAL GEOGRAPHY 
n. Economic GEOGRAPHY 
1v. CULTURAL GEOGRAPHY 
v. SOCIAL GEOGRAPHY 
vi. STATISTICAL GEOGRAPHY 


Richard Hartshorne 
Harold H. Sprout 
Richard S. Thoman 
Edward T. Price 
Anne Buttimer 
Brian J. L. Berry 


I 
THE FIELD 


Geography is neither a purely natural nor a 
purely social science. From its early development 
as an organized field of knowledge in classical 
Greece, geography has included animate as well 
as inanimate things, man and his works as well as 
nature. This was of little concern as long as man 
was regarded as an integral part of nature. But 
geography, although a very old subject, did not 
become established as a university discipline with 
an organized academic profession until after the 
natural and social sciences had become divided 
into separate faculties. Regular university depart- 
ments of geography were first established in Ger- 
man-speaking countries in the 1870s and 1880s; 
in France a little later; and in Great Britain and 
the United States generally in the present century. 
In each country the first generation of professors 
of geography had been trained in other fields, in 
most cases the natural sciences. Self-taught in ge- 
ography, few of them outside Germany were fa- 
miliar with its past development. 

One consequence was that geography tended to 
separate into two parts: one natural, more com- 
monly called physical geography, and one human 
or social, sometimes called economic geography. 
(In different countries and in different institutions 
in the same country practice varies as to whether 
the subject is part of the natural science faculty, 
part of the social science faculty, or split between 
the two.) For many students, on the other hand, 
it was the connections between the two parts, the 
man-land relationships, that constituted the dis- 
tinctive subject matter of geography. During the 
period of its initial establishment as a university 
subject in the English-speaking countries, geog- 
raphy was commonly defined as the study of the 
relationships, in whichever direction, between the 
natural environment and man. This “environmen- 
talist” concept of the field seems to have been first 
formulated in Germany, and then in France, late in 
the nineteenth century, but it was very soon dis- 
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carded, both in theory and practice by German 
geographers and, to a large degree, in practice by 
those of France; in both countries geography re- 
turned to its historic focus of interest in the study 
of the distinctive character of the areas of the 
earth. This concept and its historical background 
were made familiar to American and British geog- 
raphers only in the 1920s and 1930s. And although 
today few geographers would assert the environ- 
mentalist concept, many of its aspects, notably the 
emphasis on a man-nature dichotomy, still color 
much of their writing. [See ENVIRONMENTALISM. ] 

Modern geography, like the geography of past 
centuries, studies the earth as the space in which 
man lives—his habitat, milieu, or environment. 
This includes not just part of the environment, the 
physical or natural part, but the total environment; 
in any inhabited area the environment of today has 
been in part produced by man, and the existing 
population constitutes a living factor in the present 
environment. Geography, of course, is not alone in 
studying man’s environment. Many fields in the 
natural and social sciences study a particular cate- 
gory of phenomena, not excluding its distribution 
and variations over the earth. What geography, 
and geography alone, studies is the areal character 
of the earth in which man lives—the form, the 
content, and the function of each areal part, region, 
or place and the pattern of and interconnections 
between the areal parts. 

If the total diversity of places and their interrela- 
tions were simply the sum of areal variations and 
connections of physical, biological, and social phe- 
nomena, the subject could readily be divided into 
distinct fields: physical geography, biogeography, 
and human, or social, geography; or possibly two 
parts, the geography of nature and the geography 
of man. In reality, however, the phenomena of 
these several abstract categories are in many cases 
very closely interrelated in their areal variations and 
connections from place to place. Indeed, what the 
geographer observes as individual features—i.e., a 
soil, river water, a farm, a transport route—are ele- 
ment complexes in which factors of physical, ani- 
mate, and social origin are so intricately interwoven 
as to require study within a single field. 

Places or areas, large or small, may be studied 
either specifically or generically. Human interest 
in individual places is indicated by the practice 
from earliest times of giving each area a proper 
name—“Hudson River,” “Pennsylvania,” or “the 
South.” Geography, like history, is ultimately con- 
cerned with attaining maximum comprehension of 
individual cases. An essential step in the descrip- 
tion as well as the understanding of the individual 
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area is the determination of its generic character- 
istics. When we speak of places as “deserts,” “can- 
yons,” “cities,” “farms,” or “culture areas,” we limit 
the criteria in each case to a few closely interrelated 
features, overlooking aspects in which places of the 
same type may be radically different. Comparative 
study of the characteristics of places by kind may 
reveal indications of significant correspondence, 
leading to hypotheses of generic relationships. [The 
use of modern statistical methods to discover and 
determine such correlations is discussed below in 
GEOGRAPHY, article on STATISTICAL GEOGRAPHY. ] 

Among the social sciences geography, like his- 
tory, overlaps the fields which study a particular 
category—economic, political, or sociological. In 
geography, as in history, it is the integrated combi- 
nations of diverse elements, in their complex inter- 
relationships, that form the direct subjects of study. 
While the ultimate objective in geography is com- 
prehension of the full integration of areas, analysis 
requires focusing successively on partial integra- 
tions. Comparative study of areas, to establish 
generic concepts and relationships, must be limited 
to partial integrations over many areas or over the 
whole world. Such studies may be confined to a very 
narrow topic, such as the relation of crop yield to 
rainfall, or may cover the whole group of features 
which form the economy of areas. [Divisions of 
human geography based on common groupings of 
social features are treated in the articles that fol- 
low.] 


RICHARD HARTSHORNE 
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I 
POLITICAL GEOGRAPHY 


Political geography may be defined from the dis- 
ciplinary perspective of either geography or politi- 
cal science. From the former perspective, political 
geography appears as “the study of political phe- 
nomena in their areal context” (Jackson 1964, 
p. 1). This is amplified in the statement political 
geography is “the study of areal differences and 
similarities in political character as an interrelated 
part of the total complex of areal differences and 
Similarities” (Hartshorne et al. 1954, p. 178). At- 
tention to areal dimensions and patterns—sug- 
gested by such terms as location, distance, space, 
distribution, configuration—is also evident in re- 
search under the rubric of political science; the 
same holds for research in political history and 
politically oriented research in sociology and other 
disciplines, 


Concepts and techniques 


Classical writings on political subjects contain 
many speculations regarding man’s relations to the 
earth (Thomas 1925). Several contemporary politi- 
cal scientists have given special attention to areal 
aspects of political institutions, processes, relation- 
ships, and policies (e.g., Spykman 1938; 1942; 
1944; Spykman & Rollins 1939; Deutsch 1953; 
Smuckler 1953; Herz 1957; Sprout 1931; 1963; 
Sprout & Sprout 1946; 1960; 1962; 1965). But with 
very few exceptions (e.g., Van Dyke 1960, p. 128), 
commentaries on the political science discipline 
deal with the areal focus trivially or not at all. 

Geographers have given more attention to areal 
aspects of political phenomena. Although its ante- 
cedents go back to the nineteenth century and 
earlier (Hartshorne 1935), the modern field of 
Political geography dates in America from World 
War 1 (e.g., Bowman 1921) and in Europe from 
somewhat earlier (Ratzel 1897; George 1901; 
Mackinder 1902; 1904; 1911-1923; Fairgrieve 


1915; and others). The American college cata- 
logues of 1930 announced few courses in political 
geography. Thirty years later the number exceeded 
300, a growth accompanied by proliferation of 
teaching materials and buttressed by theoretical 
and substantive research. Contributors to theory 
include Whittlesey (1939), Hartshorne (1935; 
1950), Gottmann (1952), and Jones (1954a; 
1954b). 

In the idiom of modern geography, geographic 
quality attaches to any phenomena, human as 
well as nonhuman, intangible as well as tangible, 
exhibiting areal dimensions and associations that 
“give character to particular places” (James & 
Jones 1954, p. 4). To anticipate a point that will 
be stressed later, areal patterns of behavior and 
other intangible human phenomena are becoming 
increasingly central to research in political geog- 
raphy. 

Similarly, the “political” in political science re- 
fers to more than the formal apparatus of govern- 
ment; political quality attaches to any aspect of 
power and influence in society. A community or- 
ganized on the basis of power is by definition a 
political community. Every political community 
(though not every political organization) has a 
territorial base. Country, often used as a synonym 
for state, denotes the territorial aspect of a state. 
Province, city, village, school district, port author- 
ity, and other subdivisions of a state all carry 
territorial connotations. The same holds for em- 
pire, political bloc, coalition, and other terms that 
identify units and combinations of units in im- 
perial and international relationships. 

Political areas. In the idiom of geography, any 
section of the earth’s surface delineated by refer- 
ence to political criteria is a political area. These 
criteria include de jure jurisdiction and authorita- 
tive decision making. Political areas so delineated 
include nation-states, their formally constituted 
subdivisions, and empires. These are unquestion- 
ably significant political areas, and a great deal of 
political geography has been written in terms of 
them. As geographers have emphasized (e.g., Whit- 
tlesey 1935), the “impress” of political authority 
changes both the physical and social aspects of 
landscapes: it affects, for example, inspection sta- 
tions and other boundary structures; transporta- 
tion grids that conform to political requirements 
(e.g., Wolfe 1963); movements of goods and peo- 
ple within a frame of migration and commercial 
laws; and linguistic and other cultural homoge- 
neities imposed by political authority. 

But delineating political areas by reference only 
to political authority and legal jurisdiction leaves 
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many phenomena uncovered: for example, it does 
not account for frontier zones that exhibit political 
homogeneities of personal behavior at variance 
with de jure jurisdiction (Hartshorne 1950), and 
areal patterns of behavior that are within a recog- 
nized territorial jurisdiction but not coterminous 
with its boundaries, such as the region of “isola- 
tion” in the United States (Smuckler 1953) or 
the Washington—Boston “megalopolis” (Gottmann 
1961). 

The criteria of jurisdiction and authority com- 
pletely fail to delineate areas that exhibit patterns 
of political interaction but no overarching organi- 
zation of authority. These are the characteristic 
patterns of international politics, whether of the 
society of nations as a global whole or of less-than- 
global areas such as the communist “realm,” the 
so-called free world, the Atlantic “community,” 
and many others. 

Geographers have traditionally emphasized the 
more tangible aspects of political areas. This em- 
phasis is evident in Sauer’s morphological concep- 
tion of political geography as “the study of political 
landscapes”—i.e., “the administrative centers, the 
boundaries, and the defensive lines and positions” 
(1927, p. 208). With reference to political bound- 
aries, Fischer (1949) noted that geographers have 
usually stressed the stabilizing influence of physio- 
graphic factors, often to the neglect of historical 
and other cultural processes. Stephen B. Jones 
(1959) reviewed the geographical literature on 
boundaries, analyzing the ways in which these 
have been conceived in different places and periods. 

Geographers have given increasing attention to 
intangible factors and to processes of social change. 
This trend is evident in the writings noted above. 
In a plea for more “functional” political geography, 
Hartshorne (1950) emphasized the importance of 
“centripetal” and “centrifugal” ideas and social 
movements in the evolution of state areas. Gott- 
mann (1951) introduced the concepts of circula- 
tion (a French term for which the nearest English 
equivalent is probably “movement”) and icono- 
graphie (symbols that foster loyalty, solidarity, 
and conformity) as organizing ideas for the anal- 
ysis of change and resistance to change in political 
areas, Jones (1954a) brought these and other ideas 
together in a “unified field theory.” 

Political potential. In a period of history, the 
results of political interaction, whether within a 
single national community or upon the broader 
stage of international politics, exhibit areal patterns 
of coercion—-submission and influence—deference. 
Within nation-states and empires, these patterns 
evolve under processes of authoritative decision 
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making, no matter how primitive or obscure these 
processes may be. The patterns of international 
politics, however, have evolved in the absence of 
legitimized overriding authority. In consequence, 
these patterns are derived in the main from less 
sharply delineated images of the distribution of 
power and influence among the interacting na- 
tional communities. 

There is no standard term to denote the aggre- 
gate pressure, pull, attraction, or simply effect 
that one nation or coalition exerts on the behavior 
of others. The term “power,” with its strong mili- 
tary connotations, is too restrictive, since relations 
of influence—deference are derived from a much 
broader spectrum of behavior than coercion-sub- 
mission—i.e., brute force and the threat of it. The 
term “political potential” has been suggested to 
denote this broader spectrum (Sprout & Sprout 
1962, p. 158). 

The concept of political potential has definite 
areal connotations. It expresses areal variations in 
the intensity and efficacy of a government's de- 
mands on oiher nations, i.e., when and how it gets 
what where. In addition, political potential ex- 
presses the total, or aggregative, effect of a nation’s 
Statecraft plus effects deriving from that nation’s 
sheer presence on the international landscape, 
which are evident in the behavior of other nations. 
Common sense, confirmed by observation, indicates 
that location, distance, space, the configuration of 
lands and seas, and the distributions of population, 
raw materials, technology, institutions, ideologies, 
and other phenomena may all have some bearing 
on the political potential of every nation and on 
the resulting patterns of political interaction and 
relationship [see MILITARY POWER POTENTIAL]. 

Maps of the international potentials of both par- 
ticular states and the major regions and the society 
of nations as a whole might somewhat resemble 
a topographic contour map. Such maps, however, 
do not exist; perhaps the available data are too 
amorphous and ambiguous to render trustworthy 
mapping possible in the present state of knowledge. 
But studies of political potential from a geographic 
perspective, utilizing geographic methods, should 
help to clarify the areal concepts implicit in the 
vocabulary of international politics—e.g., bipolar- 
ity, polycentricity, balance of power, sphere of in- 
fluence, political orbit, and many others in common 
use [see INTERNATIONAL POLITICS]. 

Geographic areas and political systems. Although 
geographers and political scientists share an inter- 
est in political phenomena, their disciplinary frame- 
works are recognizably different. Political scientists 
exhibit interest in areal dimensions and patterns 


only to the extent that these seem to contribute 
to an understanding of institutions, processes, rela- 
tionships, and issues of public concern. In conse- 
quence, political analysts tend to view geographic 
dimensions and patterns from a predominantly 
ecological perspective, i.e., relations between “po- 
litical actors” and their milieux. The ways in which 
students of politics generally frame problems tend 
to focus attention especially on the psychological 
linkages between actor and milieu (Sprout & Sprout 
1965). 

For most geographers (but there are many ex- 
ceptions) interest in ecological relationships, al- 
though generally active, tends to be subsidiary to 
areal patterns per se. However, this contrast in 
perspective should not be exaggerated. When one 
considers the increasing attention that geographers 
are giving to intangible social patterns and their 
evolution through time and to the values and moti- 
vations that underlie such patterns, it may be more 
nearly correct to say that “area” is simply the frame 
of reference within which geographers study polit- 
ical behavior and its results. 

In political theory the concept of system has 
come to occupy a position somewhat analogous to 
the concept of area in geography [see SYSTEMS 
ANALYSIS]. These organizing ideas are interestingly 
relatable. What appears from the geographic per- 
Spective to be a political area—city, province, state, 
empire, major political region, etc.—may appear 
from the viewpoint of political science to be a 
system, i.e., a constellation of political units (in- 
dividuals, groups, or organized communities) that 
interact and relate in describable patterns. Hart- 
shorne and others have emphasized the comple- 
mentarity of these perspectives (James & Jones 
1954, p. 174). 

This complementarity comes through strongly in 
Jones's “unified field theory of political geography” 
(1954a). Jones’s model consists of five intercon- 
necting categories—“political idea—decision—move- 
ment-field—political area.” These are likened to a 
“chain of lakes or basins . . . at one level, so that 
whatever enters one will spread to all the others.” 
There are close counterparts to Jones’s “basins” in 
other vocabularies. His term “political idea” ap- 
proximates the concepts of image and goal-orienta- 
tion in behavioral theory. “Decision” is just what 
it is elsewhere. “Movement” and “field” seem to be 
more or less analogous to a course of action that 
changes, even as it is changed by, the encompas- 
sing milieu, Finally, “political area” includes “any 
political organized area” that has “recognized lim- 
its, though not necessarily linear or permanent.” 
Thus, the communist international system, the 


Atlantic alliance, the British Commonwealth, or 
any other international system constitutes (with 
suitable change in perspective and terminology) a 
political area, just as does a state, a subdivision 
thereof, or any other areally expressible system of 
political interaction. 

Flow from idea to area, in Jones’s model, is the 
process by which people control and alter their 
milieu. The idea of man as a geographic agent, 
refashioning the landscape along with the physical 
processes of nature, is an important concept of 
modern geography. It was given arresting expres- 
sion in the mid-nineteenth century (see Glacken 
1956, pp. 70 ff.) and, more recently, by Sauer 
(1925) and others. The increasing capacity in 
technically advanced societies to alter the milieu 
has immense political implications. 

The concept of flow from idea to area, in Jones's 
model—i.e., from image and purpose to operational 
result in the behavioral idiom—rests (more often 
implicitly than explicitly) on the general man- 
milieu hypothesis of “possibilism.” This is the 
proposition that the capacities of the actor and the 
properties of his milieu set limits to his accomplish- 
ment with reference to any given course of action 
and that these limitations may be operative irre- 
spective of whether or how he perceives and takes 
them into account. One corollary is the hypothesis 
that the higher the level of technology, the greater 
becomes human capacity to control and modify 
nonhuman components of the milieu. Another cor- 
ollary is that an operator's ability to affect the 
behavior of human components of his milieu de- 
pends on his capabilities in relation to theirs at 
the place where their relative capabilities are tested. 

The reverse flow, from area to idea, in Jones's 
model, is the process by which the milieu is said 
to condition human behavior. This conditioning 
process has been a focus of controversy, largely 
because of the teleological imagery to which many 
writers (though not Jones) are addicted. The influ- 
ences ascribed to nature or to other aspects of the 
milieu can be expressed, free from teleological 
overtones, by such psychological concepts as per- 
ception, cognition, recognition, stimulus, response, 
feedback, etc. (Sprout & Sprout 1965). Expressing 
the conditioning process in such terms emphasizes 
the complementarity of geography and behavioral 
science, With certain exceptions, usually unimpor- 
tant in political contexts, the phenomena of psy- 
chological stimulus and response provide the only 
demonstrated path of influence from milieu to ac- 
tor, from environment to environed organism, from 
“area” to “ideas.” The psychological nature of en- 
vironmental conditioning of behavior (from which 
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many areal patterns are derived) has long been 
understood, although not always clearly delineated 
(e.g., Mackinder 1919, p. 28; Febvre 1922, p. 171; 
James & Jones 1954, p. 13). At least one contem- 
porary geographer has explicitly restated this proc- 
ess in the idiom of psychological theory (Kirk 
1952). 

Geographic techniques. A major contribution 
which geographers have made to the study of po- 
litical phenomena is the development of graphic 
techniques. Most political areas are too large to be 
directly perceived in toto. The eye may not differ 
entiate and relate various categories of phenomena 
distributed over the area, even when they are di- 
rectly perceivable; hence the value of graphic modes 
of research, analysis, and presentation, by means 
of model globes, maps, cartograms, photographs, 
etc. (Bowman 1934, chapter 4; Boggs 1948). Maps 
delineating selected features of an area can be 
compared and superposed (e.g., Bowman [1921] 
1928, pp. 146, 460). Large segments of the earth’s 
surface can be examined from different perspec- 
tives (Harrison 1944; Boggs 1945). High-altitude 
and low-altitude photographs and oblique-angle 
pictures add new dimensions and textures to the 
perception of smaller areas (e.g., Gutkind 1956, 
pp. 1 ff.). Maps and cartograms can deceive as 
well as inform and hence are powerful instruments 
of political propaganda (Boggs 1946). Maps and 
other graphic tools not only carry preconceived 
messages but, when studied, may also evoke new 
insights and hypotheses [see CARTOGRAPHY]. 


Research in political geography 

The substantive literature divides roughly into 
two categories: (1) works that focus on political 
areas as such; and (2) works that utilize areal 
concepts and patterns to explain or to predict po- 
litical events. This cleavage more or less follows 
disciplinary lines, but by no means consistently. 
Some of the more important theoretical works have 
been cited, and a few teaching books are included 
in the bibliography, along with works cited in this 
text. A more comprehensive bibliography is ap- 
pended to the long essay on political geography in 
James and Jones (see Hartshorne et al. 1954). 

With respect to research on particular political 
areas, one should distinguish between works that 
deal primarily with political phenomena in an areal 
context and those that merely utilize political 
boundaries as a frame of reference for a wider 
range of phenomena—e.g., works on areal distri- 
butions of agriculture, industry, communication 
grids, etc. 

Works that are politically oriented in the stricter 
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sense Cluster around various focuses. One of these 
is the formation, expansion, and disintegration of 
political areas (e.g., Bowman 1921; Whittlesey 
1939; Hartshorne 1950; Deutsch 1953). The fol- 
lowing are of particular interest: Herz’s analysis 
of the formation of modern “large-area” states and 
of the advances in technology that are making 
these states progressively vulnerable to economic, 
psychological, and military penetration ( 1957); 
Vevier's essay on geographical ideas in the terri- 
torial expansion of the United States (1960); and 
Hart's hypotheses regarding the logistical requisites 
of political areas (1949). There are many studies 
of functioning political areas which reflect the 
varied research perspectives and methods of several 
disciplines. The Searchlight Series, edited by G. W. 
Hoffman and G. E. Pearcy, offers a continually 
growing list of short books of this type. 

A second focus is on the analysis of political 
areas in the light of population growth, spreading 
communication grids, industrialization, and urban 
sprawl (e.g., Gottmann 1961; Wolfe 1963). Worthy 
of special mention is Mumford’s historical study of 
cities (1961). 

A third focus is on areal patterns delineated not 
by political authority but by civic attitudes and 
preferences. This involves studies of integrative 
and disintegrative ideas and movements within 
State areas (e.g., Hartshorne 1950): attitudes to- 
ward the “national space” ( e.g., Herman’s study of 
communist China, 1959); regional variations in 
civic postures toward public problems such as mili- 
tary defense and foreign policy (e.g., Beard 1934; 
Sprout & Sprout 1939; Smuckler 1953). 

A fourth focus is on the role of political author- 
ity in the development, use, depletion, conserva- 
tion, and renewal of resources. These questions 
are approached from different angles in Interna- 
tional Symposium on Man’s Role . , , (1956) and 
also in Udall (1963) and Herber (1962). The ef- 
fects of resource use and of regulations governing 
use constitute important exhibits of the “impress” 
of political authority upon the earth. 

A fifth research focus is on political regions 
larger than nation-state areas. These include an- 
cient and modern empires (e.g., Fawcett 1951; 
Fisher 1950) and international regions delineated 
in various ways (Jones 1955a). Boggs’s essay on 
the Western Hemisphere ( 1945) focused attention 
on the need for precise criteria for delineating ma- 
jor political regions. This need is exemplified in 
some textbooks, in which political regions are 
variously delineated by physiographic, historical, 
broadly cultural, or other criteria, sometimes with- 
out clear demonstration of political relevance. 


The geographic dimensions and patterns of . 
ternational politics have been analyzed from varie 
ous perspectives. Virtually every textbook on inter- 
national politics gives attention to the uneven 
distribution of people and things among nations, 
Areal variations are recognized to be strategic in 


the estimation of state capabilities ( e.g., Sprout & 


Sprout 1962; Jones 1954b). Such variations form 
the basis of hypotheses invented to explain or to 
predict patterns of interaction in the society of na- 
tions (Jones 1955a; 1955b; Sprout 1963). Such 
hypotheses represent attempts to identify factors 
whose uneven distribution in space provides a 
plausible explanation of international patterns. 


Geopolitics 


Most attention has been given to hypotheses” 


derived from the global and regional configuration 
of lands and seas. These include Mahan’s sea-power 


interpretation of history (1900; Sprout & Sprout” 


1962); Mackinder’s hypothesis of trend toward a 


world empire based in the “heartland” of Eurasia 
(1904; 1919), later modified considerably (see 
especially 1911-1923, vol. 2; 1943); and variants 
and critiques of Mahan’s and Mackinder’s theories 
too numerous to list here [see MAcKINDER; MAHAN; 


see also, e.g., Fairgrieve 1915; Dorpaien 1942; — 
Spykman 1944; East & Moodie 1956, chapter 18). 


Climatic variations have inspired another set of 
geopolitical hypotheses and critiques (e.g., Hunt- 
ington 1915; Wheeler 1946; Mills 1949; Missenard 
1954). International political patterns have also 
been linked with the uneven distribution of the 
various raw-material requisites of modern industry. 
There is some disposition to regard areal differ- 
entials in technology as the critical variable (e.g. 
Brown 1956), a hypothesis that has been linked 


with demographic distribution to produce a pre- _ 


diction that international political patterns will 
ultimately be determined by the latter, The pre- 
diction is based on the premise that technological 
primacy will vary with relative numbers of supe- 
rior scientists and other gifted individuals, the 
incidence of such individuals varying, in the long 
run, with the size of population (Blount 1957; 
critique by Sprout 1963). 

The adjective “geopolitical” requires some €x- 
planation. Political geography in general, and in- 
ternational political geography in particular, is 
often confused with geopolitics. This word entered 
the English language as a loose translation of 
Geopolitik, which came, in the interwar period, 
1919-1939, to denote mobilization of areal knowl- 
edge for purposes of state—in short, geo-policy. 
Geopolitics was associated in particular with the 


Institut für Geopolitik in Munich, directed by Kart 
Haushofer, a general turned geographer and propa- 
gandist, who is widely believed (perhaps mis- 
takenly ) to have contributed significantly to Hitler's 
—— of conquest (e.g., Dorpalen 1942; Fifield 
1945). 

Because certain Germans exploited the concept 
of Lebensraum and other geopolitical ideas for 
aggressive purposes, many in America and else- 
where illogically concluded that any mixing of 
geography and politics must be tainted with war 
and conquest. Geographers insisted that geopolitics 
was a part of political science. Political scientists 
tossed the pariah subject back to the geographers. 
Time has blurred the odious policy connotations of 
geopolitics, perhaps more so in America than in 
Europe. The term has even acquired some respect- 
ability, especially in the context of military-defense 
analysis. 

The adjective “geopolitical,” never as value-laden 
as the noun “geopolitics,” was employed sparingly 
in the 1930s (e.g., Whittlesey 1939), and increas- 
ingly in more recent years, to denote the areal 
aspect of any political pattern and, in particular, 
hypotheses that purport to explain or to predict 
areal distributions and patterns of political poten- 
tial in the society of nations. All such hypotheses 
represent assessments of opportunities and limita- 
tions implicit in the properties of the interacting 
political communities and of the milieu in which 
they operate. Such assessments (in the idiom of 
ecological theory) are essentially possibilistic, even 
though they may be expressed in deterministic or 
near-deterministic rhetoric (Sprout & Sprout 1965). 


HAROLD SPROUT 


[See also ECOLOGY; INTERNATIONAL POLITICS. Other 
relevant material may be found under ENCLAVES 
AND EXCLAVES; INTERNATIONAL RELATIONS.]} 
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m 
ECONOMIC GEOGRAPHY 


The subject matter of economic geography is re- 
lated substantively and historically to both disci- 
plines from which the field receives its name. It 
obtains from geography an emphasis upon simi- 
larities and differences from area to area, large and 
small, on the earth’s surface, and upon linkages or 
circulations between areas. It acquires from eco- 
nomics an interest in the production, distribution, 
exchange, and consumption of goods and services. 
Economic geography therefore may be defined as 
an inquiry into similarities, differences, and link- 
ages within and between areas in the production, 
exchange, transfer, and consumption of goods and 
services, Particular attention is given to the loca- 
tion of economic activity, considered both in theo- 
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retical and practical terms [see SPATIAL ECO 
nomics}. 

Relationship to geography and economics, Eco- 
nomic geography is so intimately a part of the 
whole of geography that separating it from the 
general field is difficult. Because gaining a livell- 
hood not only is essential to human existence but 
also involves a wide range of cultural and natural 
(physical and biological) features and interrela- 
tionships between those features, most matters of 
concern within economic geography are of concern 
within the over-all discipline, and vice versa. This 
is especially true if natural and noneconomic cul- 
tural features are considered in terms of their 
positive and negative implications for human use 
of earth space in gaining a livelihood. However, 
the emphasis upon livelihood in economic geogra- 
phy does mean a corresponding reduction of atten- 
tion to those cultural or natural conditions which 
may be only loosely related to spatial aspects of 
livelihood. Thus, for example, neither the cultural 
origin of a religious belief nor the process of land- 
form development is of direct interest to the eco- 
nomic geographer unless applicable in some way— 
as an advantage or constraint—to the location and 
interrelationships of economic activities, 

As in the whole of geography, the spectrum of 
the totality of interrelationships in economic geog- 
raphy may be horizontal (involving different areas 
or different points within a given area), vertical 
(involving a morphological column of cultural and 
natural features at a specific point on the earth’s 
surface), or both horizontal and vertical. Hori- 
zontal relationships are stressed in most work now 
being carried on. 

We may consider economic geography, there- 
fore, as emphasizing the livelihood aspects of the 
whole of geography, rather than as a compartment 
of the parent discipline. The field has a direct tie 
to economics and, by way of the whole of geog- 
raphy, indirect ties to other disciplines in the social 
and natural sciences. So considered, the field by 
definition is sufficiently broad in scope to antici- 
pate any methodological changes from time to 
time and place to place, although specific ap- 
proaches, concepts, points of view and emphasis, 
immediate objectives, and methods have ranged 
rather widely with change of either time or place. 

Trends. However, among economic geogra- 
phers there is lack of agreement as to specific direc- 
tion, particularly in the United States and Canada. 
For a time after its emergence as a separate field 
in the United States, during the early years of the 
twentieth century, economic geography relied pri- 
marily upon the inductive approach, with indi- 
vidual scholars aggregating ideas and data from 
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field and library work into descriptions, classifica- 
tions, and qualitative interpretations, utilizing 
numerical evidence when possible. Partially be- 
cause of limitations on the amount of information 
obtainable in these ways, emphasis was placed on 
the unique or at least the distinctive features and 
interrelationships in both systematic and regional 
work; generalizations were made when possible. 
Although this approach continues to be utilized 
profitably by many economic geographers, the past 
decade has marked the emergence of a new school 
of thought, with immediate roots going as far back 
as the 1930s and indirect roots into the early por- 
tion of the nineteenth century. This school has 
chosen an explicitly theoretical approach, empha- 
sizing nomothetic research and depending ap- 
preciably upon mathematical abstraction. In the 
early 1960s the suggestion was made that geog- 
raphy basically is concerned with systems analysis 
(Ackerman 1963) and that the overriding problem 
of geography is understanding the man—land sys- 
tem of the earth. This giant system in turn is con- 
sidered by Ackerman to comprise a large number 
of hierarchically arranged subsystems and proc- 
esses. Such a concept places economic geography 
in the position of searching for laws involving 
livelihood within a context of systems analysis ap- 
plied to earth space. The degree to which the con- 
cept has been generally accepted is not yet certain. 

Two basic rationales. Like the over-all disci- 
pline, economic geography in the United States and 
Canada can be visualized in terms of two funda- 
mental rationales—the topical, or systematic (not 
to be confused with system in systems analysis), 
and the regional. In economic geography both 
rationales focus on primary activities (here, as gen- 
erally in economics, considered to be agriculture, 
grazing, forest-products industries, mining, fishing, 
and hunting), secondary activities (manufactur- 
ing and construction), and tertiary activities (all 
other occupations). The two rationales differ espe- 
cially as to initial starting position. The topical, or 
systematic, rationale begins with structural aspects 
and works toward their earth-space expression and 
relationships, whereas the regional begins with 
space and works toward structure. The respective 
starting positions are usually reflected in various 
emphases in the completed works. Both rationales 
can be divided into subjects which themselves have 
become research interests for certain economic 
geographers. In practice these subjects are usually 
called systematic or topical specialties. Agricul- 
tural geography, manufacturing geography, and 
marketing geography have been of long-standing 
interest. Transportation geography and theoretical 
approaches to certain tertiary activities and domes- 


tic trading patterns have been developed actively 
within the past decade [see CENTRAL PLACE]. In 
addition, increasing attention is being given to 
recreation geography and to aspects of tertiary ac- 
tivities not yet accorded full consideration, to geo- 
graphical aspects of international trade, and to the 
economic geography of primary activities other 
than agriculture. Work has begun on the geogra- 
phy of consumption and the geography of price. 

Because economic activities are earth based and 
may be clustered, as to location and/or function, 
into patterns of differing kinds and intensity, 
regions are important to economic geography. As 
in the whole of geography, regions in economic 
geography may be either homogeneous or nodal. 
The homogeneous region, sometimes called the 
formal region, is more or less an inventory of static 
features and relationships within an area that has 
been delimited on the basis of prevailing homo- 
geneity of at least one feature. Both cultural and 
natural features and relationships may be so classi- 
fied: the spring wheat belt, the manufacturing belt 
between Chicago and New York, and the Appa- 
lachian Mountains are three examples in North 
America of this classification. Each example has 
been delimited on the basis of a single criterion, 
but multiple-criterion regions are possible at higher 
levels of generalization. 

The nodal, or functional, region classifies hu- 
man organization of earth space. This region has a 
point of focus (such as a city or town), an organ- 
ized area of mutual interdependence with respect 
to that point and associated territory (such as the 
trading area of a city or town), and connecting 
lines to the territory (such as transportation and 
communication routes ) providing the linkage (such 
as commuter and freight traffic and communica- 
tions flows) between the point and the area. 

Both the homogeneous and the nodal regions 
are here envisaged as means of classification and 
not as objective entities to be discovered in a scien- 
tific sense. Either type of region can be considered 
at various levels of observation and detail. When 
several levels are superimposed, a pyramidlike 
framework with hierarchical tendencies may be 
recognizable. 

Realms of interdisciplinary contact. Besides its 
subfields, economic geography extends into realms 
of interest shared with other disciplines. The uti- 
lization and conservation of natural and human re- 
sources is of long-standing interest to some eco- 
nomic geographers. More recently, the development 
of regional science is of definite interest to some 
economic geographers, many of whom participated 
in that development. Especially in the 1960s some 
economic geographers have become very interested 


in regional inequalities of economic development, 
whether within a country or at a continental or 
global level of observation. 

Theoretical and practical implications, The 
theoretical importance and practical significance 
of economic geography are inextricably intertwined. 
Theories seek optimum circumstances and efficien- 
cy in human utilization of earth space: what are 
the most desirable size, spacing, and intermix of 
specific economic units of production, exchange, 
transfer, and consumption within selected typolo- 
gies of cultural and natural conditions? On the 
other hand, evaluations of historical and current 
practices indicate the degree to which theoretical 
models are actually applicable, especially in view 
of the cultural institutions, personalities and wills 
of key individuals, and specific natural conditions 
of a given area. Such evaluations also provide a 
degree and type of qualitative insight not obtain- 
able through hypothesis alone. 

Once a satisfactory relationship between theory 
and practice has been established, a logical step is 
application to planning procedures. In economic 
geography the value of both theoretical and prag- 
matic study to planning is clear, whether the view- 
point of the planner is the broad outlook of the 
regional analyst, concerned with an intermix of 
varied economic activities and resources, or the 
more restricted and highly specialized view of the 
expert in finding locations for individual units of 
economic activity. 

Methods of study. Inasmuch as developments 
and trends in the United States from 1904 to 1954, 
the first 50 years of geography’s existence as a uni- 
versity discipline, have been evaluated elsewhere 
(James & Jones 1954), the events of succeeding 
years will be emphasized here. A survey of the 
literature indicates that research since 1954 can 
be divided into three categories, on the basis of 
approach and method: qualitative interpretation, 
usually with substantial numerical evidence and 
sometimes making use of the case-study technique; 
quantitative classification, in a more or less de- 
scriptive sense, with qualitative elaboration and 
explanation and involving a specific procedure 
applicable to different areas and time periods; and 
formulation and testing of specific hypotheses and 
models. These approaches have been applied, with 
varying degrees of intensity, to most facets of eco- 
nomic geography, but they will be discussed here 
with respect to agriculture, manufacturing, trade, 
and transportation. 

Agricultural geography. All three approaches 
are utilized in agricultural geography, which is still 
a subject of keen interest. Books and articles in- 
volving qualitative interpretation are diverse as to 
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specific subject matter, but most can be classified 
under certain broad headings. Approximately one- 
third of the articles on agriculture appearing in 
four professional journals of the United States and 
Canada—Annals (Association of American Geog- 
raphers ), Canadian Geographer, Economic Geogra- 
phy, Geographical Review—treated a specific agri- 
cultural activity in a definite area, evaluating such 
aspects as type and size of enterprises, natural 
environmental advantages or constraints, combina- 
tions of selected crops and livestock with other 
crops and livestock, allocation of land, general 
farming practices, and trends. One-fourth empha- 
sized over-all use of agricultural land in a specified 
area, considering other aspects of agricultural 
geography in a subsidiary way. Nearly one-fifth 
were concerned primarily with reclamation of agri- 
cultural land—irrigation, drainage, erosion control, 
etc. A final one-fourth involved miscellaneous in- 
terests, such as land redistribution in a given area, 
agricultural colonies of a minority group in a given 
area, land tenure in a specified place, the economic 
development of a country that is heavily dependent 
on agriculture, and the historical geography of agri- 
cultural change in a selected location, etc. These 
four categories of qualitative interpretation, which 
cover the most numerous writings on agricultural 
geography in literature published in the United 
States and Canada, provide valuable insight into 
conditions evaluated by each author but have 
almost no common denominator. 

Descriptive and analytic interpretation of a 
quantitative nature involves a classification based 
on numerical information, preferably official data 
continually available. The classification reveals 
pattern when plotted on a map and hence may be 
used to construct generic regions (typologies of 
regions) based on quantitative criteria. Once the 
criteria become standardized on a world-wide basis, 
the classifications will become standardized and 
applicable to all parts of the world. In principle 
both the homogeneous and the functional region 
may be so constructed, but in practice the homo- 
geneous region has received the most attention to 
date in agricultural work. Prior to 1954 most 
studies were based on inadequate quantitative evi- 
dence, rather highly generalized, and presented at 
continental or even global levels of observation. 
Weaver (1954, especially pp. 175-184) applied a 
greater measure of objectivity to an area of inter- 
mediate size, aggregating data from county units 
to compute, for the Middle West of the United 
States, areal differences in crop combinations on 
the basis of degree of variance from a theoretical 
curve of optimal combinations. Subsequent work 
by other geographers includes a quantitative sam- 
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pling approach to agricultural regionalization, the 
application of multiple correlation and regression 
analysis to rural farm population densities in the 
Great Plains, and the statistical association of cash 
grain farming in the Middle West with landforms. 

The application of hypotheses to agricultural 
geography has been based on a rediscovery of im- 
plications of Thiinen’s pioneer work treating the 
effect of transport cost and market price on crop 
and livestock combinations (1826-1863). Cur- 
rent research suggests that, while the influence of 
distance to market on agricultural land use does 
not result today in patterns so simple as those set 
down by Thiinen, the influence of market price and 
transport cost does exist and can be analyzed 
mathematically [see RENT]. 

Manufacturing geography. Qualitative interpre- 
tation of manufacturing geography, as indicated 
in publications of the United States and Canada, 
has been applied especially to areas other than 
Anglo—America, usually where numerical informa- 
tion is not fully available. Such an article or mono- 
graph may be an appraisal of a specific industry 
or group of industries or may involve a general 
examination of all manufacturing. The method of 
the historical geographer, providing the time di- 
mension, may or may not be utilized. The case 
study is seldom used. Work involving only quali- 
tative interpretation constitutes a relatively small 
Percentage of all studies in manufacturing geog- 
raphy, largely because numerical data are available 
to a greater degree than, for example, in agri- 
culture. 

Quantitative classifications are numerous in 
manufacturing geography. Labor force and value 
added are usually the criteria of measurement, al- 
though the list of possibilities is long and includes 
value of product, wages paid, amount of energy 
consumed, area of ground Space, area of floor 
Space, and land value. One such classification, with 
labor force used as the measuring criterion, has 
revealed structural and Spatial changes in the 
manufacturing of the United States, Another has 
been based on magnitude (number of employees, 
wages paid, and value added) and intensity (ratios 
of labor force and value added to selected national 
totals), the classification being applied, with allow- 
ance for kind and amount of data available, to the 
United States, Japan, and the Soviet Union. In still 
another classification based on labor force and 
value added, the author developed a technique for 
showing change over time on a single map and 
applied his technique not only to conditions within 
specific regions but also to differential rates of 
regional growth, as measured by national totals. 
Another study classified cities by labor force on 


the basis of prevailing industries, after first remoy- 
ing from consideration the city-serving functions 
of the industries. Several indices of industrial di- 
versification also have been developed, with labor 
force the prime criterion of measurement, A 
typology of manufacturing flows relies, on the one 
hand, upon the orientation of manufacturing to 
raw materials or agglomerations of input factors 
and, on the other, upon access to both national and 
regional markets. 

Increased attention is also being accorded theo- 
retical approaches to manufacturing geography. 
Here, especially, work is being shared—by eco- 
nomic geography and regional science [see RE- 
GIONAL SCIENCE]. Among initial geographical mod- 
els was a construction by Harris (1954) showing 
the importance of market potential to industrial 
localization. Other geographers have used models 
to measure association tendencies in manufactur- 
ing, concluding that agglomeration is a very im- 
portant force. Models have been used to examine 
tendencies of high-value-added manufacturing to 
concentrate in certain areas, and they have also 
been used to associate circular and cumulative 
causation with the growth of manufacturing and 
associated urbanization. [See ECONOMIES OF SCALE 
and EXTERNAL ECONOMIES AND DISECONOMIES.] 

Geography of trade. Attention to trade in eco- 
nomic geography has focused especially on domes- 
tic trade. Theoretical work is being pursued vigor- 
ously under the stimulus of central-place theory. 
Pragmatic work, whether qualitative interpretation 
or quantitative classification, also has been of keen 
interest. This pragmatic work is usually called mar- 
keting geography, although some studies in domes- 
tic commodity flow would not necessarily be in- 
cluded under such a heading. Early research of this 
kind was associated especially with urban geogra- 
phy, which developed as a field in the first half of 
the twentieth century and is becoming increasingly 
important. This early research has been a valuable 
antecedent to both central-place theory and mar- 
keting geography. From numerous studies of indi- 
vidual urban units and, subsequently, from use of 
census information three significant contributions 
were made: the idea of the city-region, a functional 
region of interdependent units, including an urban 
unit; the classification of all cities in a country by 
relatively dominant functions, as measured by 
nationwide norms; and the concept of an economic 
base comprised, on the one hand, of trading and 
related activities or portions of activities which 
provide financial support for a specified area and, 
on the other, of activities or portions of activities 
which merely provide local interaction and have 
no influence outside that area (James & Jones 


[1954] 1964, pp. 142-166, 245-251 ). Subsequently, 
Murphy and Vance (1954) developed a technique 
for delimiting the central business district (CBD), 
based on land use involving the retailing of goods 
and services and the provision of office space. 
Ullman (1957) developed a series of maps in- 
terpreting commodity flow within the United 
States, emphasizing principles of complementarity, 
transferability, and intervening opportunity. Other 
writers have been interested in the most appropri- 
ate location for shopping centers, especially with 
respect to market areas shared by competing firms. 
Questions have been raised as to the necessity of 
formal theories in marketing geography, but as yet 
no definite answers have emerged. Meanwhile, in 
a related area efforts are being made to map and 
evaluate the spatial distribution of finance as an 
economic activity and to associate this distribution 
with trading areas of cities. 

Most geographical work in international trade 
has been pragmatic. One study has associated 
broad regional patterns (countries or groups of 
countries) and degree of dependence upon cate- 
gories of exports and imports, and another has 
classified countries by degree of dependence upon 
exports. Still another has examined the free port, 
concluding that it may have outlived its usefulness 
in technically advanced, highly industrialized coun- 
tries. Thoman and Conkling (1967) appraised na- 
tional and bloc trends in international trade be- 
tween 1938 and 1963, dependence of individual 
economies upon exports and export specialties, and 
logistics and mechanics of such trade. 

Transportation geography. Although transpor- 
tation long has been treated in the literature of 
economic geography, its full significance is only 
coming to be realized as the functional region, 
which depends for interpretation largely upon the 
flow of commodities and people to and from central 
places, comes to the forefront of attention. This is 
especially true of linkage studies involving trans- 
portation and trade—the carrier and route, plus 
shipping costs, plus direction and composition of 
commodity and passenger movement, plus alterna- 
tive opportunities for such movement. Such link- 
age studies reveal the dynamic aspects of an area, 
whether for a specified time or with change over 
time. Again, all three categories of approach are 
to be seen, with qualitative interpretation utilized 
particularly to present an unusual idea or to ap- 
praise general conditions over a wide area where 
adequate numerical information is lacking. Some 
geographers, however, have produced excellent 
qualitative interpretations concerning areas, small 
or large, that are covered rather fully by census and 
comparable data. In addition, historical geogra- 
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phers have provided insight into the development 
of somewhat analogous transportation routes that 
have evolved at different times and for different 
purposes. 

Classifications by density of route and by gen- 
eral direction and function of route, again usually 
at continental or global levels of observation, are 
not new in transportation geography; but these 
have been augmented, usually at national or re- 
gional levels, by more-detailed and more-meaning- 
ful studies. Models have also been used to indicate 
the impact of highways on geographic change, to 
anticipate the development of transportation net- 
works, and to explain differential rates of growth 
in passenger traffic between leading airports. 

Regional economic geography. Several recent 
books and monographs have applied regional ap- 
proaches to economic geography. Gottmann (1961) 
evaluated the urban-economic aspects of Megalopo- 
lis, an urbanized section of the Atlantic seaboard 
of the United States stretching from Boston south- 
ward beyond Washington, D.C. Hance (1964) pre- 
sented a regional examination of Africa, based on 
many years of study there. Camu and his associ- 
ates (1964) produced a regional—systematic survey 
of the economic geography of Canada, introducing 
a multiple-criterion regional construct and several 
new ideas in regionalization, including the areal 
distribution of the total amount and types of capital 
investment. These and similar works have carried 
forward the continuing aspects of traditional eco- 
nomic geography as expressed regionally. In addi- 
tion, as has been shown, many economic geogra- 
phers have contributed to regional science and to 
regional economic development. 

Other viewpoints. Economic geography is wide- 
ly accepted outside the United States and Canada 
and generally is defined in terms already stated. 
The field is especially comprehensive in the Soviet 
Union, by definition virtually replacing human 
geography (Geograficheskoe Obshchestvo SSSR 
[1961] 1962, especially pp. 31-44). Under the 
stimuli of dialectical materialism and national de- 
velopment, economic and physical geographers of 
the Soviet Union have devoted close attention to 
pragmatic aspects of their respective subdivisions 
of the over-all discipline. A keen interest also exists 
in the Soviet Union in planning, particularly in the 
roles of theory and measurement. An emerging 
school of thought there holds that geography 
should not be compartmentalized but be considered 
as a unit—a view somewhat similar to Ackerman’s 
concept of the discipline as a man-—land system. 

Europe and the United Kingdom have continued 
a long-standing interest in economic geography. 
German geographers have built on the work of 
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Thiinen and Alfred Weber, fusing these studies 
with evaluations of management practices, enter- 
prises, and land use to produce a well-founded in- 
ductive-deductive concept of the field (Otremba 
1953). The term “applied geography” has come 
into use, particularly in France but also in other 
parts of Europe and, recently, in the International 
Geographical Union (Phlipponneau 1960). Al- 
though not limited to economic geography, applied 
geography stresses maximum efficiency in man’s 
use of earth space. In the United Kingdom eco- 
nomic geographers have bridged the gap rather 
smoothly between the deductive and inductive ap- 
proaches, providing valuable and well-written re- 
views that take cognizance of historical develop- 
ment (Estall & Buchanan 1961; Chisholm 1962). 
Planning is of long-standing interest to British 
economic geographers, and recent work evinces 
growth of theoretical work (Haggett 1965). In 
Scandinavia, notably Sweden, theoretical approach- 
es have been utilized actively for a long time, 
although pragmatic work continues. 

There are many examples in other countries of 
application of the approaches already mentioned. 
Qualitative interpretation frequently is a detailed 
inventory of available resources under specified 
conditions and cutoff limits. Classifications are be- 
coming more numerous, and there is some experi- 
mentation with hypotheses. 


No one of the three approaches in economic 
geography necessarily is superior to the others, and 
more work is urgently needed in all. As additional 
data become available and increasingly standard- 
ized internationally, classifications and theories 
probably will become more numerous and will be 
produced mainly by committee or team efforts. 
Qualitative interpretations by individuals, however, 
always will be necessary—not only to provide un- 
usual stimuli and insights but also to bring together 
cogently the threads of complex ideas, a result that 
cannot be expected from anthologies. 


RICHARD S. THOMAN 


[See also CENTRAL PLACE; CONSERVATION; REGIONAL 
SCIENCE; TRANSPORTATION.] 
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Iv 
CULTURAL GEOGRAPHY 


Cultural geography as treated here is peculiar to 
American geography and can be understood as a 
complement to certain of the trends in American 
geography in the early part of this century. Cul- 
tural geography in a broader sense deals with any 
part of man’s culture in the same way that plant 
geography deals with the distribution of plant spe- 
cies and vegetation or that economic geography is 
concerned with the production and distribution of 
goods and services. Cultural geography in the nar- 
rower sense used here is also characterized by 
certain cultural topics with which it deals, although 
its unifying thread is its manner of using the an- 
thropological idea of culture to give meaning to its 
material. The tracing of continuity in space and 
time can help account for cultures and culture 
traits whose presence may not seem satisfactorily 
explained by their function in meeting overt ends. 
The subject matter of cultural geography has been 
winnowed by its need of such probing into origins. 
Cultural geography is not a self-sufficient field of 
study that produces all its own data and examines 
them as part of a closed system; it is rather an 
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exchange in which data and interpretations from 
many sources are examined from one general point 
of view. 

Development. Physiography was emphasized in 
the formative period of American geography, which 
occurred around the turn of the century. The first 
human geography then admitted inquiry into se- 
lected relationships between man and his physical 
environment. Later, concern with the productive 
capacities of the land came to be coupled with the 
growth of an economic geography that concen- 
trated on production and trade. Economic geog- 
raphers either assumed or looked for a functional- 
ism or an adherence to economic laws that assured 
efficiency. 

Carl Sauer (1925; 1931) outlined a new cultural 
geography dealing with those elements of material 
culture that give character to area through being 
“inscribed into the earth’s surface.” The focus was 
to be on those works of man rather than on man 
himself, The study was to be empirical and his- 
torical, without preference for environmental or 
any other selected class of explanation. The ele- 
ments studied were to be broadly economic as well 
as material, although Sauer (1941) was later to 
expand their scope. 

Sauer’s proposals produced, first, a general 
change in the direction of American geography and, 
later, the more special cultural geography, whose 
early growth was chiefly through his own students. 
This cultural geography has increasingly occupied 
the territory its name and rationale have staked 
out; its content has been limited chiefly by what 
other fields have previously claimed. 

European geography contributes a great portion 
of the material of cultural geography, especially in 
dealing with Europe’s own rich heritage, but mostly 
under the heading of a general human geography 
or a less inclusive social geography. The models 
for Sauer’s program were heavily German, espe- 
cially Friedrich Ratzel’s work on culture spread, 
Eduard Hahn’s work on agricultural development, 
and regional studies focused on settlement history. 
Kulturgeographie continues as a broad division of 
geography in Germany, where the modern idea of 
culture developed, but it does not parallel the Amer- 
ican cultural geography as a specific hub for the 
swapping of ideas (National Research Council . . . 
1965). The absence of a recognized cultural geog- 
raphy in Britain and France is hardly surprising, 
for the culture concept is less used in those coun- 
tries (the genre de vie of Vidal de la Blache is 
similar in use but much less inclusive ). 

Content. Most studies in cultural geography 
develop one or more of the following subjects: 
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The growth of man’s exploitation of his habitat. 
The study of man’s use of his environment includes 
such topics as the early use of tools and imple- 
ments, domestication of plants and animals, and 
the various economies of food production. Human 
development and human invention both have geo- 
graphic dimensions in that they are composed of 
specific events occurring in specific places. The 
geographer’s concern with both the spatial arrange- 
ment and the qualities of habitats qualifies him for 
taking part in the reconstruction of man’s past as 
well as in the understanding of the present. The 
cultural geographer’s interest in the past begins 
with the beginning of man and follows his wander- 
ings with ever more cultural equipment into new 
surroundings (Sauer 1952). 

Archeology, history, physical geography, and 
field observation provide much of the raw material 
for work in this field. Archeology and history, each 
limited by its sources of data, must remain incom- 
plete records. Cultural geography often asks ques- 
tions they are least likely to have answered. Thus, 
many early chapters in man’s growth can at best 
be speculative theories that may remain unverified. 

Physical change induced on the surface of the 
earth by man. Man’s material advance leaves its 
mark on the earth he works. Physical change may 
be an inadvertent product of man’s use of the 
land: soil erosion induced by cultivation, soil en- 
richment around human habitations, and vegeta- 
tion change induced by grazing or burning. It may 
be a deliberate change, such as the clearing of 
woods or terracing of hillsides for farming (Spen- 
cer & Hale 1961). The actions of man and nature 
have been of such duration and of such intermix- 
ture as to be often no longer separable, 

American concern with man’s part in the proc- 
esses of physical geography dates from George P. 
Marsh’s writing, in 1864, of Man and Nature. A 
more recent study, Man’s Role in Changing the 
Face of the Earth (International Symposium .. . 
1956), explores the earth as the imprint of man’s 
way of life, as a record of his past, and as a mate- 
rial resource for the present and future, The proc- 
esses that explain the past may serve as guides for 
the future. 

Settlement forms—rural and urban. Settlement 
forms make up a large part of the features of the 
man-made landscape. Study of rural settlement has 
dealt mostly with house types, the arrangement of 
houses and other structures in relation to each 
other and to road networks, and the arrangement 
of fields. Although the settlement of the United 
States is recent, the history of its house forms is 
already partly lost, and tracing them is difficult 


(Kniffen 1965). The settlement patterns, more 
easily reconstructed, have converged on variations 
of the isolated farmstead. For most of Anglo- 
America the field pattern had to fit the rectangular 
survey; older areas show the confusion of metes 
and bounds, but traditional patterns survive, par- 
ticularly in the old French riverine settlements. 
Reconstruction of the history of American settle- 
ment forms has not proceeded nearly so far as the 
inventorylike studies of rural patterns in Germany 
or of rural houses in Italy. Even where inventories 
are thorough, the forms have resisted satisfactory 
explanatory generalization. 

American geographers have done relatively little 
with urban settlement forms. Enticing opportuni- 
ties for study may be found in regionally dominant 
town and city plans, the pattern of street and lot 
layout, and cross-city comparison of house types, 
building materials, and architectural styles. 

Nonmaterial culture, such as language and re- 
ligion. Languages are considered the most reliable 
ethnic tracers, since the arbitrary choosing of words 
from a very large pool makes accidental repetition 
most improbable. Further, language, as a means 
and a mark of intercommunication, is 2 maker and 
a product of group cohesion and, hence, of cul- 
tures. Analysis of languages and their distributions 
has usually been the work of the specialist, but the 
results are widely used in geography. Toponyms 
are components of language that are very close to 
geography because of their fixture on the land and 
their frequent reference to its qualities, 

Religion, also a conservative marker of peoples, 
is a social institution with significant spatial struc- 
ture and a molder of the cultural landscape. Geog- 
raphy deals with religion in ways that range from 
the distribution of specific religions to the expres- 
sion of a primal sense of order in the landscape— 
for example, orientation of streets or property lines 
with the compass (Isaac 1965). 

Origin and spread of cultures and civilizations. 
The spread of culture is most simply studied 
through particular culture traits, but often an en- 
tire complex of culture traits may be welded to- 
gether by a powerful or influential people and 
spread over a wide area in relatively uniform 
fashion. A way of living or a civilization may be 
traced from its inception to its expansion into a 
greater territorial base, until finally it reaches its 
limits and is absorbed or replaced by another ex- 
panding culture. The growth of potamic civiliza- 
tions from their home in Mesopotamia, the growth 
of the Chinese nation from its culture hearth in 
the Wei Valley, or the grafting of the Marxist 
politico-economic complex onto a variety of cul- 


tural trunks in the past fifty years would be suit- 
able subjects for such analysis. 

Geographers have treated cultures in a variety 
of ways, such as dividing the world into major 
culture regions (Russell & Kniffen 1951), exam- 
ining the development of cultures and subcultures, 
mapping the core and fringe areas of cultures 
(Meinig 1965), and studying culture islands. 

Cultural evaluation of the environment. The 
favorite theories of geography are often themselves 
generalized evaluations of environment. Their 
change with the passage of time is evidence that 
they too, however-reasoned, are part and parcel of 
changing culture. What man does with his natural 
resources depends on his technology, on his per- 
ception of his natural resources and of his place 
among them, and on a complex of values concern- 
ing the present and future. Clarence Glacken 
(1956) has studied man’s place in nature from 
the viewpoint of changes in Western ideas about 
the habitable world. Non-Western conceptions, as 
shaped by cosmologies, modified by experience, and 
revealed in language, are also essential to interpre- 
tation of living patterns (Lowenthal 1961). 

The environment may be graded in aesthetic 
terms. Modification of the landscape, including the 
productive portion of the landscape, may then be 
guided by aesthetic senses and axioms. Yet the 
perception of the environment and the responses to 
environmental stimuli are probably not purely cul- 
tural. Sonnenfeld (1965) has asserted the need for 
isolating the noncultural parts of the behavior that 
relate to the environment and effect the shaping of 
the landscape. To do so would better sort the vari- 
ables with which the cultural geographer deals and 
would clarify his ideas of causality. 

Purposes. Many studies are undertaken to pro- 
vide factual answers to specific questions. Some 
studies are primarily in the geographical tradition 
of exploration; these include many of the regional 
inquiries in cultural geography (e.g., Wagner 
1958) as well as more specific data-collecting trips 
(e.g., Zelinsky 1958). Other studies seek links in 
the solution of specified larger problems, be they 
geographic in nature or otherwise. A major theme 
of Kniffen’s work has been the use of a particular 
culture trait as an index to migration and diffusion 
and the culture regions they shape. 

H. C. Brookfield (1964), in a thoughtful critique 
of American cultural geography, noted its fre- 
quent reluctance to compare, generalize, or explain, 
especially if doing so meant going into social organ- 
ization or social attitudes. American cultural geog- 
raphers have often preferred to focus their imme- 
diate interest on filling selected gaps in knowledge 
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rather than on using the material gained primarily 
as a means to further the abstract concepts of the 
field. There are advantages in developing a reliable 
body of elaborated description relatively free of the 
data selection that would be suitable for testing 
prechosen abstractions. However, cultural geog- 
raphy should also work ahead with generalization 
as fast as the data and advance of theory permit. 
Some division of labor may optimize the exchange 
between description and abstraction. 

Purposes of scholarly study are often indetermi- 
nate. Idle curiosity may impel the investigator, 
even while specific hope of practical application 
accounts for his financial support. Many of the 
world’s problems are not abstract generalizations 
but are stresses that arise from unique local con- 
ditions. A geography that views every place as 
sui generis is a likely source of help. The theme of 
man’s use of his environment, enmeshed in most 
of cultural geography, gives geography its widest 
views and most likely application on a world scale. 
Certainly the cultural geographer is not much 
drawn to his study by the hope of immediate appli- 
cation, even though it is with past and present 
application that he constantly deals. Rather he is 
lured on by the prospect of a better articulated view 
of man’s work and works in their terrestrial frame. 

Methods. Studies of small areas are most likely 
to depend for their data on field observation. Those 
on a world-wide scale necessarily depend on sec- 
ondary sources. Spatial and chronological analysis 
of the data often form the logical core of the study. 
The former seeks simplification. through the dem- 
onstration of spatial order or pattern; the latter 
emphasizes change in historical depth or the more 
detailed sequence of change known as process. 

Spatial arrangement of data is an essential char- 
acteristic of geography. The map is the visual 
means of arranging the data in its spatial order. 
(To spread out or to unfold in this manner approxi- 
mates the literal meanings of explanare and ex- 
plicare, bases for the verb explain in English and 
the modern Latin languages). Mapping the distri- 
bution of a culture trait or complex does not con- 
stitute an explanation, but it leads to hypotheses as 
to how the culture trait developed. One common 
method of map interpretation is to seek correlative 
distributions that may also be causal associations. 
A second method is to treat the distribution as a 
changing product of diffusion and extinction and 
to seek the conditions that have governed the 
changes (Zelinsky 1958; Spencer & Hale 1961). 
An isochronic map is useful in combining both 
temporal and spatial analysis. 

Chronological arrangement of data is essential 
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to the study of culture. The conservative nature of 
culture, which follows earlier models even in the 
process of change, encourages one to study it by 
tracing its continuity. If a device is not demon- 
strably functional in every aspect of its design, it 
may be explained genetically by tracing its origins 
and movements. John Leighly emphasized this type 
of cultural explanation in his proposal for a study 
of the tangible works of man in the landscape in 
the terms of art history, stressing “the essential 
time-bond of culture rather than its looser place- 
bond” (1937, p. 135). The culture concept pro- 
vides a means of giving intelligibility to what, at 
least to people removed from the particular cul- 
tural context, seems irrational. 

Plumbing the basic reasons for cultural choices 
may provide a functional explanation of what was 
once considered irrational. Cultural geography 
would become less dependent on genetic explana- 
tion as its mainstay to the extent that culture 
change—or lack of it—could be explained in terms 
of human satisfactions. Predictions of the direc- 
tion of culture change will lead to more application 
of the findings of cultural geography. Any theory 
of culture change could have a corresponding the- 
ory of cultural geography as the spatial expression 
of the change. Uses of the culture concept in re- 
gional study by geographers and anthropologists 
have been considered by Thomas (1957). 

A battery of general methods is summed up by 
Wagner and Mikesell (1962, p. 24): “Who? Where? 
What? When? and How? The themes of culture, 
culture area, cultural landscape, culture history, 
and cultural ecology respond to these queries.” The 
themes imply cross sections of investigation, most 
of them with conceptual dimensions rather than 
geometrical ones as in the case of the map. Since 
cultural geography deals with such a great range 
of phenomena, its methods must also be varied. 

Persistent questions. The objects of study in 
cultural geography may be seen as forms—abstract 
or concrete, single or complex. Many of the general 
questions of cultural geography hinge on the der- 
ivation of these forms, which is variously sought 
in environment, function, ideology, technology, 
ornamentation, previous forms, and the chances 
of invention and accident. The derivation of one 
form does not necessarily serve as a model for the 
derivation of others. 

The role of the physical environment. Cultural 
geography can deal with man’s relations with his 
physical environment at any depth of understand- 
ing of this environment. Sauers work with early 
man leaned heavily on assessments of environmen- 
tal change and of environmental opportunity to 
supply gaps in the a posteriori record. On the other 


hand, the continuity of culture can be traced over 
the earth without attention to the physical environ- 
ments through which the culture spread, although 
to do so is to ignore a part of the possible ex- 
planation. 

The reaction against environmental determinism 
(the doctrine that the physical environment deter- 
mines the way in which man lives in an area) has 
united with a culturally oriented geography to pro- 
duce a new geographic etiology that stresses cul- 
tural determinism (a doctrine that emphasizes the 
role of culture as opposed to that of the environ- 
ment). One statement of cultural determinism is 
that culture determines what the environment 
means to man. An even stronger position holds 
that man’s perception of the environment is all that 
matters about the physical environment; since per- 
ception is culturally controlled, explanation of hu- 
man behavior is then deemed to be cultural, Cul- 
tural determinism presents useful views of the 
continuity of culture growth, but is less successful 
as an analysis of cause (a precursor without which 
the result would have been different). One must 
remember, first, that perception, by its definition, 
is not merely hallucination (i.e., independent of 
external stimulus) and, second, that the environ- 
ment may not respond to man’s management in 
just the way his perception of it orders. 

The difficulties of the man—nature and culture— 
environment dichotomies are sometimes dissolved 
in union. Man is often conveniently viewed as a 
part of nature. Man, culture, and environment in 
a land have been treated as one in sketches of 
regional “personality.” 

Form versus function. What part does function 
play in cultural design and what part do previous 
forms play? How fast do forms change to conform 
to changing needs? Are forms also molded by 
aesthetic considerations? A barn is built for certain 
purposes, and a given design is likely to be retained 
only so long as it fulfills its function reasonably 
well. But the differences between two barns per- 
forming the same functions must be explained on 
some other basis. Even if the differing features 
mirror cultural traditions, they may still have origi- 
nated in functions of other times and other places. 
Purely decorative features, too, sometimes origi- 
nate in function. Relic forms may continue to be 
built or may survive in structures that have out- 
lived their original functions, Whether function or 
form is stressed in any particular comparison may 
depend on whether the similarities or the differ- 
ences are sought. 

Form and ideals. A simple feature in the land- 
scape, say a post, serves its function and reflects 
some particular post-making tradition. It probably 


has no particular relation to the ethos or set of 
ideals of the people who use it. A more complicated 
form with a more complicated function, say the 
physical form of the village or community built to 
facilitate a way of living, is more likely to reflect 
the distinctiveness of that way of living or of the 
ideals that lie behind it. An ascetic people, not 
given to social intercourse, probably would not pro- 
vide their town with plazas and esplanades or dec- 
orate it in bright colors. The few geographic studies 
that deal specifically with form in relation to ideol- 
ogy suggest that rather different peoples may adopt 
the same readily available forms for the same overt 
purposes. An account of a Dutch Reformed settle- 
ment in Michigan (Bjorklund 1964) tells how im- 
migrants largely gave up their old forms in favor 
of the common American forms, but it also shows 
how the American forms still conformed to the old 
ideology. If identical forms conform to different 
ideologies, we must assume that the differing peo- 
ples see the forms as fulfilling different inner func- 
tions and gain different satisfactions from them. 
In another study Philip Wagner concludes: “Nicoya 
suggests that thoroughgoing transformations in the 
social sphere may sometimes produce only moder- 
ate variations in technology and landscape, and 
thus that two or more very dissimilar societies may 
differ little in the way they conceive and utilize a 
given habitat” (1958, p. 248). Conversely, peoples 
with similar ideologies are likely to evolve different 
community forms for their living. Inquiry into the 
regulation of form by ideals should be carried 
further. 

Forms of distributions. The forms of distribu- 
tions are also subject to systematization. The sig- 
nificance of continuous and discontinuous distribu- 
tions, the relation of area of origin to area of 
greatest intensity, or the persistence of the core 
area of a culture region are all aspects of the struc- 
ture or form of culture areas. 

Relation to other fields. Historical geography is 
the division of geography most closely related to 
cultural geography. The two complement each 
other in a cultural-historical geography, in which 
history provides the explanation of culture and 
culture provides the organizing concept for the sub- 
jects of most geographic interest in history. His- 
torical geography can stand alone, however, for it 
is often not organized around the culture concept. 
On the other hand, culture is always dependent on 
the past for its explanation. 

Economic geography and cultural geography 
could go separate ways as long as one depended 
primarily on economic laws for its explanation 
while the other depended on the patterns of the 
past. But an entire system of economic laws may 
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be found to be a culture complex that has evolved 
somewhat accidentally and not in an inescapable 
mesh of cosmic law. Even in our market economy 
the prices bid for goods may be but expressions of 
cultural preference (for example, the American 
preference for corn-fed ham versus the European 
for barley-fed). And within a changing economy 
new institutions, products, and types of enterprise 
originate and diffuse in a fashion suggestive of the 
culture traits that they are. On the other hand, the 
relevant economic Jaws are also necessary parts of 
the cultural explanation. Thus, economic and cul- 
tural geography join in any broad view of the two 
fields. 

Political geography, like economic geography, is 
supported by an independent discipline that has its 
own laws (although less precise), and its systems 
fulfill stated functions. A nation represents an idea 
that had an origin and has been spread with some 
show of power to the borders of the land. The 
generic idea of nation is again a cultural concept 
that had an origin and a diffusion that just now 
seems about to complete the circuit of the earth. 
Each nation depends on a community of interests, 
often including such culture traits as a national 
language and religion. The behavior of its voters, 
too, reflects persistent patterns in its regional 
culture. 

Cities have both economic and political dimen- 
sions, although the economic has loomed larger in 
urban geographical studies. City planning and ur- 
ban sociology are cognate disciplines that help pro- 
vide the rationale for an independent urban geog- 
raphy. At the same time, cities are concentrations 
of culture that are very sensitive to cultural differ- 
ences. Internally, the distribution of subcultures 
within a complex city and the city landscape are 
equally concerns of cultural geography. Externally, 
urban functions and rural-urban attitudes vary 
from culture to culture. 

The mathematical formulations now popular in 
economic and urban studies will be most success- 
ful if they can describe the regularities and impli- 
cations of cultural behavior. Swedish studies have 
originated quantitative analysis of culture trans- 
mission and have attempted to simulate both its 
rational and random qualities. 

The idea of culture provides a frame into which 
man-made functional systems fit. The narrower 
cultural geography and the fields that deal with the 
functional systems that are also a part of culture 
can fuse into a broader cultural geography in 
which the culture concept constantly insists on a 
proper relativity in time and space. Grasp of the 
concepts of culture and cultures is perhaps the 
quickest route to a viewpoint not entirely bound 
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by one’s own culture and from which one can even 
see one’s own culture in some perspective. Culture 
is a fit mediator in a study whose point of de- 
parture is the comparison of different peoples and 
lands. 


EDWARD T. PRICE 


[Other relevant material may be found in CARTOG- 
RAPHY; CULTURE; ENVIRONMENTALISM; LANDSCAPE; 
and in the biographies of Mars; RATZEL; SAUER; 
VIDAL DE LA BLACHE.] 
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v 
SOCIAL GEOGRAPHY 


No generally accepted definition of social geog- 
raphy exists. The variety of literature which has 
appeared under the title of social geography is 
astounding; even within particular schools there 
are wide disparities of approach and definition. 
With some notable exceptions, for example, in 
Sweden and Holland, social geography can be con- 
sidered a field created and cultivated by a number 
of individual scholars rather than an academic tra- 
dition built up within particular schools. Further- 
more, for many people the term “social geography” 
itself is in disfavor because of its past association 
with various forms of determinism that postulated 
a causal connection between society and the geo- 
graphical environment. 

Perhaps, therefore, the best way to examine 
social geography is to establish a general theoretical 
outline of the field and, on this basis, to review the 
existing literature. Naturally, many of the works 
relevant to what is here called social geography will 
have been written as contributions to some other 
discipline. 

The argument that social geography is a neces- 
sary discipline can be made in at least two ways. 
One is by analogy with other, better established 
branches of geography. A widely accepted defini- 


tion of “human geography” is that it deals with 
mankind in the context of his total geographical 
milieu. For the purposes of analysis this milieu has 
been subdivided into separate categories corre- 
sponding to various orders of human activity, for 
example, the economic, the political, and the cul- 
tural. Therefore, one could postulate that social 
geography is the subdivision of geography that 
deals specifically with the social order, or that it is 
the systematic study of the social dimension in 
areal differentiation. 

An alternative way is to begin with the definition 
of geography as the study of similarities and con- 
trasts between places on the face of the earth. 
Society, that is, social organization and values, pat- 
terns of social movement and interaction, and 
social dynamics and change, plays such an impor- 
tant role in producing similarities and contrasts 
between places on the earth that it justifies syste- 
matic consideration within the discipline. 

The question immediately arises as to how to 
isolate this social dimension for independent study. 
In fact, since human activities characteristically 
are group activities, how can human geography be 
anything else but social? The virtually interchange- 
able use of the terms “human” and “social” by 
several geographers in the British and Dutch schools 
serves to emphasize the logical (and etymological ) 
basis for this question, Yet, although in the evolu- 
tion of human geography emphasis has been placed 
in varying degrees on purely social elements—and 
although languages, races, and religions have rarely 
been excluded from consideration—the function of 
these social elements in the total conceptual frame- 
work has not been very clear. In fact, the idea that 
such social elements could be systematized into a 
general framework for geographical analysis has 
been only recently proposed (Bobek 1959; van 
Paassen 1965). 

There are two primary questions social geog- 
raphy must answer: How do mankind's social 
characteristics vary through space? How do these 
characteristics affect (or reflect) man’s adaptation 
to and adaptation of his total geographical milieu? 
Since such questions touch every aspect of human 
geography, it is difficult to conceive of social geog- 
raphy as a separate field. Its distinctive feature 
would thus appear to lie more in its focus and 
objectives than in any clearly delineated subject 
matter. In practical terms, the traditional twofold 
method of geography can be applied to these cen- 
tral questions in the following way: by the exam- 
ination of spatial variations in the distribution and 
interaction of social groups within their total geo- 
graphical milieus and by the examination of dif- 
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ferential patterns of society's use of the earth, as 
indicated in settlement forms, livelihoods, circula- 
tion networks, and land use patterns. While the 
first method implies a morphological or formal 
study of world social patterns, the second method 
implies a functional interpretation of such patterns 
in terms of their underlying social processes. 
Having thus outlined, in broad terms, the place 
and function of social geography, let us now see 
how these fundamental questions have been studied 
in the past. From such a general and necessarily 
eclectic survey we may discern some of the major 
conceptual and technical ingredients from which a 
definition of social geography can be formulated. 


The development of social geography 


Studies explicitly or implicitly directed toward 
the exploration of social geography can be con- 
sidered under two broad headings: first, the his- 
torical precedents, which fall roughly into three 
major stages, each one characterized by a different 
approach; and second, the works of twentieth- 
century geographers. 

Historical precedents. Descriptive reports writ- 
ten by explorers and men of letters during classical 
times, for example, the writings of Herodotus, 
Thucydides, Strabo, and others, provide the first 
written recognition of world social differences. Such 
encyclopedic descriptions continued to appear in- 
termittently in the Occident up to the seventeenth 
century, for example, the accounts of Marco Polo 
and the lettres édifiantes of Jesuit missionaries. 
The twofold implication of these works was that 
social life takes various forms in different parts of 
the world and that these differences are caused by, 
or at least are associated with, differences in the 
physical—particularly climatic—environment. 

A second phase consisted of the various philo- 
sophical reflections on these and later geographical 
discoveries. On the one hand, speculative thinkers 
sought normative principles for an ideal social 
order from natural law, and, on the other hand, the 
positivists insisted that such principles should be 
sought in the existing and empirically observable 
conditions of society. The essential message of this 
second phase was that there is a rational order in 
world society and that this order can be discovered 
deductively (speculative approach) or inductively 
(positivistic approach ). 

A third and far more significant phase began in 
the nineteenth century, accompanied in France by 
the emergence of the idea of democracy, in Ger- 
many by the rise of national consciousness, and 
elsewhere by the slow yet effective permeation of 
a “scientific” approach to knowledge. Ethnographers 
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and historians were among the first to study world 
social variations in a systematic way. As early as 
1725 Giambattista Vico suggested that human de- 
velopment followed an identical series of stages 
and that the actual variations in world society at 
any particular time were due to their differential 
positions within that series. Later in the eighteenth 
century, Johann Gottfried Herder in Germany and 
Condorcet in France expressed similar ideas. The 
geographer Johann Georg Kohl examined the social 
function and significance of various settlement 
types; later, his colleague E. Hahn (1896) studied 
the evolution of livelihoods and demonstrated the 
religious and social origins of some economic prac- 
tices. Yet this “scientific” approach to the study of 
mankind’s social differences was also associated 
with exaggerated single-factor explanations, for ex- 
ample, the biological interpretation first expounded 
by A. Schäffe (1875-1878) and the psychological 
interpretation, which found its fullest expression in 
the Durkheimian school in France. Friedrich 
Ratzel’s Anthropogeographie (1882-1891) incorpo- 
rated both these elements: the ecological view of 
society within its natural environment and the role 
of human intelligence (the “idea”) in enabling man 
to overcome physical barriers (1901). Unfortu- 
nately, the latter perspective did not emerge too 
clearly in his monumental work—on which the 
whole tradition of anthropogeography has been pat- 
terned—and so his name has been linked with the 
idea of society being determined by the physical 
environment. His Politische Geographie (1897) 
and some articles (1876; 1901) in fact contained 
hypotheses that were far more relevant to social 
geography than the Anthropogeographie. 

One of the most significant precedents to social 
geography in the nineteenth century was the work 
of Frédéric Le Play. Disdainful of the various 
a priori explanations of society prevalent in his 
day, he set out to study the actual social conditions 
of worker families in France, His famous mono- 
graph technique produced an encyclopedic inven- 
tory of social facts, and from a great number of 
studies he deduced certain basic types, which then 
served as bases for comparison. Traces of Le Play’s 
analytical formula lieu—travail-famille, later adapt- 
ed by Geddes (1915) into the formula “place— 
work—folk,” can be found in the writings of such 
early British geographers as H. J. Fleure (1918). 
French geographers inherited important elements 
from Le Play, for example, the monograph tech- 
nique in empirical field studies, but the most im- 
portant legacy of lieu—travail-famille was the social 
survey movement, which flourished in Britain and 
America during the early part of the century, 

Many geographers, such as Ritter, von Hum- 


boldt, Hassinger, Ruhl, and Hettner in Germany, 
Reclus in France, George Perkins Marsh in Amer- 
ica, and H. J. Mackinder in Britain, deserve recog- 
nition as pioneers of social geography. However, 
the three major channels of thought that contained 
the most useful concepts were those initiated by 
Le Play (the social survey movement), Ratzel 
(anthropogeography ), and Durkheim (social mor- 
phology). 

Twentieth-century social geographers. The mu- 
tual relations of society and environment was a 
subject that aroused great speculation and interest 
at the turn of the century. Yet there was no dis- 
cipline equipped to embrace the entire question. 
Ratzel had made an abortive attempt to do so, and 
his environmentalist disciples exaggerated rather 
than corrected the deterministic premise of an- 
thropogeography. Many scholars, particularly the 
Durkheimian sociologists, remained unconvinced 
that geography had any right to entertain such a 
monumental task. 

At this juncture came one of geography’s great- 
est entrepreneurs, Paul Vidal de la Blache. Society 
for Vidal (1896; 1902) and his school could not 
be explained entirely in terms of biological, psycho- 
logical, or environmental interpretations, It was 
rather an intricate network of ideas and bonds that 
provided stability and orientation to human life 
within particular geographical milieus. In his clas- 
sical studies of the Mediterranean world and of 
monsoon Asia (1917-1918), Vidal demonstrated 
the complex, yet harmoniously balanced, interplay 
between human institutions and particular natural 
settings. Genres de vie (literally, patterns of living) 
were the concrete expressions of a society’s ongoing 
contact with nature: sets of techniques, cemented 
through tradition, whereby human groups secured 
the material necessities of life within a functional 
social order (Vidal 1911; Sorre 1948). Repeated 
experiences in meeting life's common problems 
within a particular geographical milieu occasioned 
the development of community consciousness, 
which made a genre de vie truly an ecological sys- 
tem. Variations of this basic concept appear in the 
literature of other disciplines, for example, social 
anthropology (Kroeber & Kluckhohn 1952; Redfield 
1955), American human ecology (McKenzie 1934) 
and urban sociology (Park & Burgess 1921). By 
means of genre de vie and other concepts, the 
French school of human geography replaced the 
exaggerated Ratzelian notions of environmental de- 
terminism by the more elastic concepts of possi- 
bilism and dismissed the charges made in the 
Année sociologique between 1890 and 1910 more 
by substantive works than by theoretical argu- 
ments, “La géographie humaine,” thus formulated, 


Á 


was a social geography in the broad, integral sense: 
all other dimensions of the human milieu were 
studied from the vantage point of society. Many 
British and American human geographers followed 
almost identical lines, while the Dutch “sociale 
geografie” was the direct equivalent of the French 
“géographie humaine.” The kernel of this orienta- 
tion, namely, society as the source and framework 
for all human activity, reappears in the work of 
Hans Bobek (1959) in Vienna. Lucien Febvre’s 


famous apologia (1922) articulated the philosophi- * 


cal and historical raison d’étre of such a discipline. 

To Vidal's essentially ecological approach, his 
disciple Jean Brunhes added the important dimen- 
sion of group psychology, asking, for example, why 
similar environments were used in entirely differ- 
ent ways at different periods in history. He defined 
social geography as the third level of complexity in 
human geography’s fourfold structure. The fourfold 
structure included the primary groups of family, 
kin, and culture; the secondary groupings of liveli- 
hood and special interest; the various forms of 
spatial interaction within and among these groups; 
and, finally, the legal systems which institutionalize 
a society's subdivision and access to land and prop- 
erty ({1910] 1924, pp. 36—46). This definition, ad- 
mirably suited to the study of European—particu- 
larly French—rural society of the early twentieth 
century, remained the basic framework for social 
geography among British, French, and Dutch schol- 
ars up to World War 11. Most of the early studies 
in social geography were regional in character, and 
their excellence consisted more in their artistic 
cohesion and integrative descriptions than in their 
analytical or theoretical expertise. The empirical 
conditions which favored the use of the regional 
framework by French scholars did not exist to the 
same extent elsewhere; this partly explains the 
divergence of orientation and method which devel- 
oped among the various schools of human geog- 
raphy. 

During the 1930s, British social geographers 
were involved in methodological controversy. Does 
social geography consist in merely mapping man- 
kind’s social characteristics, or must it also analyze 
the processes involved in relating a society to its 
geographical environment? What is the relation 
between social geography and human ecology? 
Why not replace the term “human” by “social” as 
the generic term to signify all the nonphysical 
aspects of geography? 

The fundamental dichotomy between a formal 
and functional approach expressed in this British 
debate reiterated the duality that had developed in 
Holland since the 1920s. While at Utrecht the 
study of social groups within their territorial 
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framework (de Vooys 1950) was being pursued 
along the lines of the French school, at Amsterdam 
Steinmetz’ “sociography” was used to study the 
entire social content of space as a system in itself 
—aside from any considerations of a group's rela- 
tion to its natural environment. The birth of soci- 
ology in Holland—particularly rural sociology as 
a separate discipline—has no doubt modified the 
original disciplinary orientations of these two 
schools (van Paassen 1965). 

Prior to World War n little attempt was made to 
systematize the elements of social geography. In 
general, the important associations evident in the 
spatial organization of society—particularly in the 
United States—appeared in the literature of human 
ecology (Theodorson 1961) and urban sociology 
(Park & Burgess 1921). One major exception, of 
course, was the work of the environmentalists in 
examining connections between human behavior 
and the geographical environment (Thomas 1925). 

Pierre George and Maximilien Sorre (1943- 
1953) were the first great systematizers of social 
geography. In George’s works a close link is main- 
tained between social and economic aspects of hu- 
man behavior, the social being one facet of the 
economic (1946, p. 1). For Sorre (1948, pp. 13- 
16, 66-122) society represented a system of tech- 
niques—family and kinship systems, livelihoods 
(genres de vie), languages, and religions, each one 
having a specific influence on the spatial organ- 
ization of mankind and his work. Sorre’s schema 
does not make clear, however, whether social, ge- 
ography consists of a series of systematic subfields 
based on these various kinds of techniques, or 
whether a distinction is to be made between the 
“social” and “political” techniques. In his work all 
forms of organization from family and kin groups 
to giant political blocs form a continuum (1961, 
pp. 211-264). Gourou’s more comprehensive con- 
cept of civilisation (1964) comprises both material 
techniques (modes of production) and spiritual 
techniques (ideas, values). These three approaches 
at generalization are important because they try to 
maintain the integral and holistic character of 
social geography at the same time that they estab- 
lish some order and a basis for comparative work. 
Bobek has made a similar attempt to construct 
a spatiotemporal framework for world society 
(1959). His work is a fertile synthesis of French 
and German traditions: his systematic framework 
is based on a holistic approach involving types of 
societies defined in terms of their actual use of 
their geographical environment (1961). 

Several other attempts to formulate the prob- 
lem of society in geography in terms of a particular 
systematic framework have appeared: for example, 
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those of Wagner (1960), Ackerman (1963), and 
van Paassen (1965). Yet more characteristic of 
postwar work is the development of individual sys- 
tematic lines of enquiry, for example, geography of 
rural and urban life, population studies, and geog- 
raphy of religions and political behavior. Associated 
with this is a more lively va-et-vient between geog- 
raphers and scholars in other disciplines, particu- 
larly concerning questions of rural and urban life 
(Friedmann 1953) and regional planning (Phlip- 
ponneau 1960). Studies are still being made within 
a regional framework, but the focus has changed. 
Juillard in Alsace (1953) studied particular social 
problems from a regional perspective, while Roche- 
fort in Sicily (1961) studied regional life from the 
perspective of the social processes at work. Such 
reorientations have, of course, raised new method- 
ological problems and prospects. Chatelain (1947; 
1953), for example, postulates a duality between 
the geography of social classes (a kind of social 
morphology) and the geography of social life (a 
sociological geography). Claval (1964) envisions 
the latter as the most feasible future direction for 
the discipline, citing the work of W. Hartke at 
Munich as an example. It is difficult, however, to 
see how these two aspects of the field can be 
separated. 

To label the research being done at Munich as 
sociological geography may be misleading, Cer- 
tainly the perspective is social: social-geographic 
differentiation (sozialgeographische Differenzie- 
rung) implies that social values—as expressed in 
the occupational structure—are the primary agents 
of landscape differentiation. Thus, maps of socio- 
professional structure (Sozialkartierung) for a 
series of periods are collated with a corresponding 
series of land-use patterns (Nutzfldéchenkartie- 
rung), and significant associations are sought. This 
basic formula has been applied successfully both 
in rural and urban contexts. Geipel’s study (1952) 
of one German region, for example, demonstrated 
that the sources of regional unity—which varied 
at different periods—are found essentially in 
the collective decision-making mechanism of the 
regional community. This is quite a contrast to 
the sources of regional unity commonly sought 
in the natural (physiographic) or economic (agri- 
cultural) landscape. Hartke (1956) demonstrated 
that regions where this phenomenon existed had 
similar geographic (regional) characteristics, As- 
sociations found in urban studies are even more 
interesting. Hartke’s intraurban corridors (Pas- 
sagen) suggest some qualifications to the tra- 
ditional concentric zone and sector theories of 
urban structure, while his study of urban expansion 


patterns provides new bases for the classification 
of cities (1961). 

In marked contrast to the inductive, empirical, 
and microscopic approach of the Munich social 
geographers is the more highly developed theo- 
retical and deductive approach found in Sweden. 
Torsten Hägerstrand (1952) and Sven Godlund 


(1956) have applied refined mathematical tech- ` 


niques to the study of migration, rural—urban inter- 
action, circulation, and other dynamic aspects of 
the field. One of the most interesting developments 
has been the use of simulation models for the 
analysis and prediction of spatial movement, 

This approach has been adopted and modified in 
the postwar period by a number of American geog- 
raphers, At Iowa spatial models have been used 
to study the distribution patterns of schools, 
churches, and settlements, often with a view to 
spatial planning. Morrill’s study of Swedish towns 
(1963) exemplifies this approach, Yet, in general, 
social geography in the United States is not a uni- 
fied field: on the one hand there are holistic re- 
gional studies, for example, Platt’s Saarland study 
(1961) and Broek’s southeast Asian study (1944), 
and, on the other hand, there are a growing num- 
ber of systematic studies in racial, linguistic, re- 
ligious, and other spheres. Some interesting associa- 
tions have been elaborated, for example, between 
religion, land use, and livelihood (Isaac 1959), 
between cultural pluralism and political integration 
(Lowenthal 1961), and between migration and po- 
litical behavior within ethnic groups (Lewis 1965). 
However, the exciting developments in the actual 
social geography of America have been treated 
mainly by foreigners (Gottmann 1961) or by 
scholars in other disciplines. 

Résumé of contemporary social geography. In 
general, the empirical record would seem to char- 
acterize social geography as a multifaceted per- 
spective on the spatial organization of mankind. 
The implication is that some important sources of 
areal differentiation emanate from society, thus 
reversing the premises of anthropogeography and 
other deterministic explanations of social differ- 
entiation, Analysis of this social dimension in hu- 
man geography has involved two basic approaches: 
the examination of the formal distributions of 
social phenomena as indices of areal differentiation 
and the interpretation of these distributions in the 
light of their underlying social processes, A recent 
development, particularly in northwestern Europe, 
is the involvement of social geographers in inter- 
disciplinary research and regional planning. 

Nevertheless, the social dimension is one of the 
least studied aspects of human geography. Social 


geography lacks definite boundaries and has 
neither a central unifying concept nor even an 
agreed content. Instead, there are scattered indi- 
vidual efforts to analyze the changing social pat- 
terns of the modern world. Generalizations regard- 
ing the nature and potential function of the field, 
therefore, can only be proffered as suggestions, 
based on the substantive research directions and 
ideas of contemporary experts in the field and on 
the current trends and technical possibilities in 
other social science disciplines. 


The future of social geography 


The challenge. Social geography faces a set 
of challenges that are unprecedented. Revolutionary 
changes in world social patterns have rendered past 
analytical techniques obsolete, while philosophical 
and cultural currents within modern social life 
tend to increase the propensity to change of both 
reality itself and its social-psychological signifi- 
cance, Thus, while technological, economic, and 
commercial evolution tends to produce a certain 
degree of standardization in society's spatial order, 
there is a universal tendency to emphasize social, 
that is, ethnic, religious, or linguistic, differentia- 
tion. The philosophical problems of intersubjectivity 
and coexistence are ubiquitously discussed. “The 
home of contemporary man,” wrote Plattel ([1960] 
1965, pp. 1-2) “does not lie primarily in a localized 
environment, but in his fellow-man.” The traditional 
methods and objectives of social science are being 
fundamentally challenged. Analysis must some- 
how be broadened so as to arrive at a more holistic 
vision of social reality: the classical Cartesian 
premises underlying accepted research method- 
ology led to the discovery of systems, but mechanics 
and structures of systems constitute only a partial 
view of reality, Today both subjective (internal ) 
aspects of reality and objective (external) aspects 
of reality must be analyzed. Modern psychology 
and sociology have endeavored to meet this chal- 
lenge by forging new analytical techniques, and 
many other social science disciplines have adopted 
a decidedly behavioristic approach in recent years. 

In the light of these developments, the spatial 
patterns of world society assume a new signifi- 
cance; the immediate challenge for social geogra- 
phers would seem to be the collaboration with other 
scholars in the monumental task of describing 
world society within its geographical setting. For 
such an endeavor, social geography needs a uni- 
fying theme, a conceptual framework that will en- 
able it to contribute toward and benefit from the 
research efforts of scholars in related social science 
disciplines. Such a unified framework seems to be 
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emerging from the work of some contemporary 
social geographers. Some of its characteristics are 
described below, 

Social space as central theme. Claval’s critique 
of contemporary social geography concludes that 
“to understand the geography of a place means to 
understand the social organization of those who 
inhabit it, their mentality, their beliefs, their ‘repre- 
sentations’” (1964, p. 123). Watson's study of 
Hamilton demonstrates how “The spatial pattern 
is, in the last analysis, a reflection of the moral 
order” ({1951] 1965, p. 476). In this article I have 
postulated that the raison d'être of social geography 
rests on the fact that the social order is distinct 
from (even if closely interrelated with) the other 
orders of human activity in space. In order to de- 
scribe adequately this social dimension or order, 
contemporary thought would seem to demand the 
use of both internal and external perspectives. Is 
this possible? 

Sociologists, for example, Chombart de Lauwe 
(1956) and Gaston Bardet (1951), and human 
ecologists, for example, Firey (1960), have dem- 
onstrated the technical possibility of exploring a 
society's perception of its geographical milieu. 
Geographers, for example, Rochefort (1961), Bur- 
ton and Kates (1964), and Pataki (1965), have 
also shown that space has different meanings for 
different societies, and thus distance and spatial 
movement can no longer be considered in tra- 
ditional geodesic terms but must be considered in 
terms of those dimensions perceived by their hu- 
man occupants. For example, groups of Italians, 
Poles, Pakistanis, and Negroes may live side by 
side in one section of a city. Yet each group, be- 
cause of economic, historical, cultural, or other 
reasons, may possess an entirely different concep- 
tion of space. Some groups may have a social 
horizon that scarcely transcends the block in which 
they live or the set of stores in which they work or 
shop, while others may have social contacts with 
relatives thousands of miles away. Whether con- 
tact with distant relatives is frequent or rare does 
not influence the fact that a bond is perceived 
which ignores the barriers of space and time. The 
social geography of urban neighborhoods cannot 
ignore these differential attitudes toward space. 

This illustration, which challenges traditional 
notions of space, may lead to the impression that 
only the social-psychological conception of space 
matters. Rochefort (1963), in discussing this prob- 
lem, strongly emphasized that the real dimensions 
of geographic space must always be kept in mind. 
Therefore, the central conceptual problem in social 
geography is to define space in such a way that 
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both subjective and objective dimensions are in- 
cluded. 

Sorre’s response (1957) to this challenge was 
the concept of social space: the synthesis of real 
and perceived dimensions of space. The subjective 
component of social space in his view is embodied 
in the distribution of fundamental social groups, 
while the objective component consists of their 
concrete geographic setting, 

Bobek’s concept of social landscape already ex- 
pressed the main idea that a unit of social space 
is a region or place in which one or several groups 
live and have a common set of ideas of their en- 
vironment (1943; 1948). The fundamental merit 
of this concept, as a central theme for social geog- 
raphy, is that it incorporates the traditional ele- 
ments of groups and environment, while redefining 
them in terms which are relevant to the examina- 
tion of modern society. Let us see how the method- 
ology of contemporary social geography could be 
organized around such a central theme. 

Subjective component—social groups. Sociolo- 
gy has shown how the dimensions and meaning 
of space are colored by the beliefs and group affilia- 
tions of its human occupants. Sociologists speak 
of ethnic space, religious space, and other spaces, 
and social morphology maps the distribution of 
groups on the premise that their formal spatial 
configurations imply the values held by the group 
(Halbwachs 1938). Social geography must go 
further: these groups, the subjective component of 
social space, must be studied not only as morpho- 
logical patterns on the earth but also as formative 
influences in molding a society's perception of its 
environment. The relevant groups include those 
which determine or condition the spatial distribu- 
tion and interaction of people, for example, lan- 
guage and ethnic groups; those which influence a 
Society’s use of space, for example, religious and 
kin groups; and, most significantly, those which 
develop as a result of society’s mode of material 
subsistence, namely, the genres de vie or livelihood 
groups. The bonds and values engendered by par- 
ticipation in these groups are not directly observa- 
ble on the earth’s surface, but they are essential to 
the understanding of the spatial movements and 
distribution of people on the earth. Classical 
French geography used such formal categories of 
relevant groups, but profound transformations in 
social structure have occurred since the analytical 
framework of Brunhes, or even Sorre, was first 
formulated. Even though the choice of relevant 
grouping will demand close cooperation with soci- 
ologists and others, the social geographer does not 
have to abandon entirely the analytical techniques 


of his predecessors, Rather, such traditional con- 
cepts should be re-examined in the light of the new 
analytical possibilities which appear in many social 
science disciplines. One example which might 
merit re-examination, for example, is the Vidalian 
notion of genre de vie. Settlements forms, land 
use, social interaction, and even political integra- 
tion have been explained by geographers in terms 
of genres de vie. Many feel that the concept has 
lost its applicability to modern social life (George 
1951; Le Lannou 1949), but others argue that it 
can be reformulated (Sorre 1948; Varagnac 1948). 
By discounting the various modifications which 
have accrued through the years and by re-examin- 
ing the original notion in the light of contemporary 
developments both in world society and in social 
science, guidelines for a reformulation may become 
apparent. A genre de vie, in Vidal’s opinion (1911), 
implies more than a means of material subsist- 
ence; its geographical significance stems in large 
part from its spiritual component, the structures 
mentales which persist even after the external 
modalities of livelihood change. The important 
point is that both material and spiritual elements 
are harmoniously integrated in the genre de vie 
community within a particular milieu. Such a con- 
ception closely resembles the notion of “com- 
munity” in rural sociology (Hillery 1950), 
Without changing the concept at all, there are 
some applications in the modern world. Witness 
the adaptaticn problems of immigrants from rural 
to urban areas, the psychological problems involved 
in the retraining of unemployed miners, the social 
consequences of colonialism and economic restruc- 
turing within the “third world.” In the urban indus- 
trial world, however, livelihood is a less compelling 
basis for community consciousness than other 
similarities, for example, a common racial, pro- 
fessional, or linguistic background or similar con- 
sumption habits (Fourastié 1963). But whatever 
the source, if a recognizable consistency in a 
group's perception and consequent use of its en- 
vironment are associated with a common structure 
mentale, why not consider this pattern as a genre 
de vie, for example, that of travel agents, of sales- 
men, of truck drivers, of commuting students, of 
social scientists? Chombart de Lauwe (Chombart 
de Lauwe et al. 1952, p. 243) showed how a deep 
social rift could prevail in a small dormitory vil- 
lage because the inhabitants belonged to two dif- 
ferent genres de vie. The same could be said of 
immigrant ethnic groups in some urban centers 
(Taeuber & Taeuber 1965). Ideally, within either 
an urban or a rural region, one could thus identi- 
fy the component genres de vie and see if there is 


a hierarchy of importance among them, the domi- 
nant one giving a character to the place, as in 
pilgrimage, market, or university towns. Many 
other possibilities exist, but much more substantive 
work, preferably in conjunction with other disci- 
plines, is needed before any formal categories of 
modern genres de vie can be made. Until this is 
done, the existing formal groupings of language, 
religion, race, etc., may serve to constitute the sub- 
jective component of social space; however, if these 
sociological categories can somehow be integrated 
into the more geographical concept of genre de vie, 
the result would be an ideal subjective ingredient 
for social geography. 

Objective component—the social environment. 
The term “social environment” is used here to con- 
note all the socially significant aspects of the total 
geographic milieu. Traditionally, geographers have 
tended to exaggerate the distinction between the 
natural (physical—biotic) environment and the arti- 
ficial network of human establishments created by 
society. This dualistic conception tends to ignore 
the fact that mankind’s environment-creating ap- 
paratus has by no means entirely destroyed the 
natural framework and that the interplay of natu- 
ral and artificial assumes very different forms 
throughout the world. The social environment, as 
objective component of social space, includes more 
than these two levels. It includes, for example, the 
relation of social attitudes and traditions to nature, 
resource use, and the ethics of group relations. 

Social geographers are far from a satisfactory 
definition of the social environment; they lack sub- 
stantive studies which would provide the raw ma- 
terial for such a definition. What is the social sig- 
nificance, for example, of purely physical elements, 
such as humidity, temperature, or altitude? Geog- 
raphers have added very little to the “findings” of 
the Huntingtonian environmentalists. Yet the be- 
havioral sciences are interested in knowing the 
connections, real or perceived, between society 
and its natural environment. The research chal- 
lenges proposed in Sorre’s Géographie psycho- 
logique (1954) remain virtually untouched. In 
addition, little is known about the “synthetic en- 
vironment” (Herber 1962): the various conse- 
quences of atmospheric and oceanic pollution, or 
the consumption of medicated foods, stimulants, 
and sedatives. What are the physiological and path- 
ological consequences of changes in the environ- 
ment, for example, housing, communication, and 
diet? 

Recently some geographers have viewed the en- 
vironment as an amalgam of systems (Wagner & 
Mikesell 1962; Ackerman 1963; van Paassen 
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1965). This approach is satisfactory from the 
theoretical and technical points of view, but does 
it admit of nonsystematic (dysfunctional) ele- 
ments which often play such an important part in 
social life? The social geographer must be sensi- 
tive to the local exceptions which give special char- 
acter to individual places, such as Rochefort 
(1961 ) demonstrated in her study of Sicilian social 
environment. 

Approaches to the study of social space. We 
have seen that the study of social groups within 
their territorial (environmental) framework has 
constituted the basic traditional methodology of the 
Dutch, British, and some French social geogra- 
phers. In theory, this has involved a combination 
of a morphological approach (mapping of social 
groups) and an ecological approach (relations of 
groups to environment ). Today, however, the latter 
(vertical) dimension is perhaps less significant 
than the horizontal one, namely, the spatial pat- 
terns of interaction between social groups, such as 
Lowenthal’s Caribbean study illustrated (Lowen- 
thal 1961). A psychological approach to group 
attitudes, such as one finds in the Revue de psy- 
chologie des peuples, may provide clues to the ori- 
gins of some spatial discontinuities in social 
interaction. 

In terms of the notions of group and environ- 
ment, as redefined above, let us see what analytical 
methods can be used in the study of social space. 
Two of the many possible approaches are (1) to 
consider social space as a mosaic of social areas 
defined in terms of the occupant groups, for exam- 
ple, genres de vie or ethnic groups; and (2) to view 
social space as nodally organized, that is, as a net- 
work of spatial relations radiating around certain 
centers (Sorre’s points privilégiés) and permeated 
by the arteries of circulation. 

Formal approach—social areas. Initially, the 
formal approach examines the spatial patterns and 
characteristics of social groups in virtually the 
same fashion as that used by the disciples of Stein- 
metz in Amsterdam. On the basis of these distribu- 
tions a series of regions, homogeneous in terms of 
individual characteristics, can be compared and 
associations can be sought. Such associations, how- 
ever, must then be examined in terms of the social 
environment in which these social characteristics 
occur, that is, an ecological approach must supple- 
ment the more formal “sociographical” stage of 
analysis. In addition to these two steps, the geog- 
rapher must endeavor to see how all these elements 
combine to form the social whole within a particu- 
lar region and must seek explanations for the vari- 
ations through space in the incidence and func- 
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tional character of these social wholes. Jones sees 
“social regions” within the city of Belfast (1960), 
for example, as a product of historical and religious 
forces, while the “social area analysis” tradition in 
American human ecology (Theodorson 1961) has 
demonstrated the use of various other indices in 
the establishment of intraurban social regions. 

Functional approach. A more dynamic and in- 
creasingly popular approach is to consider social 
space in terms of its nodal organization. The orbit 
of group activities and the related horizons of social 
consciousness can be examined (cartographically ) 
in terms of their use of these nodes, for example, 
markets, cinemas, and schools (Chombart de 
Lauwe et al. 1952). The hinterland of each of these 
nodes varies in scale and significance, and these 
variations provide crucial insights into the social 
character of particular places. The study of nodal 
regions and of circulation are two examples of a 
functional approach to the study of social space. 

Sorre (1961) suggests that settlement units— 
towns, cities, metropolises—provide a primary set 
of nodes on a world scale. Within each of these 
nodes is an internal system of centers (schools, 
churches, cinemas) whose social significance can 
also be examined cartographically. Here again the 
social geographer can collaborate with and utilize 
some of the existing principles of central place 
theorists and perhaps somewhat qualify defini- 
tions of centrality currently based on commercial 
and industrial criteria. Edgar Kant’s Umland 
studies (Kant et al. 1951 ), J. Labasse’s circulation 
Studies (1955), and Pierre George’s studies of the 
urban fringe (1962) provide orientation for this 
kind of study. As world society becomes more 
urbanized, social geographers will concern them- 
selves more with the urban field, and, here, col- 
laboration with other scholars will be imperative. 

The essential clue to the internal dynamism of 
social space can be found in its circulation system. 
Circulation here includes all kinds of movement of 
goods, services, people, and ideas—any kind of 
spatial movement which occasions social communi- 
cation. As the Paris study (Chombart de Lauwe 
et al. 1952) demonstrated, the actual and poten- 
tial use of a circulation system indicates the con- 
crete social horizons of the group it serves; changes 
within it may indicate or produce changes in the 
relation between groups and between a group and 
its social environment. 

A vast number of research questions emanate 
from this dimension of social space, particularly 
now that the processes of social differentiation and 
cultural standardization are so closely tied with 


large-scale, mass-produced goods and services, 
Interregional traffic, the currents of the tourist 
world, pilgrimages, daily and seasonal commuting 
—these are only a few samples of the many activi- 
ties the student of circulation could investigate. 


To summarize, social geography can be defined 
as the study of the areal (spatial) patterns and 
functional relations of social groups in the context 
of their social environment; the internal structure 
and external relations of the nodes of social activ- 
ity; and the articulation of various channels of 
social communication. 

Although the discussion has distinguished be- 
tween various elements and approaches to social 
geography, it must be emphasized that one of the 
fundamental characteristics of the field has been, 
and must remain, its integral, holistic character. 
Like social history, it must endeavor to maintain 
the holistic view, that is, to show how the individ- 
ual parts and their functional connections integrate 
to give a specific character to the social whole. 
French geographers have supplied ample precedent 
for this kind of holism; so, indeed, have the social 
anthropologists of the Anglo-American world. The 
more the field becomes theoretically systematized, 
the greater will the challenge of integration become. 

For social geography to fulfill its potential, the 
various approaches to the field need to be coordi- 
nated into a systematic conceptual framework. 
Sorre’s concept of social space could provide a cen- 
tral theme for such a framework. Its ingredient 
elements could be considered as bases for system- 
atic subdivisions, for example, geography of lan- 
guage, of religions, and of diet, each of which con- 
tributes a valuable perspective on society’s spatial 
order. The definition given here seems to incorpo- 
rate the various elements which have belonged to 
the field of social geography and which, given the 
trends in contemporary social science, could con- 
stitute fruitful future directions for the discipline. 


ANNE BUTTIMER 
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STATISTICAL GEOGRAPHY 


Statistical geography is to geography what econ- 
ometrics is to economics, sociometrics to sociology, 
psychometrics to psychology, or even jurimetrics 
to jurisprudence—an approach to the field rather 
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than a subdivision of that field. Like these other 
approaches, it is of recent development—the mani- 
festation within geography of the trend to a more 
quantitative approach that has characterized all 
the social sciences since the end of World War n. 
As is also the case with these other approaches, 
many of the pioneering contributions to statistical 
geography have come from workers in other fields. 
The parallel term “geometrics” is not used to de- 
scribe the work of the statistical geographer, how- 
ever, not so much because it provides an inappro- 
priate picture of the statistical geographer’s attempts 
to identify and measure regularities observable in 
spatial distributions as because the branch of 
mathematics that originated in Greek attempts to 
measure the earth has a two-thousand-year priority 
in its right to the name. In addition, mathematical 
geography, concerned as it is with map projections, 
is the branch of geography that today relies most 
heavily and directly upon geometry. 

The syndrome that characterizes statistical geog- 
raphy today includes a more formal theoretical 
orientation than was true of geography in the past, 
a reliance upon statistical inference and numerical 
analysis in empirical research, the use of mathe- 
matical programming and simulation procedures in 
applied research, a basic concern with model con- 
struction, and an involvement with high-speed 
computers, mass-data banks. and automated map- 
ping devices. 

Yet this assemblage of interests is not mono- 
lithic. There are differences in emphasis, corre- 
sponding to each of the four main traditions of 
geographic research: spatial, area studies, man— 
land, and earth science (Pattison 1964; National 
Research Council 1965). Statistical geography orig- 
inated within the spatial tradition, with its em- 
phasis upon analysis of spatial distributions and 
associations, and initially relied heavily upon exer- 
cises in distribution fitting and upon regression 
and correlation analysis. It spread to area studies 
when the spatial tradition began using multivariate 
analysis, then to the man-land tradition via be- 
havioral studies of environmental perception and 
individual decision making, and to the earth- 
science tradition as spatial studies and related 
systematic sciences began using systems analysis. 
To understand these several facets of statistical 
geography, with their differences in use of theory, 
choices, and timing of applications of statistical 
methods, and their contacts with the rest of sci- 
ence, therefore requires some understanding of the 
four research traditions. These, in turn, can best 
be viewed within a formal overview of approaches 
to regional analysis (Berry 1964a). 
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Approaches to regional analysis 


Virtually any sort of regional analysis may be 
considered as starting from information about one 
or more places. For each place, one or more prop- 
erties, or characteristics, may be measured, and 
the measurements may be made at one or more 
points in time, It is often convenient to think in 
terms of a three-dimensional array X of order 
ux pxt. Cell xi; of this array (or matrix) re- 
cords the value of variable i at place j in time k. 
Rows of the array—vectors of array values for 
fixed i, k—record the distribution of variables over 
places at some point in time. Column vectors in- 
ventory the properties of places at time t. Time 
vectors report on the t states of variable i at place j. 

Operational specification of variables, places, 
and times varies with the interests of the particular 
analyst, but a matrix such as X is (albeit usually 
implicitly) the object of all forms of geographic 
study. A row vector of X is a spatial distribution 
that can be mapped. A column vector is a loca- 
tional inventory. Such row and column vectors are 
the bases of systematic ( topical) and regional geog- 
raphy, respectively. Time vectors report changes 
in spatial distributions and at locations and are 
basic to historical geography. 

The data. Cell x; may contain a variety of 
different records, depending upon specification of 
i, j, and k. Columns, for example, may be defined 
as places with area, such as countries, states, 
counties, census tracts, or quarter-mile-square cells 
of a half-mile grid. Alternatively, they may be di- 
mensionless points; three such examples are tri- 
angulation points used for measuring altitude, 
Weather stations, and soil-sample-core locations 
(Kao 1963). Rows may be defined as properties 
of the places (scalar quantities, such as population 
residing in each of a set of census tracts or altitude 
at each of a set of triangulation points), or they 
may refer to connections between places (vector 
quantities, such as flow of coal from southern 
Illinois to each of the Midwestern counties). Rows 
also might be airline traffic flow between cities; 
in this case columns would represent pairs of 
cities. In addition, any level of measurement may 
appear, whether nominal, ordinal, interval, or ratio 
[see STATISTICS, DESCRIPTIVE, article on LOCATION 
AND DISPERSION, for a discussion of levels of meas- 
urement]. 

Spatial tradition. Geography’s spatial tradition 
is founded upon cartographic portrayal and subse- 
quent study of spatial distributions, thus having 
as its base row-wise analysis. A few examples of 
the kinds of spatial distributions studied follow: 


(1) A land-use map—scalar, nominal, of areas, 
an areal distribution. 

(2) A map showing the location of major cities 
by dots—scalar, nominal, at points, a point distri- 
bution. 

(3) An airline-route map—vector, nominal, join- 
ing points. 

(4) A map of soil quality—scalar, ordinal, of 
areas, 

(5) A map showing average annual temperature 
—scalar, interval. If the map shows averages by 
States or districts, it is also discontinuous and areal 
(a choropleth map, if different shadings are ap- 
plied to the units), but if it shows the temperature 
varying over the country as a surface, it is a con- 
tinuous generalization. The generalization is of 
point observations (usually isopleth, since the sur- 
face will be depicted by contours ) if, for example, 
weather-station data are used, or of areal data if 
interpolations were made, for example, with re- 
spect to the district averages treated as points 
central to each of the districts. 

(6) A highway map, with routes classified by 
quality—vector, ordinal, joining points. 

(7) A map of city-to-city air-passenger move- 
ments—vector, ratio, joining points. 

Two dominant themes emerge in spatial anal- 
ysis of such maps: evaluation of the pattern of 
scalar distributions and of similarities in pattern 
over a number of such distributions and evaluation 
of the connectivity evidenced by vector distribu- 
tions and of similarities in the connectivity of 
several such distributions. Apparently, the funda- 
mental properties of pattern include absolute loca- 
tion (position), relative location (geometry), and 
scale, with a family of interesting derived prop- 
erties, including density and density gradients, 
spacing, directional orientation, and the like. Simi- 
larly, accessibility is central to the study of con- 
nectivity, and from it are derived such properties 
as centrality itself, relative dominance, degree of 
interdependence, etc. The two themes merge in 
spatial-systems analysis, where pattern and con- 
nectivity are examined in their association. For 
example, urban land values decline with increasing 
distance from the city center, and type of farming 
varies with distance from market. Such examples 
are readily generalized to the dynamic, that is, 
time-dependent, case. 

Area-studies (chorographic) tradition. Just as 
row-wise analysis is the basis of the spatial tradi- 
tion, so columnwise analysis provides the base for 
Seography’s area-studies tradition. The essential 
problems of this tradition are those of regional 
intelligence: the characterization of place in terms 


of the associations between characteristics localized 
in that place. This approach is often restricted to 
those features of place that are directly observable 
as landscape but, especially among the French 
school of human geographers, is also extended to 
an evaluation of both tangible and intangible as- 
pects of “regional character” and of the differentia- 
tion of places (the study of “areal differentiation” ) 
[see LANDSCAPE]. 

An appetite for information, a penchant for the 
peculiar, emphasis upon field work, attachment to 
the people and language of a particular part of the 
world, a strong literary bent, a companionship with 
history and great reliance upon historical modes 
of explanation (in contrast to the functional, de- 
terministic, and probabilistic modes of the other 
geographic traditions )—all these serve to identify 
the work of the student in the area-studies tradi- 
tion, whatever the areas examined: countries or 
continents, regions or culture areas. 

Man-land tradition. In the tradition of medi- 
eval philosophy, classical geographers distinguished 
between two major sets of variables: the physical 
(inorganic plus biotic) and the cultural. The as- 
sociated methodological argument was that these 
provided the bases of the two major segments of 
the field: physical geography and cultural geog- 
raphy. Finer groupings of variables within these 
categories led to the variety of systematic branches 
of the subject, such as the geographies of land- 
forms, of plants, of industry, of cities, or of lan- 
guage (the many topical fields and subfields might 
thus be identified as nested subsets of rows of the 
matrix X). 

The classical modes of thought, however, also 
led to a particular tradition in geography, the 
man-land tradition, in which the relationships be- 
tween physical variables and human characteristics 
and activities were examined. In combination with 
the social Darwinism of the late nineteenth cen- 
tury, simple one-way studies were made of the 
effects of environment on man. These were later 
complemented by studies of the effects of man on 
environment, from which emerged much of the 
original thinking in the field of conservation. More 
recently the ancient dichotomy has been relaxed 
with, for example, studies of the effects of environ- 
mental perception on resource evaluation and de- 
cision making in resources management (Kates 
1962) and with the adoption of a systems-analysis 
frame (Ackerman 1963). 

Earth-science tradition, During the eighteenth 
century, geography was an integral and substantial 
part of natural philosophy. At that time geograph- 
ical study embraced all aspects of the earth, air, 
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and waters. Since then, however, most aspects of 
these studies have branched off as separate sys- 
tematic sciences, and geography's earth-science 
tradition has been left with such concerns as the 
study of landforms and their evolution (geomor- 
phology), descriptive climatology, and certain as- 
pects of the geographies of soils, plants, and ani- 
mals, together with the attempt to achieve some 
spatial synthesis of these in order to identify “natu- 
ral environmental complexes.” It is to this latter 
end that systems-analysis procedures have recently 
been employed in this research tradition (Chorley 
1962). 


Antecedents and stimuli 


Prior to World War u few papers of a statistical 
nature had been published by geographers. Per- 
haps the only contributions worthy of note were 
Matui’s (1932) fitting of the Poisson distribution 
to quadrat counts of settlements in a portion of 
Japan and Wrights (1937) discussion of Lorenz 
measures of concentration in the spatial case. 
However, two general antecedents can be distin- 
guished, in addition to the pioneering contributions 
of workers in other fields: centrography and social 
physics. From the former came the idea of develop- 
ing a special family of descriptive statistics for 
spatial distributions, and from the latter the recog- 
nition of certain classes of regularities in such 
distributions. 

Centrography. During the early part of the 
twentieth century there was a lively debate among 
statisticians concerning such measures as the 
center of population. Part of the debate stemmed 
from publication by the U.S. Bureau of the Census 
of a piece entitled Center of Population and Me- 
dian Lines . . . (U.S. Bureau of the Census 1923). 
Many articles were published, notably in the Jour- 
nal of the American Statistical Association and in 
Metron, concerning the relative advantages of al- 
ternative centers, the center of gravity, the spatial 
median, and the center of minimum aggregate 
travel. (The U.S. Census Bureau's geography 
branch still reports on the center of gravity of the 
United States’ population after each census. ) This 
debate gradually subsided in the United States, but 
in the Soviet Union centrography flourished during 
the 1920s and 1930s. A centrographic laboratory 
was founded at Leningrad, under the auspices of 
the Russian Geographic Society, in 1925, and its 
director, E. E. Sviatlovsky, pursued studies of the 
“actual” and “proper” centers and the distributions 
of all manner of phenomena. However, the set of 
“proper” centers of economic activities prepared 
for the Gosplan of 1929 was at odds with the 
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second Five-Year Plan, and as a result, the labora- 
tory was finally disbanded. Porter (1963) provides 
a fairly complete bibliography of the relevant lit- 
erature on centrography. Recently, the Israeli stat- 
istician Bachi (1963) has attempted to revive cen- 
trography, with the development of a variety of 
measures of dispersion and association of spatial 
distributions. Current interest within geography is 
slight, however, except as embraced by the social 
physicists (Stewart & Warntz 1958). 

Social physics. The attempt to describe human 
phenomena in terms of physical laws has a long 
history in every social science [see RANK-—SIZE 
RELATIONS]. 

In geographical studies this has been expressed 
in two major ways: (1) by use of “gravity models” 
to describe spatial interaction; and (2) by use of 
“potential models” as general summaries of inter- 
dependency between all places in large areas. Such 
models are said by their advocates to summarize 
a wide variety of social and economic distributions 
in economically advanced societies, Gravity models 
were first used in a relatively formal way by E. G. 
Ravenstein, in his seminal study “The Laws of 
Migration” (1885; 1889). Thereafter, these models 
found wide application, for example, in marketing 
geography—Reilly’s “law of retail gravitation” 
(1931)—and in urban transportation studies de- 
scribing interzonal travel. Carrothers (1956) re- 
views this work and the basic postulate that 
interactions or movements between places are 
proportional to the product of the masses and 
inversely proportional to some exponent of dis- 
tance: that is, I;, « M, - M,/d;,. Such gravity ana- 
logs were generalized by the astronomer J. Q. 
Stewart (1947) to the case of the potential sur- 
face, which simultaneously describes the interac- 
tions of each place and every other. Thus, the 
potential at any point i is given by 


jen 
P; = X (P,/di,). 
j= 


The surface is interpolated from such measures 
for a sample set of points. There is still consider- 
able interest within geography in social physics 
(Stewart & Warntz 1958), and new applications 
are continually being developed. Mackay (1959), 
for example, used gravity models to translate the 
depressing effects upon telephone communications 
of the French-English language boundary in Can- 
ada and of the United States-Canadian political 
boundary into their physical-distance equivalents, 
thus showing how social space can be transformed 
into the metric of physical space. Similar applica- 
tions are to be found in all branches of cultural 
geography today. 


Pioneering contributions from elsewhere. Work- 
ers in other fields provided several significant ex- 
amples of the application of statistical methods to 
geographic problems, identified the major statisti- 
cal problems of regional analysis, and prepared 
the first text on statistical geography, thereby doing 
much to set the pace and tone of statistical geog- 
raphy today. A statistician, M. G. Kendall (1939), 
for example, showed how principal-components 
analysis could be used to develop a multivariate 
index that would portray the geographical distribu- 
tion of crop productivity in England. M. D. Hagood 
(1943), an agricultural statistician, used multiple- 
factor analysis to define multivariable uniform 
regions. An economist, C. Clark (1951), showed 
that the negative exponential distribution fitted 
population density patterns within cities. G. K. 
Zipf (1949), a philologist, developed the rank-size 
distribution of cities. A mathematical social scien- 
tist, H. A. Simon (1955), showed the bases of 
this distribution in simple stochastic processes, 
and sociologists found that a repetitive three-factor 
structure characterized the social geography of 
cities (Berry 1964b; 1965). G. U. Yule and M. G. 
Kendall ([1911] 1939) identified the problem of 
modifiable units: if data are of areas rather than 
at points, results of any analysis will be in part de- 
pendent upon the nature of the areal units of 
observation utilized. W. S. Robinson (1950) pro- 
vided the relevant relationship between individual 
and ecological (areal, set-type) correlations, sub- 
sequently extended by Goodman (1959) in the 
context of ecological regression. A second problem, 
that of contiguity, or spatial autocorrelation, has 
been examined by Moran (1948) in the nominal 
case and by Geary (1954) more generally. Geary 
also applied his measures of contiguity to evaluat- 
ing lack of independence of residuals from regres- 
sion in studies of spatial association. These studies 
are reviewed in Statistical Geography (Duncan 
et al. 1961), the first general book in the field, 
written by sociologists. Further examination of 
autocorrelation in spatial series is to be found in 
“Spatial Variation” (Matérn 1960), a major con- 
tribution to areal sampling by a mathematician. 
Spatial analysis remains basic to quantitative plant 
ecology and to epidemiology, and from these fields 
have come many of the ideas used today in the 
study of pattern in point distributions and of spa- 
tial diffusion processes. 


Statistical geography—1950 to 1965 


Centrography, social physics, and external stim- 
uli, facilitated by developments in computer tech- 
nology that for the first time enabled the mass data 
of geographic problems to be handled conveniently, 


combined to stimulate workers in geography’s spa- 
tial tradition to work quantitatively. The older 
forms of cartographic analysis provided firm bases 
for this development, and many of the early studies 
were simply quantitative extensions of analyses 
cartographically conceived and executed. Arthur 
H. Robinson (1962), a cartographer, for example, 
utilized correlation and regression analysis to im- 
prove the ways by which he could map spatial 
associations. McCarty and his associates (1956) 
used similar procedures to replace older carto- 
graphic means of comparison. Thomas (1960) 
showed the various ways in which residuals from 
regression could be treated cartographically so as 
to draw upon traditional geographical means of 
map analysis in model reformulation and refine- 
ment. King (1962) applied the “nearest-neighbor” 
methods of the quantitative plant ecologist to the 
study of pattern in point distributions, with, like 
the 1932 Matui study, expectations derived from 
the Poisson distribution. These represent but a few 
examples of the spatial studies concerned with 
distribution fitting as a means of studying spatial 
pattern or with uses of correlation and regression 
in studies of spatial association. Many other exam- 
ples are to be found in uses of regression to fit 
gravity models and obtain the distance exponents 
for different phenomena (Carrothers 1956) or to 
fit negative exponential distributions to urban pop- 
ulation densities and the like (Berry 1965). 
These kinds of studies represent the beginnings, 
from which statistical geography has grown rap- 
idly. Dacey and Tung (1962) have made major 
advances in point-pattern analysis, for example, 
by transforming the distribution-fitting exercise 
into an explicit hypothesis-testing frame, with 
relevant expectations derived from settlement the- 
ory. Curry (1964) views many urban phenomena 
as the outcome of known-probability mechanisms. 
The Swedish geographer Hägerstrand (1953) was 
the first to show that many spatial patterns might 
be considered as the outcome of diffusion processes 
that could be simulated, using Monte Carlo meth- 
ods, and his work led to a burst of similar simula- 
tion studies in the United States (Morrill 1963). 
New approaches to spatial analysis have also 
been developed. Most of the examples outlined 
above use scalar data. Garrison (1960) showed 
that the mathematical theory of graphs provided 
an excellent base from which to examine vector 
distributions, and Nystuen and Dacey (1961) 
extended his argument to the case of organizational 
regions, using graph-theoretic measures of accessi- 
bility of places to communications networks to 
define relatively independent subsets of relatively 
interdependent places. Tobler (1963) showed how 
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a generalization of map projections, traditionally 
studied by the mathematical geographer, could be 
used as the basis for mapping social, economic, 
cultural, or political space into physical space, as a 
further means for merging geographical applica- 
tions of various statistical and mathematical meth- 
ods with the more traditional means of geographi- 
cal analysis. Finally, in addition to developments 
of the descriptive kind, statistical geography has 
extended its work to embrace investigations of a 
prescriptive nature. Garrison and Morrill (1960), 
for example, applied the techniques of spatial price- 
equilibrium analysis to determining what should 
be the patterns of interregional trade in wheat and 
flour in the United States. Other research workers 
are now much concerned with the procedures of 
spatial programming. Haggett (1965) has provided 
an excellent review of the substance of the first 
decade of quantitative work in the spatial tradition. 

With the use of multivariate analysis, statistical 
geography has spread from the spatial tradition to 
that of area studies. A traditional geographic prob- 
lem in this latter tradition is that of regionalization 
—the attempt to derive areas relatively uniform 
in terms of a complex of associated characteristics 
and also relatively different from other areas in 
terms of that complex. Such problems, involving 
mass-data analysis, were traditionally handled by 
overlaying maps. This earlier procedure has been 
replaced, however, by the use of the modern com- 
puter, applying such multivariate procedures as 
factor analysis to reduce many variables to a few 
factors representing “complexes” of associated 
characteristics, and the application of numerical 
taxonomy to get optimal classification (minimizing 
within-group variance ) of observations into regions 
on the basis of the distances between observations 
in the factor space (Berry & Ray 1966). Output 
from the entire procedure of data analysis and re- 
duction includes the complexes of characteristics 
that define “regional character,” measures of the 
similarity of the observations, and the regions [see 
CLUSTERING]. 

Statistical work also characterizes the man—land 
tradition, largely by virtue of either simple correla- 
tion and regression studies that include physical 
variables, on one side, and cultural variables, on 
the other (for example, correlations of annual 
precipitation and population densities in the high 
plains), or through uses of probability theory. It is 
the latter, indeterministic type of study that repre- 
sents new departures. Curry (1962a), for example, 
shows how livestock management in the intensive 
grassland-farming areas of New Zealand is related 
to probabilities of fodder availability, which in turn 
are derived from probabilities of requisite climatic 
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conditions. Much of the basic research goes into 
establishment of the relevant probabilities, in this 
case, of the probabilities of repetitive events that 
play a central role in farm management. Kates 
(1962), on the other hand, examined relations of 
management of flood-plain property to flood haz- 
ards, rare events. He found management practices 
to be conditioned, not by reasonably precise evalu- 
ations of the situation, as in the case of the New 
Zealand farmers, but by a widely varying set of 
preconceptions, at variance with the actual prob- 
ability mechanisms. 

Work in the earth-science tradition of geography 
has, also, become statistical and ranges from the 
attempt to reformulate the geography of landforms 
generated by fluvial mechanisms in the framework 
of general-systems theory (Chorley 1962) through 
studies of climatic change as a random series 
(Curry 1962b) to the analysis of precipitation 
climatology using harmonic methods (Sabbagh & 
Bryson 1962) or to the development of linear mod- 
els predictive of some characteristic through prior 
multivariate analysis, so as to satisfy the assump- 
tions of the model ultimately to be produced (Wong 
1963). There is today perhaps more work of a 
Statistical kind in the earth-science tradition than 
in either the area-studies or man—land tradition. 


Statistical geography—analysis of both the sta- 
tistical and the mathematical kind—is to be found 
in all branches of geography today. However, in the 
methods utilized, certain differences between geog- 
raphy’s four main research traditions are to be 
noted. In the spatial tradition, distribution fitting, 
correlation and regression analysis, uses of such 
methods as the mathematical theory of graphs, and 
prescriptive uses of spatial programming dominate, 
along with uses of probability mechanisms to study 
diffusion processes. The area-studies tradition re- 
lies upon multivariate analysis, particularly factor 
analysis, and upon numerical taxonomy, to facili- 
tate mass-data analysis. In the man—land tradition, 
a neat contrast is to be noted between those of 
traditional deterministic outlook, who use regres- 
sion methods, and those concerned with decision 
making in resources management, who focus upon 
probabilities of the a priori and a posteriori kinds. 
Finally, in the earth-science tradition those pro- 
cedures that facilitate systems analysis have been 
those most rapidly adopted and used. At the end 
of World War m geography was nonquantitative. 
Statistical geography has played an integral, even 
critical, part in the transformation of geography 
into a modern social science in the postwar years. 


BRIAN J. L. BERRY 


[See also CARTOGRAPHY; CENTRAL PLACE; REGIONAL 
SCIENCE. Other relevant material may be found 
in CLUSTERING; FACTOR ANALYSIS; MULTIVARIATE 
ANALYSIS. ] 
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GEORGE, HENRY 


Henry George (1839-1897) grew up in Phila- 
delphia. As the author of Progress and Poverty he 
became one of the most telling speakers for social 
protest anywhere in the world. His father was the 
son of an English immigrant who had succeeded 
as a shipper; his mother’s family had settled in the 
city before the Revolution. His father, a Democrat, 
was a customhouse clerk and for some years a 
publisher of Episcopalian sunday-school books; he 
sought to cultivate in his family both Democratic 
and low-church loyalties. Politically, young Henry 
George differed from his parents in that he was 
strongly opposed to slavery and voted Republican 
during the Civil War and the early Reconstruction 
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period. However, he never abandoned his Episco- 
palian sympathies: his lifelong belief in the tri- 
umph of justice secularized the millennial hope 
that he had learned as a youngster. 

George’s family had little money to spare, and 
he left school when he was only 13. How- 
ever, the deprivations he suffered were no worse 
than those of any child in a large family on a 
white-collar income. He learned how hard life 
could be for workingmen when, at age 15, ship- 
ping to India as a cabin boy, he witnessed a sailors’ 
mutiny. For George himself the trip was pure ad- 
venture. His next long voyage, in 1857, which took 
him to California, was also inspired by restlessness 
rather than financial need. It was not until he had 
been in California for about seven years and was 
responsible for supporting a wife and two children 
that his inability to find steady work as a printer 
made him directly aware of the desperation of 
poverty. 

George’s journalistic career began with a job on 
the San Francisco Alta California in 1865; subse- 
quently he worked on several other San Francisco 
papers, the Times, the Chronicle, and the Herald. 
In the late 1860s he returned to the Democratic 
party and became an adviser to Henry H. Haight, 
the state's governor who was crusading against 
monopoly. Haight put George in charge of the 
Sacramento Reporter, a party organ, but in 1871 
George returned to San Francisco as founder, pro- 
prietor, and editor of the independent Democratic 
Daily Evening Post. As a business venture (George’s 
only one), the paper failed, but it was a journalistic 
success. It provided George with the first platform 
from which he was able to develop fully his critical 
and reformist ideas. 

George’s experience with journalists and jour- 
nalism made him familiar with some of the prob- 
lems of California’s spectacular and troubled econ- 
omy, but it was several years before he found the 
“solution” to these problems. When he was still a 
printer in Sacramento, James McClatchy, the 
famous editor of the Sacramento Bee, acquainted 
him with the nation’s homestead policy and the 
frustrations of that policy, On the San Francisco 
Times, George fought against private domination 
of land ownership, arguing that as a legal entity 
the city of San Francisco had the true ownership 
of the land it occupied, by devolution of title from 
the king of Spain through the pueblo of San Fran- 
cisco. This claim was put forward with some suc- 
cess to reduce the claims of land-grabbers and 
speculators. 

While working for the Herald, George came face 
to face with a more general problem, that of tech- 


nological monopoly. The paper sent him to New 
York to obtain contracts for telegraphing the news 
from the Associated Press by way of Western 
Union. The failure of the negotiations appeared to 
George to jeopardize the freedom of the press. 
Deeply disturbed, he experienced “a thought, a 
vision, a call,” and dedicated himself to combat the 
evil and want he had observed in New York. A few 
months later he had a second “vision.” Riding in 
the foothills above Oakland, he looked down on the 
flatland near San Francisco Bay, which the trans- 
continental railroad had fantastically increased in 
value. “Like a flash it came upon me that there was 
the reason of advancing poverty with advancing 
wealth. With the growth of population, land grows 
in value, and the men who work it must pay for the 
privilege” (quoted in The Life of Henry George, by 
Henry George, Jr. [1871-1900] 1906-1911, vols. 
9-10, p. 210). 

As early as 1870, when he attributed to every 
man a “natural right” to a parcel of the earth’s sur- 
face, George had begun to convert his reformist 
convictions into economic formulas. In 1871 his 
pamphlet Our Land and Land Policy (1871-1900, 
vol. 8) appeared, which was sizable enough to be 
considered his first book. It is a classic of economic 
criticism. George denounced both the prodigality 
of the national government in giving away the 
arable domain and that of the California state gov- 
ernment, which permitted speculators to acquire 
much of the best land. He reasoned that in spite 
of past errors the homestead policy could yet be 
made to succeed if oversize holdings already 
granted were reduced by land taxes or inheritance 
taxes. Although he still accepted private property 
in land as a useful institution, George began to 
insist that “in nature” there is “no such thing as 
fee simple in land.” The following year, in the 
Daily Evening Post, he carried this crucial limita- 
tion on land ownership a step further: “The land 
of a country rightfully belongs to all the people of 


that country; . . . there is no justification for private 
property in land except the general convenience 
and benefit; . . . private rights in land should al- 


ways be held subordinate to the general good” 
(Barker 1955, pp. 176-177). 

Gradually George systematized his economic 
ideas. Some time in the 1860s in Sacramento he 
had attended a meeting on the protective tariff, a 
policy in which he had long believed, and he had 
made an impromptu speech against the tariff. Free 
trade thus became his first publicly announced 
economic doctrine, and one from which he never 
departed; in the Post he was to advocate it repeat- 
edly and dogmatically. 


He opposed land monopoly because it denied 
free and just access to the bounties of earth; con- 
cerning the new natural monopolies which had 
been created by technology he felt differently. He 
demanded in the Post that the latter be transferred 
from private to public ownership—railroads and 
telegraphs to be owned by the national government 
and gas and water systems by the cities. His goal 
was to turn from private to public advantage the 
benefits of all monopolies that were not artificial 
and deserving of destruction. His editorials favored 
trade unions, a policy of high wage levels (other 
California papers were deflationist), and a policy 
of reserving the arable domain to actual home- 
steaders. Although in many of these ideas he was 
stimulated by socialists, he rejected the general 
doctrine of socialism. 

When Progress and Poverty (1871-1900, vol. 1) 
first appeared in 1879, in San Francisco, the aca- 
demic profession of economics was inchoate in the 
eastern United States and had no existence what- 
ever in California. Economic thought in the east 
followed mainly the British classical school; the 
American nationalist school, of which H. C. Carey 
had been the principal figure, was a minority move- 
ment among economists, but its influence on 
Republican and national policy gave it dispropor- 
tionate importance. The new analytical and institu- 
tional economics, then rising in Germany, Austria, 
and England, was little known at the time in 
America, and known not at all to George. There- 
fore, when George used and adapted the terms and 
logic of Smith, Ricardo, and John Stuart Mill, he 
was moving in an accepted tradition, and when 
he rejected the wages fund doctrine and the iron 
law of wages, he was merely revising the tradition, 
much as Francis A. Walker in the United States, 
and Mill himself had done. 

The central four-fifths of Progress and Poverty 
is essentially a syllogism. The first premise appears 
in Book m, “The Laws of Distribution.” There 
George stated his proposition that economic rent, 
being a product of monopoly—usable land is by 
nature monopolistic, not something replaceable— 
always reduces wages and interest. The second 
premise occurs in Book 1v; on the basis of observa- 
tion, George asserted that modern industrialism 
increases rent and that, as resource sites and urban 
sites rise in value, rent flowing into private hands 
creates social inequity, depression, and poverty. 
From these two premises—the first, that the land- 
owner inevitably takes from the laborer and the 
capitalist and the second, that with private land 
Ownership rent increases exacerbate social injus- 
tice—George reasoned to his synthesis, “The Rem- 
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edy,” presented in Book vı. It is the historic pro- 
posal to do away with private property in land, or 
at least with the private privileges of tenure. In 
the following three books, George spelled out the 
procedures involved in his “Remedy”: either land 
might be nationalized, or there might be taxation 
that would transfer economic rent from the land- 
holder to the community (the latter alternative 
eventually became the “single tax”). The rent so 
placed at the disposal of the state could be distrib- 
uted or invested, according to public interest. 

The remaining one-fifth of the book contains the 
15 pages of introduction and the 90 pages of con- 
clusion, which may be called George’s moral and 
religious sequence. There the author restated his 
paradox—that modern material “progress” actually 
widens the gap between the economically privileged 
and the unprivileged—and he stated his belief that 
a socially just civilization can be based only on a 
Christian and democratic conscience: “Association 
in equality is the law of progress.” 

It was in Europe that Progress and Poverty was 
first received with understanding, if not always 
with praise. Emile de Laveleye, the Belgian “so- 
cialist of the chair,” discussed it with critical ap- 
preciation in the Revue scientifique de la France 
et de l'étranger (1880); Adolf Wagner of the Uni- 
versity of Berlin was unfavorable; but Gustav 
Schmoller found freshness in the work, although 
he also found much to criticize. George wanted 
attention in the United States so much that even 
a highly critical review by William Graham Sumner 
in Scribner's pleased him. 

George’s early international reputation increased, 
however, not so much by reason of his writings 
as because of his involvement in public affairs 
abroad. By 1881 he had moved from California to 
New York, and the Irish World of that city sent him 
as correspondent to Ireland. Conditions in Ireland 
had not yet been improved by Gladstone’s land re- 
form, and George hoped to affect the situation as 
well as to report it. He distributed Progress and 
Poverty in Ireland, as well as a pamphlet on The 
Land Question ({1881] 1906-1911, vol. 3). He also 
went to London, where he established an uneasy 
relationship with H. M. Hyndman, the wealthy 
Marxist; made friends with Helen Taylor, the lit- 
erary executor of John Stuart Mill, who said that 
Mill would have accepted George’s ideas as an 
extension of his own; and became acquainted with 
Alfred Russel Wallace, the theorist of evolution 
who was also interested in land nationalization, 
and with other prominent men like John Morley 
and Joseph Chamberlain. George made the speech 
which George Bernard Shaw says converted him 
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to social reform. George proudly assessed the long, 
semifavorable review of Progress and Poverty in 
the London Times as evidence that he was being 
taken seriously in high places. 

The first visit to Europe led to four subsequent 
ones. The last one, in 1890, rounded out a trip 
that included a triumphant speaking tour in Aus- 
tralia. As far as the influence of George’s ideas in 
the United Kingdom is concerned, the visits of 
1883-1884 and 1884-1885 were the heyday. All 
the major British journals reviewed Progress and 
Poverty, and distinguished academic economists 
reacted to the book, albeit often negatively. Sir 
Henry Fawcett of Cambridge objected to George’s 
Opposition to compensating the landlords whose 
property rights he would destroy; Alfred Marshall, 
while admitting “freshness and earnestness” in 
George, concentrated a barrage of statistics on 
weaknesses in his findings about wages; Philip H. 
Wicksteed, a Unitarian minister and distinguished 
economist, acknowledged a major debt to George; 
and Arnold Toynbee, the young pro-labor Oxford 
economist, tried just before his untimely death to 
refute George in public lectures in London. In 
Scotland, George discovered many admirers: the 
Land Restoration League there shared his prefer- 
ence for land taxation as the best means of reform. 
Keir Hardie became the political link between 
George’s impact on Britain in the 1880s and the 
modern British Labour party. During an Oxford 
visit, the students treated George outrageously; at 
Cambridge, things went better. As J. A. Hobson 
observed in 1897: “. . . Henry George may be con- 
sidered to have exercised a more directly powerful 
formative and educative influence over English 
radicalism of the last fifteen years than any other 
man” (p. 844). 

After his return from Ireland, George’s activities 
in America took several directions. He wrote a 
series of articles for Frank Leslie’s Illustrated News- 
paper which was polemically directed against a 
previous series of articles on current problems writ- 
ten by Sumner for Harper’s Weekly. In book form 
Sumner’s articles became What Social Classes Owe 
to Each Other, the recognized classic of social 
Darwinist thought. George’s articles, gathered and 
published as Social Problems (1871-1900, vol. 2), 
initially were better received than Progress and 
Poverty. 

George also became active on the political scene. 
In 1886 a major New York City labor union 
persuaded him to be the Labor party candidate for 
mayor. He drew considerable support from early 
social gospelers, the labor unions, and recent im- 
migrant voters. At the polls he came in second, 


close enough to the winning candidate to give 
credence to the charge that he would have won if 
votes had not been stolen. He did succeed in draw- 
ing maximum attention to his ideas and in con- 
solidating his followers. 

George’s next venture was the single-tax reform. 
Although the phrase “single tax” does not appear 
in Progress and Poverty, the reform idea does. It 
was George’s lasting conviction that land-value tax- 
ation could and should be used to transfer the 
whole product of economic rent from private own- 
ers to the community. A transfer of this kind would 
be politically easier to achieve in America than land 
nationalization and would have the same economic 
effect. George began using the term “single tax” 
during the mid-1880s, but it was a convert to his 
ideas, Thomas G. Shearman, a New York lawyer, 
who transformed the term into a slogan and the 
name of a reform movement. By 1888 single-tax 
meetings were being held in the principal cities on 
the eastern seaboard, in the Middle West, and in 
California. The long-term leaders of the movement 
—Louis F. Post, Tom Loftin Johnson, Warren 
Worth Bailey, William Lloyd Garrison 1, Jackson S. 
Ralston, and Lawson Purdy—were already active, 
and before 1890, 130 single-tax organizations had 
appeared. The early movement was strained by an 
inner struggle between “single-tax, limited” men, 
like Shearman, who wanted only as much rent 
appropriated as was necessary for ordinary public 
services, and “single-tax, unlimited” men, like 
George, who wanted all the rent taken. After 
George’s death, the proponents of “single-tax, lim- 
ited” predominated. The movement narrowed, but 
it endured. During the early 1900s its program was 
incorporated into municipal reform programs and 
enactments, conspicuously in Ohio; and during the 
1910s it influenced Wilsonian, Democratic progres- 
sivism. From 1887 to the present, through organi- 
zations which have varied from clubs to schools to 
endowments, the movement has continuously pro- 
moted land-value taxation and free trade. 

During the 1890s, Henry George left the advo- 
cacy of the single tax largely in the hands of others, 
as he had earlier abandoned labor politics. With 
the exception of his round-the-world trip and the 
final campaign for the New York mayoralty in 
1897, he gave his time to writing three books, 
which were essentially testaments. In The Condi- 
tion of Labor ([1891] 1906-1911, vol. 3) he chal- 
lenged Pope Leo xm to permit Catholics to enter 
his movement; in A Perplexed Philosopher ([1892] 
1906-1911, vol. 5) he fully dissociated himself 
from the materialism of Spencerian and like- 
minded belief in progress and repeated his program 


for land-value taxation; in The Science of Political 
Economy ({1897] 1906-1911, vols. 6-7), which he 
never completed, he tried, with new elaboration 
but without mastery of current modes of economic 
thought, to persuade readers that his economic 
theories were scientifically sound. These codicils 
never had the impact of his earlier books, although 
they too revealed George’s passion for justice and 
freedom and his intellectual boldness and gift for 
persuasion. The final statements rounded out a 
dedicated and prophetic life. 


CHARLES A. BARKER 


[See also RENT; TAXATION, article on PROPERTY TAXES; 
and the biographies of MILL; SUMNER; TOYNBEE; 
WALKER; WICKSTEED.] 
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GERSON, JEAN DE 


Jean Charlier de Gerson (1363-1429) was born 
at Gerson-les-Barbey in Champagne. He began his 
studies at the University of Paris in 1377, becoming 
a bachelor of arts in 1381 and a doctor of theol- 
ogy in 1392. For much of his life Gerson was a 


GERSON, JEAN DE 155 


leading academic theologian of his university; he 
was made chancellor in 1395, succeeding his friend 
Pierre d'Ailly, with whom he was later to share the 
intellectual leadership of the conciliar movement. 

Gerson was prominent in support of the attempt 
of the Council of Pisa, 1409, to end the great 
schism in the papacy by appealing to the allegedly 
superior authority of the General Council in church 
government. Later he was a member of the French 
delegation to the Council of Constance, 1414—1418; 
where he formulated his mature conciliarist theo- 
ries. He had strongly alienated the Burgundian fac- 
tion in French court politics by his strong condem- 
nation of their use of the doctrine of tyrannicide 
to justify the murder of the anti-Burgundian Duke 
Charles of Orleans in 1408, and their hostility pre- 
vented him from returning to Paris; his surviving 
years were spent in Vienna and Lyons. 

Although he wrote a number of academic theo- 
logical treatises, in the scholastic manner, Gerson 
was primarily concerned with the active implemen- 
tation of the demands of the Christian life. His 
considerable pastoral experience was gained in 
Paris and in Bruges, where he was for a time dean 
of the Church of St. Donatien. A large number of 
his sermons and pastoral and spiritual treatises, 
including influential writings on mysticism, sur- 
vive. His preoccupation with the problem of how 
best to end the scandal of the papal schism grew 
out of his pastoral responsibilities. 

Theories of church government. Gerson’s ideas 
on church authority in general, and on the posi- 
tion of the General Council within the government 
of the church in particular, are among the most 
original and permanently influential parts of his 
works. He arrived at a full-blown conciliar position 
only after a slow process of disillusionment with 
the executive power of the papacy. It was as late 
as 1409, on the eve of the Council of Pisa, that 
Gerson first put forward the idea that in case of 
grave necessity the ordinary canon law of the 
church could be set aside. Gerson was here deal- 
ing with the problem of how a council to end the 
schism could be summoned without papal approval, 
hitherto considered essential by the canonists. He 
solved the difficulty by appealing to the principle 
of epikeia, or equity, which set aside the letter of 
the law in order to preserve its spirit (a concept 
going back via Aquinas and other Scholastics to 
Aristotle). 

Gerson’s mature conciliar theory is set forth in 
De potestate ecclesiastica, written in 1416-1417, 
during the Council of Constance. Gerson argued 
that the General Council, representing the totality 
of the faithful, was superior to any other institution 
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in the church, including the papacy, even though, 
as the executive head of the church, the papacy 
had the prerogative to oversee its day-to-day gov- 
ernance. The theory of conciliar supremacy was 
not new; it had been discussed freely in the four- 
teenth century, but Gerson gave it its most compre- 
hensive expression. His application of the theory 
of representation to the church, the most articulated 
social organism of the Middle Ages, is his endur- 
ing contribution to political science. 

There has been some controversy about Gerson’s 
position in the medieval philosophical and theolog- 
ical spectrum. He certainly had close affinities with 
the Ockhamist school, sharing with them a dislike 
of the excessive intrusion of man-made intellectual 
concepts into the data of revelation, a practice 
characteristic of the Scotists. But he referred freely 
also to the authority of pre-Ockhamist writers of 
the realist tradition, such as St. Bonaventure, St. 
Thomas Aquinas, and St. Albert the Great, as well 
as to the authority of patristic sources. The truth 
may be that Gerson’s system was an eclectic one, 
evolved with a constant eye on the needs of prac- 
tical Christian teaching. 

JOHN B. MORRALL 


[Other relevant material may be found in the biog- 
raphies of AQUINAS: MARSILIUS oF Papua; Ock- 
HAM.] 
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GERVAISE, ISAAC 


Isaac Gervaise, merchant and writer on eco- 
nomics, was a French Huguenot born in Paris after 
the mid-seventeenth century. He became a natural- 
ized British citizen. In 1691 his father helped to 
finance the Royal Lustring Company of England 
(manufacturers of a fine black silk), and shortly 
thereafter both father and son became officers of 
the company. 

Gervaise is known for only one work, The Sys- 
tem or Theory of the Trade of the World, which 
was published in 1720 but in effect rediscovered 
by Jacob Viner only in 1935 (see Foreword to Ger- 
vaise [1720] 1954). It is a remarkable essay, formu- 
lating the first general equilibrium treatment of the 
international mechanism of adjustment. Gervaise’s 
presentation marks a pioneer application of the 
“income” or “macroeconomic” approach to prob- 
lems of international equilibrium and disequi- 
librium. It analyzes the process that produces dis- 
equilibrium and shows under what conditions 
adjustment is restored or maladjustment persists. 
Gervaise drew a distinction between disturbances 
which may arise from fluctuations in trade per se 
and those which may arise from comparative 
stability in prices. His formulation embraces 
domestic and international equilibrium; credit, in- 
flation, and their effects on trade; the role of 
exchange rates, monopoly, tariffs, and the determi- 
nants of free trade, 

Although Gervaise’s work deserves more atten- 
tion for its masterly theorizing than for its policy 
recommendations, it is nevertheless notable as one 
of the very few unambiguous expressions of a free- 
trade position in the English economic literature 
before Adam Smith. Gervaise argued in uncom- 
promising terms that monetary instability and 
monopoly must be held in strict control. In an age 
when most writings on economics consisted of 
special pleading, Gervaise’s scientifically objective 
work was a most unusual achievement. It is the 
more remarkable since Gervaise was an officer in a 
company that operated with special monopolistic 
privileges, and the public policy he advocated was 


in sharp conflict with his private interests. To be 
sure, it may have been easier and less noble for 
him to take an antimercantilist position in 1720, 
for in that year the Royal Lustring Company had 
lost its charter. Gervaise wrote in his preface: 
“.,. this System . . . hath the ill Fate to appear at 
a time, when I myself could wish it false” ([1720] 
1954, p. 3). The statement seems to imply that 
while repeal was in the public interest it was in- 
jurious to his private interests. 

Gervaise had not only the advantage of prac- 
tical experience but also the capacity for general 
inquiry. A man of his experience and training was 
in a singularly fortunate position to examine the 
then existing international mechanism of adjust- 
ment, and he examined it with realistic detail and 
penetrating insight. Every sentence reveals incisive 
first-hand knowledge of the phenomena he de- 
scribed. The economy of words, the deep insight 
into the motivations of men and nations, and the 
vigor of his analysis all contribute to making Ger- 
vaise’s book a profound study, the wisdom of which 
appears to grow with each reading. 


Jonn M. LETICHE 


[See also INTERNATIONAL MONETARY ECONOMICS; IN- 
TERNATIONAL TRADE.] 
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GESELL, ARNOLD L. 


Whatever else Arnold Gesell (1880-1961) may 
have accomplished, he made his name a house- 
hold word in the United States: his books and 
articles on child development are legion, and some 
of the books were best sellers. For example, by 
1943 Infant and Child in the Culture of Today 
had gone through 15 printings. At least one genera- 
tion of American infants and preschoolers was 
reared according to the manuals provided by Gesell 
and his colleagues. 

Gesell was born in Alma, Wisconsin; by the time 
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he graduated from the local high school he was 
already deeply committed to science and the pro- 
fession of teaching. At Clark University, where he 
earned his PH.D. degree in 1906, he came under 
the influence of G. Stanley Hall, one of the earliest 
psychologists to study child development. Gesell’s 
early dedication to teaching, combined with his 
contact with Hall, probably pointed him in the di- 
rection that led to his fame as the “father of sci- 
entific child study.” 

After graduate study at Clark, Gesell spent sev- 
eral years as a teacher and principal in a public 
school, as a settlement worker, and as a normal- 
school and university professor. Finally, he found 
his niche at Yale, where he founded the Clinic of 
Child Development in 1911 and remained its di- 
rector until his formal retirement in 1948. Gesell 
had already gained substantial professional status 
when he decided that medicine would give him 
greater depth as a research worker; accordingly, 
he studied medicine at Yale at the same time that 
he was an assistant professor, and he received his 
M.D. degree in 1915. Retirement 33 years later was 
only a formality for Gesell—he continued his clini- 
cal work and research for the Harvard Pediatric 
Study and at the Gesell Institute of Child Develop- 
ment. 

Gesell’s power to attract able, almost fanatically 
devoted disciples is well known to both scientists 
and laymen. Most of these followers bear his stamp 
indelibly, have reflected credit on his name and on 
their apprenticeship to him, and are themselves 
well known in the field of child development. Not 
until the 1940s did a group of Young Turks in the 
fields of child and clinical psychology come to con- 
sider Gesell only as a recorder of norms, neatly la- 
beled and classified but without the explanatory 
power that is psychology’s central interest. 

Gesell did a basically respectable, and even a 
notable, job of combining clinical work with scien- 
tific observation of children. His photographic and 
one-way mirror studies of infants brought a new 
exactitude to the study of the young child. But 
his data about older children seem to have been 
less rigorously gathered. His coauthored books, e.g., 
The First Five Years of Life (1940) and The Child 
From Five to Ten (1946), tend to lose the indi- 
vidual child and to specify rather dogmatically an 
invariance and a genetic or constitutional uni- 
versality of child development. These works simply 
assert the cyclical character of child development 
and underestimate the influence of the complex 
culture in which the child develops and learns. 
Development emerges as something mechanical: 
invariant, unchangeable, almost fatelike. The ques- 
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tion may well be asked whether Gesell, with his 
rigid norms, his lack of individual variation, his 
strong emphasis on the constitutional, and his neg- 
lect of the cultural, has not hindered parents in 
their rearing of children more than he has helped 
them. 

The Gesell Development Schedules were per- 
haps not intended as measures of infant intelli- 
gence, but they were widely so used and were long 
considered nearly infallible. In fact, however, they 
have failed, for the most part, to predict later in- 
tellectual development of children, individually or 
in groups. The harm that has been done by the 
trusting employment of the schedules is incalcu- 
lable: many children have been denied adoption 
and condemned to prolonged and blighting custody 
in infant-care institutions because their Develop- 
mental Quotients were low (and, predictably, be- 
came lower the longer the children stayed in the 
institutions). 

Gesell wrote with a bland, almost Olympian cer- 
tainty about how children develop, and unless his 
writings are examined carefully they carry the 
implication that his conclusions are based on the 
careful study, over an extended period, of untold 
hundreds of infants and children. Even the fine 
print does not reveal precisely the actual number 
of subjects who were studied at different ages. A 
sympathetic biographer has referred to “follow-up 
examinations on about 175 cases, with referrals 
from different agencies and persons of 600 to 
700, mostly of preschool age, and 1,000 or more 
guidance and observational contacts centering on 
nursery children” (Miles 1964, p. 55). The con- 
siderably less sympathetic Milton J. E. Senn, 
Gesell’s successor at Yale, has pointed out that the 
sample for Gesell’s basic 1925 survey was all white 
and numbered only 107 middle-class children from 
a single city in New England, while studies of older 
children were based on even smaller samples— 
for example, for six-year-olds, only eleven girls and 
seven boys (1955). 

This is slight evidence, indeed, for statements 
describing the six-year-old as follows: “He tends 
to go to extremes . . .” (Gesell & Ilg 1946, p. 90), 
or “An outstanding characteristic of the 6-year-old 
is his meager capacity to modulate” (p. 92), or 
“He does not only smile,—he fairly dances with 
joy. He cries copiously when unhappy, kicks and 
shakes with his grief. Even during sleep he pitches 
his whole organism into his dreams. Hence the 
gross arousals of his nightmares, which come to 
a peak at the age of six” (p. 94). And so on and on, 
for 42 pages. 

In sum, Gesell no longer appears the superman 


of child development that he was considered in his 
heyday, in the 1930s and 1940s. Serious questions 
must be raised about the quality of his social in- 
fluence and the advice to parents by his disciples— 
for example, the flat statement by Louise Ames 
that the time of first tooth eruption is perhaps the 
best predictor of later reading success. Yet none 
can deny him a measure of greatness: he was the 
most scientific of the early students of infant be- 
havior. He did give great impetus to moving “child 
study” toward “a science of child behavior.” He 
fathered and fostered one of the great centers for 
child study. He trained some of the best workers 
in the field of child behavior and development and 
influenced (and still influences) them all. These 
are no mean achievements. 

Boyp R. MCCANDLESS 


[For the historical context of Gesell’s work, see the 
biography of HarL; for discussion of the subsequent 
development of Gesell’s ideas, see ADOLESCENCE; 
DEVELOPMENTAL PSYCHOLOGY; INFANCY; MORAL 
DEVELOPMENT; SENSORY AND MOTOR DEVELOPMENT; 
SOCIALIZATION.] 


WORKS BY GESELL 

1940 YALE UNIVERSITY, CLINIC OF CHILD DEVELOPMENT 
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the Preschool Child. New York: Harper. > Contains 
“Early Mental Growth” by Arnold Gesell and “The 
Study of the Individual Child” by Arnold Gesell and 
Catherine S. Amatruda. 

(1943) 1949 GESELL, ARNOLD; and Ine, Frances L. Child 
Development: An Introduction to the Study of Human 
Growth. Part 1: Infant and Child in the Culture of 
Today. New York: Harper. 

(1946) 1949 GESELL, ARNOLD; and ILG, Frances L. Child 
Development: An Introduction to the Study of Human 
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Mass.: Clark Univ. Press. 
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GESTALT THEORY 


The problem that gestalt theory confronts is that 
of an extended event, whether an experience or an 
action, that cannot be adequately described as a 
sum of smaller, independent events. Such an event 
is called a gestalt; this term can be translated as 


“form,” “configuration,” or “structure.” Facts of this 


character were largely ignored in the atomistic psy- 
chology of the nineteenth and early twentieth cen- 
turies, although a few thinkers had begun to question 
this neglect. 

The gestalt movement introduced a new ap- 
proach to the treatment of psychological facts. It 
arose in Germany in the second decade of the 
twentieth century as a reaction against atomistic 
psychology. Its founders and pioneers were Max 
Wertheimer, Wolfgang Köhler, and Kurt Koffka. 
The first 25 years of its existence were a period of 
rapid development of ideas and of intensive investi- 
gation. Beginning with the 1920s it commanded 
wide attention in the psychological world. With the 
rise of Nazism the leading gestalt psychologists left 
Germany for the United States, and the subsequent 
course of the movement became, in a measure, part 
of the history of American psychology. [See the 
biographies of KoFFKA; KÖHLER; WERTHEIMER.| 

The gestalt movement advanced a searching ex- 
amination of the presuppositions of atomism; more 
important, it introduced an alternative conception 
that became the basis of many investigations and 
theoretical proposals. The earliest contributions of 
gestalt psychology were concerned mainly with 
problems of perception. Subsequently, its investiga- 
tions extended to the areas of thinking, memory, 
and learning, and, more recently, to social psy- 
chology and the psychology of art; it also made 
contact with certain aspects of logic and ethics. As 
its investigations came to include new areas, gestalt 
theory stood revealed as a systematic orientation 
within psychology. 

The source of the gestalt movement can be 
traced to one paramount concern. As a rule the 
events of mental life possess form, sense, and 
value; these are its striking characteristics. Yet, the 
prevailing scientific psychology contained virtually 
no reference to these attributes, and its accounts 
appeared correspondingly limited and barren. The 
customary reply to this stricture was that scientific 
procedure requires analysis into elements and 
therefore permits no other outcome. Accepting this 
reasoning, some thinkers concluded that important 
human phenomena and problems necessarily fall 
outside the reach of science. Others, among them 
the vitalists, appealed to higher and undefined agen- 
cies as sources of form and sense. Gestalt psychol- 
ogists rejected such a solution on the ground that 
it adopted a questionable postulate about the de- 
mands of scientific investigation. An approach that 
fails to do justice to the most obvious facts of 
experience cannot, they held, be scientifically cor- 
rect, They saw in this situation a challenge to 
re-examine the starting point of psychology. 
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Perceptual organization 


The formulations of gestalt theory were first 
tested in perception. This was the most advanced 
part of systematic psychology and the area in which 
atomism was most strongly entrenched. In the psy- 
chology of the early twentieth century the elements 
of perception were simple and irreducible sensa- 
tions, each having its unique quality and a constant 
relationship to a particular excitation of the sense 
organs. To explain the combinations of these iso- 
lated sensations (and their images), this psychology 
invoked the mechanism of association by con- 
tiguity, which, it held, brought order into the initial 
chaos of sensations. The goals of this psychology 
were to identify the elements of experience and the 
corresponding stimulus energies and to describe the 
manner in which the elements become associated, 
all based on the method of analytic introspection. 

The issue was brought to a head when Wert- 
heimer took the radical step of denying the reality 
of sensory elements -° parts of perceptual experi- 
ence. His study of apparent movement ({1912- 
1920] 1925, pp. 1-105), which marks the formal 
beginning of gestalt theory, provides a specific illus- 
tration of this thesis. The experience of motion was 
traditionally described as a sum of successive sen- 
sations of position, each corresponding to succes- 
sive local excitations. Apparent movement results 
when two stationary objects in distinct positions— 
for example, two lines—are exposed in succession 
with a suitably small time interval between them; 
the observer sees one line moving from one to the 
other position in a manner indistinguishable from 
real motion. This phenomenon, which is at the 
basis of cinematography, was well known. Wert- 
heimer pointed out that apparent movement is not 
a series of sensations but an effect of two stimulus 
events cooperating to produce a new, unitary out- 
come; perceived movement cannot be split up into 
successive stationary sensations. The static char- 
acter of the external situation is not represented in 
experience, while the movement perceived has no 
counterpart in the objective situation. From the 
assumption that experience consists of having one 
sensation followed by another, one cannot account 
for the experience of change inherent in motion, a 
conclusion that applies equally to the perception of 
real motion. 

A further and more important step in this devel- 
opment was the gestalt account of grouping, or 
unit formation, in perception. Visual experience 
consists of things that may, in turn, form groups; 
certain parts of the visual field appear to cohere 
and to form units that separate themselves from 
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the surrounding space and from other units. The 
units of perception are trees, houses, persons—not 
innumerable sensory elements. It was the contribu- 
tion of gestalt theory to show that the formation 
and segregation of units cannot be taken as self- 
evident, as it seems to common sense, but consti- 
tutes a central problem for the psychology of per- 
ception. The physical energies that are reflected 
from the points of an external object are entirely 
discrete; each hits the eye independently of the 
others, with no indication of whether each comes 
from the same object. They form a mere mosaic 
of stimuli that may be grouped in numerous ways 
and that provide no basis for the veridical organi- 
zation of the percept. Unity of the physical object 
does not account for unity of the percept. How then 
do units emerge from discrete stimulation? Wert- 
heimer described certain fundamental principles 
of grouping, or unit formation, in perception, among 
them those of proximity, similarity, closure, com- 
mon fate, and good continuation (Wertheimer 
1925). Working with discontinuous points or lines, 
he demonstrated that they tend to fall into groups, 
in accordance with relative spatial proximity and 
qualitative similarity; grouping also occurs in ac- 
cordance with closure and good continuation. Wert- 
heimer considered one principle, that of Prdgnanz, 
fundamental and inclusive of the others. The prin- 
ciple of Prägnanz maintains that grouping tends 
toward maximal simplicity and balance, or toward 
the formation of “good form.” 

The facts of grouping establish, first, that sets of 
stimuli produce effects not derivable from the effects 
of the single stimuli. These effects, observable only 
in extended wholes, are dependent upon strictly 
objective conditions—namely, specific geometrical 
relations between stimuli. Second, the stimulus re- 
lations logically permit other groupings that do not 
in fact occur. Thus, the facts of grouping give evi- 
dence of selective principles according to which 
sensory data are organized, and the units of per- 
ception must therefore be considered products of 
organization or specific effects of processes result- 
ing from certain relationships. Third, the same 
sensory conditions of grouping that usually give 
rise to veridical perception sometimes produce non- 
veridical perception. As the facts of camouflage 
show, physically real units are not necessarily per- 
ceived, and perceptual units sometimes arise in the 
absence of corresponding physical units. Fourth, 
Wertheimer concluded that he had identified pri- 
mary principles of grouping in perception. He ex- 
plicitly included past experience as one determinant 
of grouping but maintained that it cannot account 
for the other grouping tendencies that are them- 


selves necessary conditions of learning. Last, the 
principles of grouping have a broad range of appli- 
cation; foremost is their capacity to explain object 
and form perception in general. 

The treatment of part-whole relations, which is 
central to the gestalt position, may best be illus- 
trated with the contribution of von Ehrenfels, who 
in 1890 described perceptual facts that are not a 
sum of independent local components. A property 
of a visual entity, such as roundness or symmetry, 
does not reside in its separate parts or in their 
sum; the same is true of the character of a tem- 
poral unit, such as a melody, Such properties are 
also transposable; a melody is recognized in a new 
key, although it shares no tones with that heard 
originally, and a square is recognizable as such 
when it is enlarged or reduced or when it appears 
in a new part of the field. There are innumerable 
facts of this order that refer to qualities in wholes 
only, among them those we call straight, closed, 
hard, smooth, translucent. 

These form qualities, or Gestaltqualititen, posed 
a problem for a psychology that took sensations as 
the sole contents of experience. Reversing the tradi- 
tional formulation, Wertheimer proposed not only 
that a coherent whole has properties and tenden- 
cies not discoverable in its isolated parts but also 
that a part has properties which it does not possess 
when it stands alone or when it belongs to another 
unit. The character of a whole often determines 
whether one of its parts will be perceptible or not 
and what its properties will be. Given three dots in 
a linear array, one is perceived as middle, the others 
as ends; these properties are relationally deter- 
mined and do not exist for the isolated components. 
This thesis of part-whole determination asserts 
that a part is a dependent property of its whole and 
thus draws a basic distinction between “part” and 
“element.” 

Furthermore, a host of discoveries demonstrated 
that the identical stimulation, at different points in 
time, of a given region can produce markedly dif- 
ferent effects, depending upon the stimulation oc- 
curring in neighboring regions. The perceptual 
constancies and the so-called illusions revealed 
striking discrepancies between what is in fact ob- 
served and what should be observed if local sensa- 
tions alone were the content of experience. The 
shapes and sizes of objects remain within limits 
approximately constant as their orientation and dis- 
tance are varied, and the colors of objects tend to 
look the same when the conditions of illumination 
change widely. The same proximal stimulation may 
cause perception of bright or dark, of upright or 
tilted, of large or small, of motion or rest, of motion 


at a high or at a low velocity, depending upon other 
stimulus conditions. In an effort to bolster the 
classical position the interpretation advanced was 
that the sensations in question were in fact un- 
changed but were corrected by judgments or 
“unconscious inferences” formed in the course of 
past experience. A clearer and more consistent ex- 
planation of these and other facts could be given 
using the assumption that they were effects of per- 
ceptual organization initiated by specific stimulus 
relations. [See PERCEPTION, articles on PERCEPTUAL 
CONSTANCY and ILLUSIONS AND AFTEREFFECTS. | 

Important support for the gestalt treatment of 
perception came from the demonstration by Rubin 
(1915) of the distinction between “figure” and 
“ground,” between the thing-character of the former 
and the formlessness of the latter. A step in the 
same direction was the subsequent discovery by 
Michotte (1946) that particular conditions of suc- 
cessive stimulation produce the experience of caus- 
ality. When figural units are perceived to move in 
relation to each other at certain rates, they are ex- 
perienced as functionally connected; the observer 
refers to the motion of one object as the cause, 
while the motion of another object is perceived as 
the effect. Still other patterns of movements, which 
can also be clearly specified, produce the impression 
of animated movement. In the light of these and re- 
lated findings atomism in perception ceased to be 
a viable position. 

Physical and physiological gestalten. The con- 
cept of gestalt received a fundamental elaboration 
in the work of Köhler (1920; 1940). As a first step 
Köhler called attention to a striking similarity be- 
tween certain aspects of field physics and facts of 
perceptual organization. He pointed to certain in- 
stances of functional wholes in physics that cannot 
be compounded from the action of their separate 
parts. There are macroscopic physical states that 
tend to develop toward an equilibrium and in the 
direction of maximum regularity. One can describe 
the local conditions in such functional wholes with 
any desired degree of precision, but they do not 
function as independent parts. Systems of this 
character, of which there are numerous instances, 
are physical gestalten. They meet the criteria of 
von Ehrenfels (1890) for a gestalt quality. 

Following the lead of the phenomenal data, 
Köhler proposed that there are macroscopic field 
processes in the brain, involving interactions which 
account for the effects of grouping and segregation 
and for the operation of the Prdgnanz principle. 
Traditionally, cortical action was described in terms 
of separate excitations conducted along insulated 
fibers to circumscribed areas. The relational de- 
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termination of experiences implies that the neural 
processes corresponding to separate stimulations 
must influence each other across distances in a 
manner that depends on their relative properties. 

Köhler proposed a fundamental change in the 
conception of cortical functioning. A region such 
as the optic sector may be considered an electro- 
lyte; the processes within it occur according to 
physical laws of self-distribution rather than ac- 
cording to the microanatomy of neural networks. 
Local states of excitation are surrounded by fields 
that represent these states in their environment 
and interact with other local states similarly rep- 
resented. On this basis Köhler put forward the hy- 
pothesis that there are physiological processes 
which are special instances of physicochemical 
gestalten and that these are the correlates of phe- 
nomenal gestalten. 

Implicit in the preceding examination is the 
assumption of psychophysical isomorphism, or the 
proposition that brain processes include some struc- 
tural features that are identical with those of or- 
ganized experience. Isomorphism refers not to 
metrical but to topological correspondences; brain 
processes are assumed to preserve the functional 
relations of symmetry, closedness, and adjacency, 
not the exact sizes and angles of patterns projected 
on the retina. This formulation diverges from the 
widely accepted view that phenomenal and physio- 
logical events are lawfully correlated but have no 
further likeness between them. The postulate of 
isomorphism is intended as a heuristic guide to 
investigation. In this manner Köhler sought a uni- 
fied explanation for facts in neurophysiology and 
psychology among certain facts of physics. [See 
NERVOUS SYSTEM, article on STRUCTURE AND 
FUNCTION OF THE BRAIN.] 


Memory and association 


There is a natural transition from the gestalt 
study of perception to memory. When a form has 
been perceived it may be subsequently recognized 
and recalled: thus, the products of perceptual or- 
ganization are among the contents of memory. The 
persistence of past experiences requires a concept 
of memory traces; further, the resemblance be- 
tween memories and original experiences implies 
that memory traces preserve the organized char- 
acter of earlier processes. Gestalt studies of mem- 
ory start from this assumption; a first effort to 
elaborate a theory of memory-trace action will be 
found in Koffka (1935). 

The preceding formulations lead directly to one 
of the problems of memory—recognition. The facts 
of transposition to which von Ehrenfels first called 
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attention imply the recognition of wholes or ge- 
stalten and, further, that recognition occurs on the 
basis of gestalt similarity and in the absence of 
identical elements in past and present situations. 
Since recognition depends upon the activation of 
specific memory traces and is highly selective, the 
gestalt proposal is that such memory-trace contact 
occurs on the basis of distinctive similarity, anal- 
ogous to grouping by similarity in immediate ex- 
perience. This formulation further implies that if 
the process corresponding to a present experience 
is to contact a corresponding memory trace, it must 
have its effect beyond its immediate locus and that 
neural conduction along insulated nerve fibers 
alone does not suffice to explain recognition. Thus, 
according to this account recognition depends upon 
an interaction that is relationally determined. 
With respect to the concept of association two 
points are of main consequence, First, the gestalt 
account of perceptual organization was intended 
as a direct alternative to the interpretation that 
perceptual units are made up of associations be- 
tween elements. Second, association theories have 
in practice all but abandoned the phenomena of 
perception; they tend to take the presence of per- 
ceptual units for granted and concentrate instead 
on connections formed between one unit and an- 
other. In treating these facts, one principle, that of 
association by contiguity, was increasingly singled 
out as basic. According to this principle, temporal 
contiguity is the crucial condition of association. 
In this account an association has the character 
of a bond that does not alter the terms it connects, 
From the standpoint of gestalt theory the con- 
cept of an association as a mere bond is not a 
satisfactory basis of explanation. Processes in na- 
ture are as a rule relationally determined. In this 
connection Köhler (1929; 1941) proposed that an 
association is not a new process, but an aftereffect 
of organization and that it is dependent upon the 
relative properties of the respective terms. When 
two items are connected they form a unit and leave 
a corresponding unitary trace; subsequent excita- 
tion of a part of this trace will spread to the entire 
trace, Given this starting point there is no reason 
to single out the relation of contiguity, to the ex- 
clusion of others; all relations, such as those of 
similarity and good continuation, should bind 
events to each other. More generally, conditions 
favorable to organization should be conditions of 
association. Accordingly, the formation of associa- 
tions and perceptual organization receive a unified 
interpretation. There remain unresolved issues in 
this area, but the available evidence supports the 
conclusion that relations other than contiguity ex- 


ert pronounced effects on the formation of associa- 
tions (Asch 1960). [See FORGETTING.] 


Thinking 

Two themes have been most prominent in the 
gestalt treatment of thinking: one concerns the 
occurrence of understanding or insight; the other, 
the occurrence of processes of discovery. Of these, 
understanding is the more general phenomenon; it 
occurs often in the absence of the discovery of 
solutions and provides a basis for them. To under- 
stand is to have an awareness of a required relation 
between immediately given facts. When such un- 
derstanding is present the relation is experienced 
as “following from” the given facts—that is, the 
nexus between them is itself understandable. Given 
two premises and a conclusion the latter either 
develops out of the former or contradicts it. Such 
relations, which have the character of “if A, then 
B and only B,” contrast most strongly with the as- 
Sociation between heterogeneous facts; the terms 
and their relation form a unit all parts of which 
are dependent upon one another. An understand- 
able relation between two terms is not a third term 
added to them; given any two of the parts the third 
is demanded. The relation in question is thus a 
dependent part-property of a whole. The first point 
of the gestalt account of thinking is that under- 
standing or insight in the sense here described per- 
vades human experience and that no thinking is 
possible in its absence. Understandable relations 
have the character of requiredness, or “oughtness.” 
This is the outstanding trait of facts of aesthetics 
and ethics as well as of logic; in each of these 
realms requiredness is relationally determined, be- 
ing a property of an interdependent situation. 
Thus, the concept of value becomes related to that 
of organization. One observes an important aspect 
of requiredness when a situation is incomplete; in 
such cases the gap has particular properties that 
produce tendencies toward completion in accord- 
ance with the character of what is given. Gestalt 
theorists have sought to explore the conditions of 
requiredness out of a concern for establishing 
whether there are ethical invariants; these invari- 
ants would provide an alternative to a relativistic 
foundation for ethics, 

Connections between concrete empirical events 
are not, however, understandable in the same way 
as logical connections. That heavy bodies fall when 
dropped cannot be automatically deduced; the 
underlying functional connections are hidden, and 
conclusions concerning them must be based on 
induction. Accordingly, the prevailing tendency of 
psychology since David Hume has been to stress 


the role of purely factual regularities in our knowl- 
edge of causal action. Gestalt psychology proposes 
that empirical events too are often related in ways 
that are structurally simple and that these rela- 
tions facilitate the learning of the causal interplay. 
Duncker (1935) has pointed out that there are far- 
reaching correspondences between the phenomenal 
properties of causes and of their effects. They are 
often coincident in space and time and thus stand 
out against a background of more indifferent 
events. A sound is heard where an object is seen to 
strike; a sheet of paper acquires a crease where it 
is folded; fire burns shortly after a match is ap- 
plied to an object. There are also pronounced simi- 
larities of content and form between cause and 
effect. The shape of a footprint corresponds to that 
of a shoe; a hot object transmits heat to its sur- 
roundings; a wet object moistens things in contact 
with it. Further, variations of cause often produce 
parallel variations of effect. The accelerated rhythm 
of the motions of knocking parallels the changing 
rhythm of the sounds produced; the stronger the 
push one applies to an object, the faster and farther 
it moves. These relations make possible a system- 
atic ordering of empirical facts, although the rela- 
tions are not fully intelligible. 

Gestalt psychology treats productive thinking as 
the development of new structures or organizations. 
The discovery of a solution begins with a situation 
and a goal that cannot be directly reached; what 
requires explanation is how the gap is bridged. The 
principal point of the gestalt account is that the 
operations of thinking do not occur piecemeal but 
are effects of organization and reorganization. 
First, thinking is a directed process based on an 
initial view of a coherent but incomplete situation. 
The direction arises from the problem itself—more 
accurately, from the gap between the view of the 
given conditions and the goal. The urge to over- 
come the difficulty creates the tensions and vectors 
that lead to a re-examination of the materials and 
of the problem. This formulation asserts a distinc- 
tion between an aggregate of independent facts and 
a structure; there can scarcely be productive think- 
ing when the possibility of grasping a principle is 
excluded. Further, under the stress of the initially 
incomplete view the material is reorganized; parts 
and relations previously unnoted or in the back- 
ground emerge, often abruptly, analogously to the 
reversal of perceptual forms, and parts previously 
separated become united. These changes in the 
meaning of parts, including changes of relation and 
direction, produce the transition to a new view that 
has greater coherence. From the outset the steps 
are guided by the main lines of the problem and 
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are taken with reference to each other. The opera- 
tions of centering and recentering, of separating 
fundamental from peripheral features, spring from 
the whole character of the situation or from a struc- 
tural view of the gap and its stresses. These formu- 
lations account for the fact that the organization 
of the problem situation often changes before the 
more detailed steps can be elaborated. 

The preceding account represents only some first 
steps toward a theory discovery. There is at this 
time no satisfactory explanation of the occurrence 
of sudden reorganization that favors the emergence 
of a solution. Reference to understanding or insight 
does not constitute an explanation, since these are 
descriptive terms that do not clarify the underlying 
operations. 

The treatment of thinking in gestalt psychology 
was formulated in explicit opposition to the as- 
sociationism of the early decades of the twentieth 
century, which excluded reference to understand- 
able relations and, indeed, to relations generally. 
Associationism postulated that connections be- 
tween psychological events are neutral and devoid 
of meaning—that is, given events A and B, nothing 
in the character of A points to B rather than to any 
other event. Associationism also excluded refer- 
ence to operations of organization and reorganiza- 
tion. Accordingly, it described the emergence of 
changed views and of new solutions in terms of the 
reshuffling of associative chains, the components 
of which remain constant. This approach defined 
knowledge as a repertoire or inventory of specific 
data and of connections between them. From the 
standpoint of gestalt theory the striking powers of 
thinking seem to disappear under the association- 
istic treatment. Thinking involves functions differ- 
ent from association, although it draws some of its 
materials from associations. No purely contingent 
associations, however strong, can provide under- 
standing. 

Another point at issue is the role of past experi- 
ence in the process of solution. Associative accounts 
treat thinking mainly as a product of past experi- 
ence. Gestalt theory does not question the contribu- 
tion of past experience but asserts that thinking 
involves more than recall. Since innumerable as- 
sociations lead out from a given problem situation, 
a solution cannot arise on the basis of associative 
reproduction alone. Some selection must take place. 
Further, it is questionable whether the products of 
past experience enter into thinking in unaltered 
form; they may have to be reorganized in order to 
meet the demands of the problem. Nor may one 
neglect the fact that recalled material is itself often 
the product of understanding that has occurred in 
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the past; reference to past experience does not ex- 
clude understanding. Finally, the solution of a 
problem may not even require recall of added facts; 
this is the case when the appropriate facts are 
given as part of the problem. Conversely, one may 
fail to solve a problem under these conditions. 

In recent decades a revised associationism has 
attempted to encompass the organized character of 
thinking operations within the framework of a 
stimulus-response analysis. Realizing the inade- 
quacy of the earlier one-stage associative paradigm, 
it has postulated the presence of intervening medi- 
ational processes, which would bridge the gap 
between apparently noncontiguous events (e.g., 
Osgood 1953). The inferred mediating events in 
question are assigned the same properties as overt 
stimulus-response connections. Their function is 
to introduce further associative links between 
overtly observable connections and, thus, to pro- 
vide a replacement for cognitive operations. This 
elaboration admits no organizing principle other 
than that of association; it continues to adhere to 
a linear model of thinking operations, treating 
them as chains of stimulus—response connections. 
It is too early to evaluate this effort; at this time 
it is not clear how it can accommodate the presence 
of rules or principles, the seeing of given materials 
in a new way, or the achieving of a view in terms 
of which one can understand a mass of detail. 

The psychology of thinking touches upon ques- 
tions of education, of teaching and learning, since 
there is no sharp separation between discovering 
a solution and understanding it when it is ex- 
plained. This aspect has been treated most ex- 
tensively by Wertheimer in Productive Thinking 
(1945). In this connection he has contrasted learn- 
ing by drill and by understanding. A pupil may 
memorize the steps of a solution and reproduce 
them without error, but if he has failed to under- 
stand, he will become helpless or make senseless 
errors when the details of the problem are changed. 
If he has understood the relation of the steps to 
the goal, he will be able to adapt the solution to a 
new set of conditions that retains the essential 
structural relations. Indeed the ability to produce 
the necessary transpositions constitutes an opera- 
tional test of understanding (see, e.g., Katona 
1940). In teaching and learning, as in the dis- 
covery of a solution, to leave out the relation of a 
given fact to the whole is to take away the essen- 
tials of thinking. Educational practices that stress 
piecemeal preoccupation with details, exactness of 
repetition, and instantaneous response tend to be 
inimical to thinking. [See LEARNING; PROBLEM 
SOLVING; THINKING. ] 


Social psychology 

Gestalt theory has in recent decades provided 
the starting point for a number of systematic 
efforts in social psychology, among them those of 
Lewin and Heider. The following are some selected 
examples of problems studied from this point of 
view. 

(1) Social action in man depends upon the ca- 
pacity of the participants to perceive and under- 
stand one another. These operations involve refer- 
ence to the mental processes of others; in everyday 
life one makes sense of the actions of persons by 
referring to their feelings, perceptions, intentions, 
and ideas. Yet, it is widely accepted that there is no 
access to these internal events in others, that one 
can only observe their actions, and that these 
actions need not be expressive of internal events. 
How, then, is one to explain the conviction that 
another is in pain or is angry or that his voice is 
charged with sorrow? According to one account 
such conclusions can only be reached indirectly, 
on the basis of association and inference by analogy 
with one’s own experiences. A more behavioristic 
account disregards reference either to the experi- 
ence of the observer or of the observed; the actions 
in question are said to acquire significance on the 
basis of association with other actions and environ- 
mental conditions. Each of these formulations 
treats the perceived actions of others as initially 
neutral. Gestalt theory proposes a fundamentally 
different conception of the relation between action 
observed in others and their experience, holding 
that these are structurally closely similar. Fearful- 
ness, joy, hesitation, boldness are expressed in 
action as much as in the dynamics of experience. 
The outward form of an action is an expression of 
underlying forces. If so, understanding the mental 
life of another person is not primarily a question 
of generalizing from the physical to something 
unrelated to it. 

Perception and understanding of other persons 
depend to a marked degree on the observation of 
their expressive, or physiognomic, characteristics. 
For gestalt theory physiognomic facts are a crucial 
part of perception (Arnheim 1954; Koffka 1940). 
Percepts, including those that are inanimate and 
Static, are rarely neutral. One perceives in visual 
and auditory patterns the dynamic characteristics 
of tension, balance, rhythm; these are indeed the 
primary contents of everyday perception. Expres- 
sive qualities depend upon the pattern of an entire 
situation; they tend to be lost when one concen- 
trates on separate parts. They are often more im- 
mediately and more vividly observable than shape 


and color; the clouds that hang in the sky are 
ominous as well as dark, a face is alert as well as 
elongated. Expressive qualities are functionally 
important, since they determine approach and 
withdrawal. Gestalt theory assumes that certain 
patterns and movements are initially perceived as 
inviting or repelling, cheerful or somber; further, 
the perception of identity of expressions across dif- 
ferent media and modalities depends upon simi- 
larity of form qualities. Gestalt theory therefore 
opposes theories that try to derive the expressive 
character of a whole from those of its separate 
components. Illustrative of this is the effort to 
arrive at the expressive characteristics of a face 
from an analysis of its individual parts or to read 
character from handwriting based on a list of iso- 
lated characteristics. [See EXPRESSIVE BEHAVIOR. ] 

(2) The proposition that each individual acts in 
accordance with his wishes and needs has received 
a particular interpretation in contemporary psy- 
chology—namely, that human relations are with- 
out exception based on self-interest. So incontro- 
vertible has this assumption appeared that it has 
dictated that membership in groups and even con- 
cern for others and action in behalf of others be 
treated as derivatives of self-interest. Gestalt psy- 
chology calls attention to the far-reaching ambi- 
guity and vagueness in this position. It begins with 
the observation that the phenomenal self is only 
one part of the phenomenal field. The world that is 
represented in the experience of each person and 
that spurs him to action includes far more than the 
self; in fact, the self constitutes a small portion of 
it. The question then arises of specifying the kinds 
of relations that obtain under different conditions 
between the phenomenal field and that part of it 
we call the self; this is a special case of part—whole 
relations. Seen in this light it becomes evident that 
the customary formulation confuses the phe- 
nomenal self with the entire psychological field. In 
consequence the correct—and tautological—prop- 
osition that motivational vectors have their source 
in the individual is equated with the quite different 
proposition that the vectors arise from the self. The 
latter proposition too is often correct, but its uni- 
versal validity is no longer self-evident. In percep- 
tion one finds that under certain circumstances the 
coordinates of the surroundings become the coordi- 
nates for the self—that is, that the self is perceived 
and localized with respect to the surroundings, and 
not conversely. Thus, there are conditions that in- 
duce a person to perceive himself in motion when 
he is stationary or tilted when he is upright. 

By analogy it becomes necessary to ask whether 
there are not also motivational conditions in which 
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the individual feels and is moved to act as part of 
the social field, subject to its demands. In the light 
of this analysis it is dogmatic to presuppose that 
acting in accordance with the needs of others or 
with the demands of a situation must be inter- 
preted as a version of self-centeredness. There may 
rather be circumstances when self-centered action 
is unnatural. Actions called right and wrong are 
particularly instructive in this connection. They 
cannot be simply equated with preferences since 
they often go against personal inclination, nor are 
they always in accord with convention or social 
approval, The issue as restated becomes a factual 
one: under what conditions does action become 
self-centered, and under what conditions is it in 
accordance with needs and requirements that are 
located outside the self and to which the self is 
responsive? No propositions of gestalt psychology 
prescribe an answer to this question; rather, the 
example illustrates the role of phenomenological 
analysis and of a concern with part—whole relations 
in the formulations of a psychological problem. 
[See SYMPATHY AND EMPATHY. ] 

(3) Forming an impression of a person has a 
place of obvious importance in social psychology. 
Is it adequate to say, in accordance with an atom- 
istic interpretation, that to have an impression is 
to possess a number of facts about a person, to 
know that he has these and these characteristics? 
Certain initial observations stand in the way of 
such a conclusion. An impression of a person ap- 
pears in some sense unitary; also, certain aspects 
are felt to be more fundamental than others. To 
this one must add that a change in one character- 
istic may alter the character of the entire impres- 
sion. Thus, the discovery of one new fact about a 
person may have drastic repercussions upon one’s 
entire view of him; one may be compelled to reor- 
ganize one’s view and to conclude that one had not 
really known him. Furthermore, even when one 
“knows” a man, at some point one may realize, 
without the benefit of new information, that his 
characteristics are really organized quite differently 
than one had originally supposed and that one had 
missed the main point about him. 

These observations form the basis of a gestalt 
theory of impression formation that emphasizes the 
interrelations among observed characteristics and 
the manner in which these characteristics modify 
each other (Asch 1952). An impression has the 
characteristics of a structure the parts of which 
cooperate to produce a particular organization. 
From these initial assumptions the following, more 
specific propositions emerge from investigation: 
(a) The items of knowledge about a person do 
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not remain isolated but interact and mutually alter 
each other. (b) The interactions depend upon the 
properties of the items in their relation to one an- 
other. (c) In the course of interaction the char- 
acteristics group themselves into a structure in 
which some become central and others dependent. 
(d) The resulting interdependence creates a uni- 
tary impression which tends to be subjectively com- 
pleted in the direction of becoming more consistent 
and coherent. (e) It follows that a given item of 
information or a characteristic functions as a de- 
pendent part, not as an element. (f) If so, the 
“same” trait in two persons is not necessarily the 
same. The issue under discussion is not limited to 
persons; the same questions arise when one con- 
siders the knowledge of any extended and inter- 
related situation, whether it concerns the perceived 
character of a group or the structure of an attitude. 
[See PERCEPTION, article on PERSON PERCEPTION. ] 

(4) Acts and utterances of persons and groups 
are constantly judged and evaluated; how these 
judgments are made is a problem that has attracted 
attention. One general observation has been the 
starting point of considerable investigation: a given 
act or assertion is often differently evaluated de- 
pending upon its source. Thus, one may accept an 
opinion from one person or group but reject it when 
it comes from another. The customary explanation 
of this effect has been in terms of prestige sugges- 
tion; a judgment or opinion is said to be changed 
by the attachment to it of positive or negative pres- 
tige. Of theoretical consequence is the assumption 
that an object of judgment is one thing, its evalua- 
tion another, and that these separate factors can 
be connected at will to produce an arbitrarily de- 
sired outcome. An alternative interpretation as- 
sumes that actions and evaluations are relationally 
determined (Asch 1952). Specifically, an act or 
assertion does not retain a fixed character when it 
is related to two different sources but functions 
as a dependent part of its context, altering in con- 
tent and significance as it is referred to different 
sources, If so, the datum that is accepted is not 
psychologically the same as that which is rejected, 
and the effect in question does not concern pri- 
marily a change in the evalution of an object but 
of the object being evaluated, not a change of re- 
sponse to a fixed condition but a change in the 
condition to which one is responding. This inter- 
pretation diverges importantly from the usual one 
for both critical and uncritical judgments. [See 
PERCEPTION, article on SOCIAL PERCEPTION. | 

(5) A formally similar problem arises at a more 
comprehensive level in the psychological interpre- 
tation of culturally determined values. Thinking 


must guard against two opposed dangers: taking 
profound cultural differences too lightly and ac- 
cepting their incommensurability too readily. Be- 
havioristic learning psychology derives cultural 
values from the operations of conditioning and re- 
ward. The most striking application is to ethical 
judgments, which are said to be learned by the 
application of reward and punishment. From this 
standpoint it follows that the same act evaluated 
as morally right by one society may well be re- 
garded with indifference or as wrong by another. 
Gestalt theory introduces several considerations 
that this approach ignores. First, ethical tendencies 
may be considered vectors or requirements that 
follow directly from observation of specific condi- 
tions. Second, the relation between conditions hav- 
ing a specific character and the ethical judgment 
they generate is invariant. This postulate would 
suffer disproof if it were shown that a situation 
possesses the same cognitive character for those 
who judge it differently. However, much evidence 
in support of ethical relativism fails to consider the 
definition of the situation, which often varies con- 
siderably as a function of differences in knowledge 
and factual assumptions. Indeed, on the assump- 
tion that there are invariant principles of right and 
wrong, differences of situational meaning must 
produce differences of evaluation. There is no rea- 
son to expect that two people would value identi- 
cally what they see and hear if they do not see and 
hear identically, even though they are facing the 
same objective conditions. A limited conclusion 
from the available evidence is that the range of 
cultural relativism is substantially narrowed when 
one takes the situational context into account. 
From this examination emerges the following re- 
formulation of the problem of ethical relativism: 
can one attach different or opposed evaluations to 
a situation that has a constant cognitive content? 
Without prejudging a full answer to this difficult 
question, it appears that such an outcome is far 
from certain. [See CuLTURE, article on CULTURAL 
RELATIVISM; ETHICS.] 


Major issues 


An examination of the main themes of gestalt 
theory may clarify their mutual relevance and their 
relations to other directions within psychology and 
may point to issues that remain unsolved. 

Parts and wholes. The most general goal of 
scientific inquiry is to describe and explain the 
interdependence of observed events. This has been 
the aim of both atomistic and gestalt theories in 
psychology; they are essentially formulations about 
operations or modes of dependence. Given the 


atomistic assumption of discrete elements, the con- 
nections formed between them are neutral or inde- 
pendent of the terms they join. It follows from this 
starting point that ordered events, whether simul- 
taneous or successive, are summations of com- 
ponents. This beginning has the appeal of simplicity; 
it reduces the operations underlying the coherence 
of phenomena to a bare minimum—namely, to tra- 
ditional associations. The gestalt treatment of de- 
pendence takes as its starting point the evidence of 
experienced wholes. The observation that a change 
at one point of a coherent whole creates systematic 
changes at other points suggests that there is 
mutual determination of parts within a whole and 
that there are processes of interaction which de- 
pend upon relations between the parts. The recog- 
nizability of transposed wholes provides strong 
support for this formulation. Further, since parts of 
wholes often have a hierarchical character, their 
structure is not adequately described in terms of a 
sum of relations. 

The following examples illustrate the point at 
issue. A child who places a solid geometrical form 
in a form board appears to be guided by similarity 
of shape between the object and the area to which 
he fits it. A gestalt account of this performance 
would begin with the primary role of perceived 
similarity in steering the action. An association- 
istic account finds no place for the direct effect of 
such an intrinsic relationship; instead it begins 
with operations based on contiguity and derives the 
effects of similarity from them. The following quite 
different illustration bears upon the same point. 
How can one characterize the relation between an 
emotional experience and the apprehension of the 
conditions that produce it? It is customary to sup- 
pose that the provoking conditions function as a 
spark or trigger that releases the emotional effect. 
In this formulation the connection between the 
antecedent and consequent conditions is again 
neutral; nothing in the properties of the former 
accounts for the properties of the latter. The alter- 
native gestalt account proposes that an intrinsic 
relation connects the respective events: we flee 
from the horrible and laugh at what is amusing. 
More generally, a clarification of interdependence 
among events may involve more than the statement 
that A is followed by B; it requires an explanation 
of how one event grows out of another, how the 
character of one determines the character of the 
Other. 

Despite its pivotal position, the notion of part- 
whole determination has not been fully clarified. 
Some students have objected to the gestalt claim 
that parts do not enter into wholes with a fixed 
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character, on the ground that a whole can alter a 
part only if the latter has definite properties of its 
own. This formulation raises no logical difficulties 
but it does point toward problems that await in- 
vestigation. Gestalt psychologists have concentrated 
on two extreme types of conditions: those that give 
rise to highly coherent units and, in contrast to 
these, conditions that approximate to a mere aggre- 
gate of data. There is doubtless a multiplicity of 
intermediate instances deserving of study that 
would clarify the kinds of part-whole relations that 
occur. Thus, under certain conditions the parts of a 
unit are clearly perceived and are available in 
a relatively independent way; the omission of a part 
may have quite distinct effects depending upon the 
kind of unit in question. 

A difficulty has also arisen over the formulation 
that an additive analysis is not adequate to account 
for facts of organization. One generally attempts 
in investigation to relate a given effect to a number 
of conditions. The customary procedure in psy- 
chology is to trace the total effect to effects pro- 
duced by pairs of variables—A and B, A and C, 
B and C, etc.—and to derive the final outcome from 
an accumulation of these separate effects. This 
mode of analysis slights the fact that once inter- 
action has occurred between A and B, neither of 
these is any longer present in its original form 
when one considers their interactions with C, etc. 
Given an organized context, the effect cannot be 
decomposed into independent strands; explanation 
requires a law of the mutually interacting forces 
taken as a whole. 

Certain misconceptions concerning wholes and 
parts are probably less widespread at present than 
in the past. First, it had not been sufficiently under- 
stood earlier that experienced gestalten are not re- 
lated to any possible gestalt characteristics of the 
objective conditions. These last, when considered 
as conditions of stimulation, are never gestalten; 
the consequences of organization on perceptual ex- 
perience always transcend the conditions of stimu- 
lation. Second, emphasis upon wholes does not 
imply an indiscriminate dependence of facts upon 
each other or the assumption that there are no 
facts independent of each other. Wholes are self- 
limiting, since segregation is the counterpart of 
unit formation. Third, gestalt psychology is not 
opposed to analysis, The position it has taken is 
that analysis is fruitful provided it deals with the 
units and natural parts actually found in experi- 
ence. Fourth, the most general implication of 
whole-part determination is that crucial character- 
istics of local facts are ignored unless one takes 
into account their place within a wider scheme. In 


168 GESTALT THEORY 


the early days of gestalt psychology there was a 
tendency to neglect the converse point, later in- 
vestigated by Köhler (1958), that a larger organiza- 
tion may suppress the individuality of its parts by 
obscuring its suborganizations. Last, whether a 
given experience or action is a sum of components 
or a product of organization is a wholly empirical 
question. Thus, the issue of “relational,” as against 
“absolute,” choosing in discrimination learning is 
to be resolved by evidence, as is the question 
whether an impression one forms of a person con- 
sists of a sum of data or of their transformation 
into organized form. The contribution of gestalt 
psychology to these problems has been to formulate 
the alternatives more sharply and, where possible, 
to devise procedures for testing them. 

The principle of “Pragnanz.” No proposition is 
more characteristic of gestalt thinking than the 
principle of Prdgnanz: experienced perceptual 
wholes tend toward the greatest regularity, sim- 
plicity, and clarity possible under the given con- 
ditions. This principle also applies to certain phys- 
ical systems, and Kohler in particular has applied 
it to the cortical correlates of perception. Two op- 
posed outcomes derive from the tendency of ex- 
perienced gestalten to turn into particularly simple 
and clear structures: depending upon given con- 
ditions a gestalt will have either a maximum articu- 
lation of its parts, or it will be highly simplified. 
It is also implicit in these formulations that am- 
biguous conditions, which do not fully prescribe the 
perceptual outcome, produce a special direction 
toward the completion of gaps and the resolving of 
contradictions, so that all parts are determined by 
the structure of the whole. 

Although intended to have a general application, 
the principle of Prägnanz was oriented mainly to- 
ward facts of perception. Thus, Wertheimer sub- 
sumed the laws of grouping under it. Some thinkers, 
while accepting good continuation and closure as 
clear illustrations of Prägnanz, have wondered in 
what sense the latter describes grouping by prox- 
imity or similarity. However, in gestalt theory the 
tendency toward Prägnanz is not one process among 
others, but is inherent in all processes and products. 
Accordingly, one would say that grouping per se 
introduces regularity and simplicity and that to 
group in accordance with proximity and similarity 
is to have a radically simpler phenomenal field than 
absence of grouping would yield. This formulation 
admittedly renders concrete investigation difficult; 
accordingly, investigators have sought empirical 
tests to compare actually obtained “good form” with 
other alternatives that are logically possible under 
the same stimulus conditions. While it has not been 


possible to establish unambiguous criteria for good 
form, a great deal of selectivity has been demon- 
strated: where stimulation is compatible with a 
nearly infinite number of different experienced 
events, only a limited number—often only one or 
two—will actually be realized. 

There is, however, a set of phenomena that 
argues convincingly for a principle of maximum 
simplicity in perceptual experience: where a given 
stimulus array can be organized in alternate ways, 
that organization is realized which yields a con- 
stant, rather than an ever-changing, perceptual 
object. A black circle moving across a homogeneous 
white ground is perceived as such, while the alter- 
native organization, namely, a succession of black 
areas, each emerging from and returning to the 
white ground, is not realized. Phenomenal identity 
as studied by Ternus (1926) and Wertheimer also 
demonstrates a powerful preference to maintain a 
constant organization when one views a moving 
form. Another important instance of this tendency 
is the kinetic depth effect that Wallach and his 
associates have described (Wallach & O'Connell 
1953). When an observer follows the deforming 
shadow of a slowly rotating object on a screen, he 
organizes the successive views so as to see a rigid 
shape moving in the third dimension; he does not 
see changing two-dimensional shapes. Wallach 
(1940) has also shown that in sound localization 
with head movement, one tends to perceive the 
sound as coming from a stationary source rather 
than from a moving source. The perceptual con- 
stancies may also be considered instances of the 
same tendency. [See PERCEPTION, article on PER- 
CEPTUAL CONSTANCY.] 

In areas other than perception the principle of 
Prägnanz has in general resisted conclusive inves- 
tigation or clear definition. An early hypothesis of 
Wulf (1922) that memory traces undergo changes 
in the direction of greater regularity and simplicity 
has not been supported. In thinking, the discovery 
of a solution often marks a transition to a simpler 
structure, but this formulation lacks explanatory 
power. Equally rudimentary is our present under- 
standing of tendencies in action toward the com- 
pletion of situations that contain a gap, in accord- 
ance with the demands of the given structure. In 
social psychology the prevalence of extremely sim- 
plified views of groups and of public issues is a 
striking fact, but investigation has not advanced 
sufficiently to be theoretically relevant. A some- 
what nearer approach is the study initiated by 
Heider (1958) of preferences for balanced con- 
figurations of interpersonal relations, and of the 
tendency to convert unbalanced configurations into 


a preferred form. Despite the difficulties, interest 
in the principle of Prägnanz persists and continues 
to be a concern of investigation. 

Nativism. Gestalt theory holds that organiza- 
tion in accordance with general principles of physi- 
cal dynamics is present from the start in psycho- 
logical functioning. This position leaves wide scope 
for unlearned processes. At the same time, the 
widespread view that gestalt theory underestimates 
the effects of past experience is oversimplified. It is 
more important to note that the concept of organ- 
ization determines the treatment of both unlearned 
and learned functions. Gestalt theory refers un- 
learned operations mainly to relationally deter- 
mined physicochemical processes rather than to the 
action of specific anatomical structures. Similarly, 
it holds that the effects of past experience are also 
products of organization, or determined by struc- 
tural requirements. 

The treatment of form perception brings this 
position into sharp relief. From the standpoint of 
gestalt theory, unlearned organizing principles de- 
termine the perception of form, and past experi- 
ence cannot exert an influence until the sensory 
processes are organized. It thus takes issue with the 
assumption of the empiricists that visual perception 
consists originally of a mosaic of sensations and 
that learning transforms them into shaped visual 
objects. There is in fact substantial evidence that 
some form perception is unlearned. Figure—ground 
articulation is governed by an unlearned function, 
as is articulation in accordance with good form. 
The selective principles according to which sensory 
data are organized in perceptual grouping, in per- 
ception of visual motion, or in the perception of 
identity are similarly independent of specific learn- 
ing. Other perceptual functions also appear to be 
unlearned, among them stroboscopic motion, bright- 
ness contrast, and distance perception in lower 
organisms. Particularly instructive are observations 
of perceptually ambiguous situations; the fact that 
observers markedly agree in favoring one outcome 
over another indicates that the processes which 
order perception into coherent entities are not prod- 
ucts of specific learning. There is also evidence that 
spontaneous organization of form often takes effect 
before an influence of past experience can occur. 

The assumption of unlearned organizing prin- 
ciples does not, however, imply neglect of the his- 
tory of past stimulation. To a degree this becomes 
evident in the treatment of temporal organization 
in perception. Gestalt psychology has from the start 
considered perceptual organization of temporally 
extended events equal in importance with that of 
simultaneously given data. The role of successive 
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stimulation in apparent motion was the first phe- 
nomenon that Wertheimer formally investigated, 
and he illustrated the principles of grouping with 
melodies as well as with static visual forms. Fur- 
ther, he identified one principle of grouping, that 
of objective set (objektive Einstellung), which di- 
rectly concerns the effect of membership in a tem- 
poral sequence upon the phenomenal characteristics 
of a given structure. In these cases, as in the work 
of Michotte on phenomenal causality, perceptual 
organization depends directly on prior stimulation. 
[See TIME, article on PSYCHOLOGICAL ASPECTS.] 

In addition, recent advances in investigation have 
brought about a further development in gestalt 
thinking, as in psychology generally, pointing to 
the conclusion that a history of prior stimulation 
is a necessary condition of adequate perceptual 
functioning. Thus the investigation of figural after- 
effects by Kéhler and Wallach (1944) demonstrates 
that all perceptual experiences at a given moment 
are in some important respects a function of what 
a person has experienced in the past. The import 
of this conclusion is that the perceptual system re- 
quires not only an adequate amount of previous 
stimulation but particular kinds and distributions 
of stimulation. [See PERCEPTION, article on ILLU- 
SIONS AND AFTEREFFECTS.] 

The work of Helson (1964) on the organization 
of data across time illustrates the continuity of 
current investigation with gestalt contributions. 
The “adaptation level” demonstrates that tempo- 
rally separated stimulations interact, so that the 
phenomenal intensity of a given stimulation is 
lawfully determined by preceding stimulation. In 
addition to establishing that perception depends 
upon the articulation of a stimulus array in the 
temporal dimension, it also demonstrates that the 
same datum may be obtained from different struc- 
tures, each of which determines its subjective 
values. This line of inquiry brings to the fore the 
importance of sensitivity to successive arrays of 
stimulation. 

Furthermore, gestalt psychology does not ques- 
tion the more customary effects of past experience 
upon perception. Indeed, it finds a definite place 
for them and has formulated the rudiments of a 
memory-trace theory to account for them (Koffka 
1935). Gestalt psychologists have opposed unex- 
amined and ad hoc assumptions about the effects 
of past experience that were introduced to bolster 
the atomistic position. Such claims, they main- 
tained, require proof and an account of the oper- 
ations by which past experience exerts its effects. 
More specifically, they have combated an elemen- 
taristic concept of past experience and insisted that 
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unorganized experience cannot organize percep- 
tion. Accordingly, they have proposed that the or- 
ganization of form in past experience can reorgan- 
ize a percept by means of memory-trace contact. 
The demonstration of a memory effect of three- 
dimensional form perception by Wallach (Wallach 
et al. 1953) provides a clear illustration. A pattern 
which has previously been experienced as three- 
dimensional on the basis of appropriate depth cues 
will subsequently be seen as three-dimensional in 
the absence of these cues; in this case the outcome 
is dependent upon past organization. [See PERCEP- 
TION, article on DEPTH PERCEPTION. ] 

Gestalt psychology leaves vast room for past ex- 
perience and education in its treatment of thinking 
and learning. At the same time, it places emphasis 
in these areas on processes that have not previously 
occurred. These draw upon materials of the past, 
but the organizations that occur are not exclusively 
products of past experience. A chimpanzee needs 
relevant past experience with the functional prop- 
erties of sticks, bars, and food in order to solve 
problems, but the organization of these experiences 
is a new step. Similarly, a child’s understanding of 
a relation such as that of transitivity requires 
knowledge of the terms in question. The latter ex- 
ample raises a new question—namely, whether 
simple logical operations are learned. The implica- 
tion of the gestalt position is that once the facts 
are given, the mental operations of creating a struc- 
ture from them and of reading them off ensue 
directly. Such relations are mental products; they 
are not given as facts about the environment. 
These formulations are consistent with observa- 
tions, such as those of Piaget, that a given level of 
maturation is necessary before a child can grasp a 
relation such as transitivity, that he must pass 
through earlier intellectual stages before he is 
capable of handling it. Other developmental studies 
of the recent period raise a different problem. Har- 
low has shown that infrahuman primates require 
prolonged experience before they can handle rela- 
tions such as those of oddity and reversal. Do these 
findings demonstrate that the relations in question 
are “learned”? An alternative that must be con- 
sidered is that past experience is necessary in order 
to find out what relation is important, but that the 
relation per se is not learned, [See DEVELOPMENTAL 
PSYCHOLOGY, article on A THEORY OF DEVELOP- 
MENT; INTELLECTUAL DEVELOPMENT. ] 

Phenomenological method. Gestalt theory is 
phenomenologically oriented. It assigns a place of 
crucial importance in psychological inquiry to the 
data of immediate experience, They are part of its 
subject matter and therefore require explanation; 


in addition, they are indispensable as a basis for 
the construction of theory. This position has a gen- 
eral affinity with the phenomenological tradition 
represented in the modern period by Husserl and 
is in sharp contrast to the direction of behavior- 
ism. [See PHENOMENOLOGY and the biography of 
HUSSERL.] 

There is an obvious relation between the char- 
acter of phenomenal events and the initial formu- 
lations of gestalt psychology. In the area of percep- 
tion the primary concern has been to explain why 
things look, sound, and feel as they do; the phe- 
nomenal facts are in this case the facts requiring 
explanation. More specifically, the observation that 
wholes and whole qualities are given in immediate 
perceptual experience was the basis for the rejec- 
tion of elementarism and for the adoption of the 
concept of part—whole determination. Similarly, in 
thinking, the presence of understandable relations 
and of reorganization serve as primary observations 
and as first steps toward an eventual theory. This 
is equally the case in social psychology, where a 
clarification of the ways in which persons compre- 
hend a given situation often provides crucial infor- 
mation about the phenomena to be studied and a 
basis for further exploration. The work of Heider 
(1958) is an example of the possibilities of a phe- 
nomenological procedure in the study of interper- 
sonal relations. Finally, it is hard to conceive of a 
psychology of aesthetics that ignores direct experi- 
ence [see AESTHETICS]. 

Although the explanatory concepts that gestalt 
psychology seeks are not themselves phenomenal 
facts, it assumes an intimate relation between them. 
It holds, first, that behavior cannot be adequately 
explained without reference to central processes 
and that these may be represented in immediate 
experience. Second, the postulate of psychophysical 
isomorphism suggests that an examination of what 
is phenomenally given can be both a source of 
hypotheses about neural events and a testing 
ground for them. From this perspective psychology 
possesses, in contrast to the natural sciences, a 
unique access to central processes. As Albert Ein- 
stein is reported to have once remarked in connec- 
tion with this issue: “If atoms could talk about 
their internal processes I would not believe every- 
thing they say, but I would certainly listen.” 

The main requirement of phenomenological ob- 
servation is to render an unprejudiced report of 
immediate experience under specified conditions. 
This general rule at once differentiates phenome- 
nology from analytic introspection, which was 


“dominated by a prior theory about the character 


of the events to be described. The phenomeno- 


logical observer must be open to his experiences 
as they appear to him, independently of prior 
beliefs or assumptions about them and should not 
exclude what is strange or contrary to preconcep- 
tions. When viewing a visual contour, he will not 
allow his knowledge to deter him from noting that 
it appears in front of the ground, while the ground 
itself appears unbroken in the area that the figure 
occupies, or that the contour delineates the figure 
and not the ground. In turn the investigator will 
consider the explanation of these facts necessary 
to a theory of figure-ground organization. Although 
this mode of observation is not quite naive and 
requires cultivation, its aim is natural observation. 
The import of phenomenology as a psychological 
procedure is that it restores everyday experience, 
with its qualitative diversities, to psychology. 

Phenomenal facts would be of reduced import 
if they were unconnected with action. In fact, the 
connection is close; persons act in a situation in 
accordance with the ways they perceive, feel, and 
think about it. If a person misperceives a situation, 
he acts in accordance with his misperception, not 
on the basis of conditions as they are. It is, conse- 
quently, of the utmost importance to relate action 
to the cognitive apprehension of given conditions. 
However, in the absence of a coherent rationale 
the relation between direct experience and action 
reduces essentially to a correlation between incom- 
mensurable data. From the standpoint of gestalt 
psychology the phenomenal facts are, as stated 
earlier, the most direct, although partial, expres- 
sions of those mediating processes that steer action. 
These processes are organized representations of 
external and internal conditions. Interposed be- 
tween stimulation and action, they have the status 
of cognitive representations, comprising relations 
and systems of relations between facts, between 
means and ends, and between grounds and conse- 
quents. Since they steer action, they have the status 
of causes and are necessary to the prediction of 
action. Phenomenal facts—percepts, ideas, hypoth- 
eses, inferences—are more closely related to these 
central processes than to any other events, and 
therefore, they throw light on the events that issue 
in action. The cognitive analysis that gestalt psy- 
chology has pursued in different areas of human 
psychology and that has opened new paths for 
investigation derives from these assumptions. 

At the same time gestalt theory does not favor a 
purely phenomenological psychology. Central proc- 
esses are more inclusive and continuous than phe- 
nomenal events; therefore, the latter alone cannot 
provide the basis for a coherent science. In all areas 
of psychology, including those of perception and 
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thinking, there are functional relations that are 
inaccessible to phenomenology. Other areas are at 
best poorly represented at the phenomenal level; 
habits and attitudes are formed unawares; the for- 
mation of associations and the operations of reten- 
tion are not open to inspection; and there are 
always factors outside the phenomenal field that 
determine action. These considerations point to the 
need for inductive procedures in psychology; they 
do not, however, justify the neglect of direct experi- 
ence when it is accessible. 

The importance that gestalt theory attaches to 
immediate experience rests on a far-reaching ex- 
amination that cuts across generally accepted cate- 
gories. It distinguishes sharply between phenomenal 
and functional facts, the latter referring to events 
that are outside immediate experience and that are 
the object of scientific inquiry. Direct experience 
includes both objective facts, such as rocks and 
animals, and subjective facts, such as wishes and 
pains; indeed, the distinction between objective 
and subjective is one that occurs within the phe- 
nomenal domain. Thus, secondary and tertiary 
qualities are often phenomenally highly objective, 
since they appear located in specific objects in 
outside space. 

Consequently, direct experience is a condition of 
all scientific inquiry. Observation consists in the 
first instance of phenomenal facts; it begins with 
what is seen, heard, touched. At no point do the 
procedures of science eliminate these phenomenal 
components. The most refined observations, such 
as those represented by pointer readings, are per- 
ceptual situations that depend upon the identifica- 
tion of units, the perception of motion, and the 
discrimination of positions. Moreover, the theoreti- 
cal constructions erected to explain observations, 
including the rules of logic and mathematics—in 
general, the structures of inference and proof nec- 
essary to scientific activity—are also strictly phe- 
nomenal events. There is thus a necessary continuity 
of direct experience with the concepts and proce- 
dures that science fashions; ultimately all concepts 
have a phenomenal base. The scientist's picture of 
the world is based on the achievements of percep- 
tion and thinking; his choice of units of observation 
is based on his perception of form, and his infer- 
ences must obey logical demands and the cognition 
of causal relations. Although the investigator who 
concentrates on a particular problem must take this 
background for granted, it cannot be ignored in a 
systematic account of the character of scientific 
investigation. 

These considerations throw a new light on the 
distrust of phenomenal data in behavioristic psy- 
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chology. Early behaviorism rejected direct experi- 
ence in favor of objective data on the ground that 
the former is private and inaccessible to public 
check and, therefore, cannot form part of the body 
of scientific knowledge. Since one cannot place ex- 
periences side by side and compare them, behavior- 
ists concluded that one can observe only the 
behavior of another, not his experiences. However, 
the assumption that objective data are free of phe- 
nomenal components was based on a naive realism 
that failed to consider the role of the observer. The 
behaviorist as investigator reports his observations 
and inferences—that is, the contents of his phe- 
nomenal field. It is therefore hardly consistent on 
his part to rule out the reports of other observers 
simply because these observers happen to be his 
subjects. More recently, behaviorists have accepted 
that the data of science are based on phenomenal 
reports but have proposed to include only those 
observations which command agreement among in- 
dependent observers. It is generally accepted that 
observations on which qualified observers can agree 
have a special standing in science. However, the 
implication that phenomenally subjective events 
are generally untrustworthy and ipso facto fail to 
command agreement cannot be sustained. [See 
OBSERVATION. ] 

There are, to be sure, sources of error to be 
guarded against and difficulties to be resolved when 
one includes the reports of observers in a system 
of functional concepts. No doubt it may be more 
difficult to find the appropriate referents to the 
report “I am feeling dizzy and confused” than to 
the report “The rat depressed the bar five times.” 
Nevertheless, the methodological purism of the be- 
havioristic position has in important respects had 
negative effects. The proposals to reduce perception 
to overt discriminative responses and to treat the 
reports of persons as “verbal behavior” have not 
been fruitful. In fact, they marked an abandon- 
ment of interest in phenomena and problems that 
are most specifically and significantly human. They 
also discouraged, because they failed to realize, the 
constructive contribution of free observation and 
description in the discovery of new phenomena. 
Consequently, they underestimated the creative 
phases of scientific activity that precede proof and 
that set the goals for proof. These attitudes have 
had a profoundly restrictive effect on human 
psychology. 

The more serious reason for the behavioristic dis- 
trust of phenomenal data has been the belief that 
they are epiphenomena, with no place in a causal- 
explanatory scheme. Gestalt theory reserves judg- 
ment on the mind-body problem, while seeking a 


bridge between direct experience and the concepts 
of natural science; it sees in the clarification of 
this relation a challenge to psychology and to sci- 
ence generally. The distrust, bordering on aversion, 
of direct experience that behaviorism has demon- 
strated is also motivated by the attempt to imitate 
the natural sciences in their progressive elimina- 
tion of phenomenal data, From the standpoint of 
gestalt theory human experience is an important 
part of nature and too significant to be discarded. 
An account of human functioning that omits ref- 
erence to direct experiences is as incomplete as 
would be the description of a musical instrument 
that included all details of its materials, construc- 
tion, and functioning but made no mention of the 
music it produces. 

Relation to behaviorism. Certain other diver- 
gences between behaviorism and gestalt psychology 
have to do with the respective problems they have 
considered. Gestalt psychology grows out of studies 
in human perception and thinking, while behavior- 
ism takes its start from the study of infrahuman 
organisms. There are also diverse tendencies within 
behaviorism itself, some of which are closely con- 
nected with gestalt positions. Thus, a concern with 
the organization of central processes unites gestalt 
theory with investigators such as Lashley, and the 
work of Tolman exemplifies an approximation to a 
gestalt behaviorism in animal psychology [see the 
biographies of LASHLEY and TOLMAN], 

However, the “stimulus—response” psychologies 
constitute one variety of behaviorism, highly in- 
fluential on the American scene, that is strongly 
opposed to gestalt psychology. Stimulus—response 
accounts of behavior are atomistic; although their 
units are molar, they treat behavior as composed 
of chains of stimulus—response units and changes 
of behavior as the addition and elimination of 
such units, 

From the perspective of gestalt psychology, action 
is characterized by organization, and coordination 
of complex movements poses problems analogous 
to organization in perception. Accordingly it ques- 
tions the adequacy of accounts of action that em- 
ploy isolated stimulus—response units. Further, it 
holds that organized cognitive representations steer 
intelligent action in animals and man; sequences 
of chained units do not do justice to the operations 
of organizing and reorganizing that external con- 
ditions initiate. Also, the “stimulus” of stimulus- 
response psychologies implicitly refers to perceptual 
configurations, which are not, however, treated as 
such, 

These problems are magnified when one turns to 
human psychology. It is a significant feature of 


the stimulus—response programs that they aspire to 
base a psychology of man on concepts and methods 
derived exclusively from the study of infrahuman 
organisms. This aim presupposes that no problems 
or processes will be discovered that are unique to 
human functioning. Instead of freely examining 
human achievements and asking how one can ac- 
count for them, they attempt to fit observations to 
concepts derived from another area. The question 
arises whether concepts pertaining to one range of 
facts retain their meaning or relevance when ex- 
trapolated to a new area or whether they become 
mere labels for phenomena that have not been 
examined in their own right. 


Conclusion 


Gestalt theory was the first attempt within psy- 
chology to give a fundamental treatment to prob- 
lems of wholes and part—whole relations. It was 
productive of new discoveries and concepts; it gen- 
erated new questions and proved relevant to basic 
issues of psychology. Its contributions laid the 
foundations for the modern study of perception; 
it broke new ground in the investigation of think- 
ing, memory, and learning; it initiated new steps 
in social psychology. These achievements deeply 
affected the outlook of psychology, not least so 
when they provoked opposition. They spurred a 
sharpening of issues and the revision of alternative 
positions; there is little work of consequence in 
psychology that has been wholly untouched by 
gestalt ideas, 

It is nevertheless questionable to conclude, as 
some students have, that the contributions of this 
movement have been fully absorbed and that it has 
died of success. It is more likely that its natural 
development was blocked when it was uprooted 
from its milieu in Europe following the Nazi catas- 
trophe. To this one must add that there were bar- 
riers to a full comprehension of its perspective and 
of its conception of science in the profoundly dif- 
ferent intellectual environment that prevailed in 
the United States. It is therefore more appropriate 
to stress that gestalt theory is not a completed sys- 
tem, that many of the issues it raised await resolu- 
tion, and that it might be best described as a 
program of investigation or a region of problems. 
Thus, there is as yet little understanding of the 
physiological foundations that gestalt theory sought 
for psychology, and the postulate of isomorphism 
remains a heuristic principle. Further, gestalt psy- 
chologists were selective with respect to the prob- 
lems they studied; in general they preferred those 
that lent themselves to exact investigation and 
clear theoretical decisions. Consequently, there are 
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large areas to which it has not notably contributed, 
among them developmental and abnormal psychol- 
ogy and the psychologies of personality, of lan- 
guage, and of action. Neither has it contributed 
directly to the psychology of motivation, with the 
exception of Lewin and his group, whose concepts 
are related to those of gestalt theory. At the same 
time, the formulations of gestalt theory contain im- 
portant implications for these areas, [See ACHIEVE- 
MENT MOTIVATION; FIELD THEORY; and the biog- 
raphy of LEWIN.] 

Although there are no procedures peculiar to 
gestalt investigators, a distinctive style is evident 
in their formulations of problems and in their man- 
ner of studying them that reflects an implicit atti- 
tude toward the tasks of science. Perhaps first in 
order of importance is a sensitivity to the danger 
of distorting the subject matter by obeying prior 
prescriptions about the demands of scientific pro- 
cedure. It is a natural consequence of the phe- 
nomenological orientation of gestalt psychology to 
think it presumptuous to expect phenomena to con- 
form to rules that precede observation. Connected 
with this starting point is the belief that faithful 
observation is an essential step to explanation. Con- 
sequently, gestalt psychology accords a place of 
importance to qualitative observation and, while 
welcoming exact experimentation, rejects the view 
that measurement is the sole source of valid evi- 
dence. Indeed, some of its own most significant 
discoveries were essentially systematic demonstra- 
tions. One consequence of the more customary as- 
sumption is that a phenomenon is not considered 
important if it cannot be studied experimentally; 
it is, however, more correct to say that experimen- 
tation is only one kind of observation. Indeed, there 
is danger in stressing experimentation before essen- 
tial questions have been clarified—the danger of 
aimless experimentation. 

An equally prominent theme of gestalt theory is 
that clarification of fundamentals requires atten- 
tion to fairly large areas of phenomena. This be- 
comes evident in gestalt investigations of particular 
questions, as well as in its broad conception of 
psychology. It specifically included facts of logic, 
ethics, and aesthetics as part of the subject matter 
of psychology at the same time that it sought to 
establish contact with the concepts of natural sci- 
ence. This approach questions whether the study 
of ever more detailed problems in special fields will 
yield knowledge of psychology as a whole. Psychol- 
ogy still has to discover its fundamentals; exclusive 
attention to those questions that can be studied in 
exact ways may ignore important phases of the 
subject and may even lose sight of the realization 


174 GESTALT THEORY 


that the discovery of fundamentals is urgently 
needed. Indeed, facts in a limited area may be 
seriously misinterpreted, however carefully they 
are studied, if their reference to a wider scheme 
remains obscure. 

A sense of the human and philosophical import 
of psychology pervades gestalt writings. Their cri- 
tique of arbitrary dissection, of focusing on narrow 
facts, was based on technical considerations; they 
also expressed a concern about the destructive im- 
plications of such procedures for a conception of 
man. Science is a way to clarify major issues of 
mankind. If psychology eliminates or distorts essen- 
tial facts and if these are represented as the main 
evidence about his nature, man himself will appear 
of little account. Gestalt theory questioned the as- 
sumption that certain convictions prevalent in 
psychology were scientifically derived. The denial 
of understanding, the view that actions and judg- 
ments are in principle subjectively determined, the 
treatment of value in terms of neutral facts, and 
the consequent rejection of valid value did not, it 
claimed, necessarily follow from scientific thinking. 
There is also a quality of moral optimism in gestalt 
psychology, which some may consider extrascien- 
tific if not antiscientific. This charge cannot be 
easily leveled against a movement that strictly op- 
posed the approach of vitalism to facts of life and 
that sought to find a basis for psychology in natural 
science. The charge itself may be an expression of 
moral pessimism. 

SoLomon E. ASCH 


[Directly related are the entries FIELD THEORY and 
THINKING, article on COGNITIVE ORGANIZATION AND 
PROCESSES. Contrasting approaches to behavioral 
phenomena are discussed in FORGETTING; LEARNING, 
especially the articles on CLASSICAL CONDITIONING, 
INSTRUMENTAL LEARNING, REINFORCEMENT, and 
DISCRIMINATION LEARNING; PSYCHOANALYSIS. Other 
relevant material may be found in AESTHETICS; 
ATTITUDES; GROUPS; PERCEPTION; PHENOMENOL- 
OGY; PROBLEM SOLVING; SOCIAL PSYCHOLOGY; Sys- 
TEMS ANALYSIS, article on PSYCHOLOGICAL SYSTEMS; 
THINKING; and in the biographies of HUSSERL; Katz; 
KorrKa; KÖHLER; KÜLPE; WERTHEIMER.] 


BIBLIOGRAPHY 
PRIMARY SOURCES 


ARNHEIM, RUDOLF 1954 Art and Visual Perception. 
Berkeley: Univ. of California Press. 

Asc, SOLoMon E. (1952) 1959 Social Psychology. En- 
glewood Cliffs, N.J.: Prentice-Hall. 

Ascu, Sotomon E. 1960 Perceptual Conditions of Asso- 
ciation. Psychological Monographs 74, no. 3. 

Duncker, Kart (1935) 1945 On Problem-solving. Psy- 
chological Monographs 58, no. 5. > First published as 
Zur psychologie des produktiven Denkens. 

EHRENFELS, CHRISTIAN VON 1890 Über “Gestaltquali- 


täten.” Vierteljahresschrift für wissenschaftliche Phi- 
losophie und Soziologie 14:249-292. 

Eris, WıLLIs D. (1938) 1950 A Source Book of Gestalt 
Psychology. London: Routledge. > Contains excerpts 
from the technical contributions of gestalt psychology 
which were first published in Psychologische For- 
schung between 1921 and 1938. 

Hemer, Fritz 1958 The Psychology of Interpersonal 
Relations. New York: Wiley. 

HELson, Harry 1964 Adaptation-level Theory: An Ex- 
perimental and Systematic Approach to Behavior. New 
York: Harper. 

HENLE, Mary (editor) 1961 Documents of Gestalt Psy- 
chology. Berkeley: Univ. of California Press. 

HENLE, Mary 1965 On Gestalt Psychology. Pages 276- 
292 in Benjamin B. Wolman and Ernest Nagel (edi- 
tors), Scientific Psychology. New York: Basic Books. 

Katona, GEORGE (1940) 1949 Organizing and Memoriz- 
ing: Studies in the Psychology of Learning and Teach- 
ing. New York: Columbia Univ. Press. 

Katz, Davin (1911) 1935 The World of Colour, Lon- 
don: Routledge. > First published in German as Die 
Erscheinungsweisen der Farben und ihre Beeinfliiss- 
ung durch die individuelle Erfahrung. A revised and 
enlarged edition was published in 1930 as Der Aufbau 
der Farbwelt. 

Korrxa, Kurt (1921) 1928 The Growth of the Mind: 
An Introduction to Child-psychology. 2d ed., rev. New 
York: Harcourt: > First published as Die Grundlagen 
der psychischen Entwicklung: Eine Einfiihrung in die 
Kinderpsychologie. The first treatment of questions of 
learning and education from the perspective of gestalt 
psychology. 

KorrKa, Kurt 1922 Perception: An Introduction to the 
Gestalt-theorie. Psychological Bulletin 19:531-585. 
Korrxa, Kurt 1935 Principles of Gestalt Psychology. 
New York: Harcourt. > The only comprehensive 
treatment of gestalt psychology; by one of its leading 

figures. 

Korrxa, Kurt 1940 Problems in the Psychology of Art. 
Pages 180-273 in Art: A Bryn Mawr Symposium, by 
Richard Bernheimer et al. Bryn Mawr Notes and 
Monographs, Vol. 9. Bryn Mawr (Pa.) College. 

K6HLER, WoLFcaNc (1917) 1956 The Mentality of 
Apes. 2d ed., rev. London: Routledge. > First pub- 
lished in German. A paperback edition was published 
in 1959 by Random House. A renowned study of intel- 
ligence in apes and of the concept of insight. 

K6HLER, WoLFGANG (1920) 1924 Die physischen Ge- 
stalten in Ruhe und im stationären Zustand. Erlangen 
(Germany): Philosophische Akademie. 

K6HLER, WoLFGANG (1929) 1947 Gestalt Psychology. 
Rev. ed. New York: Liveright. > A paperback edition 
was published in 1947 by New American Library. The 
first general exposition of the movement; a classic in 
the literature of psychology. 

K6HLER, WoLFGANG 1938 The Place of Value in a 
World of Facts. New York: Liveright. 

K6HLER, WoLFGANG 1940 Dynamics in Psychology. New 
York; Liveright. > A field theory in perception and 
memory. A work that develops earlier ideas and fore- 
shadows subsequent directions of investigation. 

K6HLER, WOLFGANG 1941 On the Nature of Associations. 
American Philosophical Society, Proceedings 84:489- 
502. 

K6HLER, WOLFGANG 1951 Relational Determination in 
Perception. Pages 200-230 in Lloyd A. Jeffress (edi- 
tor), Cerebral Mechanisms in Behavior. New York: 
Wiley. 


KÖHLER, WoLFGANG 1958 Perceptual Organization and 
Learning. American Journal of Psychology 71:311- 
315. 

KÖHLER, WOLFGANG; and WALLACH, Hans 1944 Figural 
After-effects: An Investigation of Visual Processes. 
American Philosophical Society, Proceedings 88:269- 
357. 

Lewin, Kurt (1926-1933) 1935 A Dynamic Theory of 
Personality: Selected Papers. New York: McGraw-Hill. 
> Papers first published in German. 

Lewin, Kurt (1939-1947) 1963 Field Theory in Social 
Science: Selected Theoretical Papers. Edited by Dor- 
win Cartwright. London: Tavistock. 

MICHOTTE, A. (1946) 1963 The Perception of Causality. 
London: Methuen. > First published in French. 
Oscoop, CHARLES E. (1953) 1959 Method and Theory 
in Experimental Psychology. New York: Oxford Univ. 

Press. 

Rusry, Epcar (1915) 1921 Visuell wahrgenommene 
Figuren: Studien in psychologischer Analyse. Copen- 
hagen: Gylendal. > First published in Danish. 

TERNUS, JosEF 1926 Experimentelle Untersuchungen 
über phinomenale Identität. Psychologische Forschung 
7:81-136. 

Watiacu, H. 1940 The Role of Head Movements and 
Vestibular and Visual Cues in Sound Localization. 
Journal of Experimental Psychology 27:339-368. 

Waxtacn, H.; and O’ConNELL, D. N. 1953 Kinetic 
Depth Effect. Journal of Experimental Psychology 45: 
205-217. 

Wattacn, H.; O’ConNELL, D. N.; and Netsser, U. 1953 
The Memory Effect of Visual Perception of Three- 
dimensional Form. Journal of Experimental Psychol- 
ogy 45:360-368. 

WeRTHEIMER, Max (1912-1920) 1925 Drei Abhand- 
lungen zur Gestalttheorie. Erlangen (Germany ): Philo- 
sophische Akademie. > Contains three early contribu- 
tions: “Uber das Denken der Naturfélker” first pub- 
lished in Volume 70 in the Zeitschrift fiir Psychologie; 
“Experimentelle Studien über das Sehen von Beweg- 
ung” published in Volume 61; and Uber Schlafs- 
prozesse im produktiven Denken. 

WERTHEIMER, Max (1925) 1944 Gestalt Theory. Social 
Research 11:78-99. > First published in German. 
WERTHEIMER, Max 1934 On Truth. Social Research 

1:135-146. 

WerRTHEIMER, Max 1935 Some Problems in the Theory 
of Ethics. Social Research 2:353-367. 

WERTHEIMER, Max 1937 On the Concept of Democracy. 
Pages 271-285 in Max Ascoli and Fritz Lehmann 
(editors), Political and Economic Democracy. New 
York: Norton. 

WERTHEIMER, Max (1945) 1961 Productive Thinking. 
Enl. ed., edited by Michael Wertheimer. London: Tavi- 
stock. > Published posthumously; the product of a 
lifelong concern with problems of thinking, logic, and 
education. 

Wutr, Frreprich 1922 Uber die Veränderung von Vor- 
stellungen. Psychologische Forschung 1:333-373. 


SECONDARY SOURCES 
ALLPorT, FLoyp H. 1955 Theories of Perception and 
the Concept of Structure. New York: Wiley. 

Borc, Epwin G. (1929) 1950 A History of Experi- 
mental Psychology. 2d ed. New York: Appleton. 
GUILLAUME, P. 1937 La psychologie de la forme. Paris: 

Flammarion. 
HEDBREDER, Enna 1933 Seven Psychologies. New York: 
Appleton. 


GIDDINGS, FRANKLIN H. 175 


HENLE, Mary 1965 On Gestalt Psychology. Pages 276- 
292 in Benjamin B. Wolman and Ernest Nagel (edi- 
tors), Scientific Psychology. New York: Basic Books. 

METZGER, WoLFGANG (1940) 1954 Psychologie. 2d ed. 
Darmstadt (Germany): Steinkopff. 

PETERMANN, Bruno (1929) 1932 The Gestalt Theory 
and the Problem of Configuration. New York: Har- 
court. > First published as Gestalttheorie und das 
Gestaltproblem. 

Prentice, W. C. H. 1959 The Systematic Psychology of 
Wolfgang Köhler. Volume 1, pages 427-455 in Sig- 
mund Koch (editor), Psychology: The Study of Sci- 
ence. New York: McGraw-Hill. 

ScHEERER, MARTIN 1931 Die Lehre von der Gestalt. Ber- 
lin: Gruyter. 

WoopwortH, Rosert S. (1931) 1964 Contemporary 
Schools of Psychology. 3d ed. New York: Ronald. > 
See especially pages 214-250 on “Gestalt Psychology.” 


GIDDINGS, FRANKLIN H. 


Franklin Henry Giddings (1855-1931), a found- 
er of American sociological theory and research, 
was born in Sherman, Connecticut, of strict Puri- 
tan ancestry on both sides. His father was a Con- 
gregational minister. Giddings’ intellectual pre- 
cocity is shown by the fact that before entering 
college he had read extensively in the then highly 
controversial writings of Darwin, T. H. Huxley, 
John Tyndall, and Spencer. These authors, together 
with Adam Smith, Comte, and J. S. Mill, provided 
the foundation of his devotion to individualism—his 
basic philosophical position—and his primary socio- 
logical concepts. Graduating from Union College in 
1877, Giddings spent the next decade in newspaper 
work, mainly on the then famous Springfield Re- 
publican and Springfield Union. He lectured on 
political science and sociology at Bryn Mawr Col- 
lege from 1888 to 1894 and at Columbia University 
from 1891 to 1894. In 1894 he was appointed pro- 
fessor of sociology at Columbia; he was the first 
full-time professor of sociology in the United States. 

Among the founders of American sociology he 
occupies a strategic position as the most influential 
link in the transition of sociology from moral phi- 
losophy and the philosophy of history to inductive 
research. In this he was influenced by his close 
association with Richmond Mayo-Smith and by the 
concurrent rapid spread of statistical studies into 
related fields. Although he made no historically 
significant contributions to statistical research, he 
gave a powerful stimulus to quantitative studies of 
all aspects of community life through his lectures 
and books and his influence on hundreds of stu- 
dents, including over fifty PH.D. candidates. 

Giddings’ basic philosophy was a combination of 
Comtean positivism and Spencerian evolutionism. 
He saw social evolution as part of cosmic evolution, 
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as basically an equilibration of energy among indi- 
viduals and groups that results in differentiation, 
integration, segregation, and assimilation. He thus 
viewed every social order as always in a state of 
moving equilibrium, such equilibrium in power and 
status being essential for internal justice and order 
and, on a wider scale, for international peace. He 
considered sociology both a natural science and 
the basic elemental social science, giving an ac- 
count of the origin, growth, structure, and activities 
of human association through the operation of 
physical, biological, and psychological forces, At 
times he thought sociology might become a quanti- 
tatively exact science. 

In his view, “social process,” or social life, results 
from the interaction of “primary causes,” the 
natural resources and accessibility of a given habi- 
tat, and the “secondary causes,” the human motives 
arising within society itself. Since all social energy 
is physical energy transmuted by means of eco- 
nomic activities, the “social composition,” or the 
number, density, and genetic heterogeneity of a 
given population, is determined by these primary 
causes. At the same time, harnessing these re- 
sources of food and power increases social dynam- 
ics and, hence, the processes of differentiation and 
integration, thus adding complexity to the “social 
constitution,” that is, the functional and purposive 
groupings, ranging from social clubs to sovereign 
States. Giddings thus found the task of sociology to 
be the integration of subjective with objective 
processes and concepts in terms of mental activity, 
organic adjustment, natural selection, and con- 
servation of energy. 

This basic position is generally sound if rich and 
accessible natural resources are considered primary 
only in the sense of being antecedent in time, or as 
essential, but culturally viewed, static precondi- 
tions for the development of a dense population 
and a highly dynamic social order, or if it is ac- 
knowledged that both the manner and extent of 
their utilization is dependent on the State of cul- 
tural advancement. Giddings would probably agree, 
since he saw all forms of association as “essentially 
Phenomena of thought and feeling,” so that his 
analyses of the stages of Society's evolution from 
zoogenic to demotic forms, as well as his analyses 
of the social constitution and the social mind, were 
all couched in psychosocial terms. 

In The Principles of Sociology (1896) he de- 
clared that the original and elementary subjective 
fact of society is the “consciousness of kind,” or 
Adam Smith’s “sympathy,” plus a conscious recog- 
nition of likeness. In answer to critics who viewed 
consciousness of kind as merely another name for 
herd instinct or gregariousness, he sought re- 


peatedly, but not convincingly, to show that gre- 
gariousness is a purely hereditary “biophysical 
habit” to which man alone adds the purely psychic 
activity of conscious recognition of likeness and 
difference of kind. This raises the question of just 
where in the evolution of neural structures such 
recognition emerged, but in the light of studies in 
animal psychology there can be little doubt that 
it preceded primitive man. Giddings himself de- 
clared consciousness of kind to be dependent on 
in-group communication and to result in mutual 
aid, both of which traits were well developed among 
the higher apes. 

Although Giddings continued to argue this issue, 
in the outline of his “system” in Studies in the 
Theory of Human Society (1922, p. 292), he made 
“pluralistic behavior,” or like response of two or 
more persons to a given stimulus, antecedent to, 
and the basis of, consciousness of kind. He declared 
“interstimulation and response” among individuals 
to be the causal basis of all collective behavior. 
Social phenomena thus result from two variables: 
the psychosocial situation and the pluralistic re- 
sponse to it. Hence he offered a new definition: 
“Pluralistic behavior is the subject-matter of the 
psychology of society, otherwise called sociology” 
(1922, p. 252). 

Thereafter, he held that “like” response to an 
idea, symbol, or group value ensures group soli- 
darity, social control, and concerted volition. More- 
over, since “unlike” response to an ideal or purpose 
leads to segregation, competition, rivalry, and con- 
flict among social groups, the internal solidarity 
and concerted volition of every group rests upon an 
awareness of likeness in mental attitude. Differ- 
ences in the speed, intensity, and duration of re- 
sponse give rise to processes of differentiation and 
integration, to leadership and followers, the emer- 
gence of a guiding “protocracy” of more able mem- 
bers, the division of labor, and to social status. Dif- 
ferences in the intensity of consciousness of kind 
result in wide variation in the permanence of as- 
sociation and the scope of cooperative activities: 
the range extends from family to political party. 
Like and unlike responses result in an endless 
variety of social groupings, serving varied human 
purposes, from bridge clubs to nations and inter- 
national alliances, each of which by the same 
process develops its own folkways, mores, and in- 
stitutions. It seems safe to say that no elementary 
subjective sociological facts have been found that 
supersede in aptness and universality Giddings’ 
pluralistic response and consciousness of kind. 

Giddings adopted Lester Ward’s distinction be- 
tween social genesis and social telesis, Like Spencer 
and Ward he saw a teleological beneficence in the 


evolutionary processes, aided by increasing scien- 
tific knowledge. However, his utopianism was re- 
strained. In the matured views of The Scientific 
Study of Human Society (1924) he argued that the 
only means available for “societal engineering” are 
the same as those now used by religious, educa- 
tional, economic, and political agencies in current 
social adjustments, so that the processes of societal 
engineering are the same as those of social genesis. 
Indeed, the envisaged ends of telic action are the 
results of the efficient causes operating in social 
genesis, so that telesis is “a conditioned and pro- 
jected genesis.” Consequently, the fruits of telic 
efforts are more or less the same as the inevitable 
consequences of the genetic processes. Neverthe- 
less, he envisaged the ends of these processes to be 
the development of more able, more tolerant, and 
more cooperative members of society, as well as 
an enhancement of social welfare. 

These are obviously logical conclusions from his 
basic philosophy. Moreover, while the language is 
different, the meaning harmonizes, in general, with 
that of such cultural determinists as A. L. Kroeber 
and L. A. White. Giddings saw society caught up 
in cosmic evolutionary processes; Kroeber and 
White see it caught up in a self-determining cul- 
tural stream. Giddings, however, manifested a 
latent utopianism in assuming that the realization 
of his own ideals was the assured end of cosmic 
forces. 

His lectures, articles, and books abound in shrewd 
insights into human nature and social problems 
and processes, As a whole, they represent a pivotal 
point in the development of American sociology. 
While his own terms did not win wide usage, they 
contained the essence of the “social distance,” the 
“social interaction,” and the “social situation” of 
later works. 

At the same time, his somewhat questionable 
distinction between “social” psychology as defining 
individual reactions and “societal” psychology as 
defining group reactions led to his ignoring almost 
entirely the then current advances in dynamic and 
social psychology. Far more enigmatic is his neg- 
lect of the concurrent development of cultural con- 
cepts and theories in anthropology. Spencer, his 
master, had added the “superorganic” as the third 
distinctive realm of phenomena; Giddings had de- 
clared in his Descriptive and Historical Sociology 
(1906, pp. 176, 183) that the past products of 
interstimulation and response become “the most 
immediate and most important stimuli in modern 
social life,” but he did not develop the implication 
of this principle in cultural terms. 
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[For the kistorical context of Giddings’ work, see the 
biographies of ComTE; DARWIN; MILL; SPENCER; 
Warp, LESTER F.] 
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The German jurist Otto von Gierke (1841—1921) 
was born in Stettin, the son of a Prussian official. 
He was reared in a highly respectable, patriotic, 
and Prussian atmosphere. As a student at the Uni- 
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versity of Berlin, he was influenced by Georg 
Beseler, a jurist of the Germanist school, who had 
already sketched and was teaching the idea of a 
purely German theory of associations (Genossen- 
schaftstheorie). After professorships at Breslau 
(1872-1884) and Heidelberg (1884-1887) Gierke 
succeeded to Beseler’s chair at Berlin, which he 
occupied until his death. 

At the beginning of Gierke’s career, German 
legal scholarship was dominated by the Romanist 
school of Savigny; but Gierke began and remained 
a staunch Germanist. The Germanists, like the 
Romanists, were historically minded; their re- 
search, however, did not take them back to the 
Roman Empire, Justinian’s Code, and the Recep- 
tion, but followed the path marked out by Jacob 
Grimm to the law of the ancient German Mark 
and the Gemeinde (local community), to feudal 
records, town charters, and the rules of guilds, in 
search of “truly German” legal principles. The first 
volume of Gierke’s Das deutsche Genossenschafts- 
recht (1868-1913), dedicated to Beseler, was the 
first product of his self-imposed task of broaden- 
ing the foundation for a German theory of associa- 
tions by a detailed study of successive types of 
organizations in German history. This task, dili- 
gently pursued through much of his life, was not 
quite completed when, in 1913, he published the 
fourth and last volume of his most famous work. 

He temporarily abandoned historical research 
for more immediate problems in 1888, when the 
first draft of the new civil law code disappointed 
and challenged the Germanists. Gierke wrote a 
series of critical articles and, when he and his fel- 
low Germanists failed to obtain substantial modi- 
fications of the code, settled down to his second 
major task. Convinced that the materials of a com- 
mon German law existed and that legal progress 
could come only through developing deep-rooted 
German traditions and weeding out Romanist im- 
ports, he felt a solemn obligation “to penetrate the 
new code with a Germanistic spirit; to develop its 
Germanic content upon an historical basis; to 
foster the growth of its Germanism in the future” 
(1868-1913, vol. 4, p. xii). The first volume of 
his Deutsches Privatrecht (“German Private Law,” 
1895-1917), on Personenrecht (“Law of Persons,” 
1895), was followed by a second on Sachenrecht 
(“Law of Things,” 1905) and a third on Schuld- 
recht (“Law of Obligatory Relations,” 1917), Their 
important impact on German civil law is generally 
acknowledged. 

Gierke’s influence on legal and political theory 
derived from his historical and systematic analyses 
of associations. The four volumes of Das deutsche 


Genossenschaftsrecht trace the changing forms of 
groups through four periods of history. In the 
earliest period of German law the free association 
(freie Genossenschaft) was predominant; it was 
based on natural coherence; all rights remained 
with the members collectively and no corporate ex- 
istence of the Genossenschaft was postulated, In 
the second period (800-1200) the lordly union 
(herrschaftlicher Verband) was predominant; in 
this, rights attached to a single individual (e.g., 
a king or feudal lord) who represented the legal 
unity of the group. In the third period (1200- 
1525), which interested Gierke most, a new type 
of association, the free union (freie Vereinigung or 
Einung ), became predominant. Growing out of the 
free will of its associates, the free union neverthe- 
less resembled the old Germanic association in that 
the law, rights, and duties of the association were 
attributed to the collective membership. The guild 
was the purest example of this type, but it also 
appeared in towns, leagues of towns, and many 
other associations of varying importance, perma- 
nence, and respectability. The principle of free 
union, plus that of the federal organization of 
associations into larger unions, as illustrated by the 
Hanse, at one time gave promise, Gierke thought, 
that the feudalized Empire would be reconstituted 
as a federation. But rural areas continued to be 
dominated by feudal relationships; once-free as- 
sociations became privileged corporations; finally, 
a new authoritarian principle triumphed, exempli- 
fied in the sovereign state, conceived as separate 
from and above the people and as the exclusive 
embodiment of the common interest. By the end of 
the fourth period (1525-1806), the absorption or 
dissolution of privileged corporations by the state 
and the establishment of individual liberty and 
equality before the law had opened the way for a 
rich development of free associations with all the 
varied and complex characteristics that modern 
associations have. 

The old German association, Gierke explained, 
had had no clearly defined theory; Germanic con- 
ceptions implicit in the legal characteristics of the 
manifold associations of the thirteenth, fourteenth, 
and fifteenth centuries failed to reach explicit 
formulation. The competition, in legal theory, of 
German and Romanist ideas was paralleled by the 
competition, in publicistic thought, of “truly medi- 
eval” and “antique—-modern” tendencies, In the de- 
velopment of medieval association law, Gierke dis- 
cerned manifestations of a German tendency to 
construe each group as a purposive entity that 
acted as a whole and, as a whole, was the subject 
of rights and obligations. This tendency, however, 


never reached its logical conclusion, a concept of 
the group as a real person. As association law was 
finally formulated by canonists and postglossators, 
German views were submerged by Romanist influ- 
ences: the association was construed as an insti- 


tution (Anstalt), whose legal existence derived , 


from a grant of powers by superior authority and 
whose Rechtssubjektivitdt was located in an arti- 
ficial personality (persona ficta) constructed by 
positive law. Similarly, “truly medieval” political 
thought, which conceived society as a complex 
structure of mutually articulated group-entities, 
each with its own purpose, group-law, and organic 
unity, was defeated by “antique—modern” ten- 
dencies which, progressively eroding the autono- 
mous claims of intermediate groups, issued in an 
irreconcilable dichotomy between theories of the 
all-embracing sovereignty of an organic state and 
unsatisfactory attempts to explain the state itself 
as contractually constructed by the human atoms 
who alone had natural existence and natural rights. 

Gierke’s historical and systematic theses meet 
in his assertion of the Germanic doctrine that 
should be applied to modern associations. Attack- 
ing the prevailing doctrine of Savigny, which con- 
strued them as creations of positive law on the 
ground that “originally and from a naturalistic 
point of view” (1868-1913, vol. 2, p. 25) true legal 
personality belongs only to individual men, he 
propounded a doctrine which seemed to him not 
only more German, but also superior in scientific 
realism and philosophical validity. When law treats 
groups as persons, he insisted, it does not distort 
reality. Joint-stock companies, churches, trade 
unions are—like the state itself—real collective per- 
sons. They exist whether the state recognizes them 
or not; the role of the state is declaratory, not cre- 
ative, The Genossenschaft is an organic unity, com- 
posed of individuals or other associations, with its 
own original purpose; it organizes itself through 
its own system of “social law”; it is autonomously 
capable of willing and acting; it has thus a real 
personality, which is the proper subject of rights 
and obligations. Appreciation of the actual nature 
of associations opens the way to the only sort of 
legal theory that corresponds with fact and the 
only sort of social organization that can be ethi- 
cally satisfactory, one that resolves the conflicts in- 
herent in human strivings toward unity and liberty. 

Gierke repeatedly emphasized that his position 
lay between that of the extreme individualists, who 
would reduce human relationships to contracts 
among sovereign individuals, and that of the or- 
ganicists, who would absorb the individual and all 
society into the state. A man is born “as a member 
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of a family, a race, a community, in short, as a 
member of a whole” (1868-1913, vol. 2, p. 47) 
and “what man is, he owes to the association of 
man with man” (1868-1913, vol. 1, p. 1). The 
system of human associations, natural and volun- 
tary, presents a complex pattern of rich and fluid 
variety. To primitive natural associations based 
upon “purely physical” ties are added a complex 
and fluid variety of associations more deliberately 
created—some highly specialized in purpose and 
membership, others more generalized and compre- 
hensive. The process of differentiation and special- 
ization is balanced by a process of generalization. 
But, as an expression of man’s social nature, the 
lowliest and narrowest association has some of the 
same dignity and value as the highest and most 
comprehensive one. 

The state is the product of the same sort of 
process as that which produces all other associa- 
tions. But it is distinguished from other associations 
in that it is the highest and most comprehensive; 
thus its purposes include the forceful carrying out 
of “the general will” and the coercive conciliation 
of the wills of all individual and collective persons. 
Accordingly, although it is “not generically differ- 
ent,” the state differs qualitatively as well as quan- 
titatively from other associations. Moreover, its 
function requires both Genossenschaft and Herr- 
schaft. The history of the German state, Gierke 
believed, had culminated in an integration of 
Genossenschaft and Herrschaft, organically unit- 
ing the state’s associational basis with the inher- 
ent authority of the monarchy at its apex. 

Gierke’s theory of law corresponded to his theory 
of associations. In “Die Grundbegriffe des Staats- 
rechts” (1874) he attacked the formalist concep- 
tion of law as a creation of the state. “The final 
source of all law” is “the social consciousness of 
any social institution whatever” (Gierke in Lewis 
1935, p. 176); the declaration that transforms 
social convictions of right into law is made by other 
associations besides the state. In the terminology 
he used in Das Wesen der menschlichen Verbände 
(“The Nature of Human Associations,” 1902), the 
two basic categories of law are not private and 
public law, but “individual law,” through which 
the state regulates the external relations of indi- 
vidual and collective persons, and the various 
bodies of “social law,” which treat individuals only 
as members of groups. Social law is the law pro- 
duced by the collective persons themselves to regu- 
late their internal life, the relations of the whole 
with its parts, and the integration of narrower into 
more inclusive entities. The public law that organ- 
izes the structure of the state, and those of nar- 
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rower associations (e.g., local communities, prov- 
inces) insofar as they are a part of the state 
structure, is simply one among many systems of 
autonomously developed “social law,” differing from 
other systems only in the specific characteristics 
appropriate to the specific nature and purpose of 
the state. Gierke’s concept of “social law” enables 
him to construe the internal rules of churches, 
trade unions, business corporations, etc., as inde- 
pendent of state determination and to put such 
bodies on an equal basis with human persons in 
claiming areas of freedom into which the state 
cannot intrude. 

Gierke’s conception of the nature of associations 
could suggest a highly decentralized federal po- 
litical structure, which might include as constitu- 
ent members both functional and territorial units. 
Morris R. Cohen once referred to Gierke as “a sort 
of patron saint of political pluralists.” But Gierke 
himself was not a political pluralist, nor did he 
develop the ideal of a functional federalism. The 
pluralistic elements of his theory were always care- 
fully balanced by the organic and authoritarian, 
and by the dominant role he assigned to the state 
and to its law. His devotion to Prussia and the 
monarchy and his concern for the assured unity of 
the German people tipped the balance steadily 
toward authority. He became increasingly con- 
vinced that the constitution of the Bismarckian 
Reich achieved an almost perfect harmony of as- 
sociational and authoritarian principles. Earlier in 
his career he had demanded decentralizing re- 
forms, but in 1919 his fear of the disruption that 
would follow the abolition of the monarchy made 
him a vigorous critic of the Weimar constitution. 

Gierke’s theory of social law influenced such 
writers as Léon Duguit and Hugo Krabbe; his in- 
sistence on the autonomous origins of associations 
influenced, directly or indirectly, the thought of 
S. G. Hobson, G. D. H. Cole, Harold J. Laski, and 
others. His influence on English pluralism owed 
much to Frederic William Maitland, who intro- 
duced Gierke to English-speaking academic circles 
in 1900. With a lawyer’s interest in the legal in- 
terpretation of corporations and other groups, 
Maitland emphasized Gierke’s legal doctrine of the 
real personality of associations but paid less atten- 
tion to his view of their organic nature and to the 
very special role he attributed to the State. J. N. 
Figgis contributed to Gierke’s reputation as a plur- 
alist when in Churches in the Modern State he 
drew heavily from Gierke in vindicating the “real 
life and personality” of churches and other associa- 
tions against the “Leviathan state” and the Aus- 


tinian concept of sovereignty. In Germany, Gierke’s 
student and follower Hugo Preuss argued in his 
early writings for a transformation of the authori- 
tarian, Prussia-dominated Reich into a decentral- 
ized democratic state whose articulation would be 
unhampered by the outworn dogma of sovereignty; 
but Preuss did not minimize his divergence from 
Gierke. In the Weimar period he moved to an 
uncompromising assertion of the sovereignty of the 
united German state. 

Gierke’s historical interpretations have been 
criticized as generalizing beyond the evidence, as 
tending to transform social movements into ideo- 
logical or spiritual movements, as reading his own 
categories into past thought. But Das deutsche 
Genossenschaftsrecht, with its massive erudition 
and often perceptive statement, remains a classic 
which no historian of its topics can ignore. Gierke’s 
systematic theses have also been criticized, even 
by generally sympathetic writers, for example, in 
the penetrating analysis by Ernest Barker (1934), 
Interest in Gierke’s theory has declined with the 
decline of the pluralist school. The normative and 
juristic conclusions that he drew from recognition 
of the spontaneous self-assertion of groups have 
little in common with more recent descriptive anal- 
yses of group action in politics, 

Joun D. Lewis 


[For the historical context and subsequent develop- 
ment of Gierke’s ideas, see NATURAL LAW; PLURAL- 
ISM; POLITICAL GROUP ANALYSIS; VOLUNTARY ASSO- 
CIATIONS; and the biographies of Ducut; FiccIs; 
Lask1, MAITLAND.] 
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Sir Robert Giffen (1837-1910) was a British 
statistician, economist, and writer on finance. He 
was born in the Scottish town of Strathaven and 
was educated at the local village school. He showed 
an early aptitude for journalism, he and his brother 
contributing anonymous poems and articles to the 
local newspaper. At the age of 13 he entered the 
office of a Glasgow solicitor in a very minor posi- 
tion but made use of his seven years’ stay in that 
city by attending lectures at the university. Jour- 
nalism, however, attracted Giffen more than the 
law, and in 1860 he became reporter and subeditor 
of the Stirling Journal. Like many another Scot 
before him, he was lured south of the border, and 
he became connected with the Globe, then more or 
less the official organ of Lord Palmerston’s admin- 
istration. After a short time on the Fortnightly 
Review, in 1868 he became subeditor of the Econ- 
omist, then under the editorship of Walter Bagehot. 

The time that Giffen spent in the city of London 
was to serve him well in his future career. His con- 
tact with Bagehot was especially profitable, for he 
acquired from Bagehot both his style and his meth- 
odology. His later connection with the city office 
of the Daily News provided an invaluable acquaint- 
ance with the London stock exchange and the ways 
of the London financial community. 

In 1871, when the Report on Local Taxation was 
published, George Goschen, president of the Poor 
Law Board and main author of the report, acknowl- 
edged Giffen’s assistance. The reputation so earned 
led to Giffen’s appointment as chief of the statistical 
section and controller of corn returns at the Board 
of Trade. An attempt in 1876 to reorganize this 
body and transfer some of its functions to the 
Foreign Office proved unsatisfactory; the old Statis- 
tical Department was revived and reorganized with 
Giffen at its head as assistant secretary. In 1892 
he became controller-general of the Commercial, 
Labour, and Statistical departments, a position he 
retained until his retirement in 1897, having re- 
ceived the K.c.B. in 1895 in recognition of his 
service. 

When Giffen accepted his position in the civil 
service he obtained permission to publish his views 
on matters of economic interest. Indeed, he was 
very active in the affairs of the Royal Statistical 
Society and the Royal Economic Society; he was 
influential in the founding of the latter body in 
1890 and president of the former from 1882 to 
1884. A frequent, almost prolific, contributor to the 
journals of both bodies, he was also editor of the 
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Journal of the Royal Statistical Society from 1876 
to 1891 and provided the “city notes” in the Eco- 
nomic Journal until he died. 

Giffen’s reputation rests mainly on his work in 
“official statistics,” a field which had been pioneered 
by G. R. Porter, who was the first to organize the 
crude, incomplete mass of statistics relating to 
revenue and commerce. Richard Valpy made fur- 
ther improvements, instituting the system of statis- 
tical abstracts. Giffen was largely responsible for 
the first attempts to collect figures relating to the 
wages of manual laborers and for setting up the 
Bureau of Labour Statistics. Although Giffen ap- 
peared to have little knowledge of the mathematical 
theory of statistics, he had a remarkable intuitive 
grasp of the significance of facts and figures, a gift 
that allowed him to clear the way through the most 
complicated problems by the use of bold estimates. 
He was always inclined to caution, but his imagi- 
native approach caused him to tackle such prob- 
lems as the measurement of national income, which 
had formerly been considered too difficult a task 
by many an able statistician. In addition, his jour- 
nalistic flair did much to popularize the use of 
statistics, and his wide acquaintance and knowl- 
edge of the world of finance and business led gov- 
ernment departments and official bodies to seek his 
advice. In fact, some of his most important work is 
to be found in official reports. 

While at the height of his career, Giffen’s opinion 
was respected in the fields both of applied and 
theoretical economics; today he is chiefly remem- 
bered for his “Paradox,” mentioned in Book 3 of 
Marshall's Principles ([1890] 1961, p. 132). This 
exception to the general law of demand was held 
to operate in a community heavily dependent upon 
one commodity for its staple diet. If the price of 
this commodity falls, the real income of the com- 
munity is increased so that it can afford to buy 
superior foodstuffs and thereby cut down its con- 
sumption of the “Giffen good.” The opposite will 
occur in the case of a price rise, producing the 
unusual effect of quantity demanded increasing as 
prices rise and falling as prices fall. 

Giffen was among the earliest to analyze the 
workings of the quantity theory of money, and the 
role played by the rate of interest, within an econ- 
omy possessing a banking system. His ideas seem 
to have had a great deal of influence on Alfred 
Marshall, over and beyond the “Paradox.” Giffen 
was very much concerned with the way in which 
the volume of “bank money” affects the level of 
currency. In Stock Exchange Securities (1877) and 
two essays, “The Effects on Trade of the Supply of 


Coinage” (in [1879-1885] 1886, pp. 89-104) and 
“Gold Supply: The Rate of Discount and Prices” 
(ibid., pp. 37-88), he very clearly indicated the 
relationship that exists between a banker's cash 
reserves and the level of deposits, and the part 
played by the discount rate in bringing about an 
equilibrium between the supply of gold and the 
level of prices. 

His “dispensation” from refraining to participate 
in current controversy allowed him to become a 
leading protagonist in many of the debates of the 
latter part of the nineteenth century. A staunch 
laissez-faire economist, he was an avid opponent 
of the protectionist and socialist schools, both at 
home and abroad; but he was willing to concede 
that protection could have political advantages, es- 
pecially when it concerned colonial produce, and 
was willing to tolerate it in a mild degree if it was 
the only alternative to socialism. As might be ex- 
pected, he was also opposed to much of Gladstone’s 
financial policy and was a foremost antagonist of 
the bimetallist school. 

K. J. PENNEY 


[For the historical context of Giffen’s work, see the 
biography of BAGEHOT; for discussion of the subse- 
quent development of his ideas, see the biography 
of MARSHALL.] 
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Edward Winslow Gifford (1887-1959), Ameri- 
can anthropologist, was born in Oakland, Cali- 
fornia. His formal schooling ended with a high 
school diploma. From 1904 to 1912 he was assist- 
ant curator of ornithology in the California Acad- 
emy of Sciences and a member of academy expe- 
ditions to the Revillagigedo Islands in 1903 and to 
the Galapagos Islands in 1905-1906. He was ap- 
pointed assistant curator of the Museum of Anthro- 
pology at the University of California in 1912 and 
became, successively, associate curator, curator, and 
director of the museum. In 1920 he was appointed 
lecturer in the department of anthropology, in 1938 
associate professor, and in 1945 full professor. 

In addition to his early interests in natural his- 
tory (particularly ornithology and conchology), 
Gifford’s range of anthropological interests and 
competence was very broad, including archeology, 
folklore, kinship, social organization, material cul- 
ture, religion, and physical anthropology. In each 
of these fields he made both substantive and theo- 
retical contributions. 

Gifford’s major effort was directed toward the 
ethnology of California Indians. He wrote a num- 
ber of monographs on tribal groups in California 
and a number of articles on special aspects of the 
culture of Californian aboriginal societies. With 
the exception of those of A. L. Kroeber, Gifford’s 
contributions to California ethnography are quan- 
titatively larger than those of any other worker. 

Archeology was a secondary interest, but in 
1948, toward the end of his career, he began an 
investigation of Oceanian archeology that resulted 
in major excavations in Fiji, New Caledonia, and 
Yap. The archeology of northwestern Mexico at- 
tracted his interest in 1945 and 1946, and he 
made several important contributions to that field. 

Gifford’s ethnographies are filled with detail. 
Informants who contributed information beyond 
that pertaining to the direct question were appar- 
ently allowed to continue, and these bits of volun- 
teered data were usually noted and included. The 
result of this leniency with informants is that Gif- 
ford’s ethnographic accounts contain a multitude 
of data that does not occur in the published eth- 
nographies of any other worker, and these data 
often provide some fact or allow some insight that 
cannot be obtained when only those data are re- 
ported that relate to generalizations. 

Gifford was unusual in sensing the importance 
of recording individual experiences of aboriginal 
informants at a time long before this kind of infor- 
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mation was considered important. Thus, his study 
of Clear Lake Pomo society stands as a pioneer 
attempt to determine the actual composition of 
local groups by the genealogical method (1922). 
His monograph on the Northfork Mono includes an 
extraordinarily detailed history of one informant, 
recorded in 1918, who could list no fewer than 
23 settlements where she had lived in the course 
of her life, all of which were located within an 
area of approximately 150 square miles (1922). 
Such vignettes provide us with a means of viewing 
life in aboriginal California in terms of individual 
experience as well as impersonal customs and in- 
stitutions. 

Gifford did not hesitate to take on projects of 
synthesizing immense bodies of fact. Two of his 
largest works remain the main sources of primary 
data on two aspects of the California Indians: their 
physical characteristics (1926a) and their kinship 
terminologies (1922). 

Beyond being a patient collector of facts, Gifford 
was competent at seeing and solving theoretical 
or terminological problems. In a paper on Euro- 
American culture contact in Tonga published in 
1929, he became the first American anthropologist 
to use the term “acculturation” in the modern 
sense. The same is true of his use of the term 
“lineage,” which he proposed, in a broad survey of 
political organization (1926b), as the basic unit 
of aboriginal California societies. 

In archeology he was the first to devise a method, 
which has since become important, for segregating 
the components of refuse deposits, identifying 
them, and deriving from the proportions of each 
component economic interpretations, indications of 
dietary change over time, and reflections of the 
relations of the occupants to local environment. 
Gifford was among the first to employ the weight, 
rather than the count, of potsherds from archeo- 
logical excavations to assess chronological changes 
in the ceramic complex. 

In addition to his field investigations, Gifford 
guided the Museum of Anthropology at Berkeley 
for 44 years and directed its development of one 
of the major anthropological collections in the 
United States. The importance of the Berkeley col- 
lections rests not so much upon their size as upon 
the fact that an unusually large proportion of the 
materials was collected in the field by scholars, 
including Gifford himself, and is accompanied by 
information on cultural context. Gifford organized 
the collections to make the context of the objects 
as explicit as possible and constructed a card index 
for cross reference. Consequently, it is possible that 
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no other major research collection in anthropology 
is more accessible to scholars. 
RoBERT F. HEIZER 


[For the historical context of Gifford's work, see the 
biographies of KROEBER and LowIe.] 
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Gild studies form one of the more obvious links 
between history and the social sciences. Originat- 
ing in nineteenth-century romantic interest in early 
Germanic institutions and the theme of liberty, 
they have played into three lines of speculation 
about the role of voluntary associations in societal 
development. The first began with controversy over 
the respective influence of “Germanist” theory 
based on the right to associate freely and of the 
Romanist theory that the right to associate is a 


concession from the prince. [See the biographies of 
GIERKE and Savicny for discussion of these theo- 
ries and the controversy.] This general dialectic is 
still being reformulated as a fundamental theoret- 
ical element in studies of ecclesiastical influence on 
the philosophy of law and in studies of the devel- 
opment of representative institutions. Research on 
this last subject is now, under the direction of an 
international commission, strongly comparative. 
But it focuses on the wider political communities of 
medieval and early modern Europe rather than on 
the smaller associations, which, to social anthro- 
pology, would be of at least equal significance. 
From the start, however, a second line of thought 
linked these with the elementary bonds of kinship, 
seeing them at first as an effort to transcend kin- 
ship loyalties and later as partially reverting to 
them. Research on this problem has lagged, al- 
though there are signs of its revival in France and 
Italy. A third line of thought, prominent in much of 
the better social history written in the third quarter 
of the nineteenth century and now revived by the 
influence of twentieth-century sociology, is con- 
cerned with nonpolitical aspects of the social con- 
trols that small associations may have exercised. 

The tendency now is to interpret the term “gild” 
—the variant spelling “guild” is from the seven- 
teenth century—as applying only to occupational 
associations. This was not the original sense of the 
word. It was specific only in signifying that mem- 
bers were under the obligation to “geld,” that is, to 
make payments for purposes agreed upon. Through- 
out the Middle Ages, it was also synonymous with 
fraternitas—definable as a pseudo-kinship group 
of a character in principle congenial to Christianity. 
Since contemporary terminology varied from one 
period and one area to another, historians writ- 
ing in English have found it convenient to adopt 
“gild” as a generic term for all of the smaller asso- 
ciations with a fraternal aspect; French and Italian 
usage distinguishes the occupational types (corpo- 
rations, corporazioni) from the purely fraternal 
(confréries, confraternitate). Within Europe, his- 
torians can compare their findings on the basis of 
a common awareness of the political and social 
context in which any type of association ran its 
course. But now that ethnologists and historians of 
other civilizations are singling out occupational 
gilds for description and comment, comparative 
study will call for analyses going beyond the tradi- 
tion of European political theory. 

West European studies can, however, be a source 
of hypotheses, typologies, and, especially, of devel- 
opmental models useful in all comparative work. 
The evidence on which they rest has undergone a 


great deal of critical examination; in the North it 
covers a thousand years and in Italy almost two 
thousand, running back into republican Rome. 
West European experience was, of course, affected 
by peculiarities of political and religious organiza- 
tion and by historical contingencies that varied 
from one country to another. It will be compressed 
here into a scheme of five phases, in each of which 
the relationship between small voluntary associa- 
tions and public authority changed. The functions 
and the organization of the main types of gild 
characteristic of each phase will then be discussed, 
but space will not permit even a catalogue of the 
accompanying types of local power structures or 
any discussion of the highly influential examples 
of differentiated corporate bodies within the church 
(for which see Histoire de l’église . . . 1964). 

The course of development. The five phases of 
development may be described briefly as follows: 

(1) In the northern “barbarian” kingdoms of the 
early Middle Ages, associations contributing to or- 
der pass unchallenged by weak monarchies. 

(2) Monarchy asserts itself as the source of 
legitimacy of association. The papacy does the same 
in regard to religious orders but permits free fra- 
ternal association among the laity. This phase cul- 
minates in chartered concessions to town govern- 
ments, which become more or less autonomous 
organs of public authority. 

(3) Town governments, after some lag, permit 
the differentiation of citizen communities into oc- 
cupational gilds, which serve both public and pri- 
vate ends. 

(4) Gilds are enmeshed in a system of state- 
regulated privilege. Noneconomic associations in 
Protestant states survive only through sectarian 
organization. 

(5) Either voluntarily or in consequence of rev- 
olution, the state abandons the system of special 
privilege; economic gilds either atrophy or are 
abolished. 

Phase 1. In the societies to which phase 1 is 
applicable, customary law prevailed, and feud 
rather than royal authority was the chief sanction 
in the enforcement of law against crimes of vio- 
lence. In other words, kinship was the traditional 
source of power in local communities. It is uncer- 
tain at what point gilds arose to offer supplemen- 
tary protection (which was available also from 
lordship). The earliest known example is from the 
rules of the gild of thanes at Cambridge, written 
at some time in the late Anglo-Saxon period. This 
gild combined five functions: protection of mem- 
bers’ property, discouragement of violence, mutual 
aid in sickness, common worship, and responsibility 
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for decent burial. An oath bound members, and 
banquets brought them together. They were men 
of upper middle rank. Their protective plan, if one 
of them were robbed, was to track down the thief 
and compel him to raise the large sum of 8 pounds 
as compensation. Their way of discouraging vio- 
lence was simply to add to the normal pecuniary 
penalties: the rules allow for the possibility of the 
murder of one “brother” by another and demand 
only that the murderer pay 8 pounds to the gild in 
addition to the wergild due to the kindred of his 
victim. A group strong enough to intimidate thieves 
and to fine its own members heavily was obviously 
a formidable power in any local community. Even 
fraternities professing no purpose but group wor- 
ship, conviviality, and the support of a loan and 
burial fund could mobilize power to maintain order. 

Phase 2. The second phase was inaugurated in 
Frankish territory by Charlemagne’s fear that sworn 
gilds might disrupt the feeble unity of his realm. 
Gild power, he realized, depended on the oath; the 
words “to live or die together” could be added to it 
secretly to create a group as solidary as a kindred, 
but also expansible and potentially defiant of all 
law but its own. Charlemagne banned sworn gilds 
but was unable to suppress them. His edicts, how- 
ever, bestowed a kind of legitimacy on unsworn 
gilds for mutual aid. There were probably already 
merchant gilds among them; they grew as trade 
grew and by the eleventh century had a more elabo- 
rate organization than the earlier protective gilds. 
The cloth merchants of Valenciennes, for example, 
had a salaried secretary elected for life, a treasurer, 
an almoner, a dean, their own priests, an elected 
judicial board of 12, and, as chief officer, a mayor. 

The specific legitimation of gilds by charter came 
about slowly as part of that piecemeal granting of 
judicial or fiscal privileges to successive groups of 
town residents which preceded the full develop- 
ment of town governments. In early twelfth-cen- 
tury London, two artisan gilds received charters 
authorizing them to hold private courts, and other 
unchartered gilds of unspecified nature were made 
to pay for the privilege of existing. The older free- 
dom of association survived only under the shelter 
of the church, in the form of parish gilds support- 
ing the cult of chosen saints. 

Phase 3. The differentiation of the citizen body 
into occupational gilds, which characterized the 
third phase, was a political process. It helped to 
redefine the line, which rapid immigration had 
tended to blur, between citizens and noncitizens 
resident in a town. It domesticated the primitive 
gild oath, making it a sanction of civic obedience. 
It turned gild officers into quasi-public officials and 
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used gilds in the assignment of military and night- 
policing duties. Finally, it fostered political con- 
Sciousness. Artisan gilds gave the towns whatever 
drive toward “democratic” reform they had; the new 
mercantile gilds mobilized resistance to this drive. 

In the matter of social controls over economic 
conduct, the role of the gilds was mainly to intro- 
duce collective decisions about how to meet new 
and difficult situations. The move toward formal 
organization among artisans did not spread until 
the late thirteenth century, when, after two cen- 
turies of expanding trade, the towns were begin- 
ning to face more competitive conditions. The basic 
values of an ambitious but hard-pressed working 
population had long been rooted in the master arti- 
san families. The need for hard work, frugality, 
and calculation had been beaten into many genera- 
tions of sons and apprentices. Market officials had 
set checks on dishonesty in selling, and in the ex- 
port industries merchants and their agents had 
done the same for manufacturing practices. Neigh- 
borhood opinion condemned open quarreling. Above 
all, Christian doctrine had set a moral and religious 
value on work and approved of honest gain. In 
their character as fraternities, both occupational 
and parish gilds had sharpened sanctions by grudg- 
ing any grant from their loan or relief funds to 
members who failed in business through laziness, 
intemperance, or “folly.” The last notion covered 
bad judgment in not following the regular mer- 
cantile practice of dividing capital among separate 
ventures to reduce risk, Enterprising artisans had 
undertaken local trade in the materials of their 
craft as an avenue of upward mobility; in twelfth- 
century Genoa they had even made small invest- 
ments in the port's booming overseas trade. The 
Genoese of that age had already adopted Ben 
Franklin’s maxim that “Time is money,” their trade 
and loan contracts being dated by the hour of the 
day. 

The new formal organizations tried to keep up 
the reputation of their members’ wares for high 
quality, to lower costs by reducing wages, and to 
cover up disputes over technical innovations that 
would lower costs. In all such matters both mercan- 
tile and artisan gilds acted to some extent as secret 
societies. Although gilds had to submit a code of 
rules to the city authorities in order to be recog- 
nized as legitimate, there was no regular check on 
the interpretation of the rules or their amendment. 
Disagreement over competitive practices was some- 
times so bitter as to lead to the realignment of dis- 
sident groups into new gilds. 

As is well known, the fund-raising power of the 
gilds was drawn on to Support not only the cult 


of patron saints but also the major festivals through 
which the spiritual values of the entire community 
were reaffirmed. The persistence with which gilds 
cultivated pleasant manners within their own cir- 
cles is less well appreciated; at a gild meeting, all 
overt aggression, whether by word or deed, was re- 
lentlessly penalized. This is worth remark in an 
age when, despite centuries of Christian teaching, 
overt aggression was still common among the aris- 
tocracy, the peasantry, and the floating lower level 
of the urban population. The gilds may further be 
credited with inventing the bourgeois pattern of 
periodic, respectable festivity among married 
couples. 

The Italian gild movement of this period was 
the first to draw the secular professions into its 
pattern of organization, and in Tuscany lay “Round 
Table” clubs imitated the gild practice of employing 
notaries to draft their “statutes.” Neither universi- 
ties nor military organizations can be dealt with 
here, but it should be noted that the fraternity form 
was permitted to alien groups of artisans and that 
in the Low Countries it was used, at the instiga- 
tion of lords, to keep archers in training. In illicit, 
sworn forms it persisted among Frisian pirates, in 
French resistance to English occupation, and 
among unfree peasants raising funds to sue for 
or buy freedom. Heretic groups, on the other hand, 
preferred hierarchic forms of organization under 
charismatic leadership. 

Phase 4. The fourth phase, especially in pro- 
vincial towns in the sixteenth and seventeenth cen- 
turies, saw the social and economic activity of 
industrial gilds at its height, but political and eco- 
nomic circumstances were changing. State power 
standing above town administrations was readier 
than ever to grant enlarged powers to harass local 
competitors, gild offices fell more than ever into the 
hands of mercantile entrepreneurs, and subordi- 
nate fraternities of wage workers were suppressed. 

Phase 5. The fifth phase, during which gild 
organization of trade and industry died out, dragged 
on from the seventeenth into the nineteenth cen- 
tury, according as new industrial development and 
reaction against the old system of privilege were 
precocious or late. There had always been skepti- 
cism as to whether gilds stood, as they claimed, for 
the public interest. North of the Alps the spectacle 
of rural industry, based either on handicrafts or on 
new mechanization, progressing without benefit of 
gilds, fanned this skepticism long before it found 
theoretical exposition in new economic thought 
and propaganda. Then, too, as “fraternity” flamed 
up as a national, and international, slogan, the old 
local fraternal forms of association became obso- 


lete. True, the question of continuity between the 
old forms and the new craft unions, burial clubs, 
etc., intrigues historians, but the story remains 
incomplete. 


Comparisons with developmental models that 
might be constructed for other civilizations, includ- 
ing the rapidly moving kaleidoscope of present-day 
Africa, would rest, in the first instance, on the 
place of universalist principles in the legitimation 
of association. In all their phases European gilds 
were in part particularistic in spirit, but it was their 
contributions to order and religion that, from the 
first, were the basis of their legitimation. In Europe 
it was through the medieval gilds that universalist 
principles of policy protecting the consumer against 
fraud and dangers to health (e.g., from soiled sec- 
ond-hand mattress stuffing, diseased meat, and im- 
pure drugs) were first explicitly formulated. After 
the demise of the gilds, such functions had ulti- 
mately to be taken over by the state, and they have 
been the essential justification for the revival of 
professional and trade associations. [See LICENS- 
ING, OCCUPATIONAL.] In Islam, artisan gilds, whose 
appearance must now be dated no earlier than the 
thirteenth century (Goitein 1964), were the bearers 
of a universalist fraternal philosophy. Particularist 
aspects of Western gilds have been exaggerated by 
an uncritical acceptance of the notion that a son 
necessarily followed his father’s trade. Comparative 
study on this point has to be quantitative and has 
to take account of general demographic trends and 
the channeling of migration to towns. Far from 
being a worked-out field, the study of voluntary 
associations can still illuminate many aspects of 
society. [See VOLUNTARY ASSOCIATIONS. ] 


SYLVIA L. THRUPP 
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The work of Corrado Gini (1884-1965) has had 
a profound impact on many fields within the social 
sciences. The son of a manufacturer, Gini was 
born in Motta di Livenza, Treviso Province. He 
studied law at the University of Bologna, where he 
also took courses in mathematics. The broad scope 
of his scholarly contributions is reflected in the 
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variety of subjects he taught, successively, at the 
universities of Cagliari, Padua, and Rome: statis- 
tics, political economy, demography, biometrics, 
constitutional law, and sociology. 

Gini also furthered the development of social 
science in other than academic capacities. From 
1926 to 1932 he was chairman of the Central In- 
stitute of Statistics. He directed several scientific 
expeditions to study the demographic, anthropo- 
metric, and medical characteristics of particular 
ethnic groups in Fezzan, Palestine, Mexico, Poland 
and Lithuania, Calabria, and Sardinia, as well as 
the processes of assimilation of immigrant groups 
generally. He was the founder and editor of the 
international journal of statistics Metron, in 1920, 
and of the journal Genus, the organ of the Italian 
Committee for the Study of Population Problems. 

In recognition of his achievements he was 
awarded honorary doctorates, one in economic sci- 
ence—from the Catholic University in Milan, in 
1932—and another in sociology—from the Uni- 
versity of Geneva, in 1934; and he was awarded an 
honorary doctor of science degree by Harvard at the 
1936 tercentenary celebration of that university. 

The main currents of thought in Italian culture 
at the time that Gini began his scientific activity 
were positivism and idealism. Gini’s commitment to 
positivism was limited. His works manifest a re- 
markable eclecticism, arising from his systematic 
study of a variety of disciplines: law, economics, 
Statistics, mathematics, and biology. He always 
placed the phenomena he was studying in a more 
general context, bringing essential social, economic, 
and demographic elements to bear on the topic of 
his immediate concern, 

Gini thoroughly understood the importance of 
descriptive statistics, as is shown by his particular 
admiration for Luigi Bodio among Italian statis- 
ticians, but at the same time he deeply felt the 
need to study the procedures of inferential statistics 
and to find suitable criteria for evaluating those 
procedures, To this end he studied the works of 
Bernoulli, Lexis, and Czuber, as well as those of 
the Italians Angelo Messedaglia and Rodolfo Benini. 

Contributions to statistical method.  Gini’s first 
scientific works deal with the Statistical regularity 
of rare events. To the same period—1908 to 1911 
—belongs his initial work on the concept and meas- 
urement of probability, with special attention and 
application to the human sex ratio at birth. His 
findings were contained in a book (1908) in which 
he presented analyses of extensive empirical mate- 
rials on sex ratios from different countries. In the 
same work, he developed a theory of dispersion, 


that is, of the quantitative structure of the scatter 
or variability among measured quantities of the 
same kind. He later returned to the topic of dis- 
persion (1940; 1941). 

Gini made a lengthy critique of the principles of 
statistics in order to reinforce the logical basis of 
statistics and to rid it of naïve empiricism, He 
wished to raise statistics to the level of an inde- 
pendent science, one of whose tasks would be the 
systematic investigation of the advantages and lim- 
itations of various indexes, or descriptive statistics, 
under differing structures of error, 

In an address entitled “I pericoli della statistica” 
(1939; “The Dangers of Statistics”), inaugurating 
the first meeting of the Società Italiana di Statistica, 
Gini warned statisticians about the faulty logical 
foundations of some statistical methods and 
stressed the importance of estimating meaningful 
parameters. He published further papers on this 
general subject (1943; 1947). In the latter paper, 
“Statistical Relations and Their Inversions,” he 
pointed out that poor statistical notation may lead 
to misunderstanding and error, For example, if one 
writes y= 3 + 7x, asa mathematical relationship 
between the real variables x and y, one may just 
as well write x = —34 + 14y; this is called inverti- 
bility. On the other hand, in a statistical context of 
regression, one may loosely write y = 3 + 7x, mean- 
ing that the expectation (or average) of a random 
variable Y is thus linearly related to possible values 
of another random variable X. For clarity, one 
should use a more complete notation like the now 
conventional 


E(Y|X =x) =3 + 7x, 


and, in general, E(X|Y =y) = —3 + 14y. On the 
other hand, taking complete averages, EY = 3 + 
7EX; this situation is called one of subinvertibility. 

In a paper written in collaboration with Luigi 
Galvani, “Di talune estensioni dei concetti di media 
ai caratteri qualitativi” (1929; “Extension of Mean 
Value Theory to Qualitative Characteristics”), Gini 
made an original contribution in another area. The 
problem here was to find a meaningful analogue of 
the usual arithmetic mean for a purely qualitative 
or nominal distribution. By defining a measure of 
deviation between any two classes of the qualitative 
classification and by a minimization argument, Gini 
showed how to choose one of the classes of the clas- 
sification as a kind of mean, 

The theory of generalized means was of continu- 
ing interest to Gini, and he was particularly con- 
cerned with its very general expression and with a 
wide variety of mean values (1938). In another 


article, again with Galvani, he discussed exten- 
sively the topic of location parameters for bivariate 
distributions, noting especially geographical prob- 
lems (see Gini et al. 1933). 

Next to generalized means, measures of vari- 
ability are of great importance, and Gini discussed 
these at length, giving special attention to descrip- 
tive statistics not intrinsically tied to the normal 
distribution (1912). He introduced a new measure 
of variability, the mean difference, which is essen- 
tially the expected absolute value of X, — X., where 
X, and X., are independent random variables with 
the same distribution of interest. (Gini thought 
primarily in terms of the sample version of the 
mean difference. ) 

Gini also worked on measures of variability for 
qualitative or nominal distributions and on meas- 
ures of variability that are tied to the concentration 
curve and are considered important in the economic 
analysis of income and wealth (1921). 

Nontypicality of typical census areas. Gini set 
himself the task of selecting a sample of typical 
census areas in Italy. To this end, he showed that it 
is not adequate simply to select the sample in such 
a way that the averages of various characteristics 
calculated for the sample are equal to those calcu- 
lated for the entire territory of Italy. Even when 
the examination is confined to only the character- 
istics used in selecting the sample, the sample 
proves to be unrepresentative with respect to the 
variability and concentration of those characteris- 
tics and to the relations existing between them. 
Accordingly, Gini included in the criteria of repre- 
sentativeness these last two aspects of the distri- 
butions of characteristics, and he examined the 
conditions to which each of the proposed criteria 
subjects the other criteria. 

Theory of price indexes. To the theory of price 
indexes, which is of great importance in economic 
statistics, Gini contributed a classic paper, “Quel- 
ques considérations au sujet des nombres indices 
des prix et des questions analogues” (1924), in 
which he solved some logical and technical prob- 
lems connected with elimination methods. 

Distinguishing between simple and complex in- 
dices, Gini showed how, in general, the methods 
for obtaining complex indexes can be considered 
as particular cases of the method of elimination 
(which consists in separating the different circum- 
stances affecting a phenomenon in order to make 
comparisons ceteris paribus with similar phenom- 
ena at different times or places). The problem was 
to keep the phenomenon of “price variation” dis- 
tinct from the phenomenon of “quantity variation.” 
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Operating with the general method of the “typi- 
cal population” organized into four special methods, 
Gini obtained various formulas for index numbers, 
at the same time providing criteria for choosing 
that one of the formulas best suited to the aims of 
particular problems, (Another of his papers on the 
same topic is “Methods of Eliminating the Influ- 
ence of Several Groups of Factors,” 1937). 

Theory of distributions. Gini made substantial 
contributions to the study of relationships between 
two probability distributions and between two ran- 
dom variables with a joint distribution. Particularly 
important are the following: 

(a) Transvariation. Suppose that one considers 
two probability distributions, represented for con- 
venience by (independent) random variables X, 
Y. Suppose further that EX < EY; it may still be 
quite probable that X > Y, that is, the difference 
between the observations X and Y may show a dif- 
ferent sign than the corresponding difference be- 
tween expected values. Gini proposed a specific 
measure for this phenomenon, which he called 
transvariation (1953; 1959a; see also Kruskal 
1957). 

(b) Distance between two distributions. Gini was 
one of the first to consider the problem of measur- 
ing meaningfully how much two probability dis- 
tributions differ in gross (1914a). He proposed a 
measure of distance, based on the two cumulative 
distribution functions, that was in fact a metric in 
the technical mathematical sense (see Fréchet 
1947). 

(c) Association between two random variables 
with a joint distribution. Gini made the basic dis- 
tinction between connection (any kind of statistical 
dependence) and concordance (dependence in 
which it makes sense to speak of one variable tend- 
ing to increase—or decrease—along with the 
other), The correlation ratio, for example, is a 
measure of connection, while the coefficient of 
correlation is a measure of concordance. Gini ana- 
lyzed previously suggested measures of association 
and proposed new ones both for the case of nu- 
merical variates and for that of qualitative variates 
(see Goodman & Kruskal 1959, which contains 
relevant references to Gini, Weida, and Pietra). 

Demography and biometrics. Gini pioneered 
in studies that relate demographic phenomena to 
social and biological phenomena. Thus, he studied 
the sources of differential fertility (1949), and he 
related the phenomenon of migration to broader 
social and demographic considerations (1946b). 

Gini also proposed a unified research program 
for analyzing the eugenic and dysgenic effects of 
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war, with special emphasis on the measurement of 
the effects of war on mortality (1915-1920). Also 
related to demographic phenomena are Gini’s 
studies of various populations in phases of expan- 
sion, regression, and extinction (1934). 

Economic and sociological research. From the 
earliest years of his career, Gini tackled the diffi- 
cult problem of estimating national wealth. He 
made a constructive criticism of the method of the 
interval of devolution and the first calculation by a 
direct method of the private wealth of Italy 
(1914b). He followed a methodological organiza- 
tion of the data with a comparative study of the 
qualitative composition of wealth, clarifying the 
conditions that make nations wealthy (1959b). 

Gini originated the idea of including the value 
of human capital in the calculation of wealth [see 
CAPITAL, HUMAN]. This makes it possible to indi- 
cate the preponderant cause for the incommensu- 
rability in time and territory of the wealth and eco- 
nomic well-being of various collectivities (1956a). 
The value-as-property of the man who labors be- 
comes a factor in estimating wealth, just as the 
share derived from human labor is included in 
income (1914a). This conception clearly links eco- 
nomic considerations with sociological and demo- 
graphic ones. 

Related to Gini’s conception of human capital is 
his cyclical theory of population. Observing the dif- 
ferential rates of reproduction of social classes, 
Gini formulated a theory of social metabolism that 
is based on an analogy to organic metabolism: the 
upper classes, having low rates of reproduction, 
will tend to extinction unless they get new members 
from the lower classes, which have higher rates of 
reproduction (1927). Gini intended this theory to 
replace Pareto’s concept of the circulation of elites. 

Gini also developed a theory of phases of popu- 
lation growth, these being related to social and 
economic phenomena ( 1923). According to his 
theory, a particular population grows rapidly in the 
first phase of its development, then has a diminish- 
ing rate of growth until it is approximately station- 
ary, and finally begins a decline that may even lead 
to total extinction. In the first phase, capital is very 
scarce and there is little differentiation between 
social classes; later, as capital is accumulated, 
classes become increasingly differentiated, and 
there is a concomitant differentiation of reproduc- 
tive behavior. As birth control spreads, a deteriora- 
tion in the quality of the population can be avoided 
only by the influence of external factors, such as 
the immigration of young people from populations 
in the phase of demographic expansion. 

Gini’s neo-organicist approach makes it possible 


to distinguish normal economic processes from 
pathological ones—those in which the state of a 
society is unbalanced—and to initiate economic 
processes of restoration. In general, he believed 
that states of imbalance could be remedied by the 
regulatory activities of political agencies and of 
economic organizations. 

In Gini’s view, an important aspect of the devel- 
opment of society is the transition from forced to 
free labor and then to spontaneous labor (1956b), 
This last represents the final Stage in a psycholog- 
ical process that raises labor from a primordial 
means of obtaining a living to a free and auton- 
omous activity that enriches the human person- 
ality. 


Gini’s pre-eminent place in the development of 
Italian statistics is based on a dual contribution: 
his work in scientific statistics, which included im- 
portant teaching and editorial activities; and his 
efforts toward the development of official statistics, 
notably, the consolidation in a single institute of 
all the various agencies for the collection of data 
and the extension of the number of items about 
which data are collected. 

TOMMASO SALVEMINI 


[Directly related are the entries SAMPLE SURVEYS; STA- 
TISTICS, DESCRIPTIVE; VARIANCES, STATISTICAL 
STUDY OF. Other relevant material may be found in 
CENSUS; GENETICS; INCOME DISTRIBUTION, article 
On SIZE; ĪNDEX NUMBERS; MIGRATION; POPULATION; 
and in the biographies of BENINI; BERNOULLI FAM- 
ILY; LEXIS; PARETO.] 
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GIRSHICK, MEYER A. 


Meyer Abraham Girshick (1908-1955), Ameri- 
can statistician, was born in Russia and immi- 
grated to the United States in 1922. After grad- 
uating from Columbia College in 1932, he did 
graduate work in statistics under Harold Hotelling 
at Columbia University from 1934 until 1937. 
(Several years later he received a doctorate from 
Columbia.) In 1937-1939 he was a statistician 
with the Bureau of Home Economics, U.S. Depart- 
ment of Agriculture, where he participated in a 
pioneer study of body measurements of 147,000 
American children that helped manufacturers to 
develop improved sizing of garments. At the same 
time, he gave evening courses in statistics at the 
U.S. Department of Agriculture Graduate School, 
courses that had a profound influence on the dis- 
semination of sound statistical methods in govern- 
ment work and encouraged many new research 
workers. In 1939-1944 and again in 1945-1946 
he was principal statistician for the Bureau of 
Agricultural Economics. For one year during World 
War uw he participated in the work of the Statis- 
tical Research Group at Columbia University, a 
panel composed of statisticians and designed to 
develop statistical methods appropriate to wartime 
problems. Working with others on the panel, par- 
ticularly Abraham Wald, had a decisive influence 
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on Girshick’s subsequent career. After a brief stay 
at the Bureau of the Census, he became a research 
statistician and mathematician at the RAND Cor- 
poration in Santa Monica, California (a research 
organization primarily devoted to work for the 
U.S. Air Force). From 1948 until his untimely 
death in 1955 he was professor of statistics at 
Stanford University. He was elected president of 
the Institute of Mathematical Statistics (one of 
the principal learned societies for theoretical sta- 
tistics ) in 1951. 

His early papers, written between 1936 and 
1944, were primarily concerned with problems of 
multivariate statistical analysis. His major achieve- 
ment was to find the distribution of the roots and 
characteristic vectors associated with certain de- 
terminantal equations that are used in testing the 
null hypothesis that two sets of variates are inde- 
pendent (1939). [See MULTIVARIATE ANALYSIS.] 

Subsequently he also found the distribution of 
these roots when the null hypothesis is not true. 
From this the power function of certain tests can 
in principle be obtained (1941). 

His interest in multivariate analysis and his 
substantive work at the Bureau of Agricultural Eco- 
nomics combined to turn his attention to newly 
developing methods of estimation of simultaneous 
equations in economics. These methods were origi- 
nated by Trygve Haavelmo and continued by 
Tjalling Koopmans, T. W. Anderson, and Herman 
Rubin. In collaboration with Haavelmo, Girshick 
conducted one of the first major empirical studies 
on an econometric model for the agricultural sector 
of the United States economy (1947), and through 
unpublished work he contributed importantly to 
the development of the limited-information ap- 
proach to simultaneous equation estimation. [See 
SIMULTANEOUS EQUATION ESTIMATION.] 

During the period that Girshick was associated 
with the Statistical Research Group, Wald was 
originating two major concepts of statistics: se- 
quential analysis and statistical decision theory 
[see DECISION THEORY; SEQUENTIAL ANALYSIS; 
and the biography of Wap]. Girshick’s subsequent 
work was almost exclusively confined to these two 
areas (see Blackwell & Girshick 1954). 

His earliest work in sequential analysis con- 
cerned the testing of composite hypotheses in 
which the mean of one process is less than, or 
equal to, the mean of another, the alternative being 
that it is greater. He showed that the power func- 
tions of such tests were constant on certain curves 
in the parameter space, and he was able to use this 
information to derive approximate sequential tests 
for the parameters of the exponential family of 


distributions (1946a). He also found exact relations 
for tests when the variables take on only a finite 
set of values. As a by-product of his interest in 
sequential analysis he found, in collaboration with 
Frederick Mosteller and L. J. Savage, a method of 
getting unbiased estimates of a parameter from 
data that have been generated by a sequential, or 
similar, sampling scheme (Girshick et al. 1946b), 
He and David Blackwell found a lower bound for 
the variance of such estimates (Blackwell & Gir- 
shick 1947). 

His work in decision theory was developed in a 
series of papers, most of them coauthored with 
Blackwell, Rubin, Savage, or Arrow. The major 
results of these studies were systematically pre- 
sented in Theory of Games and Statistical Deci- 
sions (1954), written jointly with Blackwell. This 
book today represents a major study of statistical 
method using the concepts of the theory of games 
from the decision-theory point of view. The theory 
of games, which is closely allied to decision theory, 
is also given prominence in the book. |See GAME 
THEORY.] Among its most noteworthy accomplish- 
ments are a study of the interrelations of various 
criteria for complete classes of solutions and re- 
lated concepts and a systematic treatment of 
Bayesian and related procedures in statistical con- 
texts. Other special features include rigorous anal- 
ysis of the concepts of sufficiency and of invari- 
ance, a clear exposition and characterization of 
sequential probability-ratio tests and their optimal 
properties, a study of Bayesian estimation pro- 
cedures with special loss functions, particularly the 
quadratic and the absolute value, and the theory of 
comparison of experiments. 

Girshick’s influence on the development of sta- 
tistical theory occurred as much through his direct 
personal relations, his enthusiasm and intelligent 
guidance, as through his published work, important 
as the latter is. 

KENNETH J, ARROW 
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GOBINEAU, JOSEPH ARTHUR DE 


Joseph Arthur de Gobineau (1816-1882) is best 
known as the author of the Essai sur l'inégalité des 
races humaines (1853-1855), which made race 
the principal explanatory factor in history and had 
an important influence on the development of the 
concept of race in the nineteenth century. Although 
during his lifetime Gobineau received little recog- 
nition in his native France, after his death his 
racial theories aroused considerable interest, espe- 
cially in Germany. 

The mystique of elitism and race in Gobineau’s 
thought had its roots in certain circumstances of 
his life. His family was noble, but the origins of its 
nobility were obscure. His father, Louis de Gobi- 
neau, was a captain in the royal guard, and his 
mother, Anne-Louise Magdeleine de Gercy, was 
the daughter (if we believe Gobineau) of an illegit- 
imate son of Louis xv. Gobineau’s adolescence was 
tormented by the marital conflict between his par- 
ents, and he began very early to question whether 
“all kinds of blood and all origins are alike.” 

Gobineau went to Paris in 1835 and became a 
member of its literary, artistic, and scientific cir- 
cles, He attended the salons of Madame de Serre, 
of Rémusat, and of Tocqueville. Initially he seemed 
to lean toward a sort of liberalism in politics, but 
the revolution of 1848 instilled in him a permanent 
abhorrence of democracy. When Tocqueville served 
as minister of foreign affairs of the republic in 
1849, Gobineau became his private secretary, and 
upon Tocqueville’s fall, Gobineau entered the dip- 
lomatic service. The books he wrote about the coun- 
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tries in which he lived—Germany, Persia, Greece, 
Brazil, and Sweden—were all pervaded by his the- 
ories of race. 

Aristocratic by temperament himself, Gobineau 
sought passionately all his life to justify elitism 
both scientifically and philosophically. From the 
outset, he defined the elite in ethnic terms, identi- 
fying it with a superior race, a “race of masters.” 
He asserted that the white race is superior to the 
yellow and black races by virtue of its intelligence, 
its capacity for reflection, its love of order and lib- 
erty, its sense of honor, and its pre-emption of all 
civilized values, and he claimed superiority among 
the whites for the pure Aryan. Unhappily, however, 
as he saw it, this ethnic hierarchy has no perma- 
nence, for races are constantly mixing; indeed, 
there are no more pure races, This mixing is dis- 
astrous, for it lowers the superior elements to the 
level of the less gifted and leads eventually to de- 
generation and to the extinction of civilization. On 
the political level, the mixing of races finds its ex- 
pression in democracy, which Gobineau considered 
the worst form of state. In a draft foreword written 
in 1877 for a second edition of the Essai, he ad- 
mitted that his race theory was “a natural conse- 
quence of [his] horror and disgust at democracy.” 
The Essai reveals Gobineau’s profound pessimism, 
ending as it does with an apocalyptic vision of the 
world in the final stage of decay—an era of uni- 
formity, mediocrity, and passivity on the part of 
individuals. 

Gobineau’s pessimism about the fate of civiliza- 
tion came to be reinforced by the justified convic- 
tion that he was not being appreciated. His travels 
to Greece, Persia, and Brazil only served to confirm 
his idea that the mixture of races causes degenera- 
tion. Full of hatred and contempt for the “mixed 
breeds,” he returned from Brazil to France in 1870 
to experience the Franco-Prussian War, defeat, 
the Commune, and the installation of: the republic. 
These events convinced him that the downfall of 
his country was inevitable, led as it was by the 
“Gallo-Roman rabble,” the “bourgeoisie.” 

His last ministry was in Stockholm, from 1872 
to 1876, and he did much writing there. He fin- 
ished his novel Les pléiades (1874), in which he 
presented the idea that, as a consequence of racial 
degeneration, an elite must be sought among indi- 
viduals—individuals who stand out by virtue of 
their love of liberty, their sense of honor, and their 
energy—and he went back to work on his poem 
Amadis (1887), into which he poured his feudal 
ideas, his hatreds, and his apocalyptic vision of a 
degenerate world. There too he worked on Ottar 
Jarl (1879), the story of a Norwegian pirate, the 
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imaginary founder of his own family of which he 
claimed to be the last offshoot; the work is a rather 
pathetic attempt to prove that he was an elite being. 
In Stockholm, too, he embraced the pagan spirit, 
leaving behind not only the Catholicism of his 
family but also many other forms of religious and 
philosophical thought that he had accepted succes- 
sively since childhood. 

His growing hostility toward a society that failed 
to appreciate him led to his being retired prema- 
turely in 1876. He lived out his last years in Italy, 
full of bitterness and pessimism, separated from 
his wife, and in financial stringency. 

R. THENEN 


[For the historical context of Gobineau’s work, see 
RACcE.] 
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GÖKALP, ZIYA 


Ziya Gökalp (1876-1924) was primarily respon- 
sible for introducing the study of sociology into 
Turkey, and he drew from sociology the intellectual 
basis for his ardent Turkish nationalism. 

Born in Diyarbakir, the son of a civil servant 
who edited the official loca] newspaper, Mehmed 
Ziya (later Gökalp) attended secular schools there 
and also learned traditional Islamic lore from his 
uncle, a Muslim lawyer. At 18 he attempted sui- 
cide. By the following year, however, he was able to 
go to Istanbul and enroll at the Veterinary College. 

He had already been influenced by the ideas of 
the Young Turks, and in 1895 he became a mem- 
ber of the secret society of Union and Progress in 
Istanbul. In 1898 he was arrested; after a year’s 
imprisonment he was banished to his native town, 
where he devoted all his time to study. In those 
years the Young Turks who were in exile in Paris 
were strongly influenced by French sociology. One 
of them, Prince Sabaheddin, a follower of Le Play, 
went so far as to declare that only through socio- 
logical studies could the Ottomans introduce social 


change and thus find a way to achieve harmony 
among the various elements in the empire, a view 
later supported by Gökalp (in the first issue of his 
newspaper, Peyman, August 28, 1909). 

In 1908, after the Young Turk revolution, Gökalp 
became the representative of the Union and Prog- 
ress party in Diyarbakir. The next year he was 
elected a member of the central council of the 
party at Salonica and given the task of expounding 
its doctrine and attracting young people to its 
ranks. In 1910 he received an appointment to 
teach sociology in Salonica, the first such appoint- 
ment in Turkey, and five years later he became the 
first professor of sociology at the University of Is- 
tanbul. He taught in the faculty of letters until 
1919, making it a center for studies of Turkey as 
a nation. Exiled to Malta after World War 1, he 
returned to Diyarbakir in 1921 as a wholehearted 
supporter of Atatürk and edited the Küçük mecmua 
(“Little Review”), in which he wrote a series of 
sociological essays designed to instruct the national 
leaders. In 1922 he was appointed director of Cul- 
tural Publications in Ankara, a department of the 
Ministry of Public Education, and there published 
his famous Türkçülüğün essaslari (1923; “Founda- 
tions of Turkism”). 

Gökalp believed that the political revolution of 
the Young Turks needed to be completed by a social 
revolution that would create a “new life” in such 
areas as economics, the family, fine arts, morality, 
and law. A new Turkish civilization could be cre- 
ated only by gaining knowledge of Turkey's genuine 
national values. As late as 1911 he had believed 
that values are nothing but idées-forces based on 
philosophical considerations, but after 1912 he ac- 
cepted the Durkheimian interpretation of values as 
collective representations. (He considered Durk- 
heim to be the most penetrating sociologist and the 
founder of scientific sociology.) According to Gö- 
kalp, collective representations are realizations in 
the “collective consciousness,” which, when they 
become fully articulated, are called ideals. “The 
only source of values is society itself and the expe- 
rience of collective sentiments by individuals con- 
stitutes collective conscience” ([1911-1923] 1959, 
pp. 62-64). 

After her defeat in the Balkan War, a critical 
period began for Turkey. Discussions of reforms 
were accompanied by conflicts between the Islam- 
ists, Westernists, and Turkists. Gökalp, who had 
come to Istanbul in 1912, felt that these conflicts 
had to be reconciled in a broader outlook. He argued 
that humanity was composed of culture groups, 
each with its own value system, and of civilization 
groups, with rules and techniques capable of inter- 
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cultural diffusion and universal acceptance ([1911— 
1923] 1959, pp. 97-101). It was sociologically 
valid that Turks belonged at the same time to the 
Turkish nation, to the Muslim religious commu- 
nity, and to European civilization (Gökalp [1911— 
1923] 1959, pp. 71-76; Heyd 1950, pp. 149-151). 
Increasingly Gökalp stressed nationalism as the 
most powerful ideal of the modern age and nations 
as the most highly developed species on the scale 
of culture groups. In the nation he thought it was 
possible to integrate Turkish culture, Islam, and 
Western techniques. He later came to identify col- 
lective representations with national mores and 
asserted that “the . . . [discipline] which studies 
how the culture of a nation is distinguished from 
the civilization to which it belongs is called cultural 
sociology” ({1911—1923] 1959, pp. 172-173). 

Following his belief that the task of the sociolo- 
gist is to discover the elements of national culture, 
he embarked on a series of studies on the evolution 
of the Turkish family and on the pre-Islamic Turk- 
ish religion and state. His idea of a modernized 
Islam was predicated on the theory that the part 
of the religious law of Islam that is based on con- 
sensus is of social rather than divine origin and 
may therefore change in accordance with secular 
change ([1911—1923] 1959, pp. 193-196). He was 
convinced that a national state must be a secular 
one, and he strongly advocated a national system 
of education and economy. His programs for secu- 
larizing both education and the judiciary and for 
introducing equal rights for women were partly put 
into effect in 1917-1918. 

Opinion on Gökalp is divided. He himself thought 
that what was original about his work was his 
testing of Durkheim’s sociological method by apply- 
ing it to Turkish civilization, His supporters agree 
that his conceptualizations of the nature of culture 
and nation are original and that his work repre- 
sents scientific sociology in the Durkheim tradi- 
tion; his critics stress that he had a dogmatic and 
deductive mind with strong collectivist ideas. Above 
all, he was an impassioned nationalist, and there 
is no doubt that his teachings provided an intel- 
lectual foundation for the modernization of Turkey. 


HALIL İNALCIK 


[For the historical context of Gökalp’s work, see ISLAM; 
NATIONALISM; PAN MOVEMENTS; and the biographies 
of DURKHEIM; LE PLAY.] 
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GOLDENWEISER, ALEXANDER A. 


Alexander Alexandrovich Goldenweiser (1880— 
1940), American anthropologist, was born in Kiev, 
Russia, the son of a lawyer with broad intellectual 
interests. He was educated in Kiev and at Harvard 
and Columbia, where he earned his PH.D. in 1910. 
A cosmopolitan figure, he achieved few of the re- 
wards his ability might have earned him. He never 
held a full-time, permanent academic position, but 
he taught at many schools, including Columbia, the 
New School for Social Research, Portland Exten- 
sion Center, Reed College, and the University of 
Wisconsin. His appointments were in anthropology, 
psychology, and sociology, and once he was even 
professor of thought and culture. He was an excit- 
ing teacher; although few of his students went on 
to become anthropologists, few forgot his lectures. 
He was also one of the first supporters of, and a 
contributor to, the Encyclopaedia of the Social 
Sciences. 

An early student of Franz Boas, Goldenweiser 
was one of the most gifted American anthropol- 
ogists of his generation. Yet his works are now 
often neglected. One reason for this may be that 
Boas and his students devoted so much of their 
time to critical attacks on the then prevalent theo- 
retical positions: early in the twentieth century a 
simplistic, unilinear evolutionism was dominant: 
geographic determinism and, to a lesser extent, 
even racialism and other naively biological inter- 
pretations of behavior were also respectable posi- 


tions. Armchair theorists were writing speculative 
works, many dealing with the “origins” of institu- 
tions. All of this was attacked by Boas and his 
group. Furthermore, although the group was pri- 
marily historical in outlook, it also opposed the 
extreme position of some historical anthropologists 
who were active particularly in Germany, Austria, 
and England and who argued against evolutionism 
on the grounds that cultural change results from 
migration or diffusion from one or a few original 
centers. 

Theoretical polemics aside, Goldenweiser’s writ- 
ings might be more widely read had he published 
the kinds of field reports produced in such quantity 
by Alfred L. Kroeber, Robert H. Lowie, and others. 
Although he was popular with his Iroquois inform- 
ants, he evidently did not enjoy field research and 
engaged in this activity for only a relatively few 
months. He preferred to concern himself with an- 
thropological theory and with the broader problems 
of “social theory.” He once told a friend that he 
would rather read bad theory than none at all. 
Moreover, he knew a number of languages well 
and did not limit his reading to anthropology. 

Goldenweiser made theoretical contributions in 
more than one area. His article “The Principle of 
Limited Possibilities in the Development of Cul- 
ture” (1913) is concerned with the issue of diffu- 
sion versus independent invention; it suggests that 
objects and acts can take only a limited number 
of forms and that this can lead to convergences 
among the traits of disparate cultures. Other early 
writings centered on totemism—for example, his 
1910 article “Totemism: An Analytical Study.” 
Ironically, his discussion of totemism contributed 
to a decreased interest in it among his colleagues, 
since it led them to conclude that there is no single, 
definable category of totemic practices but only di- 
verse, if partially similar, phenomena that may be 
grouped under this term. 

In his writings Goldenweiser made extensive use 
of the concept of culture. On some occasions he 
took culture for granted, but on others he advanced 
cultural, as against alternative, interpretations of 
behavior, as well as exploring particular cultural 
concepts, such as “patterning” (1936). This con- 
cept served him in his consistent attempts to see 
mankind in the round. 

He may have overestimated the extent to which 
his own analyses and those of his colleagues could 
properly be characterized as psychological. Despite 
the fact that he had read a great deal of Wundt, 
some Freud, and the works of certain social psy- 
chologists, it is doubtful whether he made any at- 


tempt to keep abreast of developments in the main 
stream of psychology. He was particularly inter- 
ested in the relationship between individuals and 
their culture, but he was not really one of the archi- 
tects of the “culture and personality” movement. 
He was curious about the conceptual worlds of 
nonliterate peoples—what was then frequently 
called the primitive mind—and he did not think 
these conceptual worlds differed fundamentally 
from that of moderg men. He was also curious 
about philosophy, the arts, and culture history, in- 
cluding various aspects of the history of European 
civilization. 

Goldenweiser was not at home in the world as 
he knew it. Sometimes he was irresponsible in his 
relations with publishers or with academic sys- 
tems. He appears to have lacked a substantial theo- 
retical basis for studying the regularities of human 
life; the writings of his contemporaries did not 
readily offer such a basis, and even had it been 
there, his personal anarchistic predilections might 
have militated against his finding it. While he made 
few lasting contributions to theory, he did think 
well and write well on the questions of concern to 
him. 

Davin H. FRENCH 


[For the historical context of Goldenweiser’s work, see 
the biographies of Boas and KROEBER.] 
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GOLDSTEIN, KURT 


Kurt Goldstein, American neurologist, psychia- 
trist, and psychologist, was born in 1878 in Katto- 
witz, Germany, and died in 1965 in New York City. 

Goldstein entered medicine with the intention 
of devoting his life to the treatment of patients 
with mental diseases. In 1903 he received his med- 
ical degree at Breslau, Germany. He studied psy- 
chiatry with Karl Wernicke and neuroanatomy 
with Ludwig Edinger. His first ten papers, all pub- 
lished between 1903 and 1904, are studies of the 
anatomy of spinal pathways, of the embryology 
of the brain, and of comparative neuroanatomy. 
Clinical papers soon followed: one, in 1906, on 
memory problems and deficits created by neuro- 
logical and psychiatric disturbances, foreshadowed 
important distinctions that underlie present-day 
differential diagnostic testing. In 1906 he joined 
the staff of the psychiatric clinic at the University 
of Königsberg, where he found his high expecta- 
tions thwarted: mental patients were relegated to 
custodial care without any attempt at treatment. 
Nor did the prevailing Kraepelinian nosology indi- 
cate any conceivable direction for therapy. Gold- 
stein used the opportunity for careful examination 
and observation of patients and published papers 
on neurological and psychiatric topics—motor dis- 
turbances, sensory disturbances, the nature of hal- 
lucinations, alcoholism, manic-depressive states, 
schizophrenia—and comparisons of clinical symp- 
toms with post-mortem findings. 

In 1919 he was appointed professor of neurology 
and director of the Neurological Institute at the 
University of Frankfurt. There he established the 
Institut zur Erforschung der Folgeerscheinungen 
von Hirnverletzungen, where, under his aegis and 
in collaboration with the psychologist Adhémar 
Gelb and many students, a large number of brain- 
injured soldiers were studied intensively and for 
long periods of time. These studies led to entirely 
novel conceptions of such problems as aphasia, 
agnosia, and tonus disturbances and generally to 
systematic descriptions of behavioral changes 
wrought by brain injury. In 1930 Goldstein became 
professor of neurology at the University of Berlin 
and director of the Neurological Hospital Moabit, 
a post he held until Hitler’s advent in 1933. After 
a year’s writing in Amsterdam he came to New 
York City in 1935; until 1940 he was clinical pro- 
fessor of neurology at Columbia University and 
head physician at Montefiore Hospital. In 1939 
Harvard invited him to deliver the William James 
lectures, which were published under the title 
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Human Nature in the Light of Psychopathology 
(1940). From 1940 to 1945 he continued research 
and teaching as professor of neurology at Tufts 
Medical School, Boston. He then returned to New 
York, where he practiced privately and taught at 
the College of the City of New York, Columbia 
University, and the New School for Social Re- 
search. For many years he was also a guest pro- 
fessor at Brandeis University. 

Goldstein’s experimental papers and monographs 
provoked lively discussions among neurologists and 
psychologists; yet his greatest impact came with 
the publication of The Organism: A Holistic Ap- 
proach to Biology Derived From Pathological Data 
in Man (1934), written during his year’s exile in 
Amsterdam. Goldstein viewed behavior as the uni- 
fied activity of the whole organism, whose basic 
“motive” is optimum self-actualization in a given 
environment. Reflexes, clinical symptoms, or func- 
tions subjected to laboratory investigation are part 
processes, and failure to consider that their isola- 
tion is artificial results in erroneous interpretation. 
Study of brain-injured patients reveals the general 
change underlying specific symptoms and manifes- 
tations, namely, an impairment of the patients’ “ab- 
stract attitude” and a preponderance of “concrete” 
modes of behavior, affecting all performance fields. 
The “abstract attitude” is defined, basically, as 
man’s capacity to reason deliberately, to plan and 
account for his actions, to view particular objects 
or events as instances of a class. Upon loss of these 
capacities the individual is at the mercy of the 
immediate, concrete sensory or mnemonic stimu- 
lus situation and is unable to transcend it. The 
aphasic patient cannot name an object, that is, 
supply the class symbol, yet he may spontaneously 
produce the word within a sentence or an action. 
Thus, aphasia is not a loss of words or word 
images. Rather, it is a dedifferentiation of lan- 
guage, a loss of concepts and their symbols; in 
other words, it is the manifestation in language of 
impaired abstraction. This view has important im- 
plications for the problem of cerebral localization. 
Functions cannot be localized, although defects 
can be, if we keep in mind the problem of isolated 
part processes and integrated organismic activity. 

Impairment of abstraction is not limited to brain- 
injured patients. It manifests itself also in psy- 
chotic processes. Whatever its cause, impaired 
abstraction restricts the patient’s opportunities for 
self-realization. Consequently, his conduct becomes 
rigid; he is vulnerable to anxiety, to the “cata- 
strophic reaction,” that is, to the threat of being 
unable to actualize himself, and to the danger of 
“losing existence.” 


Goldstein was only peripherally associated with 
the gestalt psychologists Wertheimer, Kéhler, and 
Koffka; however, he was an editor of Psychologische 
Forschung, the house organ of gestalt psychology. 
While he never established a school, his views on 
aphasia profoundly influenced the English neurolo- 
gist Sir Henry Head, as well as philosophers con- 
cerned with problems of normal language, par- 
ticularly Goldstein’s cousin Ernst Cassirer and the 
followers of Edmund Husserl. Generally critical of 
psychoanalytic theory, Goldstein acknowledged the 
importance of some of Freud’s views, especially 
his broad emphasis on motivation and his discus- 
sion of the nature of symptoms. Existential psy- 
chology has claimed Goldstein somewhat unsuc- 
cessfully. Although phenomenological observation 
and analysis were fundamental to his work, he did 
not share the metaphysics of existential psychology 
and psychiatry. 

MARIANNE L. SIMMEL 


[For the historical context of Goldstein’s work, see 
GESTALT THEORY and the biographies of KOFFKA; 
KOHLER; WERTHEIMER. For discussion of subsequent 
development of his ideas, see LANGUAGE, article 
On SPEECH PATHOLOGY; MENTAL DISORDERS, article 
On ORGANIC ASPECTS; NERVOUS SYSTEM; SYSTEMS 
ANALYSIS, article on PSYCHOLOGICAL SYSTEMS; and 
the biographies of CASSIRER and HussERL.] 
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GOODNESS OF FIT 


A goodness of fit procedure is a statistical test 
of a hypothesis that the sampled population is dis- 
tributed in a specific way, for example, normally 
with mean 100 and standard deviation 15. Corre- 
sponding confidence procedures for a population 
distribution also fall under this topic. Related tests 
are for broader hypotheses, for example, that the 
sampled population is normal (without further 
specification). Others test hypotheses that two or 
more population distributions are the same. 

Populations arise because of variability, of which 
various sources (sometimes acting together) can 
be distinguished. First, there is inherent varia- 
bility among experimental units, for example, the 
heights, IQs, or ages of the students in a class 
each vary among themselves. Then there is meas- 
urement error, a more abstract or conceptual no- 
tion, The age of a student may have negligible 
measurement error, but his IQ does not; it depends 
on a host of accidental factors: how the student 
slept, the particular questions chosen for the test, 
and so on. There are also other conceptual popula- 
tions, not properly thought of in terms of measure- 
ment error—the population of subject responses, 
for example, in the learning experiment below. 

The distribution of a numerical population trait 
is often portrayed by a histogram, a density func- 
tion, or some other device that shows the propor- 
tion of cases for which a particular value of the 
numerical trait is achieved (or the proportion 
within a small interval around a particular value). 
The shape of the histogram or density function is 
important; it may or may not be symmetrical. If it 
is not, it is said to be skew. If it is symmetrical, 
it may have a special kind of shape called normal. 
For example, populations of scores on intelligence 
tests are often assumed normally distributed by 
psychologists. Indeed, the construction of the test 
may aim at normality, at least for some group of 
individuals. Again, lifetimes of machines may be 
assumed to have negative exponential distributions, 
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meaning that expected remaining life does not 
vary with age. [See DisTRIBUTIONS, STATISTICAL, 
article On SPECIAL CONTINUOUS DISTRIBUTIONS; 
PROBABILITY; STATISTICS, DESCRIPTIVE. | 

It is technically often convenient, especially in 
connection with goodness of fit tests, to deal with 
the cumulative distribution function (c.d.f.) rather 
than with the density function. The c.d.f. evaluated 
at x is the proportion of cases with numerical 
values less than or equal to x; thus, if f(x) is a 
density function, the corresponding c.d.f. is 


F(x) = IOLO 


For explicitness, a subscript will be added to F, 
indicating the population, distribution, or random 
variable to which it applies. It is a matter of con- 
vention that cumulation is from the left and that 
it is based on “less than or equal to” rather than 
just “less than.” 

The sample c.d.f. is the steplike function whose 
value at x is the proportion of observations less 
than or equal to x. Many goodness of fit procedures 
are based on geometrically suggested measures of 
discrepancy between sample and hypothetical 
population c.d.f.’s. Some informal procedures use 
“probability” graph paper, especially normal paper 
(on which a normal c.d.f. becomes a straight line). 

For nominal populations (for example, propor- 
tions of people expressing allegiance to different 
religions or to none) there is no concept corre- 
sponding to the c.d.f. The main emphasis of this 
article is on numerical populations. 

Although goodness of fit procedures address 
themselves principally to the shape of population 
c.d.f.’s, the term “goodness of fit” is sometimes 
applied more generally than in this article. In par- 
ticular, some authors write of goodness of fit of 
observed regressions to hypothetical forms, for ex- 
ample, to a straight line. [This topic is dealt with 
in LINEAR HYPOTHESES, article on REGRESSION. ] 

Hypotheses—simple, composite, approximate, A 
test of goodness of fit, based on a sample from a 
population, assesses the plausibility that the popu- 
lation distribution has specified form; in brief, 
tests the hypothesis that Fy has shape F,. The 
specification may be complete, that is, the popula- 
tion distribution may be specified completely, in 
which case the hypothesis is called simple. Alter- 
natively, the form may be specified only up to 
certain unknown parameters, which often are the 
parameters of location and scale. In this case the 
hypothesis is called composite. Still another type 
of hypothesis is an approximate one, which is com- 
posite in a certain sense. Here one specifies first 
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what one would consider a material departure from 
a hypothesized shape (Hodges & Lehmann 1954). 
For example, in the case of a simple approximate 
hypothesis, one might agree that F, departs ma- 
terially from F, if the maximum vertical deviation 
between the actual and hypothesized cumulative 
distribution functions exceeds .07. The approxi- 
mate hypothesis then states that the actual and 
hypothesized distributions do not differ materially 
in this sense. 

Approximate hypotheses specialize to the others, 
so that a complete theory of testing for the former 
would be desirable. This is especially true since, 
as has been pointed out by Karl Pearson (1900) 
and Joseph Berkson (1938), tests of “exact” hy- 
potheses, being as a rule consistent, have prob- 
lematical logical status: unless the exact hypothe- 
sis is exactly correct and all of the sampling 
assumptions are exactly met, rejection of the hy- 
pothesis is assured (for fixed significance level) 
when sample size is large. Unfortunately, such a 
complete theory does not now exist, but the strong 
early interest in “exact” hypotheses was not mis- 
spent: The testing and “acceptance” of “exact” 
hypotheses concerning Fy seems to have much the 
same status as the provisional adoption of physical 
or other “laws.” If the latter has helped the ad- 
vancement of science, so has no doubt the former; 
this is true notwithstanding that old hypotheses or 
theories will almost surely be discarded as addi- 
tional data become available. This point has been 
made by Cochran (1952) and Chapman (1958). 
Cochran also suggests that the tests of “exact” hy- 
potheses are “invertible” into confidence sets, in the 
usual manner, thus providing statistical procedures 
somewhat similar in intent to tests of approximate 
hypotheses [see ESTIMATION, article on CONFI- 
DENCE INTERVALS AND REGIONS]. 

Conducting a test of goodness of fit. Many tests 
of goodness of fit have been developed; as with 
Statistical tests generally, a test of goodness of fit 
is conveniently conducted by computing from the 
sample a statistic and its sample significance level 
[see Hyporuests TESTING]. In the case of a test of 
goodness of fit, the statistic will measure the dis- 
crepancy between what the sample in fact is and 
what a sample from a population of hypothesized 
form ought to be. The sample significance level of 
an observed measure of discrepancy, d,, is, at least 
for all the standard goodness of fit procedures, the 
probability, Pr(d>d,}, that d exceeds d, under 
random sampling from a population of hypothe- 
sized form. In other words, it is the proportion of 
like discrepancy measures, d, exceeding d,, com- 
puted on the basis of many successive hypothetical 


random samples of the same size from a popula- 
tion of hypothesized form. For many tests of good- 
ness of fit, there exist tables (for extensive bibliog- 
raphy see Greenwood & Hartley 1962) that give 
those values of d, corresponding to given signifi- 
cance level and sample size (n). Many of these 
standard tests are nonparametric, which means 
that Pr{d > d,} is the same for a very large class 
of hypotheses F,, so that only one such tabulation 
is required [see NONPARAMETRIC STATISTICS]. 

If, as is usual, the relevant alternative popula- 
tion distributions (more generally, alternative prob- 
abilistic models for the generation of the sample 
at hand) tend to encourage large values of d,, the 
hypothesized population distribution will be judged 
implausible if the sample significance level is small 
(conventionally .05 or less). If the sample signifi- 
cance level is not small, it means that the statistic 
has a value unsurprising under the null hypothesis, 
so that the test gives no reason to reject the null 
hypothesis. If, however, the sample significance 
level is very large, say .95 or more, one may con- 
strue this as a warning of possible trouble, say, that 
an overzealous proponent of the hypothesis has 
slanted the data or that the sampling was not ran- 
dom. Note here an awkward usage prevalent in 
Statistics generally: an observed measure of dis- 
crepancy d, with low probability Pr{d > d,} usually 
is described as highly significant. 

Choosing a test of goodness of fit. Choosing a 
test of goodness of fit amounts to deciding in what 
sense the discrepancy between the hypothesized 
population distribution and the sample is to be 
measured: The sample c.d.f. may be compared 
directly with the hypothesized population c.d.f., as 
is done in the case of tests of the Kolmogorov- 
Smirnov type. For example, the original Kolmo- 
gorov-Smirnov test itself, as described below, 
summarizes the discrepancy by the maximum abso- 
lute deviation between the hypothesized population 
c.d.f., F,, and the sample c.d.f. Alternatively, one 
may compare uncumulated frequencies, as for the 
X? test. Again, a standard shape parameter, such 
as skewness, may be computed for the sample and 
for the hypothesized population and the two 
compared, 

Any reasonable measure of discrepancy will of 
course tend to be small if the population yielding 
the sample conforms to the null hypothesis. A good 
measure of discrepancy will, in addition, tend to 
be large under the likely alternative forms of the 
population distribution, a property designated tech- 
nically by the term power. For example, the sample 
skewness coefficient might have good power if the 
hypothesized population distribution were normal 


(zero population skewness coefficient) and the rel- 
evant alternative distributional forms were appre- 
ciably skew. 

Two general considerations. Two general con- 
siderations should be kept in mind. First it is im- 
portant that the particular goodness of fit test used 
be selected without consideration of the sample at 
hand, at least if the calculated significance level 
is to be meaningful. This is because a measure of 
discrepancy chosen in the light of an observed 
sample anomaly will tend to be inordinately large. 
Receiving license plate 437918 hardly warrants the 
inference that, this year, the first and second digits 
add to the third, and the fifth and sixth to the 
fourth. It may of course be true, in special in- 
stances, that some adjustment of the test proce- 
dure in the light of the data does not affect the 
significance computations appreciably—as, for ex- 
ample, when choosing category intervals, based on 
the sample mean and variance, for the x° test 
(Watson 1957). 

Second, a goodness of fit test, like any other 
statistical test, leads to an inference from a sample 
to the population sampled. Indeed, the usual hy- 
pothesis under test is that the sample is in fact a 
random sample from an infinite population of 
hypothesized form, and the tabulated probabilities, 
Pr{d>d,}, almost always presuppose this. (In 
principle, one could obtain goodness of fit tests for 
more complex kinds of probability samples than 
random ones, but little seems to be known about 
such possibilities. ) It is therefore essential that the 
sample to which a standard test is applied can be 
thought of as a random sample. If it cannot, then 
one must be prepared either to do one’s own non- 
standard significance probability computations or 
to defend the adequacy of the approximation in- 
volved in using the standard tabulations. Consider, 
for example, starting with a random sample in- 
volving considerable repetition, say the sample of 
response words obtained from a panel of subjects 
taking a psychological word association test or the 
sample of nationalities obtained from a survey of 
the United Nations. Suppose now that one tal- 
lies the number of items in the sample (response 
words, nationalities) appearing exactly once, ex- 
actly twice, etc. There results a new set of data, 
consisting of a certain number of one’s, a certain 
number of two’s, etc. This collection of integers has 
the outward appearance of a random sample, and 
the literature contains instances of the application 
of the standard tests of goodness of fit to such 
observed frequencies. Yet the probability mech- 
anism that generates these integers has no resem- 
blance whatever to random sampling, and the 
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standard probability tabulations cannot be assumed 
to apply. Other examples arise when the data are 
generated by time series; for some of these the 
requisite nonstandard probability computations 
have been done (Patankar 1954), while, in other 
cases, special devices have made the standard com- 
putations apply. For example, in the case of the 
learning experiment by Suppes and his associates 
(1964), the sample consists of the time series of 
a subjects responses to successive stimuli. Certain 
theories of learning predict a particular bimodal 
long-run response population distribution; but the 
goodness of fit test of this hypothesized shape, on 
the basis of a series of subject responses, is ham- 
pered by the statistical dependence of neighboring 
responses. However, theory suggests, and a test of 
randomness confirms, that the subsample consist- 
ing of every fifth response is effectively random, 
enabling a standard x? test of goodness of fit to be 
carried out on the basis of this subsample. Whether 
four-fifths of the sample is a reasonable price to 
pay for validly carrying out a standard procedure 
is of course a matter of debate. 


Tests of simple hypotheses 


The x? test. The y? test was first proposed in 
1900 by Karl Pearson. To apply the test, one first 
divides the possible range of numbers (number 
pairs in the bivariate case) into k regions. For 
example, if only nonnegative numbers are possible, 
one might use the categories 0 to .2, .2 to .5, 
.5 to .7, and .7 and beyond. Next, one computes 
the probabilities, p;, associated with each of these 
regions (intervals in the example just given) under 
the hypothesized F,. This is often done by subtract- 
ing values of F, from each other: for example, 
when F, is the exponential cumulative distribution 
function 1 — e7”, 


p = F,(.2) — F,( 0) =F,(.2) =1—e-# =.18 
p: = Fy(.5) — Fo(.2) =e- e5 = 21 
Ps = F,(.7) — Fo(.5) Fest dear a Pa 
pı = F,(°) — Fi(.7) Sie" = .50 


The expected numbers E; of observations in each 
category are (under the null hypothesis) E; = np;, 
where n is the size of the random sample. 

After the sample has been collected, there also 
will be observed numbers, O;, of sample members 
in each category. The chi-square measure of dis- 
crepancy d,? is then computed by summing squared 
differences of class frequencies, weighted in such 
a way as to bring to bear standard distribution 


theory, 
k 
dèa = 2 (0; SEIE 
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where the subscript 0 indicates the specific sample 
value of de. (Often “X?” or “x?” is used to denote 
this statistic. ) 

As is shown, for example, by Cochran (1952), 
the probability distribution of dz, when Fy = F,, 
can be approximated by the chi-square distribution 
with k — 1 degrees of freedom, x? ,. This fact, to 
which the test owes its name, was first demon- 
strated by Karl Pearson. The larger the expecta- 
tions E;, the better is the approximation; this has 
been pointed out, for example, by Mann and Wald 
(1942). Hence, the significance, Pr{dz, > de,s}, is 
evaluated to a good approximation by consulting a 
tabulation of the x? , distribution. For example, if 
k, as above, equals 4, and d», , had happened to be 
4.6, then Pr{de > deo} = .20. With a sample sig- 
nificance level of .20, most statisticians would not 
question the plausibility of F,. However, were 
dw,, larger, and the corresponding significance 
equal to .05 or less, the consensus would be 
reversed. 

At what point is the distributional approximation 
endangered by small E;? An early study of this 
problem, performed by Cochran in 1942 (referred 
to in Cochran 1952), shows that a few E; near 1 
among several large ones do not materially affect 
the approximation. Recent studies, by Kempthorne 
(1966) and by Slakter (1965), show that this is 
true as well when all E; are near 1. 

These and other studies indicate that, although 
some care must be taken to avoid very small E;, 
much latitude remains for choosing categories. 
How is this to be done? To begin with, in keeping 
with the spirit of remarks by Birnbaum (1953), 
if the relevant alternatives F* to F, are such that 


da(F', F,) = È (E; - B/E, 


is large for a certain choice of k categories, it is 
these categories that should be selected. Among 
various sets of k categories, those yielding large 
de(F°,F,) are preferred. 

In the absence of detailed knowledge of the alter- 
natives, the usual recommendation, at least in the 
one-dimensional case, is to use intervals of equal 
E,. There remains the question of how many such 
intervals there should be. The typical statistical cri- 
terion for this is power, that is, the likelihood that 
the value of de will be large enough to warrant 
rejection of the hypothesis F, when the population 
is in fact a relevant alternative one. If large power 
is desired for all alternative population c.d.f,’s de- 
parting from F, at some x by at least a given fixed 
amount, Mann and Wald (1942) recommend a 


number of categories of the order of 4n?/. Williams 
(1950) has shown that this figure can easily be 
halved. 

The x? test is versatile; it is readily adapted to 
problems involving nominal rather than numerical 
populations [see COUNTED DATA]. It can also be 
adapted to bivariate and multivariate problems, as, 
for example, by Keats and Lord (1962), where 
the joint distribution of two types of mental test 
scores is considered. As opposed to many of its 
competitors, the x? test is not biased, in the sense 
that there are no alternatives F* to F, under which 
acceptance of F, is more likely than it is under F, 
itself. It is readily adapted to composite and approx- 
imate testing problems. Also, it seems to be true 
that the x° test is in the best position among its 
competitors with regard to the practical computa- 
tion of power. As is pointed out by Cochran (1952), 
such computations are performed by means of 
the noncentral chi-square distribution with k —1 
degrees of freedom. 

Modifications of the x? test. Important modifi- 
cations of the x? test, intended to increase its power 
against specific families Ẹ of distributions alterna- 
tive to F,, are given by Neyman (1949) and by 
Fix, Hodges, and Lehmann (1954). Here 7 is as- 
sumed to include F, and to allow differentiable 
parametric representation of the category expecta- 
tions E;. Note that the inclusion of F, in g differs 
from the point of view adopted, for example, by 
Mann and Wald (1942). These modifications are 
essentially likelihood ratio tests of F, versus J and 
are similar to procedures used to test composite 
and approximate hypotheses. 

Another modification, capable of orientation 
against specific “smooth” alternatives, Neyman’s 
yF test, was introduced in 1937, Other important 
modifications are described in detail in Cochran 
(1954). 

Other procedures. When (X,,+::,X,) is a 
random sample from a population distributed ac- 
cording to a continuous c.d.f. F,, then (U,, °°", 
Un) = (Fo(X,), +++, Fu(X,)) has all the probabilis- 
tic properties of a random sample from a population 
distributed uniformly over the numbers between 
zero and one. (If the population has a density func- 
tion, the c.d.f. is continuous.) No matter what the 
hypothesized F,, the initial application of this 
probability integral transformation thus reduces all 
probability computations to the case of this uniform 
population distribution and gives a nonparametric 
character to any procedure based on the trans- 
formed sample (U,, -++ , U,). Most goodness of fit 
tests of simple hypotheses are nonparametric in 


this sense, including the x° test itself, when cate- 
gories are chosen so as to assign specified values, 
for example, the constant value 1/k, to the cate- 
gory probabilities p;. 

Another common test making use of the trans- 
formation U = F,(X) is the Kolmogorov—Smirnov 
test, first suggested by Kolmogorov (1933) and 
explained in detail by Goodman (1954) and Massey 
(1951). The test bears Smirnov’s name, as well as 
Kolmogorov’s, presumably because Smirnov (as 
Doob and Donsker did later) gave an alternate 
derivation of its asymptotic null distribution, tabu- 
lated this distribution, and also extended the test 
to the two-sample case discussed below (1939a). 
Denote by F,(x) the sample c.d.f., that is, F,(x) is 
the proportion of sample values less than or equal 
to x. The test is based on the maximum absolute 
vertical deviation between F,(x) and F,(x), 


ak = dk (Fn, Fo) = Vn (max [Fn (x) = F,(x)|). 


the dependence of dy on the quantities U; = F,(X;) 
being best brought out by the alternate formula 


<= Vn max (4 — u). max ees 
x = Vnmax| max | -7 7 ui; ),m: SEA 


where u, is the smallest U; , u. is the next to small- 
est, etc.; the equivalence of the two formulas is 
made clear by a sketch. As Kolmogorov noted in 
his original paper, the probabilities tabulated for 
dx are conservative when F, is not continuous, in 
the sense that, for discontinuous F,, actual prob- 
abilities of dx > dx, will tend to be less than those 
obtained from the tabulations, leading to occasional 
unwarranted acceptance of Fs. 

Computations (Shapiro & Wilk 1965) suggest 
that this test has low power against alternatives 
with mean and variance equal to those of the 
hypothesized distribution. It has, however, been 
argued, for example, by Birnbaum (1953) and 
Kac, Kiefer, and Wolfowitz (1955), that the test 
yields good minimum power over classes of alter- 
natives F” satisfying dx(F*,F,) > ô; these, as the 
reader will note, are precisely the classes of alter- 
natives envisaged by Mann and Wald (1942) in 
optimizing the number of categories used in the 
x? test. A detrimental feature of the Kolmogorov- 
Smirnov test is its bias, pointed out in Massey 
(1951). 

An important feature of the test is that it can 
be “inverted” in keeping with the usual method to 
provide a confidence band for F,(x) centered on 
F(x), which, except for the narrowing caused by 
the restriction 0 < F,(x) <1, has constant width 
[see ESTIMATION, article on CONFIDENCE INTER- 
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VALS AND REGIONS]. The construction of such a 
band has been suggested by Wald and Wolfowitz 
and is described by Goodman (1954). Attaching a 
significance probability to an observed dx ,, amounts 
to ascertaining the band width required in order 
just to include wholly the hypothesized F, in the 
confidence band. 

The Kolmogorov-Smirnov test has been modi- 
fied in several ways; the first of these converts 
the test into a “one-sided” procedure based on the 
discrepancy 


dy = vn [max ( F,(x) — F,(*))] 


eae ms (5 =") 


A useful feature of this modification is the sim- 
plicity of the large sample computation of sig- 
nificance probabilities associated with observed 
discrepancies dx+; abbreviating the latter to d, 
one has Pr{dx. > d} = e°”. It is verified by Chap- 
man (1958) that dx- yields good minimum power 
over those classes of alternatives F* that satisfy 
dx. (F', Fo) = Vn max, (F*(x) — F,(x)) > ô. 

Other, more complex modifications provide 
greater power against special alternatives, as in 
the weight function modifications (Darling 1957), 
which provide greater power against discrepancies 
from F, in the tails. Another sort of modification. 
introduced and tabulated by Kuiper in 1960, calls 
for a measure of discrepancy dy that is especially 
suited to testing goodness of fit to hypothesized 
circular distributions, being invariant under arbi- 
trary choices of the angular origin. This property 
could be important, for example, in psychological 
studies involving responses to the color wheel, or 
in the learning experiment mentioned above. The 
measure dy also has been singled out by E. S. 
Pearson (1963) as the generally most attractive in 
competition with dx and the discrepancy measures 
dat and dy mentioned below. 

A second general class of procedures also mak- 
ing use of the transformation U = F,(x) springs 
from the discrepancy measure 


dys =n f" [Fy(x) — Fox) PaFo(x) 


1 A (2i-1 

=n Gs T u) ; 
first proposed by Cramér in 1928 and also by 
Von Mises in 1931 (see Darling 1957). Marshall 
(1958) has verified a startling agreement between 
the asymptotic and small sample distributions of 
da? for sample sizes n as low as 3. Power considera- 
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tions for da> are similar to those expressed for dx, 
and are discussed also in the sources cited by 
Marshall; the test based on də can be expected 
to have good minimum power over classes of alter- 
natives F* satisfying the conditions d.:(F*, F,) = 
nJf°s[F"(x) — F,(x)]? dF,(x) > 8. However, the test 
is biased (as is that based on dx). 

As in the case of dx, dat has weight function 
modifications for greater power selectivity, and also 
a modification dy, analogous to the modification 
dy of dy and introduced by Watson (1961), which 
does not depend on the choice of angular origin 
and is thus also suited for testing the goodness of 
fit to hypothesized circular distributions. 

Other procedures include those based on the 
Fisher—Pearson measures d\}} = -227 lnu; and 
da =-205}, In (1 -— ui), apparently first suggested 
in connection with goodness of fit in 1938 by E. S. 
Pearson. As pointed out by Chapman (1958), the 
tests based on d” and d® are uniformly most pow- 
erful against polynomial alternatives to Fy(x) =x 
of form x* and (1—x)*, k>1, and hence are 
“smooth” in the sense of Neyman’s y? test. Compu- 
tations by Chapman suggest that, dually to dx, 
d” has good maximum power over classes of alter- 
natives F` satisfying d,(F*, F,) < ô. 

Another set of procedures, discussed and defend- 
ed by Pyke (1965) and extensively studied by 
Weiss (1958), are based on functions of the spac- 
ings, Uin — u; or u; — (i+ 1)“, of the ws, from 
each other or from their expected locations under 
F,. Still another criterion (Smirnov 1939b) ex- 
amines the number of crossings of F,,(x) and F,(x). 

An important modification, applicable to all of 
the procedures in this section, is suggested in Dur- 
bin (1961). This modification is intended to in- 
crease the power of any procedure based on the 
transforms U,, against a certain class of alterna- 
tives described in that paper. 

Since there are multivariate probability integral 
transformations, applying an initial “uniformizing” 
transformation is possible in the multivariate case 
as well. However, one of several possible trans- 
formations must now be chosen, and, related to 
this nonuniqueness, the direct analogues of the 
univariate discrepancy measures are no longer 
functions of uniformly distributed transforms and 


do not lead to nonparametric tests (Rosenblatt 
1952). 


Tests of composite hypotheses 


The y? test. In the composite case, the null 
hypothesis specifies only that F(x) is a member 
of a certain parametric class {Fo(x)}. Typically, 
but not necessarily, 0 is the pair (u, ©), p a param- 


eter of location, and o a parameter of scale, in 
which case F,(x) may be written F.[(x—)/o). In 
any event, there arises the question of modifying 
the measure d» of discrepancy between the sample 
and a particular cumulative distribution function 
into a measure D. of discrepancy between the 
sample and the class {F(x)}. A natural approach 
is to set 
De= min de. 


If @ is composed of m parameters, it can be shown 
that, under quite general conditions, D, is ap- 
proximately distributed according to the XÈ-s-m dis- 
tribution when F(x) equals any one of the F(x), ` 
Hence significance probability computations can 
once again be referred to tabulations of the x? dis- 
tribution. The requisite minimization with respect — 
to @ can be cumbersome, and several modifications 
have been proposed, for example, the following by 
Neyman (1949): 

Suppose that one defines d2(@) as the discrep- 
ancy d between the observed sample and the par- 
ticular distribution F(x). Then D is defined also by 


De = d2(6), 
with the estimator @ computed from 
de(ĝ) = min de(0), 


that is, with 6 the minimum chi-square estimator of 
0. The suggested modifications involve using esti- 
mators of 0 alternate to @ in this last definition of 
De, that is, estimators that “essentially” minimize 
de(0); among these are the so-called grouped-data 
or partial-information maximum likelihood esti- 
mators. 

Frequently used but not equivalent estimators 
are the ordinary “full-information” maximum like- 
lihood estimators Ê of 0, for example, (%,s) for 
(#,@) in the normal case. These do not “essen- 
tially” minimize d. and consequently tend to inflate 
D. beyond values predicted by the Xê, n distribu- 
tion, leading to some unwarranted rejections of the 
composite hypothesis. However, it is indicated by 
Chernoff and Lehmann (1954), and also by Wat- 
son (1957), that no serious distortion will result 
if the number of categories is ten or more. : 

Composite analogues of other tests. Adaptation 
of the tests based on the probability integral trans- 
formation to the composite case proceeds much as 
in the case of xè. With definitions of d.:(@) and 
dx(@) analogous to that of d2(@), Darling (1955) 
has investigated the large sample probability dis- 
tribution of Da = d.2(@) and of Dy =dx(6) for 
efficient estimators 6 of 0 analogous to the estimators 
6 for x*. Note that in the absence of any y“like 


categories, the ordinary full-information maximum 
likelihood estimators now do qualify as esti- 
mators 6. 

A major problem now is, however, that the 
modified procedures are no longer nonparametric. 
Thus a special investigation is required for every 
composite hypothesis, This is done by Kac, Kiefer, 
and Wolfowitz (1955) for the normal scale-location 
family, and the resulting large sample distribution 
is partly tabulated. 

Tests based on special characteristics. The alter- 
natives of concern sometimes differ from a com- 
posite null hypothesis in a manner easily described 
by a standard shape parameter. Special tests have 
been proposed for such cases. For example, the 
sample skewness coefficient has been suggested 
(Geary 1947) for testing normality against skew 
alternatives. Again, for testing Poissonness against 
alternatives with variance unequal to mean, R. A. 
Fisher has recommended the variance-to-mean ratio 
E? (X: — &)2/X (see Cochran 1954). This meas- 
ure is approximately distributed as x3_, when Pois- 
sonness in fact obtains, for A>1 and n> 15 
(Sukhatme 1938), which follows from the fact 
that the denominator is then a high-precision 
estimate of A, and the numerator is approximately 
distributed as A x2.,. Analogous recommendations 
apply to testing binomiality. Essentially the same 
point of view underlies tests of normality based on 
the ratio of mean deviation, or of the range. to the 
standard deviation. 

Transforming into simple hypotheses. Another 
interesting approach to the composite problem, 
advocated by Hogg and also by Sarkadi (1960), is 
to transform certain composite hypotheses into 
equivalent simple ones. 

Specifically, there are location-scale parametric 
families {F,[(x — )/o]} with the following prop- 
erty: A random sample from any particular 
F.[(x — 2)/o] is reducible by a transformation T 
to a new set of random variables, Y = T(X), con- 
stituting in effect a random sample from a distribu- 
tion G (y) involving no unknown parameters at all. 
Moreover, only random samples from distributions 
F,[(x — .)/a] lead to G(y) when operated on by T. 

It then follows that testing the composite hy- 
pothesis H that (Xı, +++, Xa) is a random sample 
from a distribution F,[(« — 4)/a] with some y and 
some o is equivalent to testing the hypothesis H’ 
that (Y,,--:,Y,,) is a random sample from the 
distribution G(y). Any of the tests for simple hy- 
potheses is then available for testing H’. An ex- 
ample is provided by a negative exponential F, and 
uniform G, in which case the ordered exponential 
random sample (Xi), tt, Xw) is transformed 
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into an ordered uniform random sample (Ya, °°" 
Y,...,) by the transformation 
(Xin — Xian) Ho + i (Xion — Xian) 


(Xn Xan) + + m-i) (Xe — Sai) 

Conditioning. Another way of neatly doing 
away with the unknown parameter is to consider 
the conditional distribution of the sample, given a 
sufficient estimate of it. This method is advocated, 
at least for testing Poissonness, in Fisher (1950). 

Tests related to probability plots. S. S. Shapiro 
and M. B. Wilk have quantified in various ways the 
departure from linearity of the sorts of probabil- 
ity plots mentioned above, in particular of the 
plot of the ordered sample values against the ex- 
pected values of the standardized order statistics 
[see NONPARAMETRIC STATISTICS, article on ORDER 
sTaTistics]. This new approach bears some simi- 
larity to one given in Darling (1955), which is 
based on the measure d.* modified for the com- 
posite case. Both approaches, in a sense, compare 
adjusted observed order statistics with standardized 
order statistic expectations. But the approach of 
Shapiro and Wilk is tailored more explicitly to par- 
ticular scale-location families, by using their par- 
ticular order statistic variances and covariances. It 
is no wonder that preliminary evaluations of this 
sort of approach (for example, by Shapiro & Wilk 
1965) have shown exceptional promise. As an 
added bonus, the procedure is similar over the 
entire scale-location family; that is, its probability 
distribution is independent of location and scale. 


Yu = 


Approximate hypotheses 


The first, and seemingly most practically devel- 
oped, attempt to provide the requisite tests of ap- 
proximate hypotheses is found in Hodges and Leh- 
mann (1954). Hodges and Lehmann assume the 
k typical categories of the x° test and formulate the 
approximate simple hypothesis in terms of the dis- 
crepancy d(p, p.) between the category proba- 
bilities p; under Fx and the category probabilities 
Poi under a simple hypothesis F,. A very tractable 
discrepancy measure of this type is ordinary dis- 
tance, for which the approximate hypothesis takes 
the form 


k 
d(p, Po) = X (Pi = Pos)? < 8. 


Denoting O;/n by 0;, the suggested test reduces, 
essentially, to the one-sided test of the hypothesis 
d(p, pi) =ô based on the approximately normal 
statistic [d(0, pa) — 5]/@, where & is the standard 
deviation, estimated from the sample 0; , of d(0, Po). 
For example, when F, specifies k categories with 
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Pos = 1/k, one treats as unit normal (under the 
null hypothesis) the statistic 

s = VRld(o, pa) — ô] 

2V Xo} — (Lo?) 

and uses an upper-tail test. Thus a value of S of 
1.645 leads to a sample significance level of .05. 
This approach lends itself easily to the computation 
of power and is extended as well by Hodges and 
Lehmann to the testing of approximate composite 
hypotheses, 

Extension of other tests for simple hypotheses 
to the testing of approximate hypotheses has been 
considered by J. Rosenblatt (1962) and by Kac, 
Kiefer, and Wolfowitz (1955). 


Further topics 


That the sample is random may itself be in 
doubt, and tests have been designed to have power 
against specific sorts of departure from random- 
ness. For example, tests of the hypothesis of ran- 
domness against the alternative hypothesis that the 
data are subject to a Markov structure are given by 
Billingsley (1961) and Goodman (1959); the latter 
work also covers testing that the data have Markov 
structure of a given order against the alternative 
that the data have Markov structure of higher or- 
der, and the testing of hypothesized values of tran- 
sition probabilities when a Markov structure of 
given order is assumed [see MARKOV CHAINS]. 

Many of the tests described in this article can be 
extended to several-sample procedures for testing 
the hypothesis that several populations are in fact 
distributed identically; thus, as first suggested in 
Smirnov (1939a), if Gm(x) denotes the proportion 
of values less than or equal to x, in an independent 
random sample (Y,, *-+, Ym) from a second popu- 
lation, d.(F,, Gn) provides a natural test of the 
hypothesis that the two continuous population dis- 
tribution functions F, and Gy coincide. Many of 
these extensions are functions only of the relative 
ranks of the two samples and, as such, are non- 
parametric, that is, their null probability distribu- 
tions do not depend on the common functional 
form of Fy and Gy. [Several-sample nonparametric 
procedures are discussed in NONPARAMETRIC STA- 
TISTICS,] 

Another topic is that of tests of goodness of fit 
as preliminary tests of Significance, in a sense dis- 
cussed, for example, by Bancroft ( 1964). That 
tests of goodness of fit are typically applied in this 
sense is recognized by Chapman (1958), and the 
Probabilistic properties of certain “nested” sequences 
of tests beginning with a test of goodness of fit 
have been considered by Hogg ( 1965). The Bayes 


and information theory approaches to x° tests of 
goodness of fit are also important (see Lindley 
1965; Kullback 1959). 

H. T. Davip 


[Directly related are the entries HYPOTHESIS TESTING; 
SIGNIFICANCE, TESTS OF. Other relevant material 
may be found in COUNTED DATA; ESTIMATION; NON- 
PARAMETRIC STATISTICS.] 
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GOODNOW., FRANK J. 


The impact of Frank J. Goodnow (1859-1939 ) 
upon the study of public administration in the 
United States has been quite durable. Goodnow’s 
classic treatise Politics and Administration (1900) 
was the intellectual point of departure, along with 
Woodrow Wilson's “Study of Public Administration” 
(1887), for much of the work in the field up to 
World War 1. It is a book still frequently cited to- 
day, although not always praised. 

Goodnow began teaching administrative law at 
Columbia in 1883 and remained there throughout 
his professorial career. In 1914 he resigned from 
the Columbia faculty to become president of Johns 
Hopkins University, a position he held until his re- 
tirement in 1929. Goodnow was one of the principal 
founders of the American Political Science Associa- 
tion and became its first president in 1903. He 
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helped to redraft the New York City charter in 
1900, served on President Tafts commission on 
economy and efficiency in 1911—1912, and in 1913 
went to China as legal adviser to its president. 

Much of Goodnow’s scholarly work was in the 
field of municipal government. Although his ap- 
proach to the study of municipal institutions was 
primarily legal in character, his works also reveal 
a keen awareness of the realities of city politics 
around the beginning of the twentieth century. 
They remain quite useful to students of urban po- 
litical development in the United States, since cur- 
rent trends and problems in urban government and 
politics are still structured by the cleavage between 
bossism and reformism, which was so prominent 
an issue in Goodnow’s day. 

A focal point of controversy in political science 
has been the distinction Goodnow delineated in 
Politics and Administration between politics, as the 
sphere in which the will of the state is articulated, 
and administration, as the range of methods and 
techniques through which the state’s purposes are 
carried out. This functional separation of powers 
had a great deal of practical utility in the early 
movement to reform the organization and operation 
of public administration in the United States, since 
it justified the introduction into the public service 
of practices and values—e.g., efficiency, hierarchy, 
and discipline—altogether alien to the egalitarian 
ethos of American politics itself. The distinction 
also helped carve out a sphere of autonomy for 
public agencies at a time when these administra- 
tive units were hard pressed to maintain profes- 
sional standards in the face of pressures from 
spoils-oriented and sometimes corrupt party organ- 
izations. When it was first formulated, the separa- 
tion between politics and administration was to a 
large extent an effort to free administrators from 
political harassment. 

Intellectually, however, the Separating of these 
two segments of governmental activity has not 
fared very well at the hands of recent critics. Since 
World War 1 the study of public administration 
has been largely “politicized”—in the sense at least 
that the role of bureaucracy in modern government 
has increasingly been studied from the point of 
view of the tactics that agencies follow in securing 
resources upon which their survival depends, in- 
cluding appropriations, constituency support, and 
Statutory authority, or with a focus on the involve- 
ment of administrators in the framing of public 
policy. The modern tendency has been to point up 
and to some extent justify the role of administrators 
in the political process. 

However, although Goodnow’s distinction be- 


tween politics and administration was to serve more 
often as a target for criticism than as a model, the 
issues he first raised in a systematic way are still 
very much at the center of scholarly concern, 
Dwight Waldo, for example, has noted that the 
dichotomy between facts and values in the work 
of Herbert Simon, since World War 1 one of the 
central figures in the study of public administra- 
tion, is closely analogous to Goodnow’s own dis- 
tinction between administration and politics (Waldo 
1963, pp. 187-188). The entire effort to create a 
“science” of administration in modern times does 
in fact ultimately turn upon the possibility of 
singling out a sphere of managerial expertise which 
is distinct from and independent of political pref- 
erences and policy goals. Thus, Goodnow’s work 
retains its intellectual relevance today, even though 
it no longer has the authority it once commanded. 

From the point of view of governmental struc- 
ture, there can be no disputing the fact that the 
distinction has been an important force in the de- 
velopment of American institutions, Merit systems 
and a variety of other devices have been established 
at all levels of government to provide administra- 
tive agencies with some degree of legal protection 
against the grosser forms of political interference. 
At the same time, there has been a continued effort 
to limit the political role of executive agencies, for 
example, by statutes designed to prevent them 
from engaging in “propaganda” or lobbying activity. 
Public policy in the United States has thus sought 
to separate politics and administration, however 
much the two spheres may overlap in the actual 
day-to-day work of government. 


Francis E. ROURKE 


[See also ADMINISTRATIVE LAW; CIVIL SERVICE; COM- 
MISSIONS, GOVERNMENT; POLITICAL PROCESS; PUB- 
LIC ADMINISTRATION; PUBLIC LAW.] 
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GOSSEN, HERMANN HEINRICH 


Hermann Heinrich Gossen, German writer on 
economics, was born at Diiren in the Rhineland 
in 1810 and died at Cologne in 1858. His claim 
to fame is based on a single publication: in 1854 
he published a book that develops, with the help 
of mathematics, a comprehensive theory of the 
hedonistic calculus. In this work Gossen postulated 
the principle of diminishing marginal utility and 
from this derived the following theorem: to maxi- 
mize utility, a given quantity of a good must be 
divided among different uses in such a manner 
that the marginal utilities are equal in all uses. 
In the Continental literature the postulate is usu- 
ally referred to as Gossen’s First Law and the 
theorem as his Second Law. 

Gossen pioneered in the development of the 
subjective theory of value on the basis of the mar- 
ginal principle, but his work was neglected during 
his life, and he died a disappointed man, having 
withdrawn the unsold copies of his book from the 
publisher. His work was briefly mentioned in a 
history of economic thought published the year of 
his death and, more appreciatively, in 1870 in the 
second edition of a book on the labor problem by 
F. A. Lange, the famed historian of materialism. 
Gossen’s importance in the development of eco- 
nomic thought was first recognized in the late 
1870s, when Jevons, Menger, and Walras had 
already published their own versions of the new 
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theory of value. Gossen’s book was brought to the 
attention of Jevons by a colleague in August 1878, 
causing him to complain: “I am, therefore, in the 
unfortunate position that the greater number of 
people think the theory nonsense, and do not 
understand it, and the rest discover that it is not 
new” (Jevons 1886, pp. 387-388). Jevons found 
comfort, however, in the thought that “the theory 
in question has in fact been independently discov- 
ered three or four times over, and must be true” 
(ibid., p. 389). 

The next month Jevons advised Walras of Gos- 
sen’s anticipation. Both men publicly recognized 
Gossen’s priority and did much to save his work 
from oblivion, Jevons in the preface to the second 
edition of his Theory of Political Economy in 1879 
and Walras in an article in the Journal des éco- 
nomistes (1885). Walras even prepared a French 
translation of Gossen’s book, but this was never 
published. Much later, in 1950, an Italian transla- 
tion was brought out. 

An estranged Catholic, Gossen offered his work 
with the messianic fervor of the founder of a new 
religion: the laws of science were the dogma, the 
hedonistic calculus was the moral principle, in- 
struction in the laws of science was the cult, ex- 
periments the sacraments, and scientists the 
priests. Some of Gossen’s thoughts echo ideas of 
Bentham, Saint-Simon, and Comte, but since Gos- 
sen did not cite authorities, it is uncertain whether, 
or to what extent, he was indebted to these writers. 
German translations of two of Bentham’s juridical 
works had appeared in the 1830s, and one of these 
had been published in Cologne, where Gossen spent 
many years of his life. He may have come under 
the influence of Bentham’s hedonistic philosophy, 
but the mathematical treatment was his own work, 
as probably was his statement of the principle of 
diminishing marginal utility. There had been other 
economists who developed the subjective theory 
of value and the marginal principle, but none of 
them carried either idea as far as Gossen did. 

In matters of economic policy, Gossen’s pro- 
nounced individualism led him to a modified 
laissez-faire point of view. All that exists, he held, 
must by itself create the means to further exist- 
ence; otherwise it does not deserve to continue to 
exist. On this basis he rejected government support 
of religion, art, and science. Public relief of the 
poor was to be given in the form of loans, A gen- 
eral system of government loans would be estab- 
lished to enable everyone to make the most of his 
opportunities. Private property was to be protected 
and freed from restrictions that hamper individual 
initiative, but land was to be nationalized (pur- 
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chased by the government) and then auctioned off 
to the highest bidder in the form of a lifetime lease 
to facilitate its most productive use. 

The reasons for the almost complete neglect of 
Gossen’s work for a quarter of a century are not 
difficult to trace. His book was the work of a lone 
outsider, unknown in academic circles. Obedient 
to his father’s wishes, but with great reluctance, 
Gossen had studied law to prepare himself for a 
career in the Prussian civil service. After a few 
years he had relinquished government service and 
started an unsuccessful venture in the insurance 
business. When he published his book, with high 
hopes of recognition, he gave it a forbiddingly 
pretentious title, which in English translation reads 
“Development of the Laws of Human Relations, 
and of the Rules of Human Action Derived There- 
from” (1854), The presentation and style of the 
work are bizarre and cumbersome, The book is not 
divided into parts or chapters, and simple dash 
lines, rather than headings, separate the various 
topics. Gossen was presumptuous enough to make 
a favorable comparison, on the first page, of his 
own study of society with Copernicus’ study of the 
cosmos, and on the penultimate page to claim in 
enlarged print that the adoption of his system of 
thought would turn the earth into a paradise. Such 
claims were bound to appear preposterous to the 
followers of the historical school, who were begin- 
ning to fill the chairs of economics in Germany in 
Gossen’s time, and as late as 1929 Sombart re- 
ferred to Gossen as a “brilliant idiot.” Moreover, 
Gossen’s book abounds with diagrams, formulas, 
and lengthy arithmetical illustrations, Half a cen- 
tury after its publication, Marshall, under much 
more favorable circumstances, stil] deemed it wise 
to take infinite care to make the mathematical 
treatment of economics palatable to his readers, 
Gossen’s book might have found some favor be- 
cause of its polemics against communists and so- 
cialists and because the subjective theory of value 
could be used as a critique of the foundations of 
socialist economics. The German historical econ- 
omists, however, had their own ways of coping 
with the threat of Socialism, and several decades 
after the publication of Gossen’s book they were 
still inclined to deprecate rather than to make use 
of the ideological weapons available in the armory 
of the Austrian branch of the new theory of value. 
The Austrians themselves began to pay attention to 
Gossen only in 1889. 

It is significant also that when Gossen was born, 
the Rhineland was part of the France of the Em- 
peror Napoleon, in whose revenue service Gossen’s 


father was employed. Gossen’s thought was funda- 
mentally un-German, and his fellow countrymen 
responded as little to his hedonism and utilitarian- 
ism as to the idea of natural law. It is thus no 
accident that recognition eventually came from 
foreigners, 

HENRY W. SPIEGEL 


[For discussion of the subsequent development of 
Gossen’s ideas, see UTILITY and biographies of 
JEVONS; MENGER; WaALRAS.] 
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The impact of W. S. Gosset (1876-1937) on the 
social sciences was entirely indirect. He was, how- 
ever, one of the pioneers in the development of 
modern statistical method and its application to the 
design and analysis of experiments. He is far better 
known to the scientific world under the pseudonym 
of “Student” than under his own name. Indeed 
all his papers except one appeared under the 
pseudonym. 

He was the son of Colonel Frederic Gosset of 
the Royal Engineers, the descendant of an old 
Huguenot family that left France after the revoca- 
tion of the Edict of Nantes. Gosset was a scholar 
of Winchester—that is, a boy who was awarded a 
prize on the basis of a competitive examination to 
pay for part or all of his education—which shows 
that his exceptional mental powers had devel- 
oped early. From Winchester he went, again as 
a scholar, to New College, Oxford, where he ob- 
tained first class degrees in mathematics and 
natural science. 

On leaving Oxford in the autumn of 1899 he 
joined the famous brewing firm of Guinness in 
Dublin. He remained with Guinness all his life, 
ultimately becoming, in 1935, chief brewer at Park 
Royal, the firm’s newly established brewery in 
London. 

At that time scientific methods and laboratory 
determinations were beginning to be seriously ap- 
plied to brewing, and this naturally led Gosset to 
study error functions and to see the need for ade- 
quate methods to deal with small samples in exam- 
ining the relations between the quality of the raw 
materials of beer, such as barley and hops, the 
conditions of production, and the finished article. 
The importance of controlling the quality of barley 
ultimately led him to study the design of agricul- 
tural field trials. 

In 1904 he drew up for the directors the first 
report on “The Application of the Law of Error.” 
This emphasized the importance of the theory of 
probability in setting “an exact value on the results 
of our experiments; many of which lead to results 
which are probable but not certain.” He used only 
the classical theory of errors, such as is found in 
G. B. Airy’s On the Algebraical, and Numerical 
Theory of Errors of Observation (1861) and M. 
Merriman’s A Text-book on the Method of Least 
Squares (1884). But he observed that if X and Y 
are both measured from their mean, there are often 
considerable differences between D(X + Y)* and 
È (X — Y)*; in other words, he was feeling his way 
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toward the notion of correlation, although he had 
not yet heard of the correlation coefficient. 

His first meeting with Karl Pearson took place 
in 1905, and in 1906/1907 he was sent for a year’s 
specialized study in London, where he worked at, 
or in close contact with, the biometric laboratory 
at University College. 

Mathematical statistics. Gosset was once de- 
scribed by Sir Ronald Fisher as the “Faraday of 
statistics.” The comparison is apt, for he was 
not a profound mathematician but had a superb 
intuitive faculty that enabled him to grasp general 
principles and see their relevance to practical ends. 

His first mathematical paper was “On the Error 
of Counting With a Haemacytometer” ([1907] 
1943, pp. 1-10); here he derived afresh the Pois- 
son distribution as a limiting form of the binomial 
and fitted it to four series of counts of yeast cells. 
The derivation presented no particular difficulty 
(it had in fact been obtained before by several 
investigators), but it was characteristic of him to 
see immediately the correct method of dealing with 
a practical problem. One of these series has become 
world famous owing to its inclusion as an example 
in Fisher's Statistical Methods for Research Workers 
(1925). 

His next paper, “The Probable Error of a Mean” 
([1908] 1943, pp. 11-34), brought him more fame, 
in the course of time, than any other work that he 
did, for it provided the basis of Student's t-test. 

In his work at the brewery he had been struck 
by the importance of knowing the accuracy of the 
mean of a small sample. The usual procedure at 
the time was to compute the sample average and 
standard deviation, x and s, and to proceed as if X 
were normally distributed with the same mean as 
that of the population and with standard deviation 
s/\V/n, where n is sample size. The difficulty here 
is that s is a fallible estimate of the true population 
standard deviation. Gosset’s intuition told him that 
the usual procedure, based on large sample con- 
siderations, would, for small samples, give a spuri- 
ously high impression of how accurately the popu- 
lation mean is estimated. 

By a combination of exceptional clearheadedness 
and simple algebra, he obtained the first four 
moments of the distribution of s*. He then pro- 
ceeded to fit the Pearson curve that has these 
moments. His results showed that the curve has to 
be of Type 1 (essentially the gamma or x” distribu- 
tion), and he found the distribution of s? to be 
C(s?/o?)"-2 exp [(— ns*/20*)]d(s°*/o*). He then 
showed that the correlation coefficient between <° 
and s? was zero, and assuming absolute independ- 
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ence (which does not necessarily follow but was 
true in this case), he deduced the probability dis- 
tribution of z = (% — u)/s, where p is the true mean. 
With a mere change of notation this is the t- 
distribution. Here s is defined to be A/L — i 
so that 


= @—w) /PRE—% = Ž-A VTT). 


He then checked the adequacy of this distribu- 
tion by drawing 750 samples of 4 from W. R. Mac- 
donell’s data on the height and middle-finger length 
of 3,000 criminals and by working out the standard 
deviations of both variates in each sample (see 
Macdonell 1902). This he did by shuffling 3,000 
pieces of cardboard on which the results had been 
written, possibly the earliest work in statistical 
research that led to the development of the Monte 
Carlo method. 

Later in his paper on “Probable Error of a Corre- 
lation Coefficient” ([1908] 1943, pp. 35-42), Gosset 
used the 750 correlation coefficients of the two 
variables. Here his remarkable intuition again led 
him to a correct answer. By correlating the height 
measurements of one sample with the middle- 
finger lengths of the next, he was able to obtain 
750 values of r, the sample correlation coefficient, 
for which p, the true population correlation co- 
efficient, was presumably zero. He noticed that the 
observed distribution of r was approximately rec- 
tangular. If it were a Pearson curve it would have 
to be Type 1, that is, C(1—r?)\, and his result 
from the 750 samples Suggested A= k(n — 4). He 
guessed that k= and confirmed the result by 
taking 750 samples of 8 to which C(1 — r?)? gave 
an excellent fit. Six years later Fisher proved that 
all these brilliant conjectures for the distribution 
of s*, t, and r when p = 0 were indeed correct. 

The correlation coefficient between the two 
measurements in the 3,000 criminals was 0.66. 
Gosset also examined two sets of 750 samples of 
sizes 4 and 8 and one set of 100 samples of 30, 
for which the true value must have been close to 
0.66. He could see from his results that the stand- 
ard deviation given by (1 — p*)/ Vn was too small 
and that the distribution could not be of Pearson 
type except when p = 0. He succeeded in obtaining 
the exact repartition of r for any p in samples of 2, 
but the general solution for P #0 had to await the 
publication of Fisher’s famous paper in 1915, 

Gosset’s fourth paper ([1909] 1943, pp. 43-48) 
dealt with the distribution of the means of samples 
not drawn at random. His brewing experience had 
repeatedly drawn his attention to the fact that 
successive observations were not uncorrelated. Here 


he supposed a sample of n values to be drawn in 
such a way that the correlation between every pair 
of observations is the same (say p), so that p is 
effectively an intraclass correlation. He used the 
algebraical methods of his second paper to deter- 
mine the first four moments of the mean, in this 
case employing as an illustration some data 
published by Greenwood and White (1909) in 
which 2,000 phagocytic counts had been grouped 
in samples of 25. Both the original counts and the 
distribution of means could be fitted by Pearson 
Type 1 (Beta) curves. However, the observed values 
of B, and ($. — 3) for the distribution of means 
were bigger than would have been anticipated if 
the usual theory for independent observations had 
been valid. The modified theory produced much 
better agreement. 

Gosset published five more mathematical papers 
between 1909 and 1921 ([1913; 1914; 1917: 1919; 
1921] 1943, pp. 53-89). With the possible excep- 
tion of the first of these, they are still of interest. 
The 1921 paper gave for the first time the correc- 
tion for ties in calculating Spearman's rank corre- 
lation coefficient. 

Agricultural and other biometric studies. It was 
natural, owing to the high importance of barley 
quality in brewing, that Gosset should have become 
interested in agricultural problems, His active in- 
terest seems to have started in 1905 when he was 
first asked for advice by E. S. Beaven, a maltster 
who had started experimental work in the 1890s. 
From then onward there was a constant inter- 
change of correspondence and ideas between them, 
in which the mathematical insight of the younger 
man supplemented the experimental experience of 
the older. 

Gosset’s first meeting with Fisher was at Roth- 
amsted in August 1922; each had the greatest 
admiration for the other’s work and doubtless each 
had considerable influence on the development of 
the other's ideas on experimental design. Toward 
the end of Gosset’s life they had a difference of 
opinion about the relative methods of random and 
systematic arrangements, but this did not affect 
the high regard that they always had for one 
another, 

In 1911 Gosset examined the results of some 
uniformity trials carried out by Mercer and Hall 
at Rothamsted ([1911] 1943, pp. 49-52). In the 
most important of these, an acre of wheat had been 
harvested in 1/500-acre plots. Gosset showed from 
the results how advantage could be taken of the 
correlation between the yields of adjacent plots to 
increase the accuracy of varietal comparisons, and 
he showed that for a given acreage greater accu- 


racy could be obtained with smaller plots rather 
than with larger plots. 

As early as 1912 and 1913 Beaven had invented 
the “chessboard” design, and experiments had been 
laid down, each with eight varieties of barley on 
yard-square plots, in three centers. These were 
essentially “block designs,” with each variety occur- 
ring once in each block; but within the block, the 
arrangement was balanced rather than random. At 
this time Gosset discovered the correct estimate of 
error per plot for the varietal comparisons, pre- 
cisely the same result as would be obtained from 
an analysis of variance. He compared every pos- 
sible pair of varieties and calculated for each pair 
E(d-— d)?, d being the difference in one block. 
He added these results together for all n varieties 
and divided by ¿n(n — 1)(m — 1), where m is the 
number of blocks. These experiments were discon- 
tinued during World War 1, but in 1923 Gosset and 
Fisher discovered, independently, the analysis of 
variance method of obtaining the result. In a letter 
to Gosset, Fisher proved the algebraical equivalence 
of Gosset’s original method and the new one. 

These chessboard designs were small-scale work. 
For field trials, Gosset and Beaven favored the 
“half-drill strip method,” in which two varieties 
were compared on an area of about an acre. In this 
method, the two varieties are sown in long strips— 
CAACCAAC, etc.—there being an integral number 
of “sandwiches” (such as CAAC). 

The error of the varietal comparison was ob- 
tained from the variances of the differences (C — A) 
either in individual strips or in sandwiches. In one 
such experiment, described by Gosset, on some- 
thing more than an acre, the standard error of a 
varietal mean was found to be about 0.6 per cent. 
Gosset was later criticized by Fisher for preferring 
this method to randomized strips or randomized 
sandwiches. Gosset welcomed the advances in the 
science of agricultural experimentation that came 
from Fisher and his school. His own attitude was a 
very practical one, based on his extensive experi- 
ence in Ireland experimenting with barley. 

A good account of much of this kind of work is 
given in Gosset’s most important paper on agri- 
cultural experimentation, “On Testing Varieties of 
Cereals” ({1923] 1943, pp. 90-114). The paper 
also describes some large-scale work carried out by 
the department of agriculture in Ireland during 
1901—1906 to find the best variety of barley to grow 
in that country. Here two varieties, Archer and 
Goldthorpe, were carried right through the whole 
period and each tested on two-acre plots in a large 
number of centers. With 50 pairs of plots of this 
size, the standard error of the comparison was still 
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about 10 per cent to 15 per cent. However, the 
result was based on wide experience. In the half- 
drill strip experiment the corresponding standard 
error was only 1 per cent, but the result applied 
only to an acre, in one place, under very particular 
conditions of soil and season. 

While it was important to plan yield trials in 
such a way as to reduce experimental error and to 
obtain an accurate estimate of it, it was only by 
comparison and analysis of the results from a num- 
ber of soils, seasons, and climates that one could 
judge the relative value of different varieties or 
different treatments. Further, products must also 
be subjected to tests of quality. Conclusions drawn 
in one center could in any case be applicable only 
to the particular conditions under which the trials 
were carried out. While he insisted that “experi- 
ments must be capable of being considered to be a 
random sample of the population to which the con- 
clusions are to be applied,” in an individual center 
he often preferred balanced (that is, systematic) 
arrangements to randomized ones. He liked the 
Latin square, because of its combination of bal- 
ance (to eliminate soil heterogeneity) with a ran- 
dom element, thus conforming to all the principles 
of allowed witchcraft ([1926] 1943, pp. 199-215). 
He was less happy about randomized blocks be- 
cause he felt that a balanced arrangement within 
the blocks often gave a greater accuracy than did 
a random one. Further, he was unwilling to accept 
the result of the toss of a coin, or its equivalent, 
if the arrangement so obtained was biased in rela- 
tion to already available knowledge of the fertility 
gradients of the experimental area. In his last 
paper, “Comparison Between Balanced and Ran- 
dom Arrangements of Field Plots” ({1938] 1943, 
pp. 193-215), he wrote: 


It is of course perfectly true that in the long run, tak- 
ing all possible arrangements, exactly as many mis- 
leading conclusions will be drawn as are allowed for 
in the tables, and anyone prepared to spend a blame- 
less life in repeating an experiment would doubtless 
confirm this; nevertheless it would be pedantic to con- 
tinue with an arrangement of plots known before- 
hand to be likely to lead to a misleading conclusion. 
(p. 202) 


He thought that an experimenter with a knowl- 
edge of his'job could arrange the treatments within 
a block so that real error, that is, the variance of 
the different treatment means that would be ob- 
tained with dummy treatments in a uniformity 
trial, would be less than if the treatments had been 
randomized. This statement was no doubt often 
true in the domain in which he worked, but its 
general validity has often been questioned. He dis- 
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tinguished between the real error as here defined 
and the calculated error, that is, the error variance 
of the treatment mean, that would be obtained 
from usual analysis of variance procedures. He 
maintained, perfectly correctly, that if the real error 
were reduced by balancing, the calculated error 
would be too high. In his last paper, he showed, in 
addition, that in this situation experiments that 
have a real error less than the calculated one fail 
to give as many “significant” results as those that 
have a greater error, if the real treatment differ- 
ences are small. When, however, the real treatment 
differences are large, the reverse is the case. There- 
fore, if balanced arrangements have a small real 
error, they will léss often miss large real differ- 
ences and more often miss small ones. He regarded 
this as a positive advantage; where real differences 
in a particular center were small, he was satisfied 
to have an upper limit to his error because he 
thought that only by collating results from different 
centers could he arrive at the truth. Where real 
differences were small, even if Statistically signifi- 
cant, the results at different centers were likely to 
be conflicting. 

This last paper was written in reply to one by 
Barbacki and Fisher (1936), which purported to 
show that the half-dril] strip method is less accu- 
rate than the corresponding randomized arrange- 
ment. Gosset was right in maintaining that these 
authors were in error, for they had not compared 
like with like in the actual data they had examined 
—a uniformity trial carried out by Wiebe (1935 Je 
However the data were not very good for deciding 
the question, for as subsequently shown by Yates 
(1939), owing to defective drilling they contained 
a periodic fluctuation, two drill-widths wide. Gosset 
would almost certainly have welcomed the com- 
bination of balance and randomization achieved by 
some of the designs invented since his day, which 
are likely to give a gain in accuracy similar to that 
obtained by his Systematic designs over randomized 
blocks and at the same time are free from difficul- 
ties in error estimation, 

In an article on Gosset, Sir Ronald Fisher praised 
“Student’s” work on genetical evolutionary theory 
(see Gosset [1907-1938] 1943, pp. 181-191 ). He 
concluded: “In spite of his many activities it is the 
‘Student’ of ‘Student's’ test of significance who has 
won, and deserved to win, a unique place in the 
history of scientific method” (Fisher 1939, P. 8). 


J. O. IRWIN 
[For the historical context of Gosset’s work, see Dis- 


TRIBUTIONS, STATISTICAL; STATISTICS, article on 
THE HISTORY OF STATISTICAL METHOD; and the 


biographies of Fisuer, R. A.: and PEARSON. For dis- 
cussion of the subsequent development of his ideas, 
see ESTIMATION; EXPERIMENTAL DESIGN; HYPOTHE- 
SIS TESTING, ] 
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by E. S. Pearson and John Wishart. London: Univer- 
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wrote under the pseudonym “Student.” The 1943 edi- 
tion contains all the articles cited in the text, 
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GOVERNMENT 


The study of government has followed several 
main lines of inquiry. These include both an exam- 
ination of the source and distribution of authority 
and the classification of types of government (such 
as presidential systems and monarchies), as well 
as analysis of levels of government (including such 
units as national societies, clubs, churches, and 
trade unions). Although a thorough review is im- 
possible here, we can examine the main themes 
which unite these various approaches and evaluate 
their theoretical status, 

At the most general level, government consists 
of a group of individuals sharing a defined respon- 
sibility for exercising power. At this level the defini- 
tion applies to cases where government is sovereign, 


as well as to cases where it is not. Sovereign gov- 
ernment, the most important type, consists of a 
group of individuals sharing a defined responsibility 
for the maintenance and adaptation of a national 
autonomous community, on behalf of which it exer- 
cises a practical monopoly of coercive powers. If by 
“a defined responsibility” we mean its legitimacy 
(the sanctified right to exercise power on behalf 
of others by means of decision making), then the 
characteristics of sovereign government are as fol- 
lows: a government is a group of individuals 
exercising legitimate authority, and protecting and 
adapting the community by making and carrying 
out decisions. [See SOVEREIGNTY and STATE.] 

These characteristics impose certain limits of 
variation upon government. One limit is efficacy, 
i.e., the capacity of a government to cater to com- 
munity needs. A second refers to the internal struc- 
ture of a given type of government, i.e., its form. 
Changes in form which occur when one type of 
government is transformed into another are ordi- 
narily related to the efficacy, or performance, of a 
particular government. Hence, the limit upon gov- 
ernment is observable when for any reason (in- 
ability to make decisions, failures to comply with 
widely distributed but central values) it can no 
longer function. If this is not merely a matter which 
can be remedied by changing the incumbents of 
political office, i.e., if the political roles and sup- 
porting offices are no longer acceptable, then the 
withdrawal of legitimacy denotes that the system 
of government is no longer regarded as appropriate 
by the public; its limits have been breached. At 
that point, change from one type of government to 
another is likely. 

The most common distinctions that have been 
made between types of governments include the 
following: is the government competitive or mo- 
nopolistic? democratic or totalitarian? pluralistic 
or monistic? presidential or monarchical? Of course, 
these well-known categories overlap considerably. 
For example, it is possible to have a totalitarian 
presidential system. As is the case with all dichoto- 
mous variables, these distinctions force the observer 
to put a particular government in one category or 
the other, even when it demonstrates characteris- 
tics of both. Such distinctions are based on two 
criteria: the organization of government and the 
degree of control it exercises over the community. 
These criteria combine the moral, or normative, 
dimension of politics with the dimensions relating 
to governmental structure and political behavior. 
These three analytically distinct elements, i.e., the 
normative, structural, and behavioral aspects of 
government, will now be separately examined. 
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Normative aspects of government deal with such 
abstract questions as justice, equity, equality. 
Through these, men define their lasting values, 
their ideas of right and wrong. Normative theory 
represents, therefore, certain speculations about 
those aims and activities of government which 
embody the central values and ultimate ends of a 
political community; it defines political legitimacy. 

In contrast, structural principles are those which 
deal with the arrangements and instruments in- 
volved in governmental decision making. Of course, 
they are related to normative issues insofar as the 
form of a government is seen as a means of attain- 
ing the ends of society. Preoccupation with the 
structural dimension leads directly to the analysis 
of alternative forms of government, with normative 
considerations employed in order to evaluate those 
most suitable for realizing the goals of the com- 
munity by means of governmental decision making. 
In the past, structural analysis has been mainly 
concerned with the distribution of political power, 
describing various types of government in terms of 
how widely power is shared by the members of a 
political community. 

Classical writers were particularly interested in 
normative issues and structuralists have been pre- 
occupied with governmental forms. Both groups, 
to the extent that they considered it at all, assumed 
behavior to be a condition of conflict. For both, a 
propensity to conflict is regarded as the normal 
political expression of human nature, much as 
economists assumed that man has a natural pro- 
pensity “to truck, barter, and exchange.” Hobbes, 
for example, put this assumption very sharply as 
“the war of all against all.” Such assumptions led 
both Plato and Hobbes to seek authoritarian gov- 
ernments as the best means of regulating the 
condition of conflict. Other theorists have seen a 
division of powers as the best method to control 
conflict. 

Certain combinations of normative and structu- 
ral approaches have been called institutionalist 
theories. Institutionalists, such as Carl Friedrich 
(1937), Harold Laski (1925; 1935), and Herman 
Finer (1932), concerned themselves with the nor- 
mative and structural relations between law, con- 
stitutional forms, and governmental procedures in 
wide historical, religious, and economic contexts. 
Their considerations included an interest in prac- 
tical reform as well as theory, and they consciously 
built on the formulations of Ostrogorskii (1902), 
Bryce (1921), Graham Wallas (1908), and others. 

Emphasis on the behavioral dimension originated 
largely in the 1930s, with the “Chicago school” and, 
in particular, with Harold Lasswell, who sought to 
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introduce behavioral explanations into political 
affairs. This is most explicit in his pioneering work 
Psychopathology and Politics (1930). However, 
until recently, few theorists followed this lead. It is, 
therefore, not surprising that the study of social- 
ization processes, motivation, and political culture 
has been handled mainly by political sociologists. 
[See POLITICAL SOCIOLOGY. ] 

Analysis today is characterized by the refinement 
of structural theories of government into a system, 
i.e., a set of interrelated elements which can be 
integrated with behavioral theories. The resulting 
analysis has taken many different forms and has 
made some advances, but it is particularly weak in 
its treatment of normative theory. [See POLITICAL 
SCIENCE and POLITICAL THEORY. ] 


Theories of government 


Historically, most theories of government have 
fallen into one or the other of two main analytical 
sets: mechanistic and organic. 

Mechanistic theories. The first set reflects the 
view that society is composed of competing and 
interacting interests (both individual and group), 
that these generate conflict, and that it is the job 
of government to ameliorate or resolve such con- 
flict. Government is thus a device for finding ways 
to relax tension in the political system. These the- 
ories, relying heavily on the free exchange of infor- 
mation, see government as a point in a flow of 
activity which is initiated by the political com- 
munity. Since government has a decision-making 
apparatus, which responds to tension points in the 
system, the appropriate actions will be forthcoming, 
Behavioral tensions represent “inputs,” or stimuli 
affecting political leaders, who by responding to 
them generate decisions, or “outputs” (see Figure 1). 


INPUTS 


Social G 
system —> 2overnment | —— > 


Figure 1 — Government as a dependent variable 


OUTPUTS 


Policy 


decisions 


In Figure 1, assume that the social system is a 
national society. Government responds to a variety 
of societally generated inputs, including customs 
and beliefs (normative characteristics ), classes and 
interests (structural characteristics), and prefer- 
ences and perceptions (behavioral characteristics), 
Democratic theories of government are based on 
this model. This is why they have been particularly 
concerned with the establishment of a useful set 
of intervening structural variables between the 


social system and government itself, Hence the pre- 
occupation with the analysis of political Parties, 
electoral systems, and the like, which are seen as 
devices for improving the relationship between the 
social system and government in order to raise the 
quality and appropriateness of policy decisions and 
increase the efficacy of government. 

It is the liberal democratic approach which ac- 
cepts this view, with government cast in the role 
of mediator and judge in conflicts between contend- 
ing parties. The principles of structural organiza- 
tion are embodied in law, which serves as a frame- 
work for all other forms of organization. Normative 
consensus centers around the maintenance of the 
legal framework itself. Government strengthens 
consensus by ensuring the widest realization of 
norms already held. Such liberal theories are con- 
tained in the ideas of Diderot and d’Alembert, 
Holbach and Helvétius, Condillac and Locke, Rous- 
seau and Hume. It is a tradition which includes 
Voltaire’s innocent rationalism and Bentham’s equi- 
table utilitarianism. What these thinkers had in 
common was an emphasis on individual knowledge 
and shared reason, a position which elevates the 
individual to the center of the political stage. Ration- 
ality is a norm, and it requires a framework in 
which free ideas and competitive views can be put 
forward. 

The Western democratic governments reflect 
variants of this model, but it was also accepted, at 
least in principle, in other areas. For example, the 
Constitutions of many of the new Afro—Asian na- 
tions were drawn with this general approach in 
mind, even where the actual practice of govern- 
ment is wholly different from its normative and 
structural theory. 

To sum up, the normative assumption underly- 
ing the model is that a social system is composed 
of individuals or groups with an equal right to be 
represented, Structurally, it is assumed that a gov- 
ernment must reflect proper representation; the 
behavioral assumption is that competitive conflict 
between the members of the social system renders 
representative forms necessary. The model there- 
fore displays the following characteristics: the unit 
of which the social system is composed is the indi- 
vidual; the ends of the individual are maximized; 
the structure of government is organized in such 
a way that a plurality of ends is maximized; the 
decisions of government, by maximizing a plurality 
of ends, maintain balance or harmony in the social 
system; and the principle of legitimacy is equity. 
The concern of political theorists following this 
tradition is with the improvement of devices that 
government can use to maximize a plurality of 


ends. Certain structural procedures have, there- 
fore, become endowed with the quality of norms. 
Moreover, underlying this view is an assumption, 
rather mechanistic, that government is a contriv- 
ance. It does not grow organically. It must be 
established, with each structural principle becom- 
ing endowed with a predictable consequence. Gov- 
ernment is, first, a kind of social physics, with 
particular devices having predictable results. 

Organic theories. The classical view is different 
(as are many contemporary ones). For example, 
both Plato and Aristotle related government to the 
evolution of human society from lower to higher 
forms. Therefore, government was essentially an 
educational body, embodying a set of ideals and 
perfecting rationality, thereby directing the state 
toward a new golden age. Moreover, this conception 
of government has had as durable a tradition as 
the liberal democratic one. Although such views 
were widely accepted in medieval Europe (Gierke 
1881), it was Hegel who gave the conception its 
most powerful rationale and Marx who brought it 
into popular currency. Marx accepted nineteenth- 
century notions of progress but saw in the evolution 
of man’s higher purposes a relationship between 
change in the material world and the unfolding 
human consciousness. Government is an instru- 
ment of this relationship. As such, it has its own 
cycle. It becomes an instrument of revolutionary 
action, of insurrection, which, if successful, rep- 
resents the most dynamic class. It comes to power 
as the instrument which must transform revolu- 
tionary impetus into practical accomplishment. 
Having accomplished this, it will in turn be 
rendered anachronistic and vulnerable until the 
final stage, when government is itself no longer 
necessary. 

Nor was Marx alone in accepting an organismic 
conception of government. More liberal-minded pro- 
ponents include Thomas Hill Green, who saw gov- 
ernment as an instrument of morality. Herder, a 
philosophical romantic, also shared the view that 
government was a transitional phenomenon by 
means of which an “aristo-democracy” would edu- 
cate the public and develop a sufficient level of 
political consciousness to render government super- 
fluous. “The ultimate aim of aristo-democracy, 
Herder saw in the disappearance of the State as 
an administrative ‘machine’ of government, and its 
replacement by an ‘organic’ way of ordering social 
life, in which active cooperation would render all 
forms of subordination obsolete and superfluous” 
(Barnard 1965, p. 77). Similar views were ex- 
pressed by Fichte, Schelling, and Bosanquet. Today 
this approach is particularly attractive in develop- 
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ing areas, where government is seen as the instru- 
ment of an evolutionary ideal. 

The organic evolutionary concept remains an al- 
ternative to the liberal democratic one. It implies 
a role for government, which directs society toward 
higher ends. Evolutionary in conception, this tradi- 
tion is often enriched by ecclesiastic and theocratic 
ideals. It stresses the role of the community over 
and against the role of the individual. Although 
modern organic conceptions elevate man to a cen- 
tral position, they emphasize that the community 
is the instrument of his perfection; such views are 
endemic in revolutionary governments, which see 
themselves as the instruments of social transforma- 
tion. Where the role of government is so central, 
we can say that it becomes the independent variable 
(Figure 2). 


INPUTS OUTPUTS 
Changes in 
Government the social 


system 


Figure 2 — Government as an independent variable 


Government is the instrument by means of which 
change is produced in the social system. The pur- 
poses and objects of such change ( normative char- 
acteristics) are defined by government. The organ- 
ization of government (structural characteristics) 
will depend on the best means to accomplish these 
purposes. The activities of government will include 
whatever symbolic manipulation through educa- 
tion, communications media, etc., is necessary to 
affect both the content and manner of policy deci- 
sion making pursuant to changes in the social 
system. 

In the first model (Figure 1) government is the 
dependent variable and social system or com- 
munity is the independent variable. Power is seen 
to inhere in the public, which creates the inputs of 
stimuli to which government must cater (Eas- 
ton 1957; 1965). In the second model (Figure 2) 
government is the independent variable and social 
system the dependent one. Such systems tend to be 
centralized in the form of their authority. They 
tend to elevate the goals of the government into 
norms and make them sacred and ethical precepts, 
through which legitimacy is defined. Governments 
resembling the first model tend to be competitive, 
pluralistic, and democratic; those resembling the 
second tend to be monopolistic, monistic, and to- 
talitarian. Normatively, the first are more secular 
than the second. Structurally, they are less hier- 
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archically organized. Behaviorally, they rely heavily 
on internalized norms and self-control, rather than 
on external authority. As opposing paradigms, 
these two generalized types, in their various con- 
crete formulations, are perpetually vulnerable to 
each other. Indeed, one can see over time that they 
form a permanent dialogue of conflict. They repre- 
sent two fundamentally different approaches to 
government. 


Political norms and forms of authority 


We can now begin to explore some of the impli- 
cations of that dialogue of conflict. One way of de- 
scribing it is in terms of the difference between 
sacred and secular norms. This difference is im- 
portant because it draws our attention to the norma- 
tive basis of a government and, therefore, to its 
legitimacy. Normative conflict generally takes place 
between sacred and secular beliefs. [See LEGITI- 
MACY. ] 

Sacred political norms. Governments based on 
sacred norms cover a wide range of cases, ancient 
and modern, including ancient China and many of 
the early Semitic kingdoms. Consider the following 
description: “The Egyptian of historic times did 
not have our doubts and difficulties. To him the 
kingship was not merely part, but the kernel of the 
static order of the world, an order that was divine 
just as much as the kingship was divine. .. . From 
the earliest historic times, therefore, the dominant 
element in the Egyptian conception of kingship was 
that the king was a god—not merely godlike, but 
very god” (Fairman 1958, p. 75). Or take the case 
of ancient Greece. There the principles of patri- 
lineal authority and ascribed status were linked to 
an ancestral and religious source, not only for 
kings but for every citizen (Hignett 1952, p. 63). 
Ascribed status applied particularly to priests and 
to the distinction between nobles and commoners, 
Hence, political norms represented an explicit 
validation of the structure authority. 

Sacredness does not apply only to theocratic or 
primitive societies. It applies as well to many mod- 
ern states. The sacred qualities of Marxism— 
Leninism in the Soviet Union are today enshrined 
in an elaborate philosophical system. Many of its 
sacred attributes were cruelly visible in the various 
purges and trials of the 1930s. It appears, even 
today, in the controversies over the political role 
of literature and the arts. Clearer still is the case of 
modern China. Mao’s prescriptive sayings assume 
such a sacred characteristic as to define the basis 
of political legitimacy. Many new nations share 
this characteristic (although to a lesser degree), 
particularly in cases where an attempt is made to 


ritualize the authority of a charismatic or highly 
personalistic leader by endowing his words and 
teachings with special insight. 

No government is entirely free from sacred 
qualities, whether these be elaborate and ideo- 
logical or be token symbols, such as a flag, or a 
constitutional document. These aspects of govern- 
ment may be merely ceremonial, or what Bagehot 
called the “dignified part,” or they may represent 
high drama in which the solidarity and unity of a 
community may be expressed. 

It is possible to distinguish three main varieties 
of sacred attachments, which, even if they overlap, 
are analytically separable. Ranking them in ascend- 
ing order of sacredness, they are (1) primordial 
attachments to or beliefs about race, language, and 
nationality (a typical expression of primordial 
attachment is nationalism); (2) philosophical 
attachments (the most generalized moral and 
philosophical ideal in which a total synthesis is 
expressed relating man to his environment and 
specifying the way of the future is to be found 
today in socialism); and (3) religious attachments 
(this refers to religious beliefs in which the origin 
of the society, moral purpose, and a particular pat- 
tern of transcendental beliefs are associated in a 
universal religious doctrine, such as Christianity ). 

In practice all three of these may be blended. 
Modern populist and totalitarian regimes, as in 
Nazi Germany, mixed primordial attachments of 
race and theological attachments of religion. In 
the Soviet Union during World War 1, the sym- 
bols of government were more and more primordial, 
i.e., nationalist, and less and less philosophical, 
i.e., Marxist. 

Secular political norms. Secular norms rely on 
a framework of rules rather than on some higher 
purposes of state. The most common cases of secu- 
lar systems are those in which the sacred elements 
have declined through institutionalization. They do 
not disappear, but they become so completely a 
part of the accepted pattern of right and wrong 
that it is not necessary to do more than refer to 
them on ritual occasions. Thus, ceremony rather 
than the substance of belief is characteristic of 
these systems. 

By tacit agreement secular systems reserve the 
“higher” goals to the individual, and these goals 
inhere in his body of private beliefs. If governments 
should violate these norms, they run the risk of 
overstepping their limits of variation and of being 
eliminated. 

We can now cut across the sacred-secular 
normative distinction with a structural variable, the 
pattern of centralized or decentralized authority 


[see AUTHORITY]. Once again we must employ a 
caveat and remind ourselves that it is always dif- 
ficult to use dichotomous variables to divide con- 
crete cases. What is in theory decentralized may be 
quite the opposite in practice. Or highly centralized 
systems may show informal patterns of consulta- 
tion and accountability to various groups in the 
community. Indeed, at any given time even the 
most highly centralized system may act on certain 
issues in a highly decentralized manner. Moreover, 
centralized government includes monarchical and 
bureaucratic systems, represented by ancient em- 
pires, that can combine monarchy with decentral- 
ized administration [see EMPIRES]. This config- 
uration includes different types of government: 
systems where the hierarchy is based on a king who 
is a father of his people, with authority deriving 
from a totemic ancestor, as in many tribal govern- 
ments, or systems where authority lodges in a patri- 
monial figure and the relationship between ruler 
and ruled is that of patron and client [see KING- 
sutp; see also Fallers 1956]. 

Centralized authority, Let us ignore all these 
variations in form and say boldly that centralized 
power begins from the top and is applied down- 
ward through a specific delegation of authority. A 
military organization or a bureaucracy represents 
a clear-cut “command” case, with autocratic and 
totalitarian governments defined as those which 
employ this system of hierarchy. Government may 
then be represented by a single figure, a king or 
dictator, or by an oligarchy or junta (Friedrich & 
Brzezinski 1956). Such highly centralized systems 
show the following characteristics: concentrated 
power is subject to few checks; power inheres at 
the top; subordinate authority is derivative; and 
there is strong reliance on the personality of a par- 
ticular leader, [See AUTOCRACY; OLIGARCHY.] 

Decentralized authority. Decentralized author- 
ity represents an opposite conception of power: 
power is generated by the public through the aggre- 
gation of their political wants, is expressed through 
various groups, and is regulated by an abstract sys- 
tem of rules. (Its usual normative expressions in- 
clude the acceptance of the principle of majority 
rule, protection of rights, and representation. ) 
This is what we mean by a democratic government. 
It is characterized by checks and balances, parlia- 
mentary control over the executive, and some form 
of election as the method of political recruitment 
to sensitive positions. Of course, such practices do 
not exhaust the forms of decentralization. Decen- 
tralization may be functional, based on the allo- 
cation of the economic power in society among 
various groupings, such as guilds, protective as- 


GOVERNMENT 219 


Table 1 — The derivation of political typologies 


CONCENTRATION OF POWER 


Centralized Decentralized 
Socred A c 
PREDOMINANT 
POLITICAL 
NORMS 
Secular B D 


sociations, professional associations, and other 
interest groups. [See CONSTITUTIONS AND CONSTI- 
TUTIONALISM.] 

The distinctions between sacred and secular 
and between centralized and decentralized types of 
government can form the basis of a more general 
model by means of which to analyze government. 
Table 1 pinpoints the four combinations that will 
be examined in detail. To summarize the possibili- 
ties, the highly centralized and sacred system, A, 
represents modern populist totalitarian govern- 
ments. The centralized and secular system, B, rep- 
resents many autocratic forms of government. The 
sacred and decentralized system, C, includes many 
early forms of theocratic society, from the feudal- 
ism of the High Middle Ages in Europe to religious 
or theocratic governments in America, such as the 
Puritan colonies of New England. And modern 
democratic governments fall into the secular and 
decentralized category, D. 


Sacred and centralized governments 


The modern sacred-centralized type of system is 
likely to be associated with the establishment of a 
new political system. Government is the independ- 
ent variable and is associated with a new moral 
framework. Such conditions commonly apply after 
a major revolution or in territories that have re- 
cently gained independence. [See DICTATORSHIP; 
TOTALITARIANISM. | 

Communist governments. The distinguishing 
feature of the sacred—centralized communist gov- 
ernment is the high degree of centralization encom- 
passing the total community. The sacred object of 
government is to transform the material conditions 
of life and the consciousness of the people at the 
same time. The evolution of the community be- 
comes a moral goal, to be sought under the leader- 
ship of a militant vanguard—the Communist party 
—serving as the spearhead of government. In the 
classic, Leninist form of the communist regime, no 
competitive sources of power can be tolerated. In 
recent times, however, a slight trend toward secu- 
larization and decentralization can be seen in the 
Soviet Union (Brzezinski 1962). 

Historically, the Soviet Union is an interesting 
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case of external beliefs influencing internal social 
groups to revolt against a highly autocratic monar- 
chy, a weak parliament (the Duma was only 
founded in 1905), and a centralized bureaucracy 
liberally sprinkled with foreign, particularly Ger- 
man, immigrants (Pipes 1954). Not only was the 
revolutionary instrument based on a small but dy- 
namic working-class movement; Marxism itself 
was largely restricted to Russian middle-class intel- 
lectuals. It was essentially an alien doctrine (trans- 
formed by Lenin to meet Russian conditions) lead- 
ing to a revolutionary organization which later 
became the centralizing mechanism of state power. 
When the religious beliefs of the Greek Orthodox 
church were replaced by the secular ideology of 
Marxism-Leninism, the goals of political develop- 
ment became sacred and formed the new basis of 
the legitimacy of government. Of course, a wide 
discrepancy existed between the theory and prac- 
tice of government. Power was in fact centralized 
in the hands of the first secretary of the Politburo 
of the Central Committee of the Communist party, 
while constitutionally the Soviet Union was a fed- 
eral system with an elected “supreme organ of state 
power,” the Supreme Soviet, which had, in theory, 
the exclusive right of legislation. [See CommMuNIsM, 
article on SOVIET COMMUNISM. ] 

This system formed the model of state organiza- 
tion for all other communist systems until relatively 
recently. Since the death of Stalin, however, two 
interesting features may be noted. The sacred 
quality of Marxist-Leninist ideology has declined, 
particularly as younger generations find it less sig- 
nificant as a doctrine than as a ritual; and a trend 
toward decentralization has begun. A struggle is on 
between the communist political leaders and the 
technical specialists, economists, scientists, and the 
like. Moreover, as “polycentricism” on an interna- 
tional level becomes more accepted, the necessity 
for a more “liberal” approach to Marxism-Leninism 
reduces its orthodoxy. Alternative structural ex- 
periments are increasingly common, such as those 
in Hungary, Yugoslovia, and Poland, in which “cul- 
tural” decentralization (in the case of Hungary 
and Poland) and economic decentralization (in the 
case of Yugoslavia) represent experiments in great- 
er freedom. [See COMMUNISM, article on NATIONAL 
COMMUNISM; see also Laqueur & Labedz 1962.] 

The communist examples are of particular rele- 
vance because they have become attractive models 
to governments of developing areas bent on follow- 
ing the Soviet pattern of rapid industrialization 
(Ulam 1960), 

Fascist governments. Fascist governments were 
more secular in their orientations. The develop- 


mental or evolutionary sacred ideology around 
which communist governments try to organize 
their societies embodies certain universalized moral 
aims. In contrast, the sacred attachments of fascist 
governments showed greater attraction to primor- 
dial sentiments, including race and nationality. 
Although there are structural similarities between 
communist and fascist governments, particularly 
with respect to the roles of a powerful totalitarian 
political leader and a weak set of parliamentary 
institutions, one important difference should be 
pointed out. In the communist case, government is 
monolithic, emphasizing the evolution of the entire 
community. Fascist governments, in contrast, toler- 
ated certain corporate groupings. 

Three fascist governments are of interest here: 
Germany, Italy, and Spain. All were highly central- 
ized, but they varied considerably with respect to 
the sacredness and secularity of political norms. 

The strongest attachment to sacred primordial 
political norms was exhibited in Nazi Germany. 
The supremacy of one race and the liberating 
effects of war and conflict were embodied in the 
revived Nordic myths (“Odinism”) and blended 
into a set of nationalist political norms. Structur- 
ally, although the government was highly central- 
ized under a personal dictatorship, four main 
groupings were given exceptional attention: army 
and secret police, large-scale industrial enterprise, 
labor organized into fronts and battalions, and 
military scientists and technicians. The Nazi case 
also shows that even under a highly centralized 
form of government, economic control can be kept 
separate from ownership, with private industry 
continuing to operate under government regula- 
tions. Unlike the situation in communist systems, 
in Nazi Germany a market system of economic allo- 
cation coexisted with government-organized fiscal 
and credit manipulations, Each corporate group 
obtained special conditions of privilege. [See NA- 
TIONAL SOCIALISM. ] 

Italian fascism showed less commitment to pri- 
mordial political norms than Nazism, as well as a 
somewhat less centralized governmental structure. 
The norms themselves were composed of ambigu- 
ous combinations of primordial sentiments, ap- 
peals to historical precedent, and claims to philo- 
sophical universality. Primordial claims were mixed 
With the corporate organization of the state, under 
the inspiration of the collegia of the late Roman 
Empire. The “corporation” was thus associated with 
the great period of Italian imperial and cultural 
achievement and became the legitimizing basis of 
the regime. 

A second claim to universality, which was of 


minor importance during the period of Italian 
fascism, may yet prove to be highly significant. 
This is the view that the proper way to organize 
the state is in corporate groupings functional to 
development and industrialization. In arguing the 
case of corporate government, it was pointed out 
that fascism as a form of government, although 
totalitarian, emphasized the role of the corporation 
as both the point of reconciliation between state 
and individual and the instrument of individual 
expression (see Barker 1942, pp. 328-366). This 
possibility remains as an important structural de- 
vice, midway between highly centralized and decen- 
tralized systems. [See FascisM.] 

Both the German and Italian systems contained 
important normative ambiguities, which they at- 
tempted to resolve in the apotheosis of violence. 
This was most apparent in their total repudiation 
of democratic, decentralized forms of government 
(which were regarded as catering to human weak- 
ness). Italian and German fascism both repre- 
sented an authentic totalitarian populism (a mod- 
ern form of tribalism), in which medieval ideas of 
corporation, organic concepts of the community, 
and primordial sentiments were intertwined within 
a highly centralized system of administrative gov- 
ernment. 

The Spanish case has been less ideological and 
less centralized. Despite the exceptional power of 
Generalissimo Franco and the concentration of au- 
thority in the national cabinet, the Falange, as a 
political party, plays a lesser role in government 
than the National Socialist party did in Germany, 
or the Fascist party in Italy (Payne 1961). One 
reason is that within six years of Franco's acces- 
sion to power, Italy and Germany were defeated. 
Their systems no longer served as models of suc- 
cessful dictatorship. Even more important is the 
Catholic tradition, to which the right wing of the 
Falange and significant proportions of the popula- 
tion generally subscribe. [See FALANGISM.] 

More decentralized than the others, and there- 
fore more autocratic than totalitarian, the Spanish 
system remains an extension of an old and estab- 
lished bureaucratic system which traces its roots 
to the imperial Spanish tradition, the Inquisition, 
and a centralized monarchy. Even today the Span- 
ish government tries to preserve a vague commit- 
ment to monarchy as a traditional form of legit- 
imacy, This allows the government to revive 
memories of Spanish grandeur, treating commu- 
nism, secularism, and socialism much as Philip 1 
and Archbishop Carranza of Toledo treated Protes- 
tantism, Islam, and Judaism (Davies 1937). This 
fervor would indicate the presence of sacred polit- 
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ical norms, derived from Catholicism and more or 
less indifferent to the structure of authority. Many 
of the same norms served equally well in Perén’s 
Argentina, and the more socialist forms of Catholic 
corporatism are sometimes embodied in modern 
and decentralized socialist or democratic govern- 
ments, for example, in Chile. [See CAUDILLISMO.] 


Secular and centralized governments 


The most pronounced characteristics of a secu- 
lar—centralized system of government are autono- 
mous power in the hands of a president or monarch 
(or perhaps a presidential monarch); a single po- 
litical party, whether in the form of an elite (a 
communist party) or a populist mass party with 
an elite center (most nationalist parties); a trun- 
cated or largely ritualistic parliament, which does 
not have a real veto power over the executive; and 
an elections system which does not allow effective 
competition between candidates for political office. 

Such centralized systems show several character- 
istic problems common to all forms of centralized 
government with the exception of institutionalized 
monarchies. The most important of these are, first, 
succession to high public office (which is usually 
accompanied by severe struggles for power) and, 
second, the institutionalization of disagreement. 

The normative content of both the communist 
and fascist forms of government gives direction 
and shape to the entire society. Historically, how- 
ever, there have been many cases where the 
normative content is relatively low (or largely 
ceremonial and ritualized), while power remains 
centralized. These include most nineteenth-century 
monarchical forms of government. Indeed, pre- 
cisely because their sacred characteristics were 
emptied of content (while retained in form) they 
were unable to survive as types and were either 
transformed or removed. In France the monarchical 
form, attempted periodically during the nineteenth 
century, had been effectively destroyed by the 
French Revolution. Only Bonapartism had any 
genuine normative success. In Britain the secular- 
ization process began with the transformation of 
the monarchy or, symbolically, with the beheading 
of Charles 1. The Act of Settlement of 1701, whereby 
the sovereign occupies the throne under a parlia- 
mentary title, established parliamentary supremacy, 
although it took many generations before the full 
implications were worked out (Dicey 1885). Real 
structural changes in the form of decentralization 
were embodied in the widening of parliamentary 
control over the executive and in popular repre- 
sentation from 1832 onward (Gash 1953). If the 
record of historical cases of secular—centralized 
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government is any guide, then one useful proposi- 
tion can be stated as follows: as sacred political 
norms become secularized through ritualization, 
government must decentralize, since its legitimacy 
disappears. [See Monarcuy. | 

Examples of secular-centralized governments in- 
clude czarist Russia ( although there were impor- 
tant theocratic elements incorporated in the role 
of the czar), and Bismarck’s Germany. More recent 
cases are colonial administrative governments in 
British and French Africa, south Asia, the Nether- 
lands, the East Indies, and the Belgian Congo. 

Many of the new nations have gone from one 
form of highly centralized system, under colonial- 
ism, to another, in the form of one-party govern- 
ment, but with a change in the quality of political 
norms. The norms often become endowed with 
intense attachment to primordial loyalties associ- 
ated with race and nation and, to a lesser degree, 
with some aspects of socialism and public owner- 
ship, all wrapped up in a particular ideological 
message, such as Nasserism in Egypt, Nkrumahism 
in Ghana, or “Communocracy” in Guinea. 


Sacred and decentralized governments 


Where new constitutional governments have been 
most successful, they have evolved a shared set of 
political norms deriving from a previous period 
when such norms were explicitly sacred, either in 
an ecclesiastical form, as in the case of the Puritan 
commonwealths, or in a more directly political 
form. This suggests the following proposition: 
decentralized-secular governments are most stable 
and effective when they have developed out of an 
earlier centralized-sacred or decentralized—sacred 
form, with norms of self-control becoming behav- 
iorally widespread. Modern democratic government 
emerged when the decentralization of authority and 
secularization of political norms proceeded more 
or less simultaneously with a corresponding in- 
crease in the standard of individual civic obligation 
(Almond & Verba 1963). 

The origins of Western democratic governments 
derive from a synthesis of generally agreed religious 
values, which is associated with a generalized Chris- 
tian ethic. The theocratic origins of democratic gov- 
ernment are not to be taken lightly. Even the 
American experience assumed a unified set of 
Christian ( mainly variants of deistic and Protes- 
tant) theological precepts. Law was based on the 
prior conception of agreed principles of political 
propriety. The formation of these principles can be 
found in many different theologically articulated 
forms, including the religious wars between Cath- 
olics and Protestants and between various Protes- 


tant groups as well. Nor was the body of precept 
within the Catholic church much more unified, 
Certainly the conflicts over conciliarism and the 
role of the church councils, not to speak of the 
nationalization of the church itself within each 
country, all testify to the explicitly political conse- 
quences of religion. These issues were so important 
to politics that much of the process of seculariza- 
tion can itself be traced to the search for some 
mutually agreeable and satisfactory common de- 
nominator of precept in order to render politics 
more secular. [See CHRISTIANITY.] 

In the United States this was most clearly recog- 
nized in the works of Brooks and Henry Adams, 
both of whom saw the modern economic state, with 
its emphasis on instrumental values, economic ex- 
change, and corporate finance, as destroying the 
implicit basis of the original Christian values. 
Nowhere is this more explicit than in Henry 
Adams’ essay on Mont-Saint-Michel (1904; see also 
Brooks Adams 1898). Such views, which tended to 
idealize the classical and medieval civilizations, 
were romantic expressions of this religious ideal. 
But in addition to energy, there was doctrine and 
creed. The church militant was not always com- 
posed of simple stuff. Even in Catholic Spain during 
the “golden century,” the conquistadors, who com- 
bined the adoration of the Virgin with great 
greed in plundering the New World and founding 
an empire, were vastly different in their political 
aims from the various religious orders, Jesuit, Ben- 
edictine, and Dominican, and these in turn had 
their constitutionalists, such as Juan de Mariana, 
Francisco Suárez, and Bellarmine (Lewis 1954). 

The secularization of political norms can be seen 
as occurring in three historic steps. The first was 
nationalization of the church. By this means the 
political universalism of the church, symbolized in 
the term “Holy Roman Empire,” was restricted, 
and various national churches arose. 

The second step was the extension of that proc- 
ess to government. It was symbolized in the expres- 
sion “divine right of kings,” whereby authority was 
traced to the Deity through the principle of royal 
inheritance and kinship. This established the idea 
ofa sovereign government as a legitimate unit with 
rights to protect itself against external sacerdotal 
power. 

The third step was the growth of Protestantism, 
associated first with the unfolding of Christian 
principles through equity and with the radicaliza- 
tion of instrumental values. The transition was par- 
ticularly significant because Protestantism emerged 
as a particular religious ideology with a mutually 
reinforcing synthesis between sacred values and 


instrumental objects germane to industrialization. 
In this sense, Protestantism was the mode of transi- 
tion from the more explicitly religious form of gov- 
ernment to a more secular one, which merely 
reflected religious values. This is why the roots of 
modern Western constitutional ideas are so deeply 
embedded in the Protestant ideal of the community. 
[See PROTESTANT POLITICAL THOUGHT.] 

In a sense, the secularization of religion emerged 
as a result of the utter loneliness of Protestantism, 
which in Calvin’s doctrine excluded even the church 
from participation in individual salvation. Weber 
makes the point that this is the singular difference 
between Catholic and Protestant doctrine, and the 
result was an emphasis on an individualism held 
in check by the concept of a calling embodying 
good works ‘and sobriety. Rationality was reflected 
in a political community of individuals. Thus, self- 
control became the founding ethic of representa- 
tive government, in conjunction with the economic 
doctrine of capitalism. Catholic doctrine, in sharp 
contrast, “punishing the heretic but indulgent to the 
sinner,” retained a conception of the organic com- 
munity that, although not necessarily antagonistic 
to decentralized government, did not support its 
basis in individualism and the doctrine of individ- 
ual representation (Weber 1904-1905; McNeil 
1954). 

The consequences of Protestantism and Dutch, 
British, and American capitalism helped to create 
the conditions of secularization, with a greater 
degree of emphasis on legal and constitutional 
political devices. Weber quotes John Wesley: “I fear 
wherever riches have increased, the essence of re- 
ligion has decreased in the same proportion” (Weber 
1904-1905, p. 175 in the 1958 edition). This view 
is central to modern secular democratic govern- 
ment, where law has replaced religion as the 
foundation of the community.. Thus, secularization 
in political terms is important in the West because, 
as a process containing a constitutional element 
the object of which is to establish a framework of 
government responsive to change, it leads to an 
explicit acceptance of the idea of the sovereignty 
of the people. Secularization paid particular at- 
tention to the accumulation of wealth as a duty, 
which favored rapid economic growth. The process 
is its own problem, however. Secular and decen- 
tralized government has wrestled with the question 
of how to retain the idea of obligation and re- 
sponsibility in the face of continuous radical 
secularization. 

It should not be assumed that there is a linear 
progression from centralized to decentralized or 
from sacred to secular systems. The opposite 0c- 
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curred (and in a peaceful manner) in Weimar 
Germany. Legitimacy was withdrawn from the con- 
stitutional government when the voters freely chose 
the Nazi party. This implies that the norms of a 
secular and decentralized system were relatively 
weak and insufficiently institutionalized. It also 
means that such a system can operate only when 
self-control and nonpolitical restraints on behavior 
predominate. 


Secular and decentralized governments 


As secularization occurred in Western societies, 
theological obligations were changed into codes of 
civic responsibility. Law replaced religion as the 
basis of political norms. With the rising prosperity 
of Europe, there developed a general belief that 
free, democratic governments with maximum polit- 
ical participation for all would provide a beneficial 
political condition. Indeed, struggles during the 
nineteenth century were over the speed and thor- 
oughness with wuich constitutional democracy 
would incorporate the entire membership of a sys- 
tem rather than over structural principles of gov- 
ernment. [See DEMOCRACY.] 

A view of government analogous to the approach 
of classical economics was widely accepted. The 
community is composed of voters, who are like con- 
sumers, and their choice is tantamount to consumer 
sovereignty. The election system represents the 
market, in which voters choose their representatives 
on the basis of stated preferences. Government, 
consisting of parliament and cabinet as well as 
administrative cadres, is similar to the productive 
unit and manufactures decisions, which the public 
evaluates through the electoral mechanism. The 
courts are present to ensure that the rules of the 
system are not violated. 

The principles on which the system works in- 
clude a high level of information, rationality as an 
attribute of voting and decision making, and equal 
representation (Downs 1957; Easton 1966). Such 
principles underlie the American form of presiden- 
tial government and the utilitarian systems advo- 
cated by John Stuart Mill and the Benthamites in 
England. Advocates of this form emphasize the 
improvement of information,’ the uses of education 
in order to reach rationality (only the informed 
voter can be rational), and, in particular, the im- 
provement of electoral systems in order to achieve 
the maximum reproduction of public wants in a 
representative chamber. Hence, for example, one 
of the problems considered most important is 
whether proportional representation is preferable 
to simple plurality voting or some combination of 
weighted balloting or lists. [See ELECTIONS, article 
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on ELECTORAL SYSTEMS.] Important questions also 
arise about the role of political parties in govern- 
ment. How do parties, acting as agents by which 
public desires are transformed into government 
cognizance, facilitate the political process? In this 
respect political parties are designed to emphasize 
certain publicly held priorities and make them ex- 
plicit, so that as the problems confronting govern- 
ment become more complex and individuals cannot 
make their views known on all of them, politicians 
stand for some symposium of priorities and on this 
basis are accepted or rejected by the voters. 

The principle of majority rule implies in effect 
that the rightness of a doctrine is measured by the 
degree of support it obtains and that support cre- 
ates power. Hence, majority rule is a principle of 
power which credits the rationality of the majority 
and elevates reason, plus numbers, over abstract 
morality. It is because of this that instrumentalities 
begin to take on their own moral proprieties. [See 
MAJORITY RULE.] 

Not all democratic polities accepted this highly 
individualistic form of government. Two alternative 
forms, one older and one very modern, have stressed 
the idea of the organic community rather than the 
more mechanistic doctrine of individualism. The 
first of these forms, an extension of medieval doc- 
trine, incorporates Catholic beliefs in the context 
of a decentralized state. This includes various spe- 
cific approaches to democratic government, such as 
Christian socialism and Christian democracy. The 
second form is democratic socialism, which em- 
phasizes the democratic state as a means of ful- 
filling conditions of equality and freedom in con- 
junction with the development of the moral and 
material basis of the community. 

Both of these forms see an inadequacy in democ- 
racy, resulting from a contradiction between pri- 
vate ownership of the means of production and the 
maintenance of civic obligation. How can govern- 
ment be secular and decentralized yet retain au- 
thority? If the achievement of democratic govern- 
ment is that it is secular in practice and therefore 
free of formalized commitments to a higher set of 
priorities than those desired by a majority, the prob- 
lem is how to retain that self-control implicit in 
Calvin’s formula. One answer is to study the new 
roles in government, particularly those which pro- 
vide a “calling,” such as the roles of civil servants, 
of members of professions, and of scientists, whose 
sense of responsibility and commitment to the ex- 
change of free ideas is perhaps one of the most 
important characteristics of democratic government 
in highly industrial societies, [See BUREAUCRACY 
and SCIENCE, article on SCIENCE—GOVERNMENT RE- 


LATIONS; see also Friedrich 1937; Jouvenel 1963; 
Sartori 1962.] 

Types of democratic governments, Democratic 
governments require further differentiation. First, 
they can be divided into unitary and federal forms. 
Unitary governments are based on the position that 
all powers not otherwise reserved belong to a cen- 
tral government. Federal governments take the posi- 
tion that residual powers lie with the component 
geographical units which make up the federation, 
[See FEDERALISM.] 

Second, they can be classified as presidential and 
parliamentary forms. In a true presidential system, 
a president elected by the population is responsible 
to them, rather than to the legislature. The legis- 
lature, in turn, is responsible to the population 
which elects it, and not to the president. This pro- 
vides checks and balances, with the public acting 
as arbiters during periodic elections, held at fixed 
intervals. [See PRESIDENTIAL GOVERNMENT.] The 
parliamentary system shows parliament supreme, 
with a prime minister responsible to it and holding 
office at its pleasure. Through votes of confidence 
and changes in parliamentary party membership, a 
government can fall, in which case a new general 
election to parliament is necessary; the majority or 
plurality of seats won by a political party provides 
a mandate to form a government. Under such a 
system the president (or monarch) is largely a 
figurehead. Where the parliamentary government 
is a constitutional monarchy, the transition from 
earlier forms of monarchy has generally been 
smooth, rather than abrupt, and has been achieved 
by virtue of internal structural changes. Notable 
cases are Holland, Denmark, Sweden, Norway, and 
Great Britain, 

Much of the present concern of democratic 
government lies with the problems associated with 
the growing complexity of modern life and the 
increasingly broad responsibilities which individ- 
uals expect a secular government to take upon 
itself. Whereas the problem in the first half of the 
twentieth century was the improvement of electoral 
and representation methods and of local govern- 
ment and administration, the present emphasis is 
on the work load of parliaments and congresses. 
Both federal and unitary systems today accept the 
principle of one man, one vote, and “popular sov- 
ereignty” tends to mean a form of parliamentary 
representation embodying territorial and demo- 
graphic bases. How to maintain debate on impor- 
tant issues and get through a heavy legislative work 
load is therefore a critical matter. In Great Britain, 
for example, the widening of the franchise was 
accompanied by the decline of the role of the 


private-member bill as the work load of the com- 
mittees increased. Parliament divides according to 
the lines laid down by the party whips, and a free 
vote, each member voting according to his con- 
science, becomes rare (see Wheare 1955; McKen- 
zie 1955). Discipline in parliament has given rise 
to the term “cabinet dictatorship.” [See PARLIAMEN- 
TARY GOVERNMENT. ] 

How well parliamentary or cabinet forms of de- 
centralized representative government work de- 
pends a great deal on how political parties carry 
on the work of government. Different structural 
rules obviously affect this. Since all decentralized 
and democratic forms of government have some 
means to control the executive, this may lead, in 
multiparty systems, to cabinet instability, as it did 
in France and does in Italy. The stability of such 
parliamentary governments, therefore, depends on 
the stability of coalitions. Where two parliamentary 
parties are characteristic, instability is rare, partly 
because of rigorous party discipline. [See PARTIES, 
POLITICAL, article on PARTY SYSTEMS; INTEREST 
Groups; see also Laski 1951.] 

How to improve decentralized and democratic 
government has posed serious problems of political 
theory. It is not surprising, therefore, that highly 
individualistic conceptions of democracy have been 
replaced by notions of group representation, block 
voting, and the responsiveness of government to 
various groups, such as professional bodies, busi- 
ness lobbies, trade unions, cooperative movements, 
civic and veterans’ organizations, churches, and 
educational and cultural groups. [See PLURALISM 
and PoLITICAL GROUP ANALYSIS; see also S. E. 
Finer 1958.] Indeed, so important have group the- 
ories become that interest groups have been called 
in Britain an “anonymous empire.” 

All of these questions relate directly to the re- 
lations between community and government. More 
precisely, they are devoted to an examination of 
intervening variables between community and gov- 
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ernment, such as political parties and interest 
groups, and even of subgroups within government 
itself. (See Figure 3.) 


Government in new nations 


Governments formed in new nations—those 
achieving independence after 1945—present some 
of the most interesting and challenging material 
confronting constitutional experts, political the- 
orists, and politicians alike. New governments tend 
to include characteristics of all the forms we have 
discussed (Apter 1965). For example, in Africa 
and parts of Asia there are attachments to primi- 
tive governments, which remain in competition 
with central government for the loyalties of the 
population. Regional clusters associated with a re- 
ligious or linguistic affiliation may represent power- 
ful primordial loyalties, denying legitimacy to a 
central government or countering it with a prefer- 
ence for local primordial attachments. Thus, unity 
and legitimacy within the context of a nation are 
urgent problems facing political leaders. Inasmuch 
as certain aspects of primitive government repre- 
sent a rightful heritage, it may be that traditional 
qualities of legitimacy should be applied to mod- 
ern governmental forms, as has been attempted 
under such ideological forms as “African social- 
ism” or “communocracy.” [See NATION.] 

Furthermore, new nations commonly emphasize 
the positive role of government as the great “engine” 
of social change, actively intervening in all aspects 
of life, from family relationships to educational 
opportunities, from road building to the develop- 
ment of local airways. Government in this respect 
is seen as an independent variable, much as it is, 
for example, in the Soviet Union. 

At the same time, most constitutional patterns 
follow the line of Western parliamentary govern- 
ment. With sufficient control over parliament, 
through the instrumentality of party discipline, it 
is possible to retain popular government, based on 
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parliamentary practice and a cabinet system, with 
few formal checks on executive authority. 

In a very real sense, then, the governments of 
new nations tend to become amalgams of the other 
types we have discussed—theocratic, communist, 
fascist, and democratic. Their roots in primitive 
government become identified in normative terms, 
with a mythical past providing a national identity. 
The communist emphasis on puritanism, public 
ownership, and discipline is represented in the re- 
cruitment of a developmental elite and in its method 
of exercising power. The representation of corpo- 
rate functional groups has much in common with 
fascist governments, And, the populist democratic 
emphasis and the pattern of parliamentary govern- 
ment represent Western democratic ideals and some 
of its procedures. 

All this is confusing because the various systems 
of government described appear to be so antithetical 
that it would seem impossible to blend them into 
a viable and effective system. This is to a certain 
extent true; hence, virtually all new governments 
are in the process of changing into some more 
stable type. It is not Surprising that quite often 
what holds such a government together is allegiance 
to a particular political leader, associated with revo- 
lution or the development of a mass movement. 
Indeed, the outstanding common feature of these 
new governments is their dependence on a person- 
alized leader supported by a dominant political 
party. This tends to be the case whether or not the 
system is formally a single-party state, as long as 
there is a dominant party which is capable of con- 
trolling large regional areas. In the special case of 
new governments, therefore, the crucial factors are 
the relationship between the leader of the govern- 
ment and the leader of the party, and the role of 
the party. Ordinarily the first two are the same 
person, and the party operates government. 

In terms of the various models employed here, 
new governments incline in theory to the position 
that government is an independent variable. In 
practice, however, government is likely to be an 
expression of an elite which manipulates a mass 
party that is itself a reflection of a wide diversity 
of interests, In other words, government and social 
system tend to become incorporated into the broad 
concept of a single party, Party becomes the inde- 
pendent variable, with government the intervening 
one, and social system the dependent one. (Even 
where there is more than one party, the situation 
is not very different. There may be several domi- 
nant regional—tribal or linguistic—parties, rather 
than one party; however, none are deeply com- 
mitted to a single ideology, and in this they differ 
sharply from communist or fascist parties. ) 


The situation illustrated in Figure 4 is found in 
its clearest form under conditions of radical trans- 
formation from dependent to independent status 
or directly after a revolution. Since the framework 
within which the parliamentary and cabinet sys- 
tems operate is not entirely eliminated. it in tervenes 
between government and changes in the social 
system. In other words, the role of government, 
derived through the formal decision-making proc- 
ess, is the making of technical decisions, while the 
main lines of policy are generally laid down by 
governmental leaders, who are also senior party 
leaders. Classic examples are Nyerere’s Tanzania, 
Ghana under Nkrumah, Nasser’s Egypt, and Al- 
geria under Ben Bella. [See MODERNIZATION, ] 


INPUTS 


| Parey | — [covernment | —> 


Figure 4 — Government in the pluralistic single-party 
state 
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How can new governments be evaluated? Nor- 
mative criteria involve those associated with demo- 
cratic systems and would include the adequacy of 
the protection of individual rights, and the degree 
of pluralism tolerated in government and parlia- 
ment. On this score governments in new nations 
show a mixed record. Some, such as Ghana under 
Nkrumah, have a declaration of fundamental prin- 
ciples to which the president has sworn adherence, 
but these declarations cannot be enforced in the 
courts. Nigeria has a bill of rights. Burma em- 
phasizes the social ownership of the means of 
production. Ghana and India have preventive- 
detention ordinances. It is safe to say that the 
protection of civil rights and liberties varies less 
with governmental form than with the general 
spirit in which government functions, 

Since the legitimacy of new governments tends 
to be rather weak and is often associated with 
highly personalistic leaders (a very vulnerable 
structural condition), another normative criterion 
is how successfully a government develops primor- 
dial loyalties. The quality of such attachments pro- 
vides a basis for evaluation of new governments 
and describes the depth of its emerging political 
culture. [See POLITICAL CULTURE, ] 

Primordial attachments tend to become linked 
to problems of economic growth. Planning, tech- 
nical skills, manpower surveys, and the like, are 
important areas of governmental] decision making. 


[See ECONOMIC GROWTH.) In addition, they are 
moral or normative concepts associated with the 
objectives which commonly take the form of social- 
ism, since socialism explicitly validates a develop- 
ment ideal and is represented, through party and 
government, in evolutionary ideal terms. Socialism 
also justifies political demands for personal sacri- 
fice and loyalty. How well socialism (or its vari- 
ants) can embody new economic rationality and 
enforce commitment to savings, work, education, 
and development becomes a third evaluative cri- 
terion. [See SOCIALISM.] 

Behavioral consequences of normative beliefs 
thus emerge. A combination of political norms, 
nationalist primordial sentiments, and philosophi- 
cal ideological expressions of socialism combine to 
form the motivational system of the society. Struc- 
turally these elements are organized less frequently 
in parliamentary or representative institutions than 
in functional or corporate bodies within and around 
a political party. Party wings and various related 
interest groups are the devices that link individuals 
to the government. 

Despite all the integrative efforts, the most strik- 
ing feature of new governments is the behavioral 
weakness in their population vis-a-vis government. 
Changing allegiances and ritualization of authority, 
not to speak of the stresses and strains of rapid 
economic change and industrialization, all require 
much study of motivation, of the sources of per- 
sonal identity, and of learning. Indeed, what is now 
called the identity problem, involving examination 
of the conditions under which individuals are able 
to establish a set of personality boundaries com- 
patible with changing normative and structural 
conditions, is a growing concern. Only a few studies 
of the relationship between government and iden- 
tity have been attempted, though many problems 
of governing derive from the search for identity 
(Pye 1962; Erikson 1958; Edelman 1964). 


Contemporary research and theory 


The author has tried to demonstrate a few of the 
emphases associated with the study of government 
and has employed both normative and structural 
variables to differentiate types of systems. Histor- 
ically, the most elaborately studied aspect of gov- 
ernment has been its normative side, which is still 
important in trying to determine how governments 
will evolve because it helps us to relate political 
means and ends. The structural dimension, almost 
as well studied in the literature as the normative, 
has been heavily weighted in favor of the study of 
the constitution as the foundation of government. 
Different constitutional systems have been distin- 
guished through enumeration of their characteris- 
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tics. More recently, work on the structural dimen- 
sion, developed along the lines of institutionalist 
analysis and heavily influenced by the work of Max 
Weber, Karl Mannheim, and others, has related 
government to art, religion, philosophy, education, 
and other social institutions, The main concern of 
these scholars has been to study democracy as a 
universal system originating in Western civiliza- 
tion and to contrast it with less evolved forms of 
government, such as monarchy or oligarchy. 

This work led to the development of functional 
analysis in the study of government, with its em- 
phasis on the derivation of more-universalized 
comparative categories. Such studies have by and 
large employed one of two forms of systems theory. 
[See SysTEMS ANALYsIs.] The first tends to follow 
the organismic analogy. Government is seen in its 
intimate relationships to society. A good set of func- 
tional categories for government will do the follow- 
ing: allow comparison of widely diverse forms and 
actions in terms of their implications for govern- 
ment as a whole; segregate critical activities from 
less critical ones; allow one to observe explicit lev- 
els of explanatory theory (Almond & Coleman 
1960; Apter 1965). 

The second emphasis in contemporary systems 
theory follows a mechanistic tradition. Some of the 
recent work, based on theories of coalitions, is orig- 
inally derived from economic theories and attempts 
to use principles of rational calculation and max- 
imization in order to predict political group be- 
havior [see Coatit1ons]. Several of the efforts to 
deal with coalitions begin with the group basis of 
politics and draw their original inspiration from 
the writings of A. F. Bentley [see POLITICAL GROUP 
ANALYSIS and the biography of BENTLEY]. The 
most powerful form of systems analysis following 
this tradition is to be found in cybernetics models 
and game theory. These represent more-generalized 
structural models than the ones used for functional 
analysis. The rules derived for one unit apply to all 
group behavior. In the game-theory approach these 
deal primarily with the consequences for action of 
communications and information. Systems analysis 
of this type involves analysis of attempts, according 
to explicit rules, to maximize gains and minimize 
losses. Formulation of such highly rationalistic 
models can help us to understand political com- 
petition and government actions in priority and 
other settings. [See GAME THEORY; see also Downs 
1957; Snyder 1961.] 

Emphasis on groups has also given rise to an 
important literature with behavioral, as well as 
structural, implications. Behavioral and structural 
aspects of government depend on the analysis of 
government as a group, with reference to its size, 
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its patterns of communications, and the ways in 
which motivation and memory are structured 
within it. Work in this area draws on the theories 
of psychologists, e.g., Kurt Lewin, R. Lippitt, and 
Theodore Newcomb, concerning leadership, inter- 
action processes, cohesiveness, control over devi- 
ance, internalization of norms, etc., and tends to 
treat decision making as the main object of analy- 
sis. [See DECISION MAKING; see also Cartwright & 
Zander 1953.] Today such an emphasis can be in- 
tegrated with informational analysis, group theory, 
and game theory in certain cybernetics models 
applied to government. This form of systems theory 
uses the concept of an information grid, in which 
a political system represents the flow of messages 
or of “cues” and government the critical “trans- 
former,” i.e., a coding and decision-making instru- 
ment. It is concerned with adaptability, and em- 
phasizes the capacity of different systems to learn 
and adjust, with government performing an essen- 
tially creative role. One modern political theory 
which attempts to bring together these functional 
and other emphases very systematically is Karl 
Deutsch’s application (1963) of the general cyber- 
netics model, which is an attempt to solve prob- 
lems of learning, creativity, and adaptation in 
politics. 

To account for these new developments, a more 
general way of analyzing governments is required. 
The formulation illustrated in Figure 5 helps to 
move the analysis of government to a highly gen- 
eralized level, incorporating the various approaches, 
new and old, in a single model. In this model politi- 
cal behavior is the independent variable, Political 
norms and structures can therefore be treated as 
intervening variables. Their effectiveness is subject 
to change because political structure is bound and 
limited by the legitimacy pattern established by the 
relationships between political behavior and politi- 
cal norms. Political structure can be seen in terms 
of its consequences for decision making, which in 
turn, is designed to maintain a sustained pattern of 


political behavior consonant with the maintenance 
of norms. 


It is possible, of course, to enlarge the complexity 
of this model. More important, it is necessary to 
rotate these variables for different purposes and to 
hold each of them independent in turn, in order 
to estimate their effects. Many of the variations in 
approach to the study of government derive from 
holding different of these variables independent 
without realizing the specific methodological impli- 
cations of doing so. The selection of variables to be 
held independent is entirely arbitrary. To estimate 
the effectiveness of different forms of government, 
however, it might be useful to treat political struc- 
ture as the independent variable and see how vari- 
ous types—democratic, totalitarian, centralized, 
decentralized, monistic, pluralistic, monopolistic, 
competitive—affect both political norms and politi- 
cal behavior. How does each of these alternative 
types allow political learning to take place? How 
does government preserve continuity? How does it 
affect the course of change? In addition to rotating 
the variables, it is possible to add new intervening 
variables. These may be normative ones, such as 
ideologies, or structural ones, such as political 
parties, administrative organizations, and the like. 


Any general theory of government will require a 
model which is sufficiently explicit to account for 
the limits of variation, sufficiently flexible to handle 
diverse methodological emphasis, and empirical 
enough to be fully operationalized. We are still far 
from able to construct such a model, but the fore- 
going discussion shows that at least the foundations 
of the model and the theory have been laid. Still 
needed are improved techniques of gathering data, 
as well as analytical paradigms by means of which 
such data can be related to the appropriate theory. 
These concerns connect the analysis and study of 
government to the philosophy of science by empha- 
sizing logical and epistemological problems, and 
also cause us to speculate about the application 
of highly advanced mathematical and statistical 
techniques and computer programming to the care- 
ful mapping and testing of propositions about gov- 
ernment. The concept of government thus remains 
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Figure 5 — A generalized model of government 


a critical point of departure for both the evaluation 
of the normative issues of political life and the 
structural and behavioral analysis of politics. 


Davin E. APTER 


[Related to the concept of government are the entries 
ADJUDICATION; ADMINISTRATION; ELECTIONS; LEG- 
ISLATION; POLITICAL EXECUTIVE; REPRESENTATION. 
Guides to other relevant material may be found 
under LAW; POLITICAL ANTHROPOLOGY; POLITICAL 
SCIENCE.] 
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GOVERNMENT STATISTICS 


Statistical data generated by government sources 
are referred to and cited in many articles in this 
encyclopedia, including CENsus; ECONOMIC DATA; 
FALLACIES, STATISTICAL; INTERNATIONAL TRADE: 
SAMPLE SURVEYS; VITAL STATISTICS; and the biog- 
raphies of QUETELET and WILLCOX. This article is 
confined to over-all aspects: the relation of statis- 
tics to the establishment of national states; the 
range of substantive matters on which statistics are 
collected; the ways in which that collection is or- 
ganized in different parts of the world; the business 
and professional environment within which gov- 
ernment statisticians work; the problems and diffi- 
culties they face in both developed and underde- 
veloped countries. 


Government statistics in the modern state are an 
essential part of a wider information system. The 
first compilers of statistics did not make a sharp 
distinction between numerical and other facts, and 
it is still true that numerical data complement 
other kinds of information in the process by which 
decisions are made in private and public under- 
takings. These data are as indispensable in a cen- 
tralized as in a pluralistic society; as important in 
the deliberations of government itself as they are 
in the firms, commissariats, or other groupings con- 
cerned with getting out a product. The effectiveness 
of the market mechanism of a free society depends 


on the quality of the information on which its 
entities base their decisions. Western European eco- 
nomic planning consists in large measure in the 
provision of supplementary information to private 
enterprise. A tightly controlled, centralized system 
needs, more than anything else, a feedback—of 
which the statistics it collects are a pivotal element 
—if it is not to be the unwitting victim of its own 
concentration of power. 


Statistics and the state 


The history, the present condition, and the prob- 
lems yet to be solved in government statistics re- 
flect the circumstances of modern national states. 
A governmental administrative apparatus is a prin- 
cipal user of official statistics, and its existence and 
efficiency are a main condition for securing them. 
While sporadic attempts at counting people or goods 
were known in classical times and in the ancient 
empires of the Middle East, there is virtually no 
continuity between these and modern official com- 
pilations. 

Modern history. Like so much else that per- 
tains to the modern state, its statistical system 
emerged about the time of the French Revolution. 
One of the early acts of the Constituent Assembly 
was to see to the publication of a statistical account 
of the resources of France, prepared by Lavoisier. 
In 1800 the Bureau de Statistique was created in 
Paris. The assembly required a census (Articles 1 
and 1 of the Law of July 22, 1791, according to 
Faure 1918, p. 277), which was duly taken in 
1791. Francois de Neufchâteau, in a circular dated 
the 15th of Fructidor of the year vı (presumably 
September 1, 1799), saw the census—with char- 
acteristic revolutionary exaggeration—as “the meas- 
ure of the strength, the source of the wealth, the 
political thermometer of the power of states” 
(Faure 1918, p. 284). 

Just as British nationality erupted less violently 
than did that of France, so its statistical system 
had a more gradual inception. Early landmarks 
were the Domesday Book of William the Conqueror 
in 1086; the record of customs dues collected in 
the Port of London in the time of Edward 1; the 
Tudor counts of men and resources in the face of 
the danger of war; and the registration of deaths, 
initiated by Henry vii in 1532, when there was 
widespread fear of the plague. 

The dependence of statehood on a statistical sys- 
tem has become more and more clear with the 
passing of time. The mercantilist writers of the 
epoch of the absolute monarchs were concerned 
with the power of the state in peace and war; the 
monarch and his advisers had to have measures of 


the stock of men and other resources [see ECONOM- 
IC THOUGHT, article on MERCANTILIST THOUGHT]. 
The British and French colonial administrations in 
North America, and later the British in India, took 
censuses, that of the Canadian province of Quebec 
in 1666 being the first of modern censuses (Linder 
1959, p. 330). With democracy there came to be 
other reasons for statistical compilations. In the 
first of the major federal constitutions, that of the 
United States, a means of determining the political 
weight of the contracting entities was required; 
their relative populations seemed to provide this, 
and the first of the regular series of decennial cen- 
suses of the United States was taken in 1790, 
although a printed schedule was not employed until 
1830. Confederation in Canada in 1867 was based 
on a similar provision for representation by popu- 
lation and was followed by a census in 1871; the 
Australian colonies were united in 1900, and the 
formation of a statistical office followed in 1905. 

Since World War 11 dynastic, colonial, tribal, and 
other political forms have been displaced by na- 
tional organizations throughout the world. Along 
with attempts to provide themselves with constitu- 
tions, elections, and the beginnings of modern in- 
dustry, new countries have set up statistical sys- 
tems, both as a precondition for development and 
as a symbol of nationhood. The new statistical style 
is far more ambitious than the colonial model that 
preceded, in proportion as over-all national aims 
are more extended than colonial aims. An impor- 
tant role in promoting the extension and quality 
of statistical work was played by the League of 
Nations, through its Committee of Statistical Ex- 
perts, and subsequently by the United Nations, 
through its statistical and population commissions. 
Under UN auspices a world population census was 
attempted about 1950 and again in 1960, with a 
high degree of national cooperation. In the decade 
of the 1860s censuses were taken in which 17 per 
cent of the estimated population of the planet was 
counted; over ten times as many people were count- 
ed in censuses around 1960, and these constituted 
67 per cent of the population of the globe (Demo- 
graphic Yearbook 1962, p. 1). Results are not uni- 
formly satisfactory; a study of the quality of statis- 
tical organization and statistical output of the 
various countries would undoubtedly show a close 
relation to the quality of governmental administra- 
tion in general. Chile is better organized statis- 
tically than Burma; Burma than Cambodia. 

Government statistics classified. A wide range 
of statistical data has come to be regarded as ap- 
propriate for government compilation and pub- 
lication. 


GOVERNMENT STATISTICS 231 


Classification by source. One way to classify 
government statistics is according to source; house- 
holds for population censuses, family-budget sur- 
veys, employment and unemployment counts; busi- 
ness establishments for production and employment 
data; incorporated and unincorporated firms for 
profit figures; national-government revenue depart- 
ments for foreign trade and income tax accounts. 
The categories are not exclusive; households and 
commercial establishments often provide comple- 
mentary data bearing on the same matter. 

Where economic statistics are essentially a sum- 
mary record of transactions, they are in principle 
available through questions addressed to either of 
the parties to the transaction; retail prices may be 
ascertained through surveys of retail stores with 
respect to goods sold or through household surveys 
with respect to goods bought. If the business estab- 
lishment is more often used, this is a matter of the 
greater availability of records and the larger num- 
ber of transactions implicitly covered by one report; 
in underdeveloped countries, where business is less 
organized, questions addressed to the consumer are 
relatively more favored. For data on employment 
and unemployment, the household seems on the 
whole as satisfactory a source as the business estab- 
lishment. In North America the contest between 
rival figures of unemployment has been an impor- 
tant stimulus to improvement not only of the 
sampling and questioning techniques used in house- 
hold surveys but also of the administration of 
unemployment insurance itself, in respects that go 
far beyond statistics. 

Classification by time of publication. A grosser 
form of classification is by the indication of tem- 
poral trends that series give. Some are issued 
promptly and tell the latest news: weekly railway 
carloadings, department-store sales, stocks of wheat 
in central elevators, stocks of the several metals. 
At the other extreme are full censuses of industry 
and population, taken only at long intervals; their 
results are released over a period of time, beginning 
within a few months after enumeration and con- 
tinuing for some years after the data are collected, 
and they are valuable for the cross-sectional rela- 
tions that they reveal. Because of this configuration 
of prompt summary data and delayed details, the 
national economic picture of a country with respect 
to any moment of time is only gradually filled in, 
over the five or ten years subsequent to that time. 

Government statistical activities include analysis 
and interpretation as well as data collection. Al- 
though knowledge of analysis helps in gathering 
data, and vice versa, the two specializations are 
different. An example of the division of labor on 
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this basis is that existing between the United States 
Bureau of the Census and the Department of Labor 
on statistics of employment and unemployment; 
the former agency has the responsibility for gather- 
ing the material, the latter for the official analysis. 

Expansion in modernization. A statistical sys- 
tem seems to start with population censuses and 
foreign trade as the main items; other kinds of 
data—for example, counts of starts and comple- 
tions of residential construction—are added as 
time goes on. As a country develops, the need for 
statistics mounts with the variety and difficulty of 
decisions required by an increased division of labor 
in the economy. A demand for factual justification, 
in terms of which people can explain their de- 
cisions to themselves and to others, seems to have 
characterized the North American continent from 
an early date. General Francis A. Walker, speaking 
before the International Statistical Institute, point- 
ed this out in 1893: “A strong passion for statistics 
early developed itself in the life of our people, and 
such statesmen and publicists as Hamilton . . . be- 
came working statisticians. . . . No government in 
the world has ever lavished money and labor . . . 
more cheerfully and patiently in this respect” 
(quoted in Cummings 1918, p. 573). But this cul- 
ture of facts is no longer confined to the United 
States, to those of English speech, or to Europeans; 
it is becoming world-wide. 

The proposition that the statistics collected by 
any government are a function of its interests and 
responsibilities can be exemplified in the process of 
decolonization. When much of the world was under 
the hegemony of the states of western Europe, the 
Statistical system centered on foreign trade. What 
was important for British administrators was the 
amount of rice exported from Burma, of jute ex- 
ported from Bengal; the Netherlands wanted to 
know the amounts of coffee, sugar, and rubber 
exported from Java in the periods when each of 
these commodities was at its height. Another phase 
of colonial development was the land tax; the land 
tax made it both necessary and possible to have 
figures on acreages and production of the main 
crops. A further stage was some rudimentary con- 
cern with people, expressed by population censuses. 
The building of railways in India and Java, for in- 
stance, was followed by the collection of transport 
statistics—data that no one would have collected 
when oxcarts, proas, and the backs of men were 
the principal means of shipping commodities. And 
the evolution that occurred in the colonies in Asia 
was paralleled in Latin America, where the several 
governments, although independent, had interests 
nearly as restricted as those of the colonial powers, 


The advent of independence in Asia and of the 
welfare state everywhere has increased the range 
of government statistics. Every country concerned 
with development is trying to expand its educa- 
tional system, and statistics of schools, teachers, 
and pupils are nearly universal. In England and 
Wales educational statistics of a kind date back to 
the 1820s and 1830s, when public money was first 
given to the schools (Baines 1918, p. 377). Vital 
statistics are being gradually improved, a process 
that requires the inculcation of the habit of record- 
ing births and deaths not only on the part of the 
hierarchy of the civil registration system but also 
on the part of the medical profession and the public 
at large; such institutionalization will take at least 
a generation or two in the new countries. Mean- 
while, sample surveys are being introduced, to ob- 
serve the birth rate and its changes, as well as 
many other population and economic phenomena. 
Family-budget surveys, which had been foreshad- 
owed by colonial governments in their moments of 
welfare-mindedness (a coolie-budget survey was 
made in the Netherlands Indies in the 1930s), are 
becoming more frequent; the government of India, 
through the Indian Statistical Institute in Calcutta, 
has been particularly active in this and other types 
of household survey. 


Organization of statistical services 


Every government has sooner or later to consider 
the organizational framework within which its sta- 
tistics will be collected. When left to themselves, 
individual departments of the central government 
tend to collect whatever data their administration 
generates, and they hardly separate the collection 
of such information from administrative control of 
their operations. When the act regulating the col- 
lection of duties in the United States was passed in 
1789 (Cummings 1918, p. 579), it required the 
collector of customs to record ships’ manifests and 
other information connected with trade; the series 
of foreign trade statistics for the United States ac- 
cordingly dates from 1789. The British Post Office 
collects and publishes the returns of the postal 
service; the British Department of Inland Revenues, 
established in 1849, collects the estate and stamp 
duties, land taxes, and income taxes, But with in- 
crease of scale, such departments come to separate 
their statistical from their administrative work; the 
British Board of Trade formed within itself a sta- 
tistical branch as early as 1833 (Baines 1918, P. 
374). Subsequently, labor, local departments of 
agriculture, and many other departments or boards 
came to have their own statistical units, not only 
in the United Kingdom but in the United States, 


the British dominions, and the countries of west- 
ern Europe, The Interstate Commerce Commission 
(ICC) of the United States was from the start in- 
sistent on securing data from the railways it regu- 
lated, and ICC publications constitute an important 
series. 

Sometimes one of these agencies would be as- 
signed the taking of a decennial census, a periodic 
task required of the registrar-general in England 
and Wales, whose continuing work was the civil 
register and vital statistics. In Canada the decen- 
nial census provided for in the British North Amer- 
ica Act was the charge of an office established in 
the Department of Agriculture in 1905, But in all 
countries, increasing attention to statistics led to 
proposals for centralization in a specialized agency. 

Degree of centralization. With the establish- 
ment of even small statistical offices within gov- 
ernment departments, the work of statistics began 
to benefit from some degree of separation from the 
day-to-day exigencies of administration. 

Centralization in national government. The col- 
lection into a single office of all the statistical 
activities of the national government was con- 
sidered, in order to secure even more of the gains 
of specialization. To the argument of efficiency 
(concentration of specialized personnel and equip- 
ment to deal with the large scale of centralized 
statistical work) was added that of avoidance of 
duplication, a perennial hazard of government work 
and one which appeared in statistical work from 
the earliest days. Separating statistical work from 
the other work of government lessens the pressure 
on the statistician to distort his results to protect 
political or administrative interests—reference is 
made below to the judicial aspect of the statis- 
tician’s function. The better coordination of a cen- 
tral system, moreover, ought to permit the recog- 
nition of gaps in data. It should also make easier 
the adoption of uniform classifications; it is highly 
desirable that production and exports, for instance, 
be recorded on the same commodity classification, 
so that comparisons may be possible. But anyone 
who knows the tendency to autonomy of the di- 
visions of a governmental organization will realize 
that having the several compilations under a single 
roof is no automatic guarantee of uniformity; the 
several sections can pursue different courses. One 
special danger of a centralized organization is that 
it will become so self-contained as to be immune 
to the needs of the users of its data. Centralization 
offers high returns when the central organization 
is well coordinated internally and alert to the 
changing requirements of outside users. 

The administrative argument for centralization 
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was early carried into effect, in one degree or an- 
other, in the Netherlands, Canada, Australia, Ger- 
many, and Italy. About the middle of the 19th 
century the Netherlands took important steps in 
this direction; in 1848 a statistical bureau was 
established in the Department of Home Affairs 
whose responsibilities were far wider than the tabu- 
lation of data generated in the department. Canada 
and Australia committed themselves to a statistical 
system that was centralized in the double sense that 
statistics were at least as much the affair of the 
central government as of the states or provinces or 
of any lower level of government and that among 
the agencies of the central government there was 
one with pre-eminence in the collection and tabula- 
tion of statistics in a number of different fields. 
The Canadian system has been in principle entirely 
centralized since the founding of the Dominion 
Bureau of Statistics in 1918. A degree of central- 
ization was arranged in Germany with the estab- 
lishment of the Imperial Statistical Office in 1872, 
the year after the establishment of the German 
Empire; in Italy centralization was begun in 1861, 
the year of the constitution of the monarchy, with 
a directorate of statistics that was the ancestor of 
the present Istituto Centrale di Statistica. 

In the United States and the United Kingdom 
practice has evolved closer to the decentralized pole 
of the centralized—decentralized continuum, The 
means of coordination in the United Kingdom is 
an office in the Cabinet Secretariat that operates 
through a series of understandings with the heads 
of statistical divisions in the Board of Trade, the 
Ministry of Food, the Ministry of Agriculture, etc. 
All the statistical divisions in departments are di- 
rected by statistician members of the Professional, 
Scientific, and Technical Class, one of the five 
classes of the British civil service; appointments 
anywhere are open to members of the statistician 
class irrespective of where they are serving. The 
Treasury approves the expenditures of statistical 
departments, and on new expenditures they are 
advised by the Central Statistical Office. 

The American arrangement is more formal; the 
Office of Statistical Standards of the Bureau of the 
Budget not only reviews all requests for funds to 
carry out statistical work within the federal govern- 
ment; its approval is required before any federal 
government questionnaire can be sent to ten or 
more respondents. This requirement dates back to 
the Federal Reports Act of 1942 and was intended 
to reduce the burden on respondents and to elimi- 
nate duplication. Where duplication exists, it not 
only is wasteful of government funds but also 
arouses the fiercest resentment of respondents who 
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are required to answer the same questions more 
than once. 

The best coordination does not consist exclusively 
of the negative injunctions of an enforcement 
agency. An example of a more positive kind is the 
collaboration of the U.S. Social Security Adminis- 
tration, essentially an operating agency, and the 
Bureau of the Census in the creation of a publica- 
tion showing county. business patterns based on 
Social Security Administration records. The U.S. 
Bureau of the Census has used tax records to elimi- 
nate business-census questionnaires for a million 
retail establishments without employees. By using 
Social Security records as a source of lists, the 
Bureau of the Census is able to take the economic 
censuses at reduced cost. 

Coordination requires fine judgment at a thou- 
sand points. Should both the U.S. Bureau of Mines 
and the Bureau of the Census continue to secure 
mineral statistics? The Bureau of the Census col- 
lects benchmark data on an establishment basis 
every five years from all establishments within the 
scope of the census, assigning to the mining indus- 
try the output of all those that are classifiable as 
mining. The Bureau of Mines collects information 
more frequently, on the output of minerals as well 
as on engineering and technical matters. These 
data are thought to be independently valuable 
and different enough not to seem to constitute 
duplication. 

Division of responsibility. Along quite a differ- 
ent dimension is the division of responsibility be- 
tween the national government and the local gov- 
ernments of states or provinces and cities. In this 
dimension the United Kingdom is highly central- 
ized, while Germany has problems because of the 
division of statistical activities among the Länder, 
and even in the Netherlands there is some devolu- 
tion of statistical activities to organizations in the 
cities, 

Every national statistical system must depend 
on local sources for its information, At the one ex- 
treme is complete central control, as found in the 
national census of countries such as the United 
States and Canada, in which the local officers are 
appointed and instructed by the center through a 
training ladder that inculcates definitions and pro- 
cedures that are, at least in principle, fully de- 
termined centrally, and which is backed by a cen- 
tral budget from which all participants are paid. 
More decentralized is the census in countries such 
as Argentina, where the plans are made by a na- 
tional statistical office but where the budget does 
not include the funds for paying enumerators, who 
are government employees on the payroll of other 
departments of the center or the provinces (for 


example, schoolteachers), co-opted for one or sev- 
eral days’ work on the enumeration. Coordination 
of such an operation requires feats of diplomacy, 
Further down on the scale of decentralization is the 
system of vital registration, used in the United 
States and other countries, in which local registry 
officials report to a state authority and whatever 
national uniformity exists is the result of negotia- 
tions in which the national office can take leader- 
ship more by virtue of its persuasiveness than be- 
cause of any legal or budgetary power. 

Comparability and change. The government 
Statistical agency must have both continuity and 
adaptation of its output to changing needs, a com- 
bination of virtues easier to prescribe than to fol- 
low. In some instances an agency is very rigid; the 
disposition to continue collecting the same data in 
the same fashion year after year is both the strength 
and the weakness of government agencies, But at 
other times it changes arbitrarily; A. L. Bowley 
charged that the United Kingdom census of 1921 
was deliberately made noncomparable with any- 
thing that went before, a statement challenged by 
Greenwood et al. (1932, p. 279). Again, judgment 
is required; the elimination of old series and the 
initiation of new ones and the modification of defi- 
nitions that make a given series more useful for the 
future but lessen its comparability with the past 
must be discussed on a more specific level than this 
article can attain. 

Practices that are good in one place and time 
may not be so in a different economy and society. 
Attempts to transfer categories of statistical com- 
pilation from a country in which money economy 
dominates to one in which households produce for 
their own use rather than for exchange may not be 
Satisfactory. Some of the greatest difficulties in col- 
lecting statistics within the developed countries are 
in the sector of subsistence agriculture. The rela- 
tion of statistics to industrial practice is brought 
home especially strikingly by the present situation 
in the United States, where the National Bureau of 
Economic Research and the Bureau of the Census, 
among other agencies, are trying to find how indi- 
vidual series can be improved or replaced. With the 
shift of traffic to trucks and airlines, carloadings 
have become much less useful. Department-store 
sales mean less when many of the lines of mer- 
chandise are also sold by discount houses and 
drugstores. The inventive spirit that discovered 
department-store sales and carloadings as key fig- 
ures in the economy is needed now to go beyond 
these. 

Interactions with environment. The govern- 
ment statistical agency is by no means a free agent 
but depends on the administrative practices of 


other governmental agencies and of business con- 
cerns. Statistics of imports depend on valuations 
made for customs purposes; comparing the exports 
of newsprint or metals from Canada to the United 
States, for instance, with the imports of nominally 
the same commodities into the United States from 
Canada will show how substantial is the effect of 
varying definitions of commodities as well as of 
sources and destinations. Since definitions are often 
part of procedures for customs valuation, and these 
are embedded in laws around which substantial 
material interests have developed, discrepancies 
between countries are likely to persist despite many 
conventions and attempts at accommodation, from 
that of Brussels in 1910 to those of the United 
Nations Statistical Commission. 

Effects of accounting practices. One important 
network in which the statistical agency is en- 
meshed is the accounting practices of industry, 
themselves in part the consequence of the regula- 
tions of the government income tax department. 
The art of statistical collection includes finding 
modes of definition of the entities about which in- 
quiry is made that are most in conformity with the 
accounting practices of business and provide the 
most useful information to the public. The indus- 
trial establishment, as against the firm, is generally 
taken as the primary element for collection of in- 
dustrial statistics and is defined in practice as the 
smallest production unit that maintains more or 
less complete records. The statistical agency does 
not always play a purely passive role; it is often in 
a position to exercise influence and initiative. In 
Canada the federal agency concerned with finan- 
cial statistics of the 5,000 municipalities of the 
country was able to persuade them to adopt a 
standard accounting procedure, which benefited 
them as well as national statistics. 

National professional associations. The environ- 
ment within which a statistics-producing office of 
government has to work includes, besides the oper- 
ating agencies of the same government and the 
productive units of the country at large, a host of 
professional associations, as well as individuals in- 
terested in statistics. The American Statistical As- 
sociation in 1844 petitioned the Congress of the 
United States to recompile the census of 1840; its 
policy of protesting poor work and supporting good 
in the federal statistical field has operated to the 
present day, although the need for its intervention 
is not as great as when the government service was 
staffed almost exclusively by amateurs (Bowman 
1964), 

International professional associations. To the 
national associations have now been added im- 
portant international ones. First among these was 
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the International Statistical Institute, from the nine- 
teenth century on a persistent and wholesome in- 
fluence in favor of high standards and comparable 
classifications. The Economic and Social Council 
of the United Nations, through its statistical and 
population commissions, has had an effect in stimu- 
lating countries to take censuses; to adopt a mini- 
mum set of questions in these, for which a serious 
attempt would be made to attain international com- 
parability; to use modern techniques, in the inter- 
est of accuracy and promptness of release of data. 
Sensitive to the desire of nations to be independent 
of outside interference in statistics as in other 
fields, the council has made its chief weapon the 
argument of international comparability in sta- 
tistics of production, as well as of population. Also 
of influence have been the regional agencies of the 
United Nations for Asia, Europe, Latin America, 
and now Africa; the Inter-American Statistical As- 
sociation; the specialized agencies of the United 
Nations, including the United Nations Educational, 
Scientific, and Cultural Organization (UNESCO), 
the Food and Agriculture Organization (FAO), the 
International Labour Organisation (ILO), etc., in 
their respective fields. The resolutions and the 
conventions that these have drawn up often suffer 
from being stated in highly abstract terms, but they 
are supplemented by technical assistance services 
to less developed members, including the sending 
out of advisers and the providing of training facili- 
ties. The publications of international bodies have 
not only been directly useful to scholars and others 
but also have been an incentive to members to 
secure data. FAO conferences of groups of neigh- 
boring countries have constituted an effective 
form of pressure for improvement of agricultural 
statistics. 

Other influences. Series of government statis- 
tics have often been initiated from the outside. The 
Metropolitan Life Insurance Company began col- 
lecting statistics of labor turnover in the United 
States in 1926, and these compilations were later 
taken over by the United States Department of 
Labor (Hauser & Leonard 1946, p. 363). In Canada 
statistics of wage rates collected by the Bell Tele- 
phone Company for its own purposes have influ- 
enced federal government collections. Statistics of 
stocks on hand have often been started by associa- 
tions of manufacturers and then continued by a 
public statistical agency. In the Netherlands one of 
the influences that led to the centralized system 
was the private Union for Statistics, founded in 
1856 (Stuart 1918, p. 435), which published many 
volumes of data, although it did no primary col- 
lecting. Later this private union was subsidized by 
the government, and ultimately all of its activities 


236 GOVERNMENT STATISTICS 


were taken over. Especially well known is the pres- 
sure that Quetelet, the Belgian statistician, exerted 
through his researches as a private citizen between 
about 1825 and 1841 (when he became a public 
official). His avid interest in the regularity of 
certain social phenomena, e.g., the “budget of the 
scaffold,” finds its monument in the extensive series 
of criminal statistics produced in Belgium and 
many other countries (Julin 1918, p. 128). In 
England the first edition of the essay by Malthus 
that attracted so much attention to population was 
followed within three years by the census of 1801 
[see the biographies of QUETELET and MALTHUS]. 

There are many other examples of the influence 
of men and ideas—as well as outside organizations 
—on government statistics. The collection of eco- 
nomic data during the present generation has been 
altered by the recognition of national accounts as 
a general framework. The concept was developed 
by scholars in universities in the United States and 
Great Britain, following suggestions of J. M. Keynes. 
The notion of national income makes it possible to 
arrange a great variety of existing series in the 
pigeonholes of a national-accounts framework; the 
output of the statistical systems ceases to appear 
arbitrary, for each portion measures a contribution 
to the gross national product; certain gaps become 
visible—for instance, personal services—and there 
is pressure to fill them, Furthermore, the examina- 
tion in this fashion of economic statistics as a 
whole permits the grand total of the country’s eco- 
nomic activities to be calculated in at least three 
ways—by value added, by income received, and by 
expenditure—which have important elements of 
independence as to source of data. The confronta- 
tion of calculations of a total derived from more or 
less independent statistical sources has been an 
incentive to improve the accuracy of all com- 
ponents. 

The current prestige of economic planning has 
had important effects on government statistical 
work. Planning may be good or bad, effective or 
ineffective, but in all instances it requires data. 
One cannot even go through the motions of plan- 
ning without statistics. Government planning agen- 
cies requiring data are in an especially strong posi- 
tion to see that the necessary resources are allocated 
to their collection. 

The statistical profession and government. Un- 
derlying recent developments is a rapid profession- 
alization of statistics. Times have changed greatly 
since Francis A. Walker lamented, in an address 
before the American Statistical Association in 1896, 
“I do not know of a single man now holding, or 
who has ever held, a position in this country as the 
head of a statistical bureau, or as chief of a statis- 


tical service, or as a statistician, who had any ele- 
mentary training for his work” (quoted in Cum- 
mings 1918, p. 574). A recent Bureau of the Census 
survey of its staff showed 576 employees in profes- 
sional statistical positions, of whom all but 22 had 
an academic degree, including 36 PH.D.s and 102 
M.A.S (Bowman 1964, p. 14). The change has 
come about through the extension of the field. Sta- 
tistical method is applied to acceptance sampling 
and quality control in industry, to experimental 
design in agricultural trials, to bio-assay, operations 
research, and sample surveys of national popula- 
tions. The interchange among these applications 
and between each application and a rapidly ex- 
panding mathematical statistical theory has had a 
decisive effect. Backed by a theory based on proba- 
bility and relatively well-established methods of 
application of the theory; with a rapidly growing 
body of literature in books and professional jour- 
nals; with departments of statistics in a number of 
universities, and courses in nearly all; there is less 
and less need for the government statistical agency 
to depend on the gifted amateur like Quetelet in 
Belgium, Knibbs in Australia, or Coats in Canada. 
And yet, present-day expansion and progress can 
be matched in quality by some of the early work. 
Florence Nightingale had much effect on the Brit- 
ish War Office through her statistics showing 
deaths from battle and from disease separately. 
Her data on the army hospitals in India led to im- 
portant reforms. It is not that the modern profes- 
sional is better than the old-fashioned gifted ama- 
teur, but the former is more certain to be on hand 
when needed and commands the tools that have 
now been created. 

Sampling in the new professionalization. Among 
the large-scale sample surveys now conducted are 
those on a monthly basis in the United States and 
Canada, quarterly in the Federal Republic of Ger- 
many and other European countries, and virtually 
continuously in India. The topics of survey are 
almost as wide as the topics of statistics itself [see 
SAMPLE SURVEYS]. 

Like any important change in technique, sam- 
pling does not merely attain the earlier objectives 
at lower cost; it brings a radically new viewpoint 
on the whole process of data collection. Instead of 
thinking of himself as charged with passively com- 
piling given data, the statistician orients himself to 
the purpose of the compilation and the degree of 
accuracy required if the purpose is to be served; 
thus, he is led to take account of the nonsampling 
as well as the sampling error to which his work 3 
subject. He sees himself at his professional best in 
measuring error, controlling quality, and evalu- 
ating results. He comes to measure information, 


not by the mass of data he can turn out, but rather 
by the accuracy attained in a single figure. The 
percentage of the labor force unemployed, for ex- 
ample, may be the occasion for a major decision 
in regard to the economy. If there is an allotment 
to improve the figure, he must decide what amount 
should be spent to improve the accuracy of re- 
sponse and what to increase the sample. In other 
words, he must decide if there will be more return 
from applying a given effort to the sampling or to 
the nonsampling error. The changed attitude toward 
error—the view that it is both inevitable and to be 
constantly combated—is seen in the United States 
Bureau of the Census, where the 1950 and subse- 
quent population censuses have been accompanied 
by an official estimate of the degree of under- 
enumeration, an estimate in part made by having 
superior enumerators survey a sample of areas. 
This practice has been applied in Canada since 
1951. It is the mark of the professional statistician 
that he does not assert that his results are exact; 
even when he has done the survey by the best 
means that are available, he is satisfied to con- 
sider his one survey as an arbitrary selection from 
all the surveys that might have been carried out 
at about the same time and with equally accept- 
able definitions [see ERRORS, article on NONSAM- 
PLING ERRORS]. 

Enduring problems of official statistics. A sur- 
vey of the statistical agencies of any government, 
even those of a very advanced country, would show 
the continuance of some out-of-date practices, the 
publication of some figures that are not usable be- 
cause they are based on purely administrative defi- 
nitions of the entities counted or because the error 
is not stated by the agency but may be presumed 
to be large. Besides the valuable statistics they 
contain, library shelves are weighted with irrele- 
vant, useless, and inaccurate statistics, a situation 
to be deplored in proportion as good techniques are 
available. 

Government statistics’ judicial function. Espe- 
cially pernicious are those inaccuracies intended to 
serve some political purpose. The strongest argu- 
ment for statistical centralization relates to what 
may be called the judicial function of the official 
statistician, The statistics of foreign trade may 
show that the government's trade policy is going 
badly; the statistics of prices may show that its 
monetary policy is leading to inflation. Protection 
of the public requires that the agency responsible 
for the policy not be the one with exclusive control 
over the statistics; to give it such control is to ask 
of it superhuman strength to withstand temptation. 
In a hundred minor decisions of statistical compila- 
tion, an operating agency may be swayed by noting 
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how the result will come out. Presentation also re- 
quires impartiality. On issues such as whether a 
price index ought to be released now, when it has 
been scheduled to appear, or next week, after a na- 
tional election has taken place, hangs the virtue of 
the statistician. Without this virtue and the result- 
ing public confidence in statistics, much of the cost 
of collecting data is wasted; statistics that are not 
believed are of no use. In a statistician, as in a 
judge, ignorance is less dangerous than corruption. 
The statistician will be most useful to the com- 
munity when his statistical honor is not in conflict 
with some nonstatistical responsibility. At a higher 
administrative level, the political head of the de- 
partment that includes the statistical office must 
accept the self-discipline of permitting decisions 
regarding statistics to be made on statistical con- 
siderations, not because such decisions are always 
correct but because, when they are wrong, they are 
wrong in a disinterested way. One method that les- 
sens the need for self-discipline is to have a com- 
mission entirely outside of politics control the 
statistical office, as is done in the Netherlands. 

Much less subtle than the issues of the preced- 
ing paragraph are the distortions as a matter of 
policy that have been seen in some totalitarian 
countries. During the Stalin period in the Soviet 
Union, statistics were not considered to pertain to 
science but to revolution and to mobilization of the 
people for national construction. If the statistics do 
not show progress, the masses will become discour- 
aged and their effort will flag. The same attitude 
was found in China in the 1950s (Li 1962). The 
competition between districts to be in the van of 
socialist production was extended to competition 
with respect to the figures of production, and this 
local zeal for declaring high figures reached its 
peak in 1960, when total production of cereals was 
announced as 275 million tons, apparently an ex- 
aggeration of over 100 per cent. It had been 
thought democratic to bring the masses into the 
work of compiling statistics; the consequent social- 
ist rivalry seems to have genuinely deceived the 
regime, as well as its subjects, with subsequent dis- 
astrous consequences. There are clear signs that in 
the Soviet Union, as in older industrial societies, 
trustworthy statistics are now recognized as some- 
thing more than a bourgeois luxury; whether the 
dependency of good decisions on good statistics is 
now understood in China is not clear. 


Technical contributions 

Extensive contributions have been made to tech- 
nique in nearly every division of statistics by agen- 
cies attempting to implement efficiently their legis- 
lative assignment to collect data. One may mention 
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the methods ef calculating life tables; the deseason- 
alization of economic time series; the theory and 
practice of index-number calculation, as it has 
been developed in the United States Bureau of 
Labor Statistics and in the Dominion Bureau of 
Statistics in Canada; the classification of occupa- 
tions, industries, and commodities (Coats 1925). 
Particularly important for the growth of demog- 
raphy were the technical contributions during the 
nineteenth century by William Farr and other stat- 
isticians of the United Kingdom General Register 
Office [see INDEX NUMBERS; LIFE TABLES; POPULA- 
TION; TIME SERIES]. 

The most striking single example of these con- 
tributions is the intimate relation of the U.S. 
Bureau of the Census to the development of tabu- 
lating and computing equipment, from the primi- 
tive punch-card devices of the 1890s, built in the 
Census Bureau itself by Herman Hollerith, to the 
Univac, built privately with bureau encouragement 
and financing and technical help from the Bureau 
of Standards. The Canadian census office also 
pioneered in this, with some highly original electric 
and compressed-air tabulators, the result of the 
ingenuity of Fernand Belisle, a lifelong employee 
of the Canadian Bureau of Statistics. Both Cana- 
dian and United States census offices have in re- 
cent years built or stimulated the building of input 
devices to avoid the keypunching of data, the most 
tedious part of the processing of large-scale surveys. 

Besides having had a part in the development of 
computers, government statistical agencies have 
shown great initiative in using them. Such use, 
whether in an insurance office, an oil refinery, or a 
government statistical office, requires extensive re- 
thinking of processes that have developed in the 
course of decades. Several of the more advanced 
of the world’s statistical offices have drastically 
reorganized their work in order to exploit electronic 
computation, with consequences of more extensive 
cross tabulation, better control of error in process- 
ing, and economy. In the United States the demand 
for machine-readable results has increased as users 
of statistics have acquired their own computers. 
The Bureau of the Census has an extensive cata- 
logue of results available on punch cards or elec- 
tronic tape; this mode of publication is sure to 
become widespread. 


Publication of results 


Prompt and full publication is the highest of 
virtues in government statistics—provided that the 
data published are accurate as well as relevant. 
This follows from the place of statistics in a 
decision-making process: the appropriateness of 


the decision—for example, to build a factory—will 
depend on the situation that exists at some future 
date; and if the statistician is not in a position to 
count the population of five or 25 years hence, he 
can at least describe what the situation was up 
to as recent a moment as possible. On the content 
of what is published—what cross classifications 
of distributions, what percentages, what time com- 
parisons, what charts—the judgment of the gov- 
ernment statistician becomes better the greater his 
contact with users of his results. With the multi- 
plicity of governmental and private sources, the 
need increases for good indexing within any pub- 
lication, as well as for cross referencing of com- 
parable data in other publications. 

Government statistical offices have entered the 
field of social science in their efforts to interpret 
their data. This runs from comment on statistical 
tables that enables the reader to know how the 
entities counted have been defined, along with 
measures of sampling. and nonsampling error, 
through calculations distributing not-stated ages 
and deseasonalizing economic series, to full-scale 
monographs that have become a regular part of the 
census, in the United States and Canada going 
back at least a full generation. India is preparing a 
set of monographs based on the 1961 census, and 
Pakistan is planning to do likewise. If one thinks 
of the continuum, from the collection of raw statis- 
tical data to its final use as an ingredient either in 
scientific investigation or in the making of de- 
cisions, then there is room for differences of 
opinion as to where the role of the government sta- 
tistical office ends and that of the user begins. 
The vital minimum is that the government statis- 
tician describe all aspects of his collection pro- 
cedures that can possibly affect the interpretation 
of the results, It is to be hoped that in some future 
time no survey will be issued without realistic 
estimates of the accuracy of its figures, but that 
time still seems to be far in the future. 

Notwithstanding all this, a certain abstemious- 
ness is forced on the official statistician by virtue 
of his position; he cannot afford to take the sorts of 
risks in the interpretation of data that newspaper- 
men can take so freely. Public confidence in the 
accuracy of his figures, and hence their usefulness, 
would be jeopardized by palpable arbitrariness in 
judgments expressed in his text. This consideration 
often drives him to the writing of text that is no 
more than a gloss on what is obviously revealed in 
the tables and is perhaps less clear than the figures 
themselves. On the other hand, the government 
statistician can argue with justice that interpreta- 
tion, lively or dull, made without his intimate 


knowledge of the basis of the figures is likely to 
be misleading. The answer is that, as a minimum, 
he give enough detail on his procedures to put the 
reader in a position to interpret correctly. 

Secrecy. The matter of secrecy enters into sta- 
tistical work in two ways, to which exactly op- 
posite principles apply. In the report made by the 
individual to a government agency, it is generally 
considered that the most honest reporting will be 
secured if a guarantee of confidentiality is provided, 
giving the supplier assurance that his contribution 
will not be identified outside the statistical office 
and that only the statistical aggregates of which 
it is a part will be made public. The government 
statistician is endowed with the power to enforce 
reporting on the part of the persons or corporate 
bodies to which he addresses his questionnaires; 
the obverse of this power is the obligation to keep 
individual returns secret (Dobrovits 1947). In the 
large number of government surveys for which re- 
porting is not mandatory, the degree of voluntary 
response may be proportional to the public con- 
fidence in provisions for maintaining secrecy. 

On the other hand, the aggregate results of every 
survey ought to be given the widest possible pub- 
licity. One of the difficulties of government pub- 
lication is that the government does not have com- 
plete access to the distribution channels of the 
private book trade; sometimes a solution is possible 
through private publication, as was done for the 
monographs on the 1950 census of the United 
States and as is done in the Netherlands and Ger- 
many. The principle of equal access of the public 
to statistics is important; this is safeguarded by 
preannounced release times, at which statistics are 
simultaneously made available to all who are inter- 
ested. In many countries there is much diffusion of 
government statistical results through nongovern- 
mental intermediate sources, including trade year- 
books and scientific journals; it is customary to 
allow republishing without specific permission. 

A serious problem arises from the desire of gov- 
ernment to keep certain statistical results secret. 
This was a regrettable necessity in the United 
States and Canada during World War 11 in respect, 
for instance, to detailed foreign trade figures, be- 
cause the information would have been used by the 
Axis powers in their submarine campaign against 
Allied shipping. Aside from such exceptional cases, 
it is fair to say that there are no instances in which 
the public as a whole benefits from concealment of 
statistical reports that provide accurate data, al- 
though governments and particular departments 
within them may indeed benefit at the expense of 
the public. The discussion of the publication of gov- 
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ernment statistics merges at this point with the 
wider question of the free flow of information in 
the society as a whole. 

NATHAN KEYFITZ 


[See also ELECTIONS, article on ELECTORAL SYSTEMS.) 
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Fritz Graebner (1877-1934), German ethnolo- 
gist and one of the principal exponents of the 
culture-historical approach to ethnology, was born 
in Berlin. The noted botanist Paul Graebner was 
his brother, Like many German ethnologists at the 
turn of the century, Graebner started out in another 
field. He obtained his PH.D. in Berlin in 1901 as 
a historian, with a dissertation on a topic in me- 
dieval history. But as early as 1899, he became a 
research assistant on the staff of the Royal Museum 
of Ethnology in Berlin, which was directed by Adolf 
Bastian. Graebner was assigned to classify the mu- 
seum’s rich holdings of South Seas material in 
Felix von Luschan’s Africa and South Seas de- 
partment. 

Here he worked in close contact with Bernhard 
Ankermann, the Africanist, and together they de- 
veloped the method of the Kulturkreis. They col- 
laborated on two lectures that were delivered on 
November 19, 1904, to the Berlin Gesellschaft fiir 
Anthropologie, Ethnologie, und Urgeschichte; these 
were published in 1905 in the Zeitschrift fiir Eth- 
nologie. The lectures attempted to interpret Ocean- 
ian history on the basis of a study of artifacts 


and an extensive knowledge of the literature, in 
terms of the geographical diffusion of clusters of 
cultural traits. It is of course true that Leo Fro- 
benius had already used a similar approach to 
Africa in his Ursprung der afrikanischen Kulturen 
(1898), but Ankermann’s work and Graebner’s 
research on Kulturkreise in Oceania were consider- 
ably more thorough and precise. 

Graebner plotted the diffusion of individual cul- 
ture traits cartographically throughout the South 
Seas; he recognized the linkages that seemed to 
occur; and he arranged complexes or clusters of 
traits into Kulturkreise. He then proposed a chrono- 
logical sequence for the diffusion of the Kreise: 
the Tasmanian culture, the Old Australian culture, 
the western Papuan or totem culture, the eastern 
Papuan or two-class culture, the Melanesian or bow 
culture, the Polynesian culture, and a few more re- 
cent “cultural movements.” 

In his article “Die melanesische Bogenkultur 
und ihre Verwandten” (1909a), he tried for the 
first time to establish in a larger region the Kultur- 
kreise he had worked out locally for parts of the 
Pacific. This marked the beginning of the trend in 
European ethnology that was called Kulturkreis- 
lehre. After World War 1, Wilhelm Schmidt of 
Vienna, who had initially had strong disagreements 
with Graebner (see, for example, 1909c), at- 
tempted a world-wide application of Graebner’s sys- 
tem. Graebner, in his contribution to the volume 
on anthropology in Hinneberg’s anthology, Die Kul- 
tur der Gegenwart (1923), himself transferred 
some of his South Seas Kulturkreise to other re- 
gions of the globe, although he did so with much 
greater moderation than did either Father Schmidt 
or Father Wilhelm Koppers. 

In 1906 Graebner left the Berlin Museum, ac- 
cepting the invitation of Willi Foy, director of the 
Rautenstrauch-Joest Museum in Cologne, who be- 
came one of his most active supporters. In Cologne 
he published his Methode der Ethnologie (1911), 
a work that explicitly acknowledged a debt to Ernst 
Bernheim’s Lehrbuch der historischen Methode und 
der Geschichtsphilosophie (1889), and thus re- 
established his connection with the discipline of 
history. By now he had at his command great 
quantities of ethnological material, and he and Foy 
considered ethnology as essential to a universal 
history. 

The Methode der Ethnologie became the corner- 
stone for the culture-historical school of ethnology. 
In this work Graebner’s principal concern was to 
work out methodological guidelines for research 
into culture affinities, although this work also con- 


tained an exemplary chapter on critical study of 
the original sources and a discussion of the ways 
of interpreting ethnological sources. The “criteria 
of cultural links” (1911, pp. 104-125) are still 
valid. If there are numerous homologous elements 
of culture in a region of the globe that are not 
functionally interdependent (criterion of quantity), 
and if their similarity of form is extensive without 
being due to identity of purpose and identity of 
material (criterion of form), it must be presumed 
that they did not originate independently of one 
another but came from the same original source or 
are related by borrowing. If these elements of cul- 
ture belong to many or all cultural spheres (econ- 
omy, society, religion, art, etc.), they constitute a 
Kulturkreis or a “culture complex.” If parts of this 
complex are widely separated geographically, cul- 
tural relationship is more likely if certain elements 
that are, as it were, cultural bridges can be located 
between them (criterion of continuity). Moreover, 
if it can be shown that the separated complexes 
are most alike where their areas of distribution are 
nearest (criterion of degree of relatedness), cul- 
tural relationship and nonindependent origin can 
be inferred from this situation as well. Zones of 
“leveling” and “draining” of a Kulturkreis, discon- 
tinuity of the area, penetration by groups from 
another culture complex, and so forth, produce a 
sequence of Kulturkreise that subsequently may be 
regarded as culture layers. 

Graebner never did any field work. When he did 
go to Australia at the invitation of the Australian 
government, the outbreak of World War 1 prevented 
him from doing research. He was interned and not 
released until 1919. Nevertheless he made use of 
his time there to do much studying, and he pro- 
duced a comparison of the myths of Indo-European, 
Mongolian, Polynesian, and Hamito—Semitic peo- 
ples, “Thor und Maui” (1919/1920), and a study 
of Old and New World calendars (1920/1921). 
These two works were important preliminary 
studies for Das Weltbild der Primitiven (1924). 
Here Graebner advanced for the first time a broad 
theory of history, visualizing an archaic “advanced 
culture” that was diffused throughout the Old 
World and the New World. Many of his arguments 
in support of links between the advanced cultures 
of the Old World and the New are still valid. His 
theory was wholly independent of the so-called 
British school of diffusionism led by Grafton Elliott 
Smith and W. J. Perry. 

In 1925 Graebner succeeded Foy as director of 
the Cologne museum. The following year he was 
appointed to an honorary professorship at the Uni- 
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versity of Cologne. A prolonged illness forced him 
to resign in 1928; he returned to Berlin, where he 
died in 1934. 

HERMANN BAUMANN 


[For the historical context of Graebner’s work, see DIF- 
FUSION; HISTORY, article on CULTURE HISTORY; and 
the biographies of BASTIAN and FROBENIUS. For dis- 
cussion of the subsequent development of Graebner's 
ideas, see the biographies of Koprers and SCHMIDT.] 
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Marcel Granet ( 1884—1940), pre-eminent French 
sociological Sinologist, was born at Luc-en-Diois 
and died at Sceaux. He was admitted to the École 
Normale Supérieure in 1904 and received the 
agrégation ďhistoire in 1907. He was named direc- 
teur d'études at the École Pratique des Hautes 
Études in 1913 and chargé de cours at the Sorbonne 
in 1920, the same year that he received his doctorat 
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és lettres. In 1926 he became the first director of 
the Institut des Hautes Etudes Chinoises of the 
Sorbonne as well as professor at the Ecole des 
Langues Orientales of Paris. He received the Croix 
de Guerre for his services in World War 1. 

It was his teacher Edouard Chavannes who sug- 
gested to Granet that his interest in feudalism might 
well lead to the study of Sinology. Another teacher 
of Granet’s was Emile Durkheim. The historian 
Marc Bloch was a friend and fellow student of 
Granet’s at the Ecole Normale, as was also Marcel 
Mauss, the nephew of Durkheim. Granet married 
Marie Terrien in 1919. While Granet was studying 
in China, from 1911 to 1913, the Ch’ing dynasty 
was replaced by the Republic of China. He visited 
China once more, after World War 1, as a member 
of a mission. 

Granet was a special sort of pioneer in the social 
sciences—he became a Sinologist in pursuit of 
more general interests in social phenomena. After 
an initial thorough grounding in the discipline of 
history, he was exposed to Durkheim’s ideas, and 
these were decisive in developing his more general 
sociological interests. Granet was also a pioneer 
figure in stimulating the interest of non-Sinological 
social scientists in the Far East—most directly in 
Chinese society, but also, as a consequence of new 
attention to comparative studies, in Japanese 
society. 

As a Sinologist, Granet freed himself from the 
dependence on secondary Chinese materials that 
had hampered earlier social scientists who studied 
China. He made it clear that difficult language bar- 
riers are no excuse for inferior standards of ex- 
pertise. Moreover, unlike most orthodox Sinologists 
of his day, he found “inauthentic” texts no less val- 
uable than authentic ones as evidence of social 
life. Rather than concentrating largely on the ques- 
tion of the “authenticity” of documents, Granet felt 
social scientists would do well to examine these 
documents critically in order to distinguish between 
ideal and actual patterns in Chinese society. 

Granet addressed his efforts at broad interpreta- 
tions of Chinese life, cutting across historical 
epochs. Mainly, however, he concentrated on feudal 
China and the transition to imperial China, As was 
pointed out by Emile Benoit-Smullyan, in a review 
that introduced Granet’s work to many American 
sociologists (1936), Granet found Durkheim’s 
“sociologistic epistemology” especially relevant to 
his work, leading him to important empirical elab- 
orations of Durkheim’s analysis. Thus, Granet 
showed the fundamental differences between Chi- 
nese and modern European modes of thought and 


suggested that those differences stemmed from far- 
reaching differences in social structure. 

For scholars with a general interest in the social 
sciences, as well as for many scholars who are not 
Sinologists in the strict sense, Granet’s most im- 
portant contributions are to be found in his two 
volumes Chinese Civilization (1929a) and La pensée 
chinoise (1934). Both volumes have excellent in- 
troductions by Henri Berr, with carefully chosen 
titles: “Loriginalité de la Chine” and “Mentalité 
chinoise et psychologie comparée.” 

Granet’s subtitle of Chinese Civilization con- 
veys an important distinction: “La vie publique 
et la vie privée” (the English translation dropped 
the subtitle). In this book he subdivided his 
treatment into political history and Chinese society, 
and these parts in turn contain discussions of tra- 
ditional history (through the beginning of the im- 
perial era in Han times), the chief data of ancient 
history, the people of the countryside, the founda- 
tion of the chieftainships, the seignioral town, and 
society at the beginning of the imperial era. Granet 
used the ancient texts, the legends, dances, and 
feasts, and what archeological evidence he could 
find in an attempt to reconstruct the preimperial 
background and demonstrate its relevance for an 
understanding of imperial China. Imperial Chinese 
society represents one of the most elaborate and 
sophisticated developments in the varied history of 
mankind: it was perhaps the only society with a 
large-scale membership, spread over a vast area, 
which was nevertheless sufficiently stable in its 
main outlines to endure for some two thousand 
years. Granet used the available materials to make 
a careful attempt to reconstruct both the life of the 
peasants—the vast majority of the people—and 
the public roles and the domestic lives of the nobles. 

The most complex, powerful, and difficult of 
Granet’s contributions is La pensée chinoise, and 
it is this work which still constitutes one of the 
most important attempts to apply empirically the 
leads furnished by Durkheim in Elementary Forms 
of the Religious Life. Granet explicitly rejected any 
interpretation of Chinese thought as mystical or 
prelogical—any such interpretation, he believed, 
was “lacking in the humanistic spirit’—and he 
rested his interpretations of Chinese thought 
squarely on Durkheim's view that modes of thought 
are themselves social manifestations and reflect 
other social facts. Most notably, modes of thought 
reflect those forms of social organization whose 
very persistence, Granet held, in some sense proves 
their value (1934, pp, 27-29). He began his anal- 
ysis of Chinese thought with an examination of 


language and writing, and in so doing, he was at 
once very French and precociously modern. His 
chapters on the Chinese conceptions of time and 
space, yang and yin, numbers, and the Tao are 
classics, although they are controversial. 

Even at the time Granet published his interpre- 
tation of China, some scholars felt that his treat- 
ment of China as a special instance of social uni- 
versals led him to overlook or misunderstand many 
features of Chinese society, and materials dis- 
covered since Granet’s time necessitate other quali- 
fications of his interpretation. Yet there is general 
admiration among scholars for the stimulus 
Granet’s work gave to Sinology in particular and to 
social science more generally. His intellectual de- 
scendants do him the honor of challenging his 
positions within the whole field he opened for them. 


MARION J. Levy, JR. 


[For the historical context of Granet’s work, see the 
biographies of BLocH; DURKHEIM; Mauss. Also re- 
lated is the article CHINESE SOCIETY.] 
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GRAPHIC PRESENTATION 


Graphic presentation represents a highly devel- 
oped body of techniques for elucidating, interpret- 
ing, and analyzing numerical facts by means of 
points, lines, areas, and other geometric forms and 
symbols. Graphic techniques are especially valu- 
able in presenting quantitative data in a simple, 
clear, and effective manner, as well as facilitating 
comparisons of values, trends, and relationships. 
They have the additional advantages of succinct- 
ness and popular appeal; the comprehensive pic- 
tures they provide can bring out hidden facts and 
relationships and contribute to a more balanced 
understanding of a problem. 

The choice of a particular graphic technique to 
present a given set of data is a difficult one, and no 
hard and fast rules can be made to cover all cir- 
cumstances. There are, however, certain general 
goals that should always be kept in mind. These 
include completeness, clarity, and honesty; but 
there is often conflict between the goals. For in- 
stance, completeness demands that all data points 
be included in a chart, but often this can be done 
only at some sacrifice of clarity. Such problems can 
be mitigated by the practice (highly desirable on 
other grounds as well) of indicating the source of 
the data from which the chart was constructed so 
that the reader himself can investigate further. 
Another problem occurs when it is necessary to 
break an axis in order to fit all the data in a rea- 
sonable space; clarity is then served, but honesty 
demands that attention be strongly called to the 
break. 
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A choice among graphic techniques also depends 
upon the proposed use to which the chart will be 
put. As Schmid (1954) has pointed out, graphs 
that are satisfactory in memoranda for private cir- 
culation may be inappropriate for a published book 
or paper. 

In classifying charts and graphs, criteria of pur- 
pose, of circumstances of use, of type of compari- 
son, and of form have been used. On the basis of 
form, charts and graphs may be classified as (1) 
rectilinear coordinate graphs; (2) semilogarithmic 
charts; (3) bar and column charts: (4) frequency 
graphs and related charts; (5) maps; (6) miscel- 
laneous charts, including pie diagrams, scatter- 
grams, fan charts, ranking charts, etc.; (7) pic- 
torial charts; and (8) three-dimensional projection 
charts. 

In graphic presentation, three different basic 
geometrical forms can be utilized for purposes of 
comparing magnitudes of coordinate items: (1) 
linear or one-dimensional, (2) areal or two-dimen- 
sional, and (3) cubic or three-dimensional. The 
simplest and most exact comparisons are those 
made on a linear basis; comparison of relative 
sizes of areas is more difficult and of volumes most 
difficult. Accordingly, where possible, the use of 
areal and cubic forms should be avoided in graphic 
presentation. 

Rectilinear coordinate graphs. Perhaps the best 
known form of graphic presentation, and certainly 
one of the most frequently used, is the simple 
arithmetic line chart, one of several types of the 
rectilinear (or Cartesian) coordinate graph. 
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Figure 1 — Basic structural features of a rectangular 
coordinate graph 


The basic form of this type of graph is derived 
by plotting one or more series of figures on a co- 
ordinate surface in which the successive plotting 
points are joined together in the form of a con. 
tinuous line, customarily referred to as a “curve.” 
A curve on a graph of this kind is not necessarily 
smooth and regular but instead may be straight 
and angular. Figure 1 indicates the basic struc- 
tural characteristics of a rectilinear coordinate 
system. 

Figure 2 portrays the characteristic features and 
basic standards of the rectilinear coordinate graph. 
Many of the essential elements and specifications 
of the rectilinear coordinate graph are also ap- 
plicable to other graphic forms. 

Semilogarithmic charts. The semilogarithmic 
chart is often superior to the arithmetic chart (as 
used in Figure 1), for the former can show relative 
changes clearly; hence it is sometimes referred to 
as a ratio chart. Sometimes the semilogarithmic 
chart has the merit of representing as nearly 
Straight lines functions that otherwise would be 
appreciably curved. 

The essential feature of the semilogarithmic 
chart is that one scale is logarithmic and the other 
arithmetic, so that the chart effectively is a con- 
venient device for plotting log y against x, or log 
x against y. There is no zero line on the logarithmic 
scale, since the logarithm of zero is minus infinity. 
The logarithmic scale consists of one or more sets 
of rulings calibrated in terms of logarithmic values; 
each complete set of rulings is referred to as a 
“deck,” “cycle,” “bank,” or “tier.” The rulings for 
each deck are the same, but the scale values change 
from one to the other. For example, if one deck 
runs from 1 to 10, the adjacent deck above will 
vary from 10 to 100, the third from 100 to 1,000, 
and so forth. On the other hand, the adjacent deck 
below the one from 1 to 10 would vary from 0.1 to 
1.0. The logarithmic scale can thus be extended 
upward or downward indefinitely. 

In a semilogarithmic chart, relative rate-of- 
change comparisons can be made readily between 
different parts of a single series or between two or 
more series. The relative rate of change of y with 
respect to x is the slope of y as a function of x, 
divided by x; equivalently it is the slope of log y 
except for a constant depending on the logarithmic 
base. The slope of the logarithmic scale variable is 
the relative rate of change of the variable. If the 
Slope is steep, the relative rate of change is great. 
It makes no difference on what part of a semi- 
logarithmic chart a curve is located; the same slope 
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Figure 2 — Essential characteristics and specifications of a rectilinear coordinate graph 


means the same relative rate of change. This is 
particularly convenient if the two series being com- 
pared have very different ranges of values. 

This situation is illustrated in Figure 3, where 
the same data have been plotted on both arith- 
metic and semilogarithmic grids, placed in juxta- 
position. On the arithmetic grid, the relative rate 
of growth of the city in comparison to that of the 
entire state cannot be easily determined; by con- 


trast, the curves on the semilogarithmic chart por- 
tray relative rate of growth clearly and correctly. 
Note, however, that this clarity is achieved only at 
the expense of some distortion; careful labeling is 
imperative to prevent the reader from mistakenly 
concluding that the absolute growth of Dallas is 
greater than that of the state as a whole. 

Other special graph papers exist, such as double 
logarithmic, normal, and hyperbolic. 


246 GRAPHIC PRESENTATION 


SEMILOGARITHMIC 


10,000 


ENUM [ 


1,000 


Nie 


POPULATION IN THOUSANDS 


” 
z Í eee ne een ee eee 
a = 
© Bae ETS he N ee 
= GGT? SSS? PET Ey ey 
i RD Ebi a SE 
z Sy ee E ee AE 
T eee Te] 
EER oe E 
= 100 ees Eee ages | 
z = sy. PLS Sere Ce Se 
aa Ew. ES Ss RS: Ss Se 
2 2d es eee a 
EEE ery naa ere | 
10 lever e i 


1890 1900 1910 1920 1930 1940 1950 1960 


ARITHMETIC 


1890 1900 1910 1920 1930 1940 1950 1960 


Figure 3 — Comparison* of rectangular coordinate and semilogarithmic graphs 


* Comparison clearly demonstrates the superiority of the semilogarithmic graph in portraying relative rates of change. 


Bar and column charts. Bar and column charts 
are simple, flexible, and effective techniques for 
comparing the size of coordinate values or of parts 
of a total. The basis of comparison is linear or one- 
dimensional; the length of each bar or column is 
Proportional to the value portrayed. Bar and col- 
umn charts are very much alike; they differ mainly 
in that the bars are arranged horizontally in a bar 
chart and the columns are arranged vertically in a 
column chart. In addition, the bar chart is seldom 
used for depicting time series, whereas the column 
chart is often used for that purpose, 

There are several different kinds of bar and 
column charts. Four of them are illustrated in 
Figure 4, 

Frequency graphs and related charts. The most 
common graphic forms for portraying simple fre- 
quency distributions are the frequency polygon, the 
histogram, and the smoothed frequency curve. Fre- 
quency graphs are usually drawn on rectilinear 
coordinates, with the Y axis representing frequen- 
cies and the X axis representing the class inter- 
vals. The Y axis always begins with zero, and 
under no circumstances is it broken. The horizon- 
tal scale does not have to begin with zero unless, 
of course, the lower limit of the first class interval 


Source: Data from U.S. Bureau of the Census 1963, pp. 45-19, 45-22. 


is zero. In laying out a frequency polygon the ap- 
propriate frequency of each class customarily is 
located at the midpoint of the interval, and the 
plotting points are then connected by straight lines. 
The typical histogram is constructed by erecting 
vertical lines at the limits of the class intervals and 
forming a series of contiguous rectangles or col- 
umns. The area of each rectangle represents the 
respective class frequencies. (If the class intervals 
are unequal, special care must be taken. ) Smoothed 
frequency curves may be constructed by either 
mathematical or graphic techniques, The main 
purpose of smoothing a frequency graph is to re- 
move accidental irregularities resulting mainly from 
sampling errors; the data from which the smoothed 
curve was obtained should always be presented as 
should some indication of the method of smoothing 
employed. 

For some purposes, the cumulative-frequency 
curve, or ogive, is more useful than the simple 
frequency graph. In a cumulative-frequency distri- 
bution, the frequencies of the successive class in- 
tervals are accumulated, beginning at one end of 
the distribution, If the cumulation process is from 
the lesser to the greater, it is referred to as a “less 
than” type of distribution; if from the greater to 


the lesser, it is known as a “more than” type of 
distribution. In constructing an ogive, the cumula- 
tive frequencies are represented by the vertical 
axis and the class intervals by the horizontal axis. 
The cumulated frequencies are plotted either at the 
lower or upper end of the respective class intervals, 
depending on whether the cumulation is of the 
“less than” or “more than” type. A common con- 
figuration of the cumulative-frequency curve is that 
of an elongated S. 

Concentration curves. A special kind of graph, 
related to cumulative frequency graphs, is known 
as a concentration curve or a Lorenz curve. Such 
a graph is used to portray the nature of non- 
uniformity in the distributions of inherently posi- 
tive quantities like wealth, income, amount of retail 
sales, etc. The graph shows the proportions of total 
wealth (to be definite) held by various proportions 
of the relevant population; one reads from the 
graph that the least wealthy 10 per cent of the 
population holds, say, 1 per cent of total wealth, 
that the most wealthy 5 per cent of the population 
holds, say, 30 per cent of total wealth, etc. Thus 
the graph is a curve running through points given 


(A) Simple bar chart 


TITLE 


Amount 


(C) Grouped bar chart 


TITLE 


Amount 


E Ist class 
2d class 


GRAPHIC PRESENTATION 247 


parametrically by cumulative relative frequencies 
and cumulative relative totals; it is usually pre- 
sented in square form, with both axes taking values 
from 0 to 1 (for proportions) or O to 100 (for 
per cents). If wealth, or whatever, is uniformly 
(equally) distributed in the population, the con- 
centration curve is a straight line, the diagonal of 
the square from (0,0) to (1,1). The more the con- 
centration curve deviates from the straight line the 
greater is inequality. 

Statistical maps. There are many varieties of 
maps used in portraying statistical data. They can 
be grouped under the following basic types: (1) 
crosshatched or shaded maps; (2) spot or point- 
symbol maps; (3) isoline maps; (4) maps with 
one or more types of graphs superimposed, such as 
the bar, column, line, flow, or pictorial forms (see 
Figure 7); and (5) a combination of two or more 
of the preceding types. [For an overview of map 
making, see CARTOGRAPHY.] 

Crosshatched maps. The crosshatched or 
shaded map, characteristically, is used to portray 
rates and ratios that are based on clearly delineated 
areal units such as regions, nations, states, coun- 
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Figure 4 — Four types of bar charts 
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ties, or census tracts, Value ranges of rates and 
percentages are represented by a graded series of 
crosshatchings. ‘Figure 5 is an illustration of a 
crosshatched map. Since this figure is for illustra- 
tion only, the medians were computed by the sim- 
plest formula, with no attempt made to correct for 
the fact that most school dropouts occur at the end 
of a school year. 

Spot maps. In spot or point-symbol maps em- 
phasis is placed on frequencies or absolute amounts 
rather than on rates or proportions as in the cross- 
hatched map. Although there may be some over- 
lapping, spot maps may be differentiated into five 
types on the basis of the symbols used. Symbols 
may stress (1) size, (2) number, (3) density, 
(4) shading, or (5) form. In the first type of map 
the size of each symbol is proportional to the fre- 
quency or magnitude of the phenomena repre- 
sented. Symbols may be either two-dimensional or 
three-dimensional and are normally in the form of 
circles or spheres rather than rectangles, cubes, or 
irregular forms, In the second type of spot map the 
basic criterion is not size but number or frequency 
of spots or point symbols. The spots are uniform 
in size, each representing a specific value. The 
spots are designed and arranged to make them as 
readily countable as possible. The third type of map 
is also a multiple-spot variety; but instead of em- 
phasizing countable frequencies, comparative den- 
sity and distribution are emphasized. Figure 6 is 
an example of the third type of spot map, in which 
over-all density patterns are stressed. In the fourth 
type of spot map the criterion is shading. The size 
of the symbols is uniform, but the amount of shad- 
ing is indicative of the magnitude or value repre- 
sented. The form of the symbol in the fifth type of 
map represents certain qualities or attributes rather 
than quantities, as in the previous types. For exam- 
ple, if the dichotomy male-female is to be portrayed 
on a map, one type of symbol would represent male 
and another symbol would represent female. 

Isoline maps. There are two fundamental types 
of isoline (from the Greek isos, meaning equal) 
maps: the isometric map, in which the lines are 
drawn through points of equal value or intensity, 
and the isopleth map, in which the lines connect 
equal rates or ratios for specific areas. The isopleth 
map is particularly valuable in the social sciences. 

Miscellaneous graphic forms. Although not as 
basic or as widely used as the graphs and charts 
discussed in the preceding sections, there are cer- 
tain other graphic forms that possess advantages 
for certain problems, 

Pie charts. The pie or sector chart is widely 
used to portray proportions of an aggregate or 


total. Given the several proportions of the total, a 
pie chart is made by dividing a circle into pieces, 
one for each separate proportion, by boundary radii. 
Each proportion corresponds to a single slice, or 
sector, thus formed; and the central angles (equiv- 
alently, the circumference arcs, or again the sector 
areas) are proportional to the magnitudes of the 
proportions. Frequently, shading or coloring is used 
to help distinguish the sectors, and of course proper 
labeling is essential. Although a pie chart can be 
very effective in simple situations, it becomes diffi- 
cult to use if there are more than four or five sec- 
tors or if one wishes to compare corresponding 
proportions for several pie charts. 

Trilinear charts. The trilinear chart (or bary- 
metric coordinate system) is used to portray simul- 
taneously three nonnegative variables with a fixed 
sum. Usually they are percentages and the sum 
is 100. It is drawn in the form of an equilateral 
triangle, each side of which is calibrated in equal 
percentage divisions ranging from 0 to 100. 

Scatter diagrams. The scatter diagram (scatter- 
gram) and other types of correlation charts portray 
in graphic form the degree and type of relationship 
or covariation between two series of data. The 
scatter diagram shows a two-way or bivariate fre- 
quency distribution. Customarily, arithmetic scales 
are used in the construction of scatter diagrams, 
although semilogarithmic or double-logarithmic 
scales sometimes may be more appropriate. 

Pictorial charts. Pictorial graphs are used 
mainly because of their popular appeal, although 
they rarely convey more information than do more 
conventional graphic forms. In general, there are 
four basic types of charts in which pictorial sym- 
bols are used, distinguished by criteria of purpose 
and emphasis. 

The four types of pictorial charts are (1) charts 
in which the size of the pictorial symbol is propor- 
tional to the values portrayed; (2) pictorial unit 
graphs, in which each symbol represents a definite 
and uniform value; (3) cartoon and sketch charts, 
in which the basic graphic form, such as a curve 
or bar, is portrayed as a picture; and (4) charts 
with pictorial embellishments ranging from a single 
pictorial filler to elaborate and detailed pictorial 
backgrounds. 

Three-dimensional projection charts. In recent 
years it has become common practice to portray 
various kinds of graphs and charts in axonometric, 
oblique, and perspective projection. Charts in three- 
dimensional form, with depth and other picturelike 
qualities, unquestionably possess definite popular 
appeal. 

The design of charts and graphs in three-dimen- 
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Figure 5 — Crosshatched map* 


s of averages. 


* Note hatching gradation from light to dark. Each type of crosshatching represents a value interval in a se! 


Source: Data from U.S. Bureau of the Census 1962, pp. 15-24. 
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Figure 6 — Spot map* 
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circles, and generalized land-use feat 


ity and basic distributional patterns. Note also census tracts, concentric 
ures on base map. 


Source: Schmid & McVey 1964, p. 9. 
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ENROLLMENT 
COLLEGES AND UNIVERSITIES 
FULL-TIME STUDENTS 
STATE OF WASHINGTON 
FALL TERMS: 1960 AND 1961 


“Full-time students” includes those who are 
registered for 12 or more credit hours 
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For each institution, left column represents 1960, right column 1961 


Figure 7 — Chart in three-dimensional projection® 


a. Note especially that columns are drawn in oblique projection. 


sional form should be based on technically ac- 
ceptable principles of axonometric, oblique, and 
perspective projection. Axonometric and oblique 
projections are the most satisfactory for three- 
dimensional graphs. Although perspective projec- 
tion is perhaps the most realistic of the three, it 
possesses serious limitations as a technique in 
graphic presentation, Charts constructed in per- 
spective projection are generally distorted; they do 
not portray exact distance, shape, or size. Figure 7 
illustrates a map in three-dimensional form. 


CALVIN F. SCHMID 


[See also TABULAR PRESENTATION. ] 
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GRAS, N. S. B. 


Norman Scott Brien Gras, pioneer business his- 
torian, was born in 1884 in Toronto of German 
and Irish parentage. His father was an unsuccess- 
ful general retailer and moved from place to place 
in search of a living. As a child, Gras had to work 
as a mill delivery boy and at other lowly jobs, hard 
beginnings which help to explain him as an adult. 
Endowed with extraordinary intellectual capacity, 
he was able to overcome economic difficulties and 
complete his undergraduate training at the Univer- 
sity of Western Ontario. He went on to obtain his 
PH.D. in economics at Harvard University. His 
thesis was written under Edwin F. Gay, America’s 
first economic historian, whom Gras himself con- 
sidered his “intellectual father.” From 1912 to 
1917 Gras was an assistant professor at Clark 
College and in 1917/1918 an associate professor 
at Clark University. From there he went as a full 
professor to the University of Minnesota and then 
in 1927 to the Graduate School of Business Admin- 
istration, Harvard University, where he remained 
as Isidor Strauss professor of business history until 
1950, when he retired. From 1949 to his death in 
1956, he was president of the Business History 
Foundation, Inc., under whose auspices the history 
of the Standard Oil Company (New Jersey) was 
being written. 

Gras’s career falls into two main periods. In the 
first he devoted himself chiefly to problems in 
English social and economic history, and he car- 
ried on most of his research in England. English 
scholars appreciated his work, and he was made 
a member of the Council of the English Economic 
History Society and eventually, in 1939, a vice- 
president. 

The second period in Gras’s career began with 
his appointment to the faculty of the Harvard 
Business School. In the 1920s and early 1930s, 
which constituted his most creative period, Gras 
had begun to focus on a new intellectual problem: 


he began to make business the subject of system- 
atic historical research. Similar ideas were then 
being broached by leading professors at the Har- 
vard Business School, and when the new field was 
established at that school, Gras’s appointment was 
eminently suitable. 

The result of his endeavors was “business his- 
tory,” a new concept to which Gras gave content 
and which he initially defined broadly. Because he 
was the first American teacher in this field, it was 
incumbent on him to collect source material. Since 
the Harvard Business School used the case method 
in its teaching, in 1939 Gras and his lifelong asso- 
ciate, Henrietta M. Larson, brought out a Casebook 
in American Business History for use in his classes. 
Even earlier he had begun to edit a series of com- 
pany histories, the Harvard Studies in Business 
History. He wrote two of them himself, one on the 
First National Bank of Boston (1937) and another 
on the Harvard Co-operative Society (1942). In 
and outside of the school, Gras worked incessantly 
for the recognition of the new field. It is his merit 
that the use of business records has become com- 
mon among economic historians. 

Shortly after Gras’s arrival at Harvard, the Jour- 
nal of Economic and Business History was founded. 
It was the first American journal to deal with 
economic history and the first in the world to aim 
at combining economic history with the new busi- 
ness history. Gras became its highly efficient man- 
aging editor. Unfortunately, he pushed his special 
interest somewhat too hard and came into conflict 
with the powerful Edwin F. Gay, the editor of the 
journal. Gay withdrew in 1931, and Gras succeeded 
him as editor, but his Opportunity to mold the 
journal was short-lived, since financial reasons 
compelled the ceasing of publication in 1932. The 
conflict between Gay and Gras was ultimately to 
isolate Gras from the rest of the American eco- 
nomic historians, since most of them were Gay’s 
former students; worse yet, it produced a separa- 
tion between business history and the better-devel- 
oped older field, economic history. 

Gras’s attempt to publish a synthesis and gener- 
alization of the material on business history he had 
accumulated for more than 15 years was hardly 
successful. He failed to distinguish between ana- 
lytical constructs, such as his Stages of capitalist 
development, and the economic reality which they 
were designed to explain, 

After its auspicious beginnings, the development 
of business history ran into difficulties in the 1930s. 
Financial trouble in the wake of the depression 
made it necessary to abandon the original broad 


concept of “business history” and to concentrate on 
company histories. Also, Gras was now faced with 
that resistance which many innovators are bound 
to meet. Besides these problems beyond Gras’s 
control, his endeavors did suffer from his aliena- 
tion from his colleagues. The narrowing down of 
the field, originally forced on Gras, became after a 
while a matter of his choice, and he came to iden- 
tify “business history” with the study of the admin- 
istration and policies of individual enterprises. His 
work, moreover, was empiricistic. His attempts to 
stimulate interest in the new area through an ap- 
peal to businessmen and his way of gaining access 
to business records aroused the suspicion of intel- 
lectuals and social scientists, because in this period 
they tended to be hostile toward business. Finally, 
although Gras was an inspiring teacher and a 
friend of students, in his later years he seemed 
unable to deal with his peers on the basis of intel- 
lectual give-and-take. He appeared unwilling to 
cooperate with other scholars who came to the new 
field of research on business with more refined 
tools and with related, yet different, goals. By the 
time of Gras’s death the exponents of business 
history had returned to the broad and fruitful con- 
ception of his earlier years and were developing 
the field as the social and economic history of 
business. 

Fritz REDLICH 


[For discussion of the subsequent development of Gras’s 
ideas, see History, article on BUSINESS HISTORY.] 
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GRAUNT, JOHN 


John Graunt (1620-1674) is generally regarded 
as having laid the foundations of demography as 
a science with the publication of his Natural and 
Political Observations Made Upon the Bills of 
Mortality (1662a). Graunt began, as many scien- 
tists have done, by exercising idle curiosity—“Now 
having (I know not by what accident) engaged my 
thoughts upon the Bills of Mortality . . .” ({1662a] 
1939, p. 3)—and proceeding to “observations, 
which I happened to make (for I designed them 
not)...” (ibid., p. 5). Characteristically, he did not 
believe that curiosity should remain idle: “finding 
some Truths, and not commonly-believed Opin- 
ions . . . I proceeded further, to consider what ben- 
efit the knowledge of the same would bring the 
World” (ibid., p. 18). 

Graunt’s application of the statistical method to 
raw material had two essential elements: (1) clas- 
sification of like with like, so as to break the data 
down into homogeneous groups, and (2) the 
comparison of those groups in order to recognize 
significant differentials. Another attribute of Graunt 
as a Statistician was his ability to apply logic to 
his arithmetic. He never accepted a calculation if 
it offended his common sense; nor did he accept it 
without a test, if he had the means of testing it. 

As to classification, we may read in Chapter 2 of 
his Observations his very sound remarks about the 
identification of causes of death, then dependent 
on the observation of the “ancient matron” search- 
ers (ibid., pp. 27-32). Deaths at advanced ages, 
he argued, can hardly be safely attributed to one 
specific cause, and not much worthwhile informa- 
tion can be expected about sudden deaths beyond 
their suddenness. As to comparison, we find in the 
same chapter his observation about the stability of 
proportionate mortality rates for certain causes, 
and in Chapter 4 (ibid., pp. 45-48) his observa- 
tion that in plague years, deaths from causes other 
than plague were inflated to such an extent as to 
lead to his conclusion that of deaths from plague 
fully one-fifth were attributed to some other cause 
(in order to avoid the closure of the plague-infested 
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dwelling and the virtual incarceration of the sur- 
viving members of the household). Again in Chap- 
ter 3 (ibid., pp. 33-44), we may see how com- 
parison of mortality rates for rickets with mortality 
rates referring to possible similar names for the 
same disease led Graunt, after examining the fig- 
ures, to conclude that rickets was a new disease 
and not a new name for an old one. (This was 
probably a wrong conclusion, but Graunt was not a 
physician and would not have known that medical 
attention was at that time being drawn to the 
disease. ) 

The method of comparison was again employed 
when Graunt estimated the population of London 
in Chapter 11 (ibid., pp. 67-70). He used three 
methods and reconciled them. First, he estimated 
that a fertile woman had a baby every other year, 
that fertile women numbered half the married 
women, and that there were seven other members 
of a family (husband, 3 children, 3 servants ); 
therefore, the population was 32 times the number 
of annual births. This gives 384,000. Next he esti- 
mated that in certain parishes there were annually 
three deaths from every 11 families: therefore, the 
number of deaths (13,000) multiplied by 11/3 
yielded 48,000 families, or 384,000 persons. Finally, 
he observed 54 families per 100 square yards 
within the walls (12,000 families) and guessed 
that there were three times as many families out- 
side the walls. 

Graunt’s most important contribution to demog- 
raphy was his rudimentary life table, Its impor- 
tance lies not in the table itself, which is indeed 
defective, but in the novelty of presenting mortality 
in terms of survivorship. Graunt began with only 
two observations—the proportion of births surviv- 
ing to age 6 (.64) and the proportion surviving 
to age 76 (.01). He then assumed that a constant 
multiplier is involved in proceeding from the first 
Proportion to the last by decennial intervals of 
age. He did not make clear how he got his mul- 
tiplier (or “mean proportion,” as he called it), but 
it was obviously about .6, Michel Ptouka (1938) 
has put forward the hypothesis that the multiplier 
was (64 — 1)/100 = .63. D. V. Glass has suggested 
(1950) that Graunt made a second difference inter- 
polation from 1, = 100, [= 64, and l,,=1. The 
method and any error in it is not important. (Hal- 
ley was not to make the correct calculation for 
another sixty years or so.) So much has been devel- 
oped from this simple but immensely powerful 
thought that actuarial criticism would be quite out 
of place. 

Graunt is rarely considered apart from another 
English scientist of the time, Sir William Petty. 


Both were of Hampshire stock; Graunt was born 
in 1620 and Petty in 1623. They became acquainted 
in or before 1650. It appears that their relationship 
was initially that of client and patron (Graunt 
being Petty’s patron; see Greenwood 1941-1943), 
but the roles were reversed after the fire of London 
in 1666. 

Graunt was the son of a city tradesman and 
became a haberdasher and a man of substance. 
Petty, who had sampled the merchant navy, studied 
mathematics in France, spent a short time in the 
Royal Navy, returned to the Continent to study 
anatomy, spent some time in business in London, 
and went to Oxford in 1649, becoming a doctor of 
medicine by dispensation. Later he rose to be pro- 
fessor of anatomy and vice-principal of Brasenose. 
He became a candidate for a Gresham professor- 
ship in London and made contact with Graunt. 
We do not know what interests they shared at that 
time or whether Graunt (or his father) had any 
influence in Petty’s subsequent appointment as 
Gresham professor of music. Soon after, Petty 
(who seems to have had no professional duties) 
went to Ireland and made a fortune. For many 
years Graunt remained a prosperous tradesman, 
but the fire of 1666 destroyed his business. A little 
later he became a Roman Catholic convert and was 
apparently not interested in rebuilding his busi- 
ness, for he was soon bankrupt. It was now Petty’s 
turn to be the patron, and Graunt, the client—and 
apparently not an easy one, Despite Petty’s at- 
tempts to help him, Graunt’s financial troubles 
were with him until he died in 1674. 

Petty’s vital statistical work was on a different 
level from that of Graunt. Petty had inspiration 
and brilliance, many ideas and a breadth of vision, 
but he did not pursue his ideas as persistently as 
Graunt did. It was Petty who proposed a central 
government statistical office and a system of census 
taking. He anticipated William Farr in estimating 
the economic loss due to mortality. Many of his 
calculations, however, do not stand up to the tests 
of consistency that Graunt would have applied. It 
seems to be generally agreed that Graunt made an 
immeasurably greater contribution to demography. 

Essentially, Graunt was a man with an inquiring 
mind whose ideas laid the foundation of the politi- 
cal arithmetic that we now accept as part of good 
social organization and government. The Royal 
Society of London, of which Graunt was elected a 
fellow, commemorated the tercentenary of the pub- 
lication of his Observations by holding a special 
series of scientific meetings to which leading 
demographers in England and other countries con- 
tributed papers (Glass 1963). The Institute of Actu- 


aries of England paid its own tribute to Graunt by 
republishing, in a more modern format but without 
abridgment or alteration, his original Observations. 


B. BENJAMIN 


[For the historical context of Graunt’s work, see PUBLIC 
HEALTH; VITAL STATISTICS; and the biography of 
Petty. For discussion of the subsequent develop- 
ment of his ideas, see LIFE TABLES; MORTALITY; and 
the biographies of KérGsy; LOTKA.] 
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GREEN, T. H. 


Thomas Hill Green (1836-1882), English phi- 
losopher, social theorist, and reformer, was born 
in Birkin, Yorkshire, where his father was rector. 
He became a fellow of Balliol College, Oxford, in 
1860, and professor of moral philosophy in 1878 
but died four years later at the age of 45. Green 
. initiated and inspired the British renaissance of 
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Hegelian idealism, which started in Oxford in the 
1870s and dominated academic philosophy in 
Britain until its retreat before the criticisms of 
Bertrand Russell and G. E. Moore in the early 
years of the twentieth century. 

Green’s aim was to demolish the prevailing sys- 
tem of thought that grafted evolutionary ideas onto 
the stem of traditional empiricism and to lead his 
contemporaries away from J. S. Mill and Huxley 
and toward Kant and Hegel. He began with an 
attack on received views in metaphysics and the- 
ory of knowledge, which appeared in an introduc- 
tion to the works of David Hume (1874). Knowl- 
edge, he held, could not be derived by a merely 
passive mind from a sequence of discrete sensa- 
tions. Nature is an unalterable scheme of intelligi- 
ble relations constructed by the mind; this implies 
an equally unalterable and objective eternal con- 
sciousness of ‘which all finite minds are parts. As 
the constructer of nature, mind cannot be a part or 
product of it. 

The prevailing system of philosophy, which re- 
garded man as a part of nature governed by 
deterministic laws, ruled out conduct as much as 
it ruled out knowledge. Green believed will and 
choice, the rational determinants of action, must 
instead be distinguished from passive compliance 
with the promptings of natural desire. In an act of 
will, a man identifies himself with his ideal self 
or character, which is part of the eternal conscious- 
ness and not of the mechanical order of nature. 
The moral end is the full realization of one’s poten- 
tialities, and these can be realized only in a social 
community. For Green, as for Rousseau and Hegel 
before him, genuine human personality was an 
essentially social phenomenon, and it was incon- 
ceivable that isolated natural man should be a 
moral agent. Green was not entirely hostile to Mill’s 
chastened utilitarianism; he shared its rejection of 
customary rules but conceived the ideal end of 
conduct as moral perfection, not individual con- 
tentment. (For Green’s metaphysics and ethics see 
Green 1883.) 

Green’s social theory (1882) follows Rousseau 
and Hegel to some extent in subordinating the 
individual to the community. The duty of the citi- 
zen is to follow the general will for the common 
good. But for Green the community is not coter- 
minous with the state; he remained in the liberal 
tradition in asserting the sovereignty of morals over 
politics. There is some of Hegel’s contempt for the 
individual conscience in Green’s well-known view 
that there can be no unrecognized rights, but he 
did not say that all true rights were legal ones. The 
recognizing authority is not the state but the moral 
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consensus of the community. Positive law can be 
criticized and improved upon in the light of the 
state’s ideal purpose: the moral perfection of men. 
Since morality presupposes freedom the state can- 
not serve this end directly, but it can create favor- 
able conditions for it. Green valued democracy as 
giving men the kind of responsibility that is con- 
ducive to moral improvement. Participation in 
democracy is a duty, not a right; an opportunity 
for moral development, not a negative emancipa- 
tion from authority and constraint. 

Green’s special combination of moral individual- 
ism and collectivism, in particular his idea of the 
state as an indirect agent of moral improvement, 
led him to favor the intervention of the state to 
secure the welfare of its citizens (1881). He was 
opposed to Mill's libertarianism and Spencer's bar- 
barous version of laissez-faire, In his support of 
the enlargement of the state’s responsibilities he 
anticipated the doctrines of the British Labour 
party. He became the first Oxford tutor to serve on 
the city council, where he actively promoted tem- 
perance reform and public secondary education. 
Realistic in his respect for the force of circum- 
stances and the moral sense of the community, 
Green was no traditionalist. He admired Cromwell 
(from whom he claimed descent) and shared his 
republicanism. He was hostile to class distinction, 
militarism, and patriotic display. The first impor- 
tant English philosopher since Ralph Cudworth to 
spend his active life in a university, he supported, 
by precept and example, the involvement of phi- 
losophers in the social and political life of their 
society. The character of Mr. Grey in Mrs. Hum- 
phry Ward’s novel Robert Elsmere is an admiring 
portrait of Green’s nobly serious personality, in 
which a determined puritanism was at once 
strengthened and mitigated by decency and prac- 
tical good sense. 


ANTHONY QUINTON 


[For the historical context of Green’s work, see the 
biographies of HEGEL; MILL; Rousseau. For discus- 
sion of the development of his ideas, see the biog- 
raphies of BOSANQUET; Ficcis; Linpsay.] 
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GROTIUS, HUGO 


Huig de Groot (1583-1645), better known by 
the Latinized version of his name, Hugo Grotius, 
was born at Delft in the province of Holland. He 
was carefully educated by his father, whose intel- 
lectual interests were both broad and profound, 
and who imbued in him the characteristic civic 
pride of the Dutch patriciate. 

Grotius was extraordinarily young when his ex- 
ceptional gifts became apparent. From 1594 to 
1597 he studied in the faculty of arts of the Uni- 
versity of Leiden, the Protestant university that had 
been opened in 1575. In 1599 he was called to the 
bar at The Hague; and in 1607 he received his first 
public office as advocaat fiscaal (deputy attorney 
general) at the highest law court in the province 
of Holland. Five years later he was appointed pen- 
sionary of Rotterdam, a political office that gave 
him power not only in the city itself but also in the 
States (the representative assembly) of the prov- 
ince, where he acted as Rotterdam’s representative. 

Meanwhile, Grotius’ publications established him 
both as an accomplished neo-Latin poet and drama- 
tist and as an ambitious historian and jurist. As a 
jurist he wrote a short book The Freedom of the 
Seas (1609)—it was, in fact, just one chapter of 


the manuscript De jure praedae (1604), not pub- 
lished until 1868—in which he tried to prove that 
no authority is entitled to claim sovereignty over 
the high seas; although it was aimed at Spanish 
pretensions, the book also aroused the wrath of 
James 1 of England. 

In the 1610s Grotius became one of the major 
supporters of the grand pensionary, Johan van 
Oldenbarnevelt, who was engaged in a bitter strug- 
gle with the stadholder, Prince Maurice of Orange, 
on two issues: Oldenbarnevelt and Grotius sup- 
ported the cause of the Arminians and of pro- 
vincial sovereignty, particularly that of Holland. 
After their defeat in 1618, Oldenbarnevelt was 
executed and Grotius sentenced to life imprison- 
ment. Thanks to the resourcefulness of his wife, 
Grotius escaped to France after three years; he was 
welcomed by Louis x11, who paid him a pension, 
albeit irregularly. From 1621 to 1631 he lived in 
Paris with his family in relatively poor circum- 
stances. He lived by his pen, expecting all the while 
that the justice of his cause would lead to his 
eventual rehabilitation in the Netherlands. In 
1631 he made an attempt to re-establish him- 
self there but was forced to leave the following 
year. In 1634 he accepted the offer of Queen 
Christina of Sweden to become her ambassador 
to France, and for ten years he worked in that 
capacity in Paris, a somewhat eccentric scholar 
of bourgeois origin among titled professional 
diplomats. 

Diplomacy bored Grotius, and he could not af- 
ford to wait long years for his salary to be paid, as 
could his noble colleagues. He tried to compensate 
for his inadequacy as a diplomat by working for a 
goal far above the pettiness of routine politics: the 
restoration of Christian unity. During his last years, 
this came to be his major preoccupation, In 1645 
he went to Sweden to offer his resignation. He went 
by way of Holland, where at last he was welcomed 
with the warmth he had been expecting in vain for 
two decades. On the journey back from Sweden, 
where Christina had received him with utmost po- 
liteness and with equal politeness accepted his 
resignation, his ship was driven off course, and he 
went ashore on the Pomeranian coast. He took the 
road to Liibeck but did not reach that city: forced 
to rest in Rostock, he died there from exhaustion. 

Grotius’ intellectual achievement has a para- 
doxical character, He won great fame as a poet and 
dramatist, as a historian, a philologist, a theo- 
logian, and, of course, a jurist. But learned dis- 
cussions about the contributions he made to all 
these different fields have not come to any con- 
clusion about whether he was essentially a con- 


GROTIUS, HUGO 257 


servative who put together in magnificent syntheses 
opinions previously held by others or an innovator 
boldly treading new ground. The endless variety of 
his work, the mixture of precision and suppleness 
in his thought, and the sheer bulk of his learning 
make it almost impossible to determine with any 
certainty the degree of originality of his views. 

He was very much a man of the baroque age. 
His motto, ruit hora (time flies), the Latin lan- 
guage in which he wrote most of his books, and 
his profound awareness of life's antinomies indi- 
cate how fully he belonged to the civilization of his 
time. Yet in the deeply pessimistic early seven- 
teenth century his optimism and rationalism were 
exceptional and so effortless that they may seem 
shallow in comparison with the views of such phi- 
losophers as Descartes. His theological studies were 
inspired by deep religious feelings, but they brought 
him into conflict with Protestants of various de- 
nominations and gave both Catholics and, later, 
deists the erroneous impression that he supported 
their views. His constant endeavors to heal the 
breaches in the Christian church caused the most 
confusing misunderstandings and involved him in 
acrimonious controversy. 

During his imprisonment, from 1618 to 1621, 
Grotius wrote, in rhyme, The Truth of the Christian 
Religion; it was published in Dutch in 1622, and 
in 1627 Grotius’ own Latin version followed. In 
this simple book, written for seamen who might 
be impressed by foreign religions, Grotius ex- 
plained and lauded the main tenets of Christianity. 
The book was a great success; it was translated 
not only into all the major European languages but 
also into Danish, Irish, Hungarian, and Arabic, and 
there are 110 known editions of it. 

Grotius’ masterpiece, The Law of War and Peace 
(1625), was also an immediate and widespread 
success, if not on the same scale as his little book 
on Christianity: it has gone through at least 75 
editions and has been translated 24 times. In this 
book he presented his famous doctrine of the just 
war, War, in his view, is justified as a means of 
obtaining justice in cases where no law court exists 
to give a ruling upon the matter under dispute. 
Most of these cases are, of course, international 
conflicts, such as the revolt of the United Provinces 
against Spain. A contestant may take up arms in 
order to defend his property or his rights, to take 
possession of what is due to him, or to punish 
criminal offenses. Thus, war is essentially a law- 
suit carried out by armed force because there is no 
court that can deal with it. 

Other concepts are also presented in this large 
book, which is like a warehouse of opinions, quo- 
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tations, conflicting doctrines, and debates. Espe- 
cially noteworthy is Grotius’ doctrine of natural 
law, for it exerted considerable influence, even if 
it was neither coherent nor strikingly original. 
Since he considered natural law to be basic to 
all social organization, international or national, 
Grotius’ initially juridical theory developed into a 
more general one that analyzed and explained not 
only the conditions of international justice but 
every aspect of human society. In other words, his 
work pertains as much to general sociology as to 
international law. 

Grotius defined the law of nature as “a dictate 
of right reason, showing the moral necessity or 
moral baseness of any act according to its agree- 
ment or disagreement with rational nature, and 
indicating that such an act is either commanded or 
forbidden by the author of nature, God” (1625, 
pp. 20-21 in a 1949 edition). This does not differ 
essentially from scholastic conceptions, and it is 
somewhat misleading to claim, as has often been 
done, that Grotius made an original contribution 
by secularizing the medieval interpretation of 
natural law. For Grotius, just as for the medieval 
thinkers and the sixteenth-century Spanish lawyers 
whom he quoted, the law of nature is an objective 
datum, an absolute norm given for all eternity. It 
is only later in the century, with Hobbes and other 
theorists, that the law of nature, identified with 
the instinct of self-preservation, developed into an 
essentially individualistic, subjective, and secular 
concept. Grotius, according to Erik Wolf (Wolf 
1939) saw God, nature, and reason as only differ- 
ent names for the metaphysical foundation of life, 
which to human beings becomes manifest in law. 
Society is a natural and necessary form of concrete 
law because man is a social being endowed with 
reason. Thus, human society is also by definition 
rational. 

These premises served Grotius as starting points 
for determining the rational quality in social and 
political life. It was not his purpose to draft a po- 
litical theory. Various political concepts that seemed 
essential to his contemporaries were rather indif- 
ferently treated by him, Sovereignty, for example, 
did not mean much to a thinker educated in the cor- 
porative, patrician Dutch republic that was slowly 
emerging when he was young. Yet he transcended 
his origins by regarding all nations, sovereignties, 
and even churches as mere elements of the largest 
possible social entity, the human race in general, 
Therefore, the societies under examination ap- 
peared to him as manifestations of a social and 
ethical order in which each social element has its 
place, determined by its own existential principle. 


But all of them are embraced by the corporative 
unity which is humanity—primarily, of course, 
Christian humanity. 

The Law of War and Peace is more than a 
philosophical and sociological monograph. To make 
it useful to the practical statesman, Grotius en- 
deavored to determine the justifiability of particular 
actions that are often taken prior to war or in war, 
according to simple principles laid out by him. 
Although, for example, Gustavus Adolphus is said 
to have consulted the book frequently while cam- 
paigning in Germany, it is obviously impossible to 
assess the actual impact of its prescriptions on the 
conduct of war. But this much is certain: the work 
exercised a profound influence on the‘development 
of international law, and even in the present cen- 
tury it is referred to not only by professional 
lawyers but also by statesmen. The remarkable mix- 
ture of idealistic optimism—for in spite of interna- 
tional chaos, Grotius viewed the ethical content of 
natural law as a principle that automatically asserts 
itself, since, in the last analysis, rational and 
natural behavior are identical—and realism, which 
sprang from a critical study of history, rendered his 
concepts and advice applicable to many situations 
and attractive to many different groups. 


E. H. KossMANN 


[See also INTERNATIONAL LAW and the biography of 
VATTEL,.] 
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I 
THE STUDY OF GROUPS 


In social science the study of groups is often 
called small-group research. A “group” is defined as 
a number of persons, or members, each of whom, 
while the group is meeting, interacts with every 
other, or is able to do so, or can at least take per- 
sonal cognizance of every other. This requirement, 
which cannot be met for larger social units, such 
as armies, justifies calling groups “small.” Although 
no investigator has specified the number of mem- 
bers that would make a group “large” rather than 
“small,” the groups actually studied have seldom 
had more than fifty members. It is, moreover, often 
difficult to draw the boundaries between groups, as 
marginal persons may interact with members of 
two or more groups. 

Investigators working within the field of small 
groups seldom study the behavior of groups as 
such. More often they study the characteristics of 
face-to-face interaction among men, of which the 
behavior of groups considered as units is only a 
part. Accordingly, the subject should perhaps be 
called “elementary social behavior” (Homans 1961) 
rather than “small groups.” A group so small that 
an investigator can observe the behavior of each of 
its members in some detail is a good setting for the 
study of this subject, but the group usually remains 
the setting rather than the object of investigation. 
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For this reason the problem of defining a group or 
demarcating its boundaries is seldom of theoretical 
importance. When the behavior of a group as such 
is to be studied, its boundaries, in practice, are apt 
to be quite clear. 


History of small-group research 


The interest in groups has sprung from two main 
sources, which may be called psychological and 
sociological after the disciplinary affiliations of the 
chief investigators. In psychology the interest per- 
haps began with studies of social facilitation. From 
the 1930s on, it was stimulated by the experi- 
mental and theoretical work of Kurt Lewin on 
social influence. Lewin trained a group of students, 
including R. Lippitt, D. Cartwright, A. Bavelas, and 
L. Festinger (Cartwright & Zander 1953; compare 
Lewin 1939-1947). Members of the Tavistock In- 
stitute in England largely shared their viewpoint. 
Although many psychologists in the field were not 
students of Lewin’s, few escaped his influence. 

In sociology much of the work of Georg Simmel 
(1902-1917) lay in the field now called small- 
group research. But his influence was not direct: 
indeed, later research had the effect of reviving 
interest in him. The other early name of impor- 
tance is that of Charles H. Cooley, who pointed out 
the influence of certain kinds of groups, such as 
the family and the children’s play group, which he 
called primary groups, in forming the attitudes of 
men (Cooley 1909). 

Simmel and Cooley were intuitive observers and 
theorists. For empirical investigation, Elton Mayo 
was to sociology what Lewin was to psychology. 
The researches carried out in the 1920s in the 
Hawthorne plant of the Western Electric Company 
by Mayo and his associates of the Harvard Gradu- 
ate School of Business Administration were con- 
ceived as studies of human relations in industry 
(Roethlisberger & Dickson 1939). But several of 
the researches put small industrial groups under 
close scrutiny and suggested the possible fruitful- 
ness of more general work. E. D. Chapple and 
C. M. Arensberg (1940), who developed opera- 
tional measures of face-to-face interaction, were 
indirectly associated with the Western Electric re- 
searches; so were sociologists like W. F. Whyte 
(1943) and G. C. Homans (1950), who have 
worked in the small-group field, as well as investi- 
gators at the Harvard Business School. Yet there is 
plenty of such work in sociology that does not stem 
directly from the school of Mayo. 

A third, rather specialized, influence in the 
1930s was that of J. L. Moreno (1934) and his 
associate, H. H. Jennings, who worked at the New 
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York State Training School for Girls. They con- 
tributed interesting empirical findings and, more 
important, the technique they called the socio- 
metric test, which in one form or another is now 
used by most investigators. 

Great Britain, France, and the Scandinavian 
countries have also made important contributions 
to the field; in France an early theoretical influence 
was the “microsociology” of G. Gurvitch (1950). 


Methodology 

One justification for speaking of psychological 
and sociological “schools” is that until recently they 
tended to use somewhat different research designs. 
True to the traditions of their discipline, the psy- 
chologists emphasized experimental and survey 
methods. They would bring groups, sometimes 
groups artificially constituted for research pur- 
poses, into laboratories where some of the variables 
affecting the subjects’ behavior might be held con- 
stant and others experimentally manipulated. They 
developed quantitative measures of behavior and 
administered standardized questionnaires; they 
formulated hypotheses relating the independent, 
experimentally manipulated variables to the de- 
pendent ones; they tested the significance of these 
hypotheses statistically. They also exploited natural 
situations that provided some approximation to ex- 
perimental control—for instance, a set of groups 
all alike in some respect, such as residents of 
houses of identical design or groups having iden- 
tical tasks to perform. The psychologists would cor- 
relate statistically the variations along a number of 
dimensions of behavior occurring naturally in 
these groups. 

Instead of studying several similar groups under 
more or less controlled conditions, the sociologists 
were apt to concentrate on single, “real-life” groups. 
Instead of using systematic quantitative and survey 
methods, they would use the techniques of anthro- 
pological field work and nondirective interviewing, 
which provided, as they hoped, a richer and more 
rounded understanding. And instead of testing hy- 
potheses statistically, they would work out a quali- 
tative description of the social structure of the 
group in question, : 

It is easy to exaggerate the differences between 
the psychological and sociological methodologies. 
Although the Western Electric researches are 
thought of as belonging to the sociological tradi- 
tion, the study of the Relay Assembly Test Room 
employed a more rigorous method of measuring 
Output, gathered a greater body of quantitative 
data, and made a more elaborate statistical analysis 
than any other small-group study has ever done 


(Whitehead 1938). Today all investigators use in 
different mixtures much the same battery of tech- 
niques. The sociologists are ready to use survey 
methods in order to provide support for hypotheses 
first suggested by field work; and the psychologists 
will admit both that field work is a necessary pre- 
liminary to the design of a survey and that it helps 
the investigator understand what the findings of 
his survey “mean”—that is, how they are to be ex- 
plained. Most investigators now think of them- 
selves as social psychologists rather than as either 
psychologists or sociologists. 

Above all, both methodologies lead to the same 
end. The first business of any science is to state 
propositions, or hypotheses, about the relations be- 
tween variables. The psychologists tend to make 
their propositions explicit and to pretend they were 
formulated before the data were gathered. The 
sociologists are more apt to pretend that they 
studied their groups with open minds, ready for 
any relationship to appear in the data, and they 
often leave their propositions implicit in their de- 
scriptions of social structure. But their propositions 
can always be made explicit, and a proposition sup- 
ported by data remains a finding, whether formu- 
lated ex ante facto or ex post facto. Nor is it easy 
to understand why one should put greater con- 
fidence in a proposition tested on a set of groups 
similar in some controlled respect than in one that 
can be shown, as many of the findings of field re- 
search can be shown, to hold good for a number 
of otherwise quite dissimilar groups. In any event, 
no one has ever demonstrated that the findings of, 
for instance, laboratory research are inconsistent 
with those of field research, once the very different 
conditions that obtain for artificial and for “real” 
groups are taken into account. They are not always 
taken into sufficient account. 


Some leading problems 


Within the field of small groups numerous prob- 
lems call for investigation, and there have been a 
number of surveys and anthologies of research 
(Cartwright & Zander 1953; Hare 1962; Hare et al. 
1955; Olmsted 1959). Here only the chief ques- 
tions to which research has sought answers are 
mentioned. 

What determines whether a man changes his 
behavior or opinion under the influence of others? 
In particular, what determines whether or not he 
conforms to the norms of a group? What gives 
some men power over others? 

Small-group research deals with all these ques- 
tions within the context of a general concern about 
interpersonal relations, which may often be re- 


garded as stabilized resultants of processes of in- 
fluence. Accordingly, the researcher in this area, 
starting with a broad interest in the relations be- 
tween the behavior and opinions of men and their 
attitudes toward one another, may come to ask 
more specific questions. Here the ideas that have 
come to be called “balance theory” (Heider 1958) 
should be specially mentioned, What are the con- 
ditions of status congruence and distributive jus- 
tice, and what are the effects on interpersonal rela- 
tions of failure to meet these conditions? What 
determines an attitude of “respect” for another, 
rather than a more relaxed “liking”? What de- 
termines embarrassment or a “joking relationship”? 

Interpersonal relations combine to form larger 
social structures; consideration of this elementary 
social fact raises a new order of problems. How do 
coalitions form in the smallest social groups, such 
as triads? How do differences in status build up 
among the members of a group? What are the de- 
terminants of different forms of status system? 
What are the relations between status, influence, 
and conformity, or between status, the channels of 
communication in a group, and its division of 
labor? What are the determinants of conflict be- 
tween groups, and what are its effects on social 
structure within groups? Finally, there is the ques- 
tion of social control: What are the conditions 
maintaining stability in a social structure? 

Many studies have been devoted to questions 
arising from collaboration among group members 
in attaining a common goal. Is a group more effec- 
tive than an individual in solving problems? What 
are the results of competition as opposed to col- 
laboration? How do the motivation of its members, 
their interpersonal relations, patterns of communi- 
cation, and division of labor render a group more 
or less effective in attaining a goal? Are there any 
necessary phases a group goes through in solving 
its problems and reaching its goals? What are the 
effects of success or failure upon its internal struc- 
ture? What are the determinants of individual 
satisfaction with collaborative work? 

There are also a number of special questions re- 
lated to what English-speaking people call leader- 
ship. A “leader” may be defined as someone whose 
orders are in fact obeyed by many other persons, 
especially by the members of a group to which he 
belongs, What, if any, are the traits of personality 
that make a man a leader? What are the relations 
between the values of members and the behavior 
of the man who emerges as leader of a group? 
What is the relation between status and leadership, 
or between leadership informally won and authority 
formally assigned to a man by a larger organiza- 
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tion of which his group is a part? What makes a 
leader successful in attaining individual goals or 
goals assigned to, or adopted by, a group? What 
are the effects of strong leadership on the other 
behavior of the leader himself and on the behavior 
of others toward him? 

Many of these questions lend themselves to 
applied research, which is research designed to 
provide people with knowledge that might allow 
them to change social behavior for the better from 
their point of view. The “better” behavior envisaged 
has ranged all the way from more productive indus- 
trial groups to discussion groups better organized 
to train the leaders of other discussion groups. In 
this field, research has been none the worse for 
being “applied” [see LEADERSHIP]. 


Codification of research findings 


The findings, the tested empirical propositions of 
small-group research, possess the following char- 
acteristics. In substance, they are very varied. They 
are formulated in many different terminologies: 
almost every investigator makes up one of his own. 
And they often appear to have no high degree of 
generality, to be contradictory or to hold good only 
in special circumstances. 

Since, nevertheless, the findings are all con- 
cerned with face-to-face social behavior, their cha- 
otic nature in other respects has stimulated a few 
investigators to the kind of criticism sometimes 
called codification (Homans 1950; 1961; Thibaut 
& Kelley 1959). When carried out deliberately, 
which it seldom is, codification goes behind the dif- 
ferent names given to the variables entering the 
findings and asks how these variables were actu- 
ally measured. When the same or equivalent opera- 
tions of measurement were used in different in- 
vestigations, codification gives these variables the 
same name. By this means it can sometimes show 
that the same findings have been reached in differ- 
ent investigations and so reduce the number of 
findings. 

When apparently contradictory findings survive 
such criticism, codification goes on to examine the 
conditions under which the findings were estab- 
lished, since two findings are not contradictory 
unless all the circumstances in which each holds 
good are identical. If one research, for instance, 
shows that two persons who interact often with 
each other tend to like each other and another 
shows that they do not, an examination of the other 
features of the two researches may reveal that in 
the latter the two persons were constrained to inter- 
act, whereas in the former they were not. 

These first steps in codification immediately sug- 
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gest the next: an effort to reduce the number of 
independent findings still further by showing that 
a number of different findings, including contra- 
dictory ones, can be derived from the same set of 
general propositions under specified given condi- 
tions. From an assumption, for instance, about the 
determinants of sociometric choice (expressions 
of liking), together with the condition, which is 
approximately true of some groups, that members 
use the same criteria for choice, it follows both thata 
few members will receive many more choices than 
others do—which is an empirical finding—and that 
highly chosen members will choose one another— 
which is another empirical finding. 

At this point codification begins to organize and 
unify the findings by explaining them. In any sci- 
ence, what is to be explained is always an empirical 
proposition of the general form x varies as y. Ex- 
planation is the process of showing that the prop- 
osition follows logically in a deductive system from 
more general propositions under specified given 
conditions, themselves stated in the form of propo- 
sitions. The more general propositions are more 
general in the sense that they enter into deductive 
systems by which other empirical propositions are 
explained; this is their unifying function. The 
given conditions are not general propositions, nor 
does the investigator undertake to explain them; 
he simply accepts them as matters of fact. A char- 
acteristic given condition in small-group research 
is the physical layout of the room in which a group 
meets. 

To explain a phenomenon is to produce a theory 
of the phenomenon: a theory is nothing if it is not 
an explanation. Many investigators, especially psy- 
chologists, begin their research reports by formu- 
lating a theory from which, they assert, the propo- 
sitions (hypotheses) they tested were derived. 
Codification begins by disregarding these theories 
and looking at the empirical findings themselves. 
But it must come back in the end to asking how 
they are to be explained. For this purpose it need 
not come back to the investigators’ own theories. A 
theory may be fruitful in suggesting hypotheses to 
be tested without in the long run providing the 


most general way of explaining the hypotheses 
[see SCIENTIFIC EXPLANATION]. 


The uses of theory 


The process of codification almost forces the 
codifier to ask himself what his general theory is. 
Two main types of general theory have been used, 
not always explicitly, in small-group research. They 
may be called sociological and psychological, but 
here these words refer to the characteristics of 


their general propositions and not to the academic 
affiliations of the men who use them, since a soci- 
ologist may perfectly well use a psychological the- 
ory, and vice versa. 

Sociological theory. The sociological type of 
theory is an offshoot of the “functional” theory with 
which the name of Talcott Parsons is particularly 
associated. It is sociological because its general 
propositions are propositions about social units as 
such and not about the behavior of individuals. It 
attempts to state the general conditions under 
which any social system survives, maintains itself, 
or remains in equilibrium. From these general con- 
ditions it claims to derive the particular features a 
given social system remaining in equilibrium “must” 
possess. The empirical findings represent these 
features. 

In small-group research a good example of such 
a theory is that of R. F. Bales (1950). He argues 
that if a discussion group is to remain in equi- 
librium and reach its goal, such as the formulation 
of a decision, the behavior of the members must 
collectively display certain characteristics. For in- 
stance, the group must pass through certain char- 
acteristic phases in discussion, or its members must 
develop a certain distribution of attitudes of respect 
and liking for one another. Like other functional 
theories, Bales’s derives the features of individual 
behavior from the necessities of group equilibrium, 
and not vice versa [see INTERACTION, article on 
INTERACTION PROCESS ANALYSIS]. 

The difficulties with functional theories of small 
groups are those of functional theories in general. 
No social scientist has ever succeeded in formu- 
lating a rigorous statement of the general condi- 
tions of social equilibrium. The statements actu- 
ally used are so vague that nothing definite can be 
derived from them. Moreover, many social groups 
do not remain in equilibrium, however it is defined, 
so that the range of phenomena the theory could 
explain must at best be limited, Finally, it is not at 
all clear that another type of theory cannot explain 
the phenomena, one that lacks the disadvantages 
of functional theory and possesses the further ad- 
vantage of explaining findings functional theory 
cannot cope with. Indeed, functional theorists often 
find themselves inadvertently using the proposi- 
tions of the other type of theory. 

Psychological theory. Most students of small 
groups use in explanation, not always explicitly, 
some form of psychological theory. Psychological 
theories are so called because their most general 
propositions are propositions about the behavior of 
individual men, and not about the conditions of 
equilibrium in groups. They do not in the least deny 


that the behavior of men is social. What they do, 
in effect, deny is (for instance) that the general 
propositions describing the behavior of men when 
rewarded by the physical environment are different 
from those describing their behavior when re- 
warded by other men, although the analysis may be 
much more complex in the latter case than in the 
former. Psychological theories also assume there is 
no characteristic of a group that cannot be ex- 
plained by the characteristics of the individuals 
making it up: in this sense the whole is not greater 
than the sum of its parts. Finally, psychological 
theories assume that their propositions hold good 
for all men, that although, for instance, members 
of different societies learn to behave in concretely 
different ways because the present circumstances 
and past histories of these societies have been dif- 
ferent, yet the general characteristics of the learn- 
ing process remain the same for all men. 
Behavioral theory. In particular, some of the 
most prominent codifiers of small-group research 
have come to find most satisfactory for the ex- 
planation of the phenomena one or another of the 
psychological theories (called “behavioral” or “learn- 
ing” theories) first developed by experimental psy- 
chologists working with animals. These theories 
need not be incompatible with other psychological 
theories, such as psychoanalytic ones, although the 
terminologies employed may be quite dissimilar. 
The chief propositions of behavioral theory are 
of the following sorts: A man whose activity (some 
item of “voluntary” behavior) has been rewarded 
(or “reinforced”) is apt to repeat that activity. He 
will repeat it the more often, the more successful 
it is in obtaining the reward. He is also more apt 
to repeat it, the more valuable (reinforcing) the 
reward is to him, and the value of the reward de- 
pends on the degree to which he has been deprived 
of it. Any activity that allows a man to escape or 
avoid punishment is also by that fact rewarded. A 
man usually has more than one activity open to 
him and more than one reward for these activities. 
The probability of his emitting one of the activities 
rather than another depends on the relative value 
of the alternative rewards and the relative proba- 
bility of the success of the activities in obtaining 
them. When a particular stimulus situation (some 
set of cognitive elements) has also attended an 
occasion in the past when an activity has been 
rewarded, the presence of some similar stimulus on 
a new occasion will render it more probable that 
the activity will again be emitted. Finally, be- 
havioral psychology states various propositions 
about emotional behavior, such as that when a 
man’s activity has been regularly rewarded under 
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particular circumstances and then suddenly ceases 
to be so, he is apt to become angry, and in anger 
he is apt to find aggressive behavior rewarding. 
This is sometimes called the frustration—aggression 
hypothesis. 

Behavior is social when the activities of each of 
at least two men reward (or punish) the activities 
of the other. Accordingly, the propositions of be- 
havioral psychology may be used to explain the 
characteristics of social behavior, which is the real 
subject of small-group research. The variables 
whose relations are explained are the relative fre- 
quency with which each person emits alternative 
activities, the values to each of the rewards pro- 
vided by the other, and the variations of stimuli 
along various cognitive dimensions, including the 
stimuli each person presents to the other. Since the 
value and frequency of the activities each emits 
affect the value and frequency of the other's ac- 
tivities, the way is at once opened for the explana- 
tion of social influence. As the number of persons 
in interaction increases the analysis obviously be- 
comes very complex, but it can sometimes be 
handled with the help of simplifying assumptions 
that may approximate real conditions. Particularly 
interesting is the situation in which many mem- 
bers of a group set a high value on a scarce reward 
(scarce in the sense that only a few of the mem- 
bers are able to supply it). In this situation be- 
havioral psychology is able to explain why there 
should be differences between the members in 
status and power. It can also begin to explain how 
a social structure, a stabilized set of social rela- 
tions, may develop and maintain itself in a group. 
Indeed, most of the phenomena investigated by 
small-group research can be explained, at least in 
a gross way, by the propositions of behavioral 
psychology [see LEARNING THEORY]. 

Since this theory envisages social behavior as an 
exchange between persons of goods (rewards) in 
different amounts and of different values, it en- 
visages social behavior as a generalized economy, 
and the specialized and highly developed economics 
dealing with prices and markets can be used to 
sharpen its formulations. Alternatively, behavioral 
psychology can help to explain the propositions of 
elementary economics, thus contributing to the 
intellectual unification of social science. 


The need for explanatory principles 

Too high hopes should not be held out for de- 
tailed explanation and prediction. The problem is 
partly one of getting the necessary information. 
Among the crucial variables, for instance, is the 
relative value to individuals of different rewards. 
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Yet many values are not innate in men but ac- 
quired by them in the course of their past experi- 
ence. If the investigator knows that a man grew up 
in a particular culture, he may infer correctly that 
the man has shared the experiences, and hence the 
values, of other members of that culture. In other 
cases, the man’s values may be quite atypical, the 
precipitate of idiosyncratic experiences. Without 
detailed knowledge of the man’s past, the investi- 
gator, even if he commands the best psychology in 
the world, may be unable to predict the man’s social 
behavior. Behavioral psychology itself explains why 
this should be so. 

It must also be remembered that any small 
group, even one artificially formed, is part of a 
larger institutional structure. The behavior of its 
members cannot be wholly explained without ref- 
erence to that structure. Thus a jury is certainly a 
small group, but one required by law to reach a 
unanimous decision in secret. So far as its members 
conform to the legal norms, their conformity af- 
fects their other relations. This does not mean that 
conformity to institutional norms, or even the na- 
ture of the norms themselves, cannot be explained 
by psychological principles. It does mean that insti- 
tutional norms are always among the given con- 
ditions to be used in explaining face-to-face social 
behavior. Differences in cultural and institutional 
norms may help explain why apparently similar 
small-group experiments do not always yield the 
same results in different societies. 

Detailed explanation and prediction also presents 
a problem of a different kind. The fundamental 
propositions of behavioral psychology are very gen- 
eral and refer to the behavior of individuals. How 
is the investigator to show what the resultant, or 
synthetic, implications of a set of propositions 
would be when many individuals are interacting in 
complicated and varying circumstances? Here the 
best hope lies in the increasing use of high-speed 
computers, whose virtue is precisely that of working 
out quickly the detailed implications of general 
propositions under varied parametric conditions. 

Although students of elementary social behavior 
will take their basic explanatory principles from 
behavioral psychology, they will get little other help 
from the behavioral psychologists. Psychologists 
who have experimented with animals tend, when 
applying their principles to human behavior, to 
explain the learning processes of individuals or to 
jump to the gross features of institutions like re- 
ligion and government in the larger society. Yet it 
may well be that in the long run a more convincing 
link between psychology and sociology will be 
forged by those who study at the same time the 


detailed behavior of individuals in interaction and 
the simpler social structures that emerge therefrom. 
They may indeed establish the first theoretical 
organization of the social sciences on a sound 
basis. This is the continuing justification of small- 
group research. 

GEORGE CASPAR HOMANS 


[Directly related are the entries COHESION, SOCIAL; IN- 
DUSTRIAL RELATIONS, article on HUMAN RELATIONS; 
SOCIOMETRY. Other relevant material may be found 
in FUNCTIONAL ANALYSIS; ROLE; SOCIAL PSYCHOL- 
OGY; STATUS, SOCIAL; and in the biographies of 
CooLEY; Lewin; Mayo; SIMMEL.] 
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II 
GROUP BEHAVIOR 


Although William James, Charles H. Cooley, and 
George H. Mead had, at the beginning of the cen- 
tury, stressed the importance of primary groups in 
the development of individual personality, it was 
not until the 1930s that the small group became a 
serious focus of scientific attention. A most remark- 
able independent development of research interest 
in small groups by a number of creative people in 
different social science disciplines occurred during 
this period. Elton Mayo and his colleagues at the 
Harvard Business School conducted their famous 
studies of industrial work groups at Western Elec- 
tric; Jacob Moreno and Helen H. Jennings devel- 
oped sociometric methods for investigating group 
structure; William F. Whyte employed anthropo- 
logical techniques in the study that led to Street 
Corner Society; and Kurt Lewin, Ronald Lippitt, 
and Ralph White initiated the experimental study 
of democratic and authoritarian group leadership. 
At the same time, Paul Schilder, Samuel Slavson, 
and Trigant Burrow were doing pioneer work in 
group therapy. 

These early interests and activities collectively 
seemed to have reached the “critical mass” by the 
end of World War n, resulting in an explosive and 
rapidly mushrooming interest in small groups. By 
1960, about 2,200 small-group studies had been 
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published, more than 80 per cent of which ap- 
peared in the decade 1950-1960. Since 1960, ar- 
ticles have been appearing at the rate of more than 
250 per year. 

No attempt will be made in this article to present 
the detailed findings of this enormous outpouring 
of research. Hare (1962) and McGrath and Alt- 
man (1966) provide useful summaries of the liter- 
ature. Here I shall attempt to provide a framework 
within which some of the important findings can 
be integrated. The framework is oriented to such 
questions as: what is a group? what are the sig- 
nificant ways in which groups differ from one 
another? what are the effects of such differences? 

What is a group? An examination of the dif- 
ferent usages of the term “group” suggests that 
each combines a greater or lesser number of the 
following distinguishing criteria: two or more per- 
sons who (1) have one or more characteristics in 
common, (2) perceive themselves as forming a 
distinguishable entity, (3) are aware of the inter- 
dependence of some of their goals or interests, and 
(4) interact with one another in pursuit of their 
interdependent goals. In addition, some writers, 
particularly those with sociological backgrounds, 
indicate that (5) groups endure over a period of 
time and as a result develop (6) a set of social 
norms that regulate and guide member interaction 
and (7) a set of roles, each of which has specific 
activities, obligations, and rights associated with it. 
I shall use the term “group” to signify at least the 
first four of the distinguishing criteria listed above. 
This usage is consonant with the intuitive notion 
that a group is an entity that consists of interacting 
people who are aware of being psychologically 
bound together in terms of mutually linked inter- 
ests. A group is thus to be distinguished from an 
aggregate, class, category, or type, which consist 
of people who are classified together because of 
some common characteristic. Also, “group” implies 
a psychological or perceived bond, not merely an 
objective linkage, between the members’ interests 
or goals. Moreover, the psychological linkage has 
some cohesive feature to it—i.e., members of a 
group see that in some respects they sink or swim 
together. This latter statement is not meant to deny 
that divisive and disruptive tendencies may exist 
within a group; rather, it is meant to indicate that 
by definition a group does not exist if its cohesive 
bonds are not strong enough to contain its dis- 
unifying influences. 

How groups differ. There are endless ways in 
which groups differ. It is useful to have some 
simplifying outline that highlights the central char- 
acteristics of groups and that permits a prolifera- 
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tion of detail as this becomes necessary. It is well 
to recognize that an outline abstracts variables 
from their contexts and their interrelationships, 
and thus presents them in somewhat distorted 
form. The outline that follows is guided by—but 
not limited to—the criteria of groups listed in the 
preceding section. 

(1) Group size: the number of members in a 
group. 

(2) Group composition: the individual charac- 
teristics of the members, including their distribu- 
tion and patterning. 

(3) Group structure: the patterning of member 
characteristics as perceived by the group members. 

(4) The existential criteria of groups: the cri- 
teria for recognizing a group’s existence, members, 
action, property, etc. 

(5) Group cohesiveness: the type and strength 
of the interests binding the members to the group. 

(6) Group task and environment: the task con- 
fronting the group, and the environment within 
which the group functions. 

(7) Interactional process: the modes and pat- 
terns of interaction between members and with the 
task environment. 

(8) Group culture: the norms, standards, role 
patterns, traditions, and customs operating within 
the group. 

(9) Group effectiveness: the task performance, 
the viability of the group, the membership satisfac- 
tion, and the change within individual members. 

It is useful to recognize that any causal arrow 
connecting an item with any other item in this 
outline is likely to be bidirectional rather than uni- 
directional. Consider group size and group compo- 
Sition. It is evident that increasing the number of 
members in a group will affect the composition of 
the group—e.g., the more people there are in a 
randomly composed group, the more likely it is 
that the group will contain an individual whose 
intelligence is above or below any specified level. 
However, the causal arrow also points in the other 
direction; if a group is composed of a certain kind 
of members, its size is likely to be affected. Thus, 
groups of young children are likely to have fewer 
members than groups of older children. Of course, 
the causal path in the direction from group com- 
position to group size is longer and more circuitous 
than the path in the Opposite direction: it weaves 
from group composition to interactional process to 
group effectiveness (member Satisfaction ) to group 
cohesiveness and arrives finally at group size (pre- 
sumably a size that permits interactions that are 
satisfying to the children because they are within 


their cognitive capabilities, and hence, the children 
are motivated to continue the group). 


Some findings from the study of groups 


I shall employ the outline presented above to 
organize the discussion of illustrative findings ob- 
tained from the study of groups. The studies of 
groups have been mainly of temporary ad hoc 
laboratory groups, created by the investigator, 
rather than of ongoing natural groups in their 
native habitats. Hence, there is no assurance that 
the research findings are generalizable. However, 
they do not seem inconsistent with everyday ob- 
servations. Necessarily, the presentation is over- 
simplified. It largely ignores the evident fact that 
the effects of any given variable upon another (e. ba 
of group size upon interaction patterns) are very 
much influenced by other factors in the situation 
(e.g., the nature of the group task). 

Group size. Hare (1962) and Thomas and 
Fink (1963) have reviewed the relevant research 
literature in some detail. The latter have presented 
a useful framework for viewing the major effects 
of variations in size. The schema presented here 
borrows from theirs. It is helpful to make a distinc- 
tion Fetween the statistical properties of size and 
the psychological properties of size. The former 
are those properties of a group that come from 
taking a given-sized sample of individuals, accord- 
ing to a given procedure, from a population with 
certain characteristics. The group is considered as 
an aggregate, and the psychological properties that 
arise from the compresence and interaction of its 
members are disregarded. 

The statistical properties of size can be appre- 
ciated by observing how variations in size affect 
some common statistical measures: the sum, the 
mean, intragroup variability, the probability of the 
occurrence of any characteristic, the probability of 
concordance of characteristics, the variability of 
the statistical properties. Consider the resources 
(perceptual capacity, memory capacity, informa- 
tion, intellectual capacity, physical strength, skills, 
money, tools, and so forth) available as size in- 
creases. Clearly, the total resources, such as the 
total money in the group, will increase as a linear 
function of size. However, the usable resources will 
be determined by the task and environment. In 
certain tasks it does not help the group to have any 
duplication of a resource (e.g., to have more than 
one person who knows how to type; to have several 
mimeographing machines). Hence, the usable re- 
sources will often increase at a slower rate than 
the total resources and often will, beyond a certain 


point, not increase at all. If this is so, as size in- 
creases, the average usable resource per member 
will decrease. This reasoning explains why, for 
certain kinds of tasks, group performance as meas- 
ured by production per group member decreases as 
size increases, even as total production goes up: 
not all of the total resources are usable, 

As the size of a sample increases, the probability 
that any given characteristic will appear (that 
someone will have red hair, that someone will favor 
vegetarianism) increases. The probability that at 
least one individual of a group has some given 
characteristic clearly depends on the frequency of 
the characteristic in the population from which the 
group was formed, the size of the group, and the 
manner of group formation. If the members of the 
group manifest statistical independence regarding 
the characteristic, and if the probability, P,, that 
any individual has the characteristic is a constant, 
then the probability that at least one member of 
the group has the characteristic is 1— (1 —P,)*, 
where N is group size. Thus, as N increases, the 
probability increases toward 1. Even for more real- 
istic assumptions about group formation, it is clear 
that, as group size increases, so does the probability 
that the group contains at least one individual 
having any preassigned characteristic. Thus, the 
larger the size of a group, the more likely it is to 
have any or all of the following: a very bright 
person, a very stupid person, a quiet person, a 
talkative person, a “right-winger,” a “Jeft-winger.” 

Similarly, the probability that every member of 
the group has the given characteristic generally 
decreases as group size increases, for most reason- 
able assumptions about group formation. Under 
the assumptions of independence and constancy, 
the probability that all group members have the 
characteristic is PY , which decreases exponentially 
as N increases. Hence, as the size of a group 
increases, it is less likely that all individuals will 
have the same opinion or speak the same language 
or be equally informed or be equally resourceful. 
Although heterogeneity is likely to be greater within 
larger groups, larger groups are less likely. to vary 
from one another in aggregate properties than are 
smaller groups. (A group of 20 persons is more 
likely to have the same average IQ as another 
group the same size than are two groups of three 
Persons, ) 

Size affects not only the statistical properties of 
the aggregated resources and other characteristics 
of the group but also the opportunity to satisfy in- 
dividual wants—for example, the larger the group, 
the more time, space, supplies, and facilities it will 


GROUPS: Group Behavior 267 


need to enable all individuals in the group to talk 
and be heard. Thus, if the total amount of time, 
reward, and space remain constant for a given 
task environment, the opportunity for individual 
participation and reward will decrease as the size 
of the group increases. On the other hand, the 
number of potential interpersonal relations in- 
creases geometrically as size increases. For exam- 
ple, the number of possible dyadic relations in any 
group increases with size (N) according to the 
formula: (N?—N)/2. Since there appears to be 
a numerical limit to the capacity to establish close 
associations with others, a smaller proportion of the 
possible linkages will be formed as size increases. 

From our discussion so far, it should be clear 
that different-sized aggregates of noninteracting in- 
dividuals should differ predictably. The larger-sized 
aggregates should have more resources and more 
handicaps; more good solutions and more bad solu- 
tions; more diversity and difference; more demands 
for the available opportunities; more opportunity 
for diversified interpersonal contact but less for 
repeated contact. If the task environment is such 
that the presence of resources, good solutions, het- 
erogeneity, and so forth, are more important fac- 
tors than the presence of handicaps, bad solutions, 
and homogeneity in determining productivity, then 
larger aggregates should be more productive than 
smaller aggregates. But groups are not simply ag- 
gregates; they are composed of interacting indi- 
viduals. A group’s performance may differ from 
the performance of a comparable aggregate of 
individuals because the contributions the individ- 
uals make in the group will be affected by the group 
milieu and because the group will combine or as- 
semble the individual performances in a unique 
manner. 

Audience effects. Social influence on individ- 
ual thinking may reflect either the effect of work- 
ing before an audience (such as other group mem- 
bers) or the impact of the contributions being 
made by the other group members. Research re- 
sults, generally, indicate that there is increased 
motivation and increased distraction when a person 
works on intellectual problems before an audience 
rather than by himself (Kelley & Thibaut 1954). 
In addition, there tend to be fewer idiosyncratic 
thoughts, more moderation in judgments, more 
common associations, more cautiousness, and a 
general taking into account of the anticipated reac- 
tions of the audience. Over a period of time, adap- 
tation to being observed tends to occur (Deutsch 
1949), and hence the “audience effects” tend to 
decrease. Some research evidence (Atkinson 1964) 
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Suggests that once motivation to achieve passes 
beyond a certain moderate level, further increase 
in motivation tends to result in less effective per- 
formance. 

Problem solving and productivity. The preced- 
ing discussion of “audience effects” suggests the 
possibility that as the size of a group increases, the 
intellectual functioning of its members will deteri- 
orate: they will be “overmotivated,” more distracted, 
and more conventional. But other members of a 
group do not merely serve as an audience. They 
also contribute new information and different per- 
spectives; they invoke more aspects of memory and 
demand greater attention; they provide more mate- 
rial for the individual to think with and about. 
That is, the contributions of other members may 
provide new associational starting points, may help 
the individual to break out of an ineffective set by 
Suggesting new orientations, may fill in gaps or 
reveal unnoticed errors in the individual's thinking. 
On the other hand, the contributions of other mem- 
bers may distract, interrupt a chain of thought, blot 
out an individual's own associations, or confuse 
him by providing too much material for him to 
assimilate at one time. The meager relevant re- 
search indicates that the contribution of others may 
be more distracting than useful when the task con- 
fronting the individual is one that requires sus- 
tained, directed attention to a complex pattern 
where the relations between sequentially ordered 
parts must be kept in mind (e.g., in “reasoning” 
problems). In such a task, individuals working 
alone will hit upon different approaches (or will 
symbolize the same approach differently), and 
once they have taken a few steps on their respec- 
tive approaches, understanding each other may be 
difficult without going back to the initial formula- 
tion of the approach or until an obvious solution 
has been reached. Hence, with such a task, the 
larger the group, the more likely it is that it will 
interfere with the individual's thinking. On the 
other hand, when the task is such that the indi- 
vidual is likely to have an initial set that would 
lead to a clichéd or superficial solution to the prob- 
lem and would overlook some of its major dimen- 
sions, the contributions of other members (starting 
from different sets, which may also be superficial) 
may force the individual to go deeper into the 
problem [see PROBLEM SOLVING]. 

A group solution will depend not only upon the 
abilities of its members to think within the milieu 
of the group but also upon the readiness and ability 
of the members to contribute to the group and upon 
the way in which their contributions are coordi- 


nated, assembled, or weighted to produce the re- 
sultant group solution. With regard to the readiness 
and ability of members to contribute, research in- 
dicates that the inequality of participation among 
the various members of a group increases as the 
size of the group increases (Stephan & Mishler 
1952; Bales et al. 1951). These results suggest that 
individuals who tend to be shy are unlikely to par- 
ticipate actively in larger groups, although they 
may contribute much in small groups. On the other 
hand, individuals who tend to be assertive are 
likely to have a disproportionately large influence 
in larger groups as compared with smaller groups. 

With regard to the coordination, assembling, or 
weighting of the contributions that individual mem- 
bers make, investigations have found that the diffi- 
culty of keeping track of the contributions of the 
various members, of coordinating and assembling 
them, will increase as the size of the group in- 
creases. For success in resolving this difficulty, 
larger groups have to devote more of their energy 
to activities directed toward coordination than do 
smaller groups. In addition, it is reasonable to 
hypothesize that such personality factors as self- 
confidence, assertiveness, and persuasiveness are 
more likely to play a significant role in determining 
the individual's impact upon the group solution in 
larger groups than in smaller groups. 

Member satisfaction, also, is affected by the size 
of the group. Laboratory and field studies both 
indicate that members of small groups are more 
likely to feel satisfied with their group, more likely 
to inhibit expression of disagreement, and less 
likely to develop cliques and factions. Large groups, 
on the other hand, are characterized by more ab- 
senteeism, more formality, and more internal con- 
flict than are smaller groups. 

Group composition. How are the individual 
members of a group characterized? The answers 
to this question presuppose that one knows what 
features of the members may influence the way 
they interact, interrelate, and function together. 
There is as yet, however, little systematic knowl- 
edge of how group composition—the distribution 
and patterning of member characteristics—affects 
group behavior. Nevertheless, it is reasonable to 
think that a group’s behavior will be affected by 
the distribution and patterning of such member 
characteristics as abilities, knowledge, resources, 
attitudes, interests, personality dispositions, age, 
Sex, and social status. The combined characteristics 
of the members may be considered in terms of 
their influence upon the group’s effectiveness in 
coping with the task confronting it. Or they may be 


related to the compatibility of the members with 
one another, the attraction of the group for various 
members, the likelihood of the formation of cliques, 
and so forth. 

Effectiveness. With regard to group effective- 
ness, considerable research supports the common- 
sense proposition that groups whose members have 
high abilities, training, or experience are more 
effective than groups whose members are lacking 
in these respects (McGrath & Altman 1966). How- 
ever, it is not simply the average level of abilities 
that is important, but rather whether the kinds of 
abilities necessary to carrying out the role require- 
ments set by the group's task exist among the group 
members and are appropriately distributed. Group 
composition must be evaluated in reference to the 
demands confronting the group rather than in a 
vacuum. Homogeneity of member characteristics 
is an asset when the various group members are 
called upon to fulfill the same task functions, but 
it is a liability when there are varied functions to 
perform. For example, it is reasonable to assume 
that a group composed of both “abstract thinkers” 
and “concrete thinkers” will be more effective in per- 
forming in a task requiring both intellectual analy- 
sis and action than a group composed exclusively 
of one or the other type. Hoffman, Harburg, and 
Maier (1962) report results indicating that groups 
composed of individuals with dissimilar person- 
alities are more productive than homogeneous 
groups. Schutz (1958) has theorized that compat- 
ibility is likely to be greater when people with 
different but complementary personality disposi- 
tions are paired together (e.g., people who wish to 
give affection and people who wish to receive it) 
than when people of similar dispositions are paired 
together (e.g., two people who wish to dominate). 
His research indicates that groups composed of 
members with compatible interpersonal tendencies 
are more productive than those composed of mem- 
bers with incompatible tendencies. 

Patterns of interaction. There is a vast body of 
research on the effect of the personal and social 
characteristics of members on the development of 
attitudes toward the group, interactional patterns 
within the group, etc. Much of this research is con- 
Sidered in the sociometric literature. In general, 
research supports the saying, Birds of a feather 
flock together. People prefer to associate and inter- 
act with others who are similar rather than 
dissimilar to themselves in attitudes, status, back- 
ground, interests, and so forth. The major excep- 
tions to this generalization occur when similarity 
enhances competition (e.g., when two suitors are 
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interested in the same girl); personal needs require 
complementarity rather than similarity (e.g., in 
heterosexual relations); or task requirements ne- 
cessitate differentiated functions and statuses. 

Individual behavior. There has also been much 
study of the personality characteristics that affect 
the performance of members in small groups. 
Mann (1959), reviewing the literature from 1900 
through October 1957, focused on seven personal- 
ity factors: intelligence, adjustment, extroversion— 
introversion, dominance, masculinity—femininity, 
conservatism, and interpersonal sensitivity. He 
summarized the relations between each of the 
seven personality variables and each of the follow- 
ing measures of an individual's status and behavior 
in groups: leadership, popularity, total activity rate, 
task activity, social-emotional activity, and con- 
formity. His survey indicates that the best single 
predictor of an individual’s behavior in the group 
is his intelligence. Intelligence and also extrover- 
sion and adjustment are positively related to total 
activity rate, leadership, and popularity in the 
group. In addition, the more intelligent and better- 
adjusted members are likely to contribute a rela- 
tively larger share of their total activity to building 
up group solidarity and providing emotional sup- 
port for other members and a relatively smaller 
share to being critical or rejecting other members. 
Dominance is related positively to leadership and 
negatively to conformity; conservatism, on the 
other hand, is associated negatively with leader- 
ship and correlated positively with conformity. 
Masculinity and interpersonal sensitivity show pos- 
itive relationships to leadership and popularity. 
Mann (1959, p. 266) concludes his review with 
the caution that the magnitude of the median of 
the correlations between an aspect of personality 
and performance is in no case higher than .25, 
and most of the medians of the correlations are 
nearer .15. 

Group structure. Popular conceptions of group 
and organizational structure have been very much 
influenced by organizational charts, developed in 
the military and other large bureaucracies, that 
stress lines of formal authority. This is too limited 
a view. It is more fruitful to think of structure in 
terms of the way members actually relate to one 
another. In a sense the term “group structure” is 
a misnomer; there may be many different “struc- 
tures” within a group—the work structure, the com- 
munication structure, the friendship structure, the 
power structure, the prestige structure, and so forth. 

There is often a correspondence among the posi- 
tions an individual holds in the different structures, 
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so that an individual who holds a central position 
in one structure (e.g., the communication struc- 
ture) is likely to hold a central position in other 
Structures (power, friendship, and prestige). The 
research of Galtung (1964) in Norway indicates 
that this is the case for Norwegian society: people 
who are more central on social variables (income, 
education, occupation, residence, age, and sex) are 
also more central in the communication and power 
structures. In the status-equilibration hypothesis, 
Benoit-Smullyan (1944) and, later, Homans (1961) 
have stressed the forces that operate to make for 
similarity in the positions of an individual in dif- 
ferent structures. However, status equilibrium is 
not always achieved. Research by Adams (1953) 
with air crews demonstrated that lack of congru- 
ence on such status dimensions as age, military 
rank, education, reputed ability, popularity, com- 
bat time, and position importance was related to 
poor morale, less friendliness, and lack of mutual 
confidence. Exline and Ziller (1959), working with 
experimentally created groups, found that groups 
constructed so as to have incongruent status hier- 
archies manifested more interpersonal conflict and 
less productivity than congruent groups. 

In addition to research on status congruency, 
research on group structures has investigated such 
topics as (1) the effects of different communica- 
tion structures (Guetzkow 1953); (2) the effects 
of different leadership structures, for example, 
leaderless groups versus groups with leaders, and 
of leadership style (Fiedler 1964); (3) the effects 
of different residential and propinquity structures 
(Festinger et al. 1950); (4) the effects of similar- 
ity or dissimilarity on various social and personal 
characteristics, such as age, sex, religious belief, 
attitudes; (5) the Kafkaesque effects of complex 
organizational structures, which are dimly per- 
ceived and little understood by members; ( 6) the 
determinants of sociometric structure, leadership 
structure, communication structure, etc.; (7) meth- 
ods of classifying and identifying roles within 
groups (Bales 1950); and (8) mathematical pro- 
cedures for characterizing different types of struc- 
tures and different positions within a structure 
(Coleman 1964), 

The existential criteria of groups. Although a 
vast body of legal principle and practice has been 
concerned with the conditions under which a “legal 
personality,” such as a corporation, can come into 
existence, and with the identifying of those who 
can act in its name, there has been little social- 
psychological research that bears upon such re- 
lated problems as how a group is identified and 
what determines whether an action of a member 


is attributed to the group or to him personally. 
However, drawing upon studies of perceptual or- 
ganization, it is possible to indicate some general 
principles concerning the conditions that are con- 
ducive to the perception that a collection of indi- 
viduals or units are part of a system rather than an 
unorganized aggregate of elements. 

As Koffka (1935) pointed out, abrupt discon- 
tinuity produces segregating forces between the 
parts of a visual field that it separates, as well as 
unifying forces within the separated parts. Further, 
he indicated that homogeneity tends to produce 
unifying forces in the visual field. Homogeneity 
may be based upon (1) the common fate of the 
elements perceived (they move together); (2) their 
qualitative or quantitative similarity (they have the 
same color or the same luminosity); (3) proximity 
(they occur in spatial or temporal contiguity); (4) 
a common boundary; (5) past experience or cus- 
tom that has led to similar responses to the various 
elements; and (6) set or the expectation that the 
elements are to be grouped together [see GESTALT 
THEORY]: 

It seems evident that processes analogous to 
these determine whether an individual will per- 
ceive a collection of individuals as a social group 
and whether he will perceive himself to be part of 
the group. Thus, if an individual perceives that he 
and some others are strikingly different in certain 
respects from the remainder of the people in their 
surroundings; that he and the others tend to be 
satisfied or dissatisfied at the same time or under 
similar circumstances; that he and the others have 
similar attitudes or similar backgrounds; that he 
and the others live or work in close proximity; that 
he and the others are associated together in other 
people’s minds or treated similarly by other people 
—if he perceives any of these patterns, the indi- 
vidual is likely to perceive himself and the others 
as cooperatively interdependent. I would stress, as 
does Campbell (1958), the central role of the per- 
ception of common fate in determining the con- 
sciousness of being joined with others to form a 
group. 

Group cohesiveness. In everyday usage “cohe- 
siveness” refers to the tendency to stick together; 
its usage in social psychology is much the same. 
It refers to the linkages that bind the members of a 
group together. Deutsch (1949; 1962) has stressed 
that the linkages among members are cohesive, 
rather than disruptive, when the goals and interests 
of the members are cooperatively, rather than com- 
petitively, interrelated. Various aspects of these link- 
ages have been the focus of research: the nature of 
the mutually linked goals or interests—such as 


friendship, work, money; the strength of the mutu- 
ally linked goals or interests; the degree of linkage, 
and the availability of other means of obtaining 
one’s goals; the forces operating to restrain mem- 
bers from leaving the group; other interests or 
memberships that are in opposition to continued 
membership in the group. 

Since group cohesiveness is central to the exist- 
ence of groups, it is natural that its determinants 
and also its consequences have been studied exten- 
sively (Hare 1962; Collins & Guetzkow 1964; Mc- 
Grath & Altman 1966). Research findings, over-all, 
indicate that cohesiveness (as measured by inter- 
personal congeniality, the desire to remain a mem- 
ber of the group, attitudes toward the group's func- 
tioning, or other similar measures) is consistently 
associated with greater communication between 
group members, greater readiness of group mem- 
bers to be influenced by the group, more consensus 
among members on attitudes and beliefs that relate 
to group functioning, more sense of responsibility 
toward each other among group members, a greater 
feeling of personal ease and security within the 
group by the group members, and so forth. Also, 
task effectiveness is generally positively correlated 
with cohesiveness if high accomplishment on the 
task is valued by the group (some groups restrict 
performance to achieve their objectives) and if the 
task is such that its performance is likely to be 
enhanced by increased group effort. It should be 
noted that the causal arrow is bidirectional: group 
cohesiveness not only increases intragroup com- 
munication and group success, but group success 
and intragroup communication increase group co- 
hesiveness [see COHESION, SOCIAL]. 

Group task and environment. It is self-evident 
that the task confronting the group and the en- 
vironment within which the group functions can 
influence all the other characteristics of a group. 
Unfortunately, however, the research relating to 
task and environmental characteristics has been 
meager, largely because there has not yet been 
developed any systematic way of characterizing 
tasks or environments. Nevertheless, certain useful 
distinctions have been made. 

It is possible to characterize many tasks and en- 
vironments in terms of the type of requirements 
for success that they impose upon the group. It is 
apparent that tasks differ in the types and amounts 
of skills, knowledge, effort, and resources required 
and in the way these factors have to be inter- 
related. In other words, the roles within the group, 
the structure of the group, the size of the group, 
etc., may vary as a function of the group's task 
and environment. 
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Fiedler (1964), for example, has shown that the 
effectiveness of different types of leaders is very 
closely related to the structure of the task confront- 
ing the group. In both field and laboratory studies, 
his findings indicate that controlling, authoritarian 
leaders tend to be most effective either in very 
favorable or else in relatively unfavorable group- 
task situations, while the permissive, considerate, 
democratic leaders are most effective in situations 
that are intermediate in favorableness. Fiedler in- 
dicates that a situation is very favorable when 
(1) the leader-member relations are positive, (2) 
the task is clear and well structured, and (3) the 
leader has well-defined authority and power to re- 
ward and punish. Thus, for example, if a leader 
has good personal relations with his group and the 
task is routine and his authority well-defined, he is 
likely to be more effective if he is “directive,” rather 
than democratic and permissive. On the other hand, 
if the task is novel and unstructured but the situ- 
ation is otherwise similar, democratic, permissive 
leadership is likely to be more effective [see LEAD- 
ERSHIP]. 

Task structure helps to determine the types and 
amounts of interaction and communication within 
a group and also the sequencing and organization 
of the activities within the group. These, in turn, 
will often affect the social relations that develop 
within a group. Much research has supported the 
proposition (Homans 1950) that people who inter- 
act frequently with one another tend to like one 
another, and vice versa. Thus, tasks that require 
certain group members to work closely with one 
another and limit their interaction with other mem- 
bers may, if the task is long-enduring, help to cre- 
ate patterns of friendship that parallel the inter- 
actional requirements of the task. Further, the 
research indicates that if a task places a given in- 
dividual in a central position because he is able to 
communicate readily with other members or be- 
cause he possesses a scarce resource (a skill, spe- 
cialized information, or a particular tool) that is of 
critical value to group success, then he is likely to 
have high status within the group (and high satis- 
faction with the group). 

Tasks differ in the degree to which they permit 
division of labor and specialization of function. 
The problem of dividing tasks into subtasks, of 
sequencing them, and of assigning personnel and 
resources to them has been of major interest to 
economists and operations-research analysts, and 
they have had considerable success in developing 
rational methods of predicting the effectiveness of 
different methods of dividing up a task. Here let 
us note some psychological aspects of the division 
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of labor. Among its possible negative consequences 
are the loss of one’s identification with the over-all 
group objectives; the loss of a sense of an over-all 
significance and meaning to one’s activities; the 
development of vested interests in one’s specialized 
activities; the development of specialized languages, 
values, and modes of thought that interfere with 
coordination and communication between the vari- 
ous specialized activities. Among the possible posi- 
tive consequences of specialization is, in addition 
to increased group productivity and individual eco- 
nomic reward, the greater chance that an individ- 
ual will be able to find some activity that matches 
his particular interests and abilities. 

Tasks and environments differ not only in their 
activity and interaction requirements but also in 
their stressfulness. The term “stress” has been used 
to refer to a hypothetical state of tension, frustra- 
tion, or internal conflict induced by such conditions 
as task difficulty (e.g., a problem without any solu- 
tion), lack of information about how well the task 
is being performed, threat of punishment for task 
failure, danger (e.g., as in combat or survival in 
the Arctic), intense criticism, time pressure, an 
unpredictable environment. The results of studies 
of the effects of such varied conditions are not 
univocal. The safest generalization seems to be that 
mild stress often improves group performance and 
enhances group cohesiveness, while severe stress 
often has the opposite effects. Optimal stress for a 
group is presumably higher the more able and 
cohesive the group is and the more the members 
see themselves as able and motivated to cope with 
problems (Deutsch 1959). 

Finally, it is relevant to note that environments 
differ in the probability of reward and the amount 
of reward they provide for effective group action 
and also in the manner in which rewards are dis- 
tributed within a group. Little research has been 
done on the effects of different “schedules of rein- 
forcement” upon group behavior, yet there is rea- 
Son to assume that they would influence group 
performance (Shapiro 1963). On the other hand, 
a considerable number of research studies Ces, 
Deutsch 1949; Raven & Eachus 1963) have 
demonstrated that whether rewards are distributed 
cooperatively or competitively within a group may 
have a striking effect on member behavior. In gen- 
eral, group members who are rewarded coopera- 
tively show more positive response to one another, 
have greater involvement in the group, are less 
likely to work at cross-purposes, communicate with 
one another more effectively, and work more pro- 
ductively together than group members who are 
rewarded competitively. 


Interactional process. The observable trans- 
actions between members and their observable 
transactions with their task environments are 
lumped together under the term “interactional proc- 
ess.” It is, in effect, what goes on in groups. There 
are many different ways of characterizing what 
goes on. Most of them focus on one or more of the 
following aspects of a transaction: who communi- 
cates or does what to or with whom; with what 
intent or function; how, when, or under what condi- 
tions; through what media or channels; with what 
effects upon whom, as perceived by whom. Each 
of the italicized terms could be elaborated in con- 
siderable detail. For example, if one specified the 
characteristics of the potential communicators and 
communicatees (the who and the whom )—their 
statuses, their personality tendencies, their pre- 
existing attitudes toward one another—one could 
predict, to some degree, who will talk to whom and 
how they will talk. (The demeanor of a subordinate 
making a critical remark to a superior will be rather 
different from that of a superior criticizing a sub- 
ordinate.) Similarly, knowledge of the conditions 
—e.g., what stage in problem solving the group is 
at—enables one to predict what kinds of content 
(the what) a transaction is likely to have. 

The most widely used system for categorizing 
interactions is the one developed by Bales (1950). 
It focuses on the who-to-whom-and-whait_ inter- 
action. His system consists of 12 distinct categories 
of the content of communication: content that (1) 
shows solidarity, (2) shows tension release, (3) 
agrees, (4) gives suggestion, (5) gives opinion, 
(6) gives orientation, (7) asks for orientation, (8) 
asks for opinion, (9) asks for suggestion, (10) 
disagrees, (11) shows tension, and (12) shows 
antagonism, The categories are grouped in various 
ways. One major grouping is into task categories 
(subdivided into questions and attempted answers) 
and social-emotional categories (classified as posi- 
tive and negative). More recently Stone and his 
co-workers (1962) have devised a more generalized 
system of content analysis called the General In- 
quirer, which employs a computer to code the 
actual verbal text of group interaction into 164 
categories [see INTERACTION, article on INTER- 
ACTION PROCESS ANALYSIS]. 

Slater (1955) has shown that members who are 
ranked high as “idea men” by the other members 
in problem-solving groups initiate interaction more 
markedly in “attempted answers,” while members 
who are ranked high in likability participate more 
heavily in the categories grouped as “positive 
social-emotional.” His research has also indicated 
that often the task leader, or idea man, and the 


social-emotional leader are not the same person; 
this specialization of interaction function is more 
evident if the group exists over a period of time. 
In other words, as Barnard (1938) had noted 
earlier, the two major problems confronting groups 
—adaptation to their task environment and pro- 
vision of personal satisfaction to the individual 
members—do not necessarily lead to the same 
emphases within a group. 

Why do people interact? Few theorists have gone 
beyond the common-sense viewpoint that they do 
so because it is instrumental to a given end or be- 
cause it is gratifying in itself. Festinger and his 
associates (1950), however, have suggested that 
one of the major instrumental functions of inter- 
action is helping to establish “social reality”: the 
validation of opinions, beliefs, abilities, and emo- 
tions in terms of a social consensus. That is, one of 
the functions of communication within a group is 
to establish uniform views about reality, so as to 
provide the members with some confidence in their 
beliefs and to enable them to coordinate their be- 
havior for effective group action. Thus, group mem- 
bers whose views deviate from those held by the 
rest of the group will be subject, through communi- 
cation, to pressures to change their views to con- 
form to those of the rest, or they will be rejected 
or isolated by the group, so as to eliminate a source 
of disturbance to the group. Festinger hypothesized 
that these pressures are greater the more cohesive 
the group is, the more relevant the belief is to the 
group, the more discrepant the deviant’s viewpoint 
is, etc. A considerable body of research is consistent 
with these hypotheses. Festinger has also suggested 
that communication may function as a substitute 
for social locomotion—people who would like to be 
in powerful positions direct their communication to- 
ward those who hold such positions. The study by 
Kelley (1951) of communication in experimentally 
created hierarchies is consistent with this hypoth- 
esis, However, Cohen (1958) suggests that up- 
ward communication may be more directly moti- 
vated by the desire to receive the benefits that a 
higher-status person can bestow upon someone of 
lower status. 

Group culture. The members of any group who 
have had a prolonged experience of interacting 
with one another tend to develop shared values, 
expectations, and rules—a normative consensus 
that helps to regulate interaction between members 
and between the group and its task environment 
and that also serves to define the roles of the vari- 
pus members, including their specialized activities, 
rights, and responsibilities. A normative consensus, 
or group norm, sets criteria for evaluating the 
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desirability-undesirability, acceptability-unaccept- 
ability, of the group members’ activities, beliefs, ap- 
pearance, etc., and for responding with various 
sanctions, positive or negative, such as reward- 
punishment, approval-disapproval, to a member's 
conformity with or violation of the norm [see 
Norms]. 

Norms develop about many things, from the 
type of pronoun to be used in addressing intimates 
or strangers to the type of wine one should serve 
on certain occasions. Yet not everything is regu- 
lated. Norms tend to develop mostly in areas that 
are relevant to the group’s functioning, and it 
seems likely that the more important an area is to 
the group, the more norms there will be, the more 
intense will be the sanctions employed to obtain 
conformity to them, and the smaller will be the 
range of acceptable behavior. Thus, as Sutherland 
has shown, the norm of punctuality in appearing 
at a prearranged time and place is strictly enforced 
among professional thieves because its violation 
may endanger an enterprise and lead to arrest (Con- 
well 1937). Norms are more often developed with 
regard to overt behavior than private beliefs, not 
only because the former are usually more impor- 
tant to group functioning but also because beliefs 
are less controllable, being less observable than be- 
havior. Further, it is apparent that the norms of 
different types of groups will differ—for example, 
the norms of a friendship group and those of a 
work group. It is not uncommon for a person to 
experience conflict because he belongs to groups 
that have conflicting norms [see CONFORMITY]. 

There is an extensive research literature on the 
determinants of conformity and deviation to group 
norms (for a summary, see Symposium on Con- 
formity . . . 1961). It is not a great oversimplifica- 
tion to sum up the findings as indicating that con- 
formity appears to be a function of such factors 
as the person’s awareness of the norm, the strength 
of the norm, the strength of the person’s attraction 
to the group, the likelihood that conformity or de- 
viation will be observable by others, the strength 
of the sanctions expected for conforming or deviat- 
ing, personality predisposition (such as depend- 
ency, acceptance of authority, self-confidence ). 
Whether a person will conform will depend not 
only on these factors but also on the strength of 
the tendency to deviate, which is determined by 
parallel considerations (e.g., is the tendency to 
deviate from the norms of one group a tendency 
to conform to the norms of another group to which 
the person belongs? ). 

Groups not only develop norms, which specify 
the “shoulds” and “should nots”; often they also 
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develop styles, traditions, or customs (and these 
often become the object of norms), which are the 
habitual ways of dealing with recurring situations. 
Thus, a group may develop a unique language (al- 
most every profession and trade develops its own 
“slang,” its own peculiar abbreviations); distinc- 
tive garb, insignia, or appearance, to permit ready 
identification; a distinctive locale for meeting and 
engaging in its activities; a special style of inducing 
emotional responsiveness and of expressing emo- 
tion (e.g., distinctive dances, ceremonies), etc. 
Case-study material suggests that groups are most 
likely to develop idiosyncratic traditions if they are 
relatively isolated, as a result of geographical or 
social factors (due to superior or inferior status); 
if they are in conflict with other groups; or if their 
task environment is unique. 

Anthropologists have, of course, studied and de- 
scribed in considerable detail the customs and 
traditions of many simple societies. A similar kind 
of analysis could be made of the development of 
customs and traditions in experimentally created 
or naturally formed groups, to investigate some of 
the determinants of particular kinds of traditions 
and customs. Unfortunately, little research has 
been conducted, apart from the pioneering studies 
of Sherif (1936), Merei (1949), and Rose and 
Felton (1955). 

Group effectiveness. A group’s effectiveness may 
be characterized in such terms as (1) task per- 
formance—the quality and quantity of the group’s 
outputs, as measured in terms of external criteria: 
(2) group viability—the group's ability to main- 
tain itself as a functioning group under varying 
conditions; (3) member satisfaction—the desire 
of the members to maintain their membership and 
to contribute to the group's viability and the at- 
tainment of group goals; and (4) member change 
—the change in knowledge, skills, attitudes, ad- 
Justment, or personality of the individual members 
of the group. Although there may be relations be- 
tween these different types of group effectiveness 
over the long run, in the short run, however, it is 
evident that these different types of outcome may 
vary independently of one another. For example, 
high member satisfaction may result from or re- 
sult in high task accomplishment, but high task 
accomplishment may also occur at the cost of mem- 
ber satisfaction (as when a demanding leader 
drives the group members on despite their protests). 

The bulk of research on group effectiveness has 
been concerned with the determinants of task per- 
formance. This research highlights the importance 
of the following types of determinants: (1) the 
Strength of the values associated with effective 


task performance—e.g., the greater the potential 
rewards for good performance and the more task- 
oriented the group norms, the more effort the 
group will be willing to put into the task; (2) the 
cohesiveness of the group—e.g., the more the mem- 
bers value the group and one another, the more 
willing they will be to expend effort in compliance 
with group norms or to achieve group-defined goals; 
(3) the perceived difficulty of the task—e.g., a 
task that is perceived to be very easy or virtually 
impossible is likely to stimulate less effort than a 
task that is viewed as difficult but attainable; (4) 
the amount of task-revelant abilities, information, 
and experience of the group members; (5) the 
appropriateness of the group structure to the re- 
quirements of the task—e.g., how efficient is the 
particular kind of division of labor for the task? 
how do the abilities, knowledge, and interests of 
the role occupants fit the requirements of their 
roles?; (6) the central role of the group leader 
and the appropriateness of his leadership style to 
the task and to the group. 

Although there have been many studies of task 
performance, research in this area has been 
plagued by the problem of establishing reliable and 
valid measures of group achievement. There is 
little evidence to indicate that one group tends to 
perform reliably or consistently better than another 
group on a given task. Nor can one predict with 
much confidence, from a group’s performance on 
one task, how it will do on another’ similar task. 
Research investigators in this area have not yet 
begun to develop any measures of group achieve- 
ment that have the usefulness of many of the 
measures of individual achievement. 

The determinants of member satisfaction have 
been studied as extensively as those of task effec- 
tiveness (for summaries, see Collins & Guetzkow 
1964; McGrath & Altman 1966). In brief, the 
relevant research indicates that a member's satis- 
faction is affected by (1) the status of the group 
—its successfulness, its task achievements, its 
prestige; (2) the interpersonal relations within the 
group—the attractiveness of the other group mem- 
bers, their attitude toward him, their attitude toward 
belonging to the group; (3) the member’s role 
within the group—its prestige, communication cen- 
trality, power, significance, interest; (4) the direct 
rewards and benefits received from membership; 
(5) the group atmosphere, as determined by such 
factors as leadership style, group size, group com- 
position; and (6) the nature and desirability of 
conflicting memberships or activities. 

Little research has been done on the determinants 
of environmental input and of group viability. 


There is, however, an extensive literature on mem- 
ber change. Much of the relevant research has 
been done under the rubric of conformity and de- 
viation (for summaries, see Symposium on Con- 
formity . . . 1961). Also, the growing literature on 
group psychotherapy (for representative papers, 
see Rosenbaum & Berger 1963) contains many in- 
sightful case discussions, even though the amount 
of systematic research is still quite small [see 
MENTAL DISORDERS, TREATMENT OF, article on 
GROUP PSYCHOTHERAPY]. 

In addition, there is a rapidly developing list of 
publications dealing with human-relations training 
groups that are concerned with helping people 
learn how to function more effectively in groups. 
Here, too, the amount of published research is 
negligible. Nevertheless, since the stimulus to this 
approach to human-relations training came from 
the theoretical writings on re-education and re- 
action research of Kurt Lewin (1935-1946), much 
of the literature on training groups is imbued with 
social science concepts and suggests research. The 
major ideas of Kurt Lewin that underlie the 
training-group approach are that the re-education 
process basically involves the equivalent of a 
change in culture, and that for the individual to 
accept a new system of values and beliefs, he must 
come to value his membership in a group that has 
these new values and beliefs as a central compo- 
nent of its culture. 

Morton DEUTSCH 


[Other relevant material may be found in SociAL Psy- 
cHoLocy and in the biography of LEWIN] 
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m 
GROUP FORMATION 


Groups are composed of individuals. But when 
and how does a collection of individuals become a 
group? The phrase “group formation” suggests that 
something is formed. The task here is to trace the 
appearance of essential properties or characteristics 
that distinguish a human group. 

Both for theory and research, it is instructive to 
adopt the strategy of tracing the formation of in- 
formal groups rather than of those instituted 
formally through blueprints handed down by out- 
side authority (such as a committee or board). 
Despite the limitation of considering groups formed 
through the interaction of the membership, the 
implications are broad. Many social institutions and 
formal organizations (including governments) had 
informal beginnings. Informal groups are almost 
invariably found within stable formal structures 
such as industrial, commercial, political, educa- 
tional, religious, military, and recreational organ- 
izations. Finally, informal groups possess the mini- 
mal characteristics essential to any organized 
association, whether large or small. 

Background. The properties characterizing the 
formation of a group were treated by the nineteenth- 
century social philosophers for various reasons and 
with varying emphasis, each using illustrative ex- 
amples known to him. But the topic was doomed 


to controversy until data were collected through 
scientific methods. 

In the 1920s, Robert E. Park at the University 
of Chicago inspired and directed a series of investi- 
gations into human groups and their relationships 
to their settings (for example, Thrasher 1927; 
Landesco 1929; Shaw 1929; Zorbaugh 1929). Initi- 
ated to deal with the problem of homeless and anti- 
social children, the work of the Soviet educator 
Makarenko included concrete data on the formation 
of group properties, which he gradually came to re- 
gard as crucial conditions for the effectiveness of 
his educational efforts (Makarenko 1933; Bowen 
1962). 

In the 1930s, under the impetus of Elton Mayo 
of the Harvard Business School, studies in Western 
Electric’s Hawthorne plant revealed the emergence 
of groups and their impact on the behavior of 
workers who had initially been placed together in 
observation rooms for the purpose of studying the 
effects of varying illumination and rest periods 
(Roethlisberger & Dickson 1939). J. L. Moreno 
(1934) and his co-workers began the systematic 
sociometric mapping of friendship choices among 
girls in reformatory cottages and children in class- 
rooms. Using the methods of laboratory psychology, 
Sherif (1936) tested sociological theories dealing 
with the formation of social norms in situations of 
ambiguity and instability, demonstrating the sub- 
sequent retention of the norms as personal stand- 
ards when individuals were alone. Lewin and his 
students (Lewin et al. 1939) initiated experiments 
varying the manner of adult supervisors of chil- 
dren’s clubs. Meanwhile, the failure of “person- 
ality” or “Intelligence” tests in selecting military 
leaders led the military in several countries to 
sponsor studies of what came to be called “leader- 


less groups” (Gibb 1954; U.S. Office of Strategic 
Services 1948), 


Properties 


The definition presented here includes only the 
minimal properties found essential through ex- 
tensive surveys of empirical and theoretical litera- 
ture on groups of all kinds (Sherif & Cantril 1947; 
Sherif & Sherif 1948), 

Definition. A group is a social unit consisting 
of a number of individuals who stand in status and 
role relationships to one another that are stabilized 
in some degree at a given time and who possess a 
set of values or norms regulating their behavior, 
at least in matters of consequence to the group. 

By this definition, the “groupness” of a group is 
a matter of degree. A collection of persons forms 
into a group proportional to (a) the degree of sta- 


bility of its organization (consisting of roles and 
status relations) and (b) the degree to which its 
particular set of norms for behavior are shared and 
binding for the participants. The undefined terms 
in the definition (role, status, norms) will be speci- 
fied through research operations with conceptual 
relationship to the process of group formation. 

It should be noted that this definition includes 
the properties covered in many modern works, 
while excluding others. Similar specifications are 
found in Bales (1950), Blau and Scott (1962), 
Bonner (1959), Cartwright and Zander (1953), 
Golembiewski (1962), and Hare (1962). The fol- 
lowing characteristics, included by some investi- 
gators, were omitted here for the reasons specified: 

Interaction and communication are not distinc- 
tive to group formation but are essential to any kind 
of human association of consequence. Shared senti- 
ments, attitudes, and behavior patterns of group 
members are implied in the normative property; 
in fact, the extent of sharing is one of the research 
measures for the degree of group formation at a 
given time. Many properties of existing groups (for 
example, morale, solidarity or cohesiveness, and 
loyalty of members) are dependent upon the condi- 
tions of group formation, especially the degree of 
stability attained. 

The properties essential to group formation will 
aid the reader in evaluating the large body of ex- 
perimental research on so-called “small groups” 
conducted in both the United States and Europe 
since World War 1r. In summarizing this literature, 
Golembiewski (1962, p. 47) found that the great 
majority of these “groups” consisted of strangers 
exposed briefly in the laboratory to tasks or instruc- 
tions creating temporary interdependence. Very few 
studies have allowed a sufficient time span for 
group properties to form. 

Generality of group formation. Beneath the or- 
ganized forms and routines of societies, the forma- 
tion and disintegration of groups occur in all walks 
of life, frequently with important consequences. 
Informal group formation is well documented with- 
in industrial, military, school, prison, and neigh- 
borhood settings (see Hare 1962; Sherif & Sherif 
1948). In studying the Near North Side of Chicago, 
Zorbaugh (1929, p. 192) reported group forma- 
tions in neighborhoods of all socioeconomic ranks 
with “an enormously important role in the lives of 
their members”: exclusive clubs on the fashionable 
Gold Coast, intimate groups of nonconformists in 
artists studios, mutual benefit societies in foreign 
colonies, gangs in slum areas, and cults and sects 
in the rooming house district. 

The extensive documentation on the generality 
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of group formation also shows the striking depend- 
ence of the process on other groups and on the 
material and ideational features and facilities of 
the environment. The process of group formation 
is not insulated by the bounds of the membership. 
The formation that results is not a closed system. 
The circumstances bringing individuals together 
initially, their motives in continuing to interact, the 
particular organization and norms that develop, 
and the degree of their stability are inevitably de- 
pendent upon environmental circumstances. In- 
cluded in the environmental circumstances are 
other groups whose activities and aims impinge 
favorably or unfavorably upon those of the group 
in formation. 


Four essentials of group formation 


The essentials of group formation can be traced, 
starting with the initial conditions for interaction 
among individuals. The encounter with another 
person is the most elementary social situation. Even 
the mere presence of other persons has consequen- 
tial effects on behavior and task performance. 
From the time when individuals are merely to- 
gether to the time when the properties of a group 
begin to appear, we see that the consequential 
effects on behavior begin to assume regularities. 
As time goes on, these regularities reflect patterns 
that are the organizational and normative proper- 
ties of the group. Accordingly, the essentials in the 
process of group formation are the following: (1) 
A motivational base conducive to repeated interac- 
tion; (2) formation of an organization (structure ) 
consisting of roles and statuses; (3) formation of 
rules, traditions, values, or norms; and (4) differ- 
ential effects of the group properties on the attitude 
and behavior of participants over time. 

Motivational base. Any human motive, frustra- 
tion, problem, or desired goal that an individual 
cannot handle effectively alone is conducive to his 
interaction with others who are seen as being in the 
same plight. The prerequisite for group formation 
is that persons with motives conducive to initial 
interaction have the opportunity over a span of 
time to recognize the concerns they share or recip- 
rocate and to attempt to deal with them in concert. 

The common motive or motives faced in the 
initial stages of interaction may be one or several 
of those found in any society—for example, hun- 
ger, sexual desire, desire for recognition or power 
in some respect, and fear or anxiety in the face of 
threat. They may be culturally defined—for exam- 
ple, desire for material possessions or prestige 
through particular activities, or pursuit of political 
goals. Here only a few points can be considered: 
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The common problem, motive, or goal conducive 
to repeated interactions is necessarily dependent 
on environmental circumstances, both in its oc- 
currence and in attempts to deal with it. Whatever 
its nature, the motivational base for group forma- 
tion invariably affects the activities and tasks en- 
gaged in by the group's members and the kinds of 
personal qualities that become prized by them. 
When a set of norms takes shape, those most bind- 
ing are typically related to the motives or problems 
that initially brought the persons together. One 
reason why the many controversies over problems 
of conformity-nonconformity are inconclusive is 
that many theorists pay scant attention to the re- 
lationship between the particular norms of a group 
and the initial motivational base underlying them. 

However, to the degree that group formation 
achieves a degree of stability over time, new 
sources of motivation and new goals, arising from 
the existence of the group itself, are generated 
among the members. These may even take prece- 
dence over those that originally brought the mem- 
bers together. Thus, the hungry person may refrain 
from eating until he can share with his starving 
fellows; the politician may spurn an advantageous 
political bargain out of loyalty to his supporters; 
the member of a group struggling for equal op- 
portunity and freedom from fear may undergo 
great deprivation and bodily injury to secure recog- 
nition of his group. 

Formation of organization or structure. As in- 
dividuals interact over a period of time in activities 
related to the common problems that brought them 
together, their behavior and their expectations for 
one another's behavior assume regularities from 
which a pattern can be constructed. Here we shall 
define certain features of these regularities that 
appear to be crucial in any group formation. Heavy 
reliance will be placed upon findings from three 
experiments on group formation and relations be- 
tween groups, each experiment lasting several 
weeks (summarized in Sherif & Sherif [1948] 1956, 
chapters 6 and 9; and Sherif 1966), and from more 
recent studies of naturally formed groups (Sherif & 
Sherif 1964). In each case the experiments started 
with unacquainted persons divided into collections 
of 10-12 as similar as possible in composition. All 
of these studies were conducted under naturalistic 
conditions, and data were collected without con- 
stant awareness by the individuals that they were 
being investigated. 

The development of organization has been de- 
fined in terms of role and status relationships 
among a number of individuals, Role denotes reci- 
procities in the treatments and expectations of in- 


dividuals, each for the others. Unlike well-defined 
occupational or sex roles, definite prescriptions for 
behavior are lacking when unacquainted persons 
first meet. Reciprocities among them must be built 
on the basis of performance in the activities en- 
gaged in, the reactions of others to a person, and 
his reactions to them. The typical finding at early 
Stages of interaction is that individual contributions 
to task performance differ from one activity to the 
next (U.S. Office of Strategic Services 1948: Gibb 
1954). Thus, observers rating behavior in the dif- 
ferent situations find that the degree of participa- 
tion and prominence of the individuals differs from 
one task to the next, according to individual dif- 
ferences in skills, abilities, temperament, or physi- 
cal resources and tools relative to the activities 
in question. 

The single most salient feature of group forma- 
tion is that over time the various member roles 
become differentiated, not merely with regard to 
task performance or personal qualities but accord- 
ing to the evaluation of the roles by the members 
themselves. Members are accorded differing degrees 
of prestige and respect by their fellows. The mem- 
ber roles acquire different degrees of relative power 
to initiate and control activities important to all 
of the individuals in the group. 

A members position (rank) in a developing 
power structure is his status in the group, defined 
in terms of the relative effectiveness of his actions 
in initiating activities, making or approving deci- 
sions affecting the group, coordinating interaction, 
and invoking correctives for deviation. 

Power, defined as effective initiative, is not 
identical with influence, in the limited sense that 
person A affects the actions of person B. Influence 
of this kind may occur with little or no relation to 
the effectiveness of person A’s actions in the group. 
Power is implemented by sanctions, while influence 
is not. 

Status (rank in power) is not identical with 
popularity or degree to which the person is liked. 
In fact, status and popularity may be poorly cor- 
related (see Hare 1962, p. 115; Sherif & Sherif 
1964, chapter 6). Nor should status be confused 
with the use of force or aggression. As Whyte 
(1943) showed, even in a street corner group 
whose members valued masculine toughness, the 
best potential fighter was not necessarily highest 
in status. Status was rooted in “mutual obligations’ 
incurred among the group members over time and 
the reliability with which a member lived up to his 
obligations. 

Since it is defined as effective initiative, status 
in a group is necessarily hierarchical. The highest 


status represents the leadership role. Especially in 
societies or situations where social equality is em- 
phasized, the operational leader, defined by observa- 
tion of his effectiveness over time, may not be 
openly chosen “leader” by the group's members. 

The accompanying figure is a diagram of the 
stabilization of statuses, based on experimental 
findings for six groups. At time T; (the top) the 
individuals in two collections have first encountered 


TOGETHERNESS GROUP A 


TIME 


Tk 


GROUP 


© Individual whose status is not yet stabilized 
A Individual whose status is stabilized 


Figure 1 — Formation of status structure in two groups, 
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one another. The circles represent the individuals 
and indicate that, at that time, independent ob- 
servers do not agree on regularities in the relation- 
ships among them from one situation to the next. 
Instead, their ratings of effective initiative are dif- 
ferent in various activities. (See Figure 1.) 

Just below, at time T}, the observers’ ratings be- 
gin to agree (from one activity to the next and 
one day to the next) that the highest and lowest 


GROUP B 


No differentiated statuses 


High status 


Low status 


High status 


Low status 


High status 


Low status 


from initial togetherness to group stabilization 


Source: Sherif & Sherif 1964. 
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positions (represented by triangles) are stabilized. 
Both in the experiments and in real-life groups, the 
leader position typically stabilizes earlier than other 
high positions. This does not imply that group 
formation consists of the “search for a leader.” On 
the contrary, leadership is subject to change. As 
Hofstätter correctly pointed out (1957, especially 
p. 24), tracing group organization over time is 
necessary to clear up many glib formulas pro- 
pounded without sufficient evidence, 

At time T, in the figure, observers are able to 
agree on the positions most members occupy, ex- 
cept in the middle of the diagram. Again, this is a 
typical finding. In part it reflects attempts by those 
in the middle to improve their standing or to align 
themselves with those higher in status. At time T: 
in the diagram the status relationships are stabi- 
lized, all observers agreeing on the status structure, 
which is also revealed in members’ perceptions of 
it as manifested by sociometric choices obtained 
from them. 

The diagram is intended to be representative. 
The different patterns of status in the two groups 
are intended to suggest that there is no predeter- 
mined form to the “steepness” or “flatness” of the 
hierarchy. Group formation represents an ideal oc- 
casion to study factors affecting the organization 
of groups, but little research has been done on this 
problem. 

The rate of stabilization varies. In the experi- 
ments, the groups stabilized within about a week 
of continuous living together. Other investigators 
have reported the discernible beginnings of group 
Structure among individuals meeting in the same 
location with similar activities within three to five 
meetings of a few hours duration (Merei 1949). 
Environmental events are as important as internal 
events in affecting the speed of stabilization, The 
Stability of the pattern is sensitive to the introduc- 
tion of new members, to changes in location and 
facilities, and to outside threat or emergency. 

In particular, the stabilization of group structure 
is never independent of relationships with other 
groups. Prolonged competition between groups for 
mutually incompatible goals is particularly effec- 
tive in quickly stabilizing a structure. Important 
intergroup confrontations, especially those result- 
ing in defeat or humiliation, produce changes in 
the internal organization of a group (Sherif 1966). 

The leader of a group, although most powerful, 
is still a member subject to loss of status. When the 
group structure is stabilized in some degree for the 
time, no person within it is free to ignore its regu- 
lation. It defines for members the bounds of “we” 


or the in-group, as compared to others who are not 
members. If sufficiently stabilized, the group can 
continue after a leader's departure with little dis- 
ruption (see Toki 1935). This persistence of a 
group structure and the effects of the group even 
on the leader are clearer in terms of the normative 
property of group formation. 

Formation of group norms. As a group struc- 
ture takes shape, members come to prefer certain 
ways of going about their important activities. They 
may adopt a group name. They set up standards 
for the ways members should and should not be- 
have among themselves and with outsiders. “Norm” 
is a general term to refer to such results of inter- 
action which produce regularities among group 
members, 

Unlike the “norm” on an examination or a test, 
a group norm does not necessarily refer to the 
average of individual behaviors. It designates what 
is expected as proper, as moral, or even as ideal. 
Yet a group norm seldom denotes a single action 
as the only way to behave. A range for individual 
variations is permissible in any group. A norm de- 
notes the range of behaviors that members come to 
deem socially desirable and acceptable (latitude of 
acceptance) and a range of behaviors condemned 
as objectionable (latitude of rejection). 

A norm implies, therefore, an evaluative scale 
(measuring rod) defining for individual members 
a latitude of acceptance and a latitude of rejection, 
to regulate their behavior in matters of consequence 
to the group (Sherif et al. 1965). Not all social 
behavior is regulated by clear-cut norms, particu- 
larly when groups are in formation. 

How can a group norm be detected? There are at 
least three objective ways: 

(1) By observing similarities and regularities in 
the behaviors (words and deeds) found among one 
set of persons but not another set in a similar 
situation. 

(2) By observing correctives (sanctions) for 
certain behaviors and praise or reward for others. 
Reactions to deviations are among the best evidence 
of the bounds of acceptable behavior. These may 
range from disapproval, frowns, and correctives 
to threats and actual punishment. 

(3) By noting the increasing similarity or con- 
vergence over time in the behaviors of individuals 
who initially behaved differently. For example, the 
entrance of a new member into a group provides 
an opportunity to detect the existence of its norms. 

When groups are in the process of formation, as 
in the experiments, one of the best indicators of 
their stability is the degree of consensus among 


members on the correctness of their norms and the 
degree to which the latitude of acceptance is bind- 
ing without direct social pressure or threat of sanc- 
tions. Stabilization of the set of norms is indicated 
when members privately regulate their own behav- 
ior within the latitude of acceptance. The person’s 
own conception of how he should behave and how 
others should act comes to fall within bounds de- 
fined by the norms. Especially when the individual 
has had a part in creating the norms as a group 
member, they become aspects of his self concept 
relative to others. He experiences personal guilt or 
shame if he violates them. 

The personal acceptance of group norms during 
group formation accounts in large measure for the 
tenacity of tradition once established. Merei (1949) 
demonstrated this tenacity by permitting play 
groups to develop procedures and rules and then 
introducing a new child who was older and had 
evidenced leadership skills in other situations. In 
every group the new child quickly found that his 
superior skills in coordinating play were not ef- 
fective, for no one paid attention. In every case, the 
new child was then absorbed into the group, adopt- 
ing its traditions and rules. 

Sherif and Sherif (1964) present evidence that 
the stringency of norms and resistance to their 
change varies according to the importance of the 
norm for the group. Violations in major activities 
or in dealings with outsiders, exposure of group 
secrets, or other behavior jeopardizing the main- 
tenance of the group were unerringly responded 
to by strong sanctions such as expulsion, threat, or 
physical punishment. B whose actions 
exposed the group or its members to humiliation, 
embarrassment, or danger were chastised. 

In less important activities the range of tolerance 
for individual differences was much wider, partic- 
ularly for the leader and higher-status members. 
In matters of daily routine or amusement strictly 
within the group, leaders were free to innovate and 
sometimes engaged in behavior that would not have 
been tolerated in lesser members. In these fairly 
stable groups, the great bulk of conforming be- 
havior occurred without direct social pressure or 
threat of sanctions, particularly behavior by mem- 
bers of moderate or high status. 

Differential effects on behavior. The formation 
of a group structure and norms has consequences 
for the attitudes and behavior of individuals within 
its fold. These consequences may be referred to as 
the differential effects of group formation. 

Any social situation provides a context for be- 
havior which differs from that in a solitary situa- 
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tion. The context includes the other people present, 
the activities and tasks undertaken, the physical 
site and its facilities, and the person’s relationship 
to all of these. Experiments have repeatedly shown 
the differential effects of various aspects of the 
social context on behavior as compared to behavior 
when alone. 

The formation of a role system and norms dur- 
ing interaction among persons over time brings 
about alterations in the relative contribution of the 
task, activity, setting, and individual reactions. 
When the persons are at first simply together, with- 
out stabilized reciprocities, their personal charac- 
teristics and skills relative to the tasks and those 
of other people are important determinants of be- 
havior (Gibb 1954; Hare 1962). As the process of 
group formation starts taking shape, the developing 
organizational and normative schemes become more 
and more binding on members. As a result, over 
time, the characteristics of the task and location— 
in short, immediate situational factors—recede in 
relative importance, and behavior increasingly re- 
flects the person’s role in the group, the roles of 
others, and the emerging norms. 

The group formation experiments of Sherif and 
Sherif ({1948] 1956, chapters 6 and 9) traced the 
development of a “we-feeling.” It was found that 
sociometric friendship choices became almost ex- 
clusively concentrated within the group, even 
though initial choices before group formation had 
been given predominantly to persons placed (de- 
liberately) in another group. In one experiment it 
was shown that estimates made by members of 
one another's performance became significantly re- 
lated to the member's status, the relationship being 
closer when the structure was more stable. Per- 
formance by high-status members was overesti- 
mated, and that of low-status persons was mini- 
mized. 

In another experiment the groups formed sep- 
arately, then competed for a series of mutually 
exclusive goals. As predicted, norms developed in 
each group justifying hostility to the other group. 
The performance by members of the other group 
in a novel task was estimated to be significantly 
lower than performance by members of the in- 
group, revealing the prejudicial norm in the judg- 
ments of individual members (Oklahoma, Univer- 
sity . . . 1961). 

In proportion to the significance a particular 
group has in a person’s life, his membership in it 
affects his attitude and behavior. As the group for- 
mation stabilizes, his sense of identity becomes tied 
to being a member of that group, proportional to 
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its scope and importance in his daily living. For 
this reason the socialization of the person is incom- 
pletely described by reference only to his acquisi- 
tion of formal prescriptions from family, school, 
and other official institutions. From early childhood 
through adolescence, groups formed among age- 
mates exert compelling impact upon the person’s 
conceptions of what is desirable for him, what is 
acceptable in others, and what is right and wrong 
(Campbell 1964; Sherif & Sherif 1964). In other 
words, they become aspects of the individual’s own 
conscience. 

Recognition of the consequences for the self 
concepts and attitudes of participants has led to 
attempts in various countries to utilize group forma- 
tion for corrective and therapeutic purposes. The 
varied outcomes reveal both gaps in the knowledge 
of group formation and lack of familiarity with 
the knowledge available on the part of many prac- 
titioners (Rosenbaum & Berger 1963, especially 
pp. 1-32). 

The emphasis earlier in this article on the moti- 
vational base of group formation and on the im- 
portance of environmental alternatives suggests 
fruitful lines of inquiry. The significance of the 
motivational base was revealed in a study of group 
formation among “emotionally disturbed young ado- 
lescents of poor prognosis” by Rafferty (1962, p. 
263). They interacted rather freely in a wide range 
of activities for five hours daily, five days a week 
for nine months. They lacked motivation toward 
the institution’s aim of changing their behavior, 
and their personal disturbances hindered any kind 
of stable interpersonal relationships. However, they 
did unite with incipient group formation in activi- 
ties reflecting a motivation genuine to them: de- 
fiance of the hospital staff in forbidden activities. 

This instructive finding raises the issue of pre- 
dicting or controlling the character of the structure 
and norms during group formation. Here the im- 
portance of the environmental setting and the be- 
havioral alternatives it encourages or permits be- 
comes evident. In the group experiments referred 
to frequently here, solidary groups devoted to con- 
structive activities were formed simply by placing 
unacquainted persons in situations of high appeal 
to them, with facilities and conditions so arranged 
that coordination of activity was the only way to 
secure individual satisfaction. -Subsequently, con- 
flict and hostility between the groups were pro- 
duced, followed by their reduction through coop- 
erative efforts by the groups. These effects were 
achieved merely by varying the facilities available, 
the other persons present, and other conditions 
external to both groups. Future research on group 


formation and its applied implications might prof- 
itably focus on the effects of varying the envi- 
ronmental alternatives and facilities available to 
groups, including other groups and persons, upon 
the character of the organization and norms that 
develop. 


Conclusions 


The material briefly presented above warrants 
the following conclusions: 

Whenever individuals with similar motives, simi- 
lar frustrations, and similar personal concerns for 
acceptance, for recognition, and for stabilizing their 
perception of themselves encounter one another, 
and when these goal-directed concerns are not ef- 
fectively dealt with through the established chan- 
nels of custom and law or the routine of prevailing 
arrangements of social organization—these indi- 
viduals then tend to interact among themselves. 

Repeated interaction in some common striving 
is conducive to differentiation of roles or functions 
to be performed toward the common end. And 
differentiation of roles and functions among the 
participating individuals, over a time span, is the 
pattern or formation that can be designated as 
the group. Every such human formation creates its 
own set of rules or norms to stabilize the regulation 
of behavior and the attitudes of members within 
its bounds. 

In a natural group, as in any other group, the 
rules or norms that count and have salience in the 
eyes of the members are generally the ones that 
pertain to the existence and perpetuation of the 
group and to the sphetes of activity that are related 
to the common motivational concerns that were 
initially conducive to repeated interaction among 
the individuals in question. 

The main properties of the group thus formed 
are an organization (structure) of roles and sta- 
tuses and a set of rules or standards (norms) for 
its activities toward the common ends. The “or- 
ganization” (which need not be formally recog- 
nized) and the set of norms (which need not be 
formally written in blueprints) define the sense of 
“we-ness” cherished within the group and upheld 
by its members in their dealings with outsiders. 

In time, the standards or norms shared in the 
feeling of “we-ness” become personally binding for 
individual members. The members who are worthy 
and true make their judgments and justify or con- 
demn events within the sphere related to their 
“we-ness” in terms of their sense of identification 
with the group. Proportional to the importance of 
the group in the lives of its members, the person’s 
self picture, his sense of personal accountability, 


his loyalty, and the “do’s” and “don'ts” of the group 
become parts of his conscience. Hence, group for- 
mation has broad implications for the regulation 
of individual attitude and behavior with and with- 
out external sanctions and controls. 
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[Directly related are the entries COHESION, SOCIAL; 
CONFORMITY; LEADERSHIP. Other relevant material 
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Iv 
ROLE STRUCTURE 


“Role” refers to a set of “expectations” for inter- 
action between a person who holds one position in 
a group and another person who holds a reciprocal 
position. In other words, there can be no “leader” 
role without a “follower” role, Since the person 
playing the role has a personality (or self) which 
he brings to the situation, the behavior of a person 
in a role will be some combination of the tendencies 
of his personality and the expectations of his role. 
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Roles in discussion groups, which are the focus 
of much of the research on groups, are usually not 
fixed. Any aspect of an individual’s behavior that 
is initially an expression of his personality can 
come to be expected by other group members and 
thus become part of his role. In general, the dimen- 
sions of role are the same as the dimensions of 
interaction and of personality. Roles have a form 
and a content, where the form includes the fre- 
quency of interaction and the communication net- 
work and the content includes task and social- 
emotional behavior. Within the task area, the 
expectations refer to problem-solving behavior and 
within the social-emotional area, according to one 
recent formulation, to behavior along at least three 
dimensions; dominance-submission, positive—-nega- 
tive, and joking-serious (Couch 1960; Hare 1962). 

Patterns of role differentiation. Natural groups 
which have been in existence for some time will 
probably have a greater degree of role differentia- 
tion than laboratory groups (Sherwood & Walker 
1960), unless the group organization allows only 
one leader who is expected to perform all func- 
tions. Thus, in some industrial and governmental 
conference groups a “sharing” of leadership is re- 
sisted by the members (Berkowitz 1953). Over the 
life of a group there may be more differentiation 
in roles when functional problems of the group be- 
come more acute, for example, when the group is 
under stress or the task is complex (Bales 1953). 
Similarly, members in cooperative groups are more 
apt to be differentiated in their functions than 
members who are competing (Homans 1961, 
p. 135). 

The small military unit or industrial work team 
is perhaps the best example of the simplest form 
of role differentiation. In each case one or two 
members may have clearly defined functions with 
little overlap in their “expectations,” while the re- 
mainder of the members play undifferentiated roles 
of “soldier” or “worker.” In discussion groups (and 
especially in laboratory groups) it is not always 
clear whether a single person plays one role or 
many roles, or whether these roles are assumed at 
the same or at different times, 

In some research all members are viewed as 
specializing in some content area, with those with 
the highest interaction rates identified as the lead- 
ers (Heinicke & Bales 1953; Grusky 1957). In 
other research some individuals may be described 
as playing a variety of roles and others none (Davis 
et al. 1961; Cloyd 1964). In either case the authors 
are not usually concerned with the rights and 
duties associated with each role. They rather use 


the terms “expectations” in the sense that group 
members can predict or “expect” that a group mem- 
ber will act in a given way on the basis of past 
performance. There is no sense that the member 
is “obliged” to act in this way to fulfill a function 
in the group or that he is entitled to any special 
privileges for performing this function [see ETHICS, 
article on ETHICAL SYSTEMS AND SOCIAL STRUC- 
TURES]. 

In some research, the evidence that a person is 
playing a given role is that other members associ- 
ate a certain set of behavioral characteristics with 
him; these behavioral characteristics are presented 
as a cluster without any special rationale concern- 
ing their interconnection (Bates & Cloyd 1956; 
Davis 1961; Cloyd 1964). In other research a per- 
son is described as playing a role because he has 
been nominated as a frequent contributor of some 
specified type of behavior, for example, giving in- 
formation or giving opinion (Bales 1958; Borg 
1960). An even less functionalist view is taken in 
research in which some members are described as 
playing individual or self-oriented roles that pre- 
sumably are extraneous to the group task (Benne 
& Sheats 1948). Another variation on the use of 
the term “role” is for the experimenter to describe 
someone as playing a role such as “newcomer” 
(Mills et al. 1957) or “doctor’s assistant” (Margolin 
1952) without actually assessing the perceptions 
of the other group members. 

Most of the research on small groups has so far 
not used a definition of role as a set of rights and 
duties but rather as a description of recurring pat- 
terns of behavior. But the former, more limited 
definition of role still appears to be theoretically 
useful even though it is not widely used. For in- 
stance, in those studies that report the interaction 
in initially “leaderless” groups, the expectations for 
the average member are probably most clearly re- 
lated to the directions given by the experimenter 
at the beginning of the session. Often, indeed, the 
experimenter will test to see if the subjects have 
been as friendly or as competitive as he directed 
them to be, but he will not refer to these instruc- 
tions as “role expectations” (Olmsted 1954). 

The power of an experimenter’s instructions is 
evident in an experiment in which one member in 
each of several discussion groups was given more 
information about the task than the other mem- 
bers. In some of these groups the experimenter 
announced, “Some of you may have more informa- 
tion than others.” In other groups he gave the im- 
pression that all members were equally informed. 
It was observed that group members reacted nega- 


tively to the best-informed man unless they had 
been led to “expect” that he would play a different 
role (Shaw & Penrod 1962). 

Although early research on groups, especially 
with children, often described “individual” or “self- 
oriented” roles as if the only role of some indi- 
viduals was to satisfy their own needs in the group 
(Benne & Sheats 1948), later formulations by Redl 
(1942), Bion (1961), and Stock and Thelen 
(1958) suggest that all roles in the group serve 
some function. However, some of these roles, par- 
ticularly the ones that allow members to deal with 
emotional themes, may not be recognized as part 
of the “official” group structure. For example, in a 
training group composed of teachers who were be- 
ing led by Bion’s methods of “interpretive group 
discussion,” the group members persisted, without 
success, in trying to induce the leader to act as a 
therapist. Finally six members did not appear at 
the scheduled group session and a spokesman for 
the absentees sent in a paper on truancy to be dis- 
cussed by the remaining members. Thus, by taking 
the “role” of absent members, some of the group 
acted out the group’s need for flight from the task 
(Herbert & Trist 1953). 

Task versus social-emotional roles. The most 
common division of roles that has been described 
in small discussion and work groups, as well as in 
families viewed as small groups, is specialization 
in the task and social-emotional areas (Bales & 
Slater 1955; Bales 1958 ). This has been shown most 
clearly in small laboratory groups in which mem- 
bers have a high degree of consensus at the end of 
a meeting on the relative amount of interaction in 
each of these areas exhibited by group members. 
These groups appear to recognize two kinds of role 
specialists: one an “idea man” who concentrates 
on the task and plays a more aggressive role; the 
other a “best-liked” man who concentrates on 
social-emotional problems of group process and 
member satisfaction, giving emotional rewards, 
and playing a more passive role. However, in 
groups similar to these in which there is less con- 
sensus on the status of members, a third type of 
person appears to be present, one who talks a great 
deal but who is not well liked or highly rated on his 
task ability. He has been referred to as a “deviant.” 
In addition, researchers have found a more passive 
task specialist and a “popular” person (Slater 
1955), 

The so-called “deviant” who overtalks is probably 
expressing the group’s anxiety about the discus- 
Sion task. To please the experimenter, the group 
allows a member to fill the time with “discussion” 
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even though he may not be the most effective at 
the task. This tendency to have a high or low inter- 
action rate has often been reported as the first of 
three factors or dimensions that may be used to 
describe behavior, the other two being task be- 
havior and sociability (Carter 1954). However, the 
frequency of interaction should probably be con- 
sidered an independent dimension because it de- 
scribes the “form” of interaction and not because it 
represents a separate area of “content.” 

In families, the father has been identified as the 
task specialist, the mother as the social-emo- 
tional specialist (Parsons & Bales 1955). A similar 
dichotomy has been reported among caseworkers 
in a welfare agency where some colleagues were 
respected and sought out for consultation on cases 
and others were attractive because of their sociable 
companionship (Blau 1962). However, a study of 
therapy groups reported that the distinction was 
not evident in this setting, since the “task” of the 
group was to deal with social-emotional problems 
(Talland 1957). 

When the same individual is required to play the 
role of both the task and the social-emotional 
leader, he may find some aspects of the roles in- 
compatible and thus experience “role conflict” ( See- 
man 1953). Officers in small military units, for 
example, may be required to consider the personal 
problems of their men as well as to be task leaders. 
The second role requires distance, since some 
assignments must be made without regard to per- 
sonal feelings, yet the first role requires closeness 
and intimacy (Hutchins & Fiedler 1960). 

The joker. In addition to these two roles, a 
third role has been identified in some groups which 
actually has a long history in many cultures. This 
is the role of the clown or the joker. Just as the 
English court jester’s costume set him apart from 
the group, so the joker tends to take a somewhat 
marginal position with regard to the task. He tends 
to look at things differently, providing both a source 
of humor and of new ideas. Under the cover of wit 
he is able to introduce ideas that the group might 
otherwise find unacceptable. 

In its literary form, comedy has been described 
as “an escape, not from truth but from despair: a 
narrow escape into faith” (Fry 1960, p. 27). One 
must be able to grasp the tragic nature of life be- 
fore one can go on to grasp its comic nature. Thus, 
the comic person in a play or in a real-life small 
group is often one who gives special insight into 
the problem. This is evident from a study of a 
series of “great-books” discussion groups where the 
joking role was most highly correlated with the role 
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of providing “fuel” for the discussions in the form 
of new ideas and opinions (Davis et al. 1961). Ina 
study of laboratory discussion groups the following 
cluster of behavioral characteristics was identified 
as composing a role: jokes and makes humorous 
remarks, is liberal, challenges other's opinions, gets 
off the subject, is egotistical, is cynical, and inter- 
rupts others (Cloyd 1964). Within the dimension 
of “joking versus task-serious behavior” there may 
be different styles of wit. Thus, sarcastic wit may 
be perceived as powerful but unpopular in a group, 
while clowning wit may be seen as popular but 
powerless (Goodchilds 1959), 

The member. The fact that simple membership 
in a group carries with it a set of rights and duties 
is evident in the group’s concern for the “silent 
member.” Although silence may be functional if it 
means that more able persons are being allowed to 
solve problems (Homans 1961, p. 136), group 
members are usually dissatisfied with a member's 
performance if he does not participate. This dis- 
satisfaction may be reduced if it is made clear at 
the outset that certain members will not partici- 
pate at all (Smith 1957). On the other hand the 
group will be more concerned if a member appears 
indifferent and neglectful in his role (Rosenthal & 
Cofer 1948). 

Another type of member role that has received 
some attention in the literature is that of the “new- 
comer.” Group members will have an easier time 
assimilating the newcomer if they have been told 
to expect change (Ziller et al. 1961 ), provided that 
the newcomer is not seen as too different from 
other group members (Ziller et al. 1960) and that 
group members have already had a pleasant time 
with each other (Heiss 1963). In any event there 
will probably be a minimal alteration in the role 
patterns of the old members at first (Mills et al. 
1957). 

Other roles. Other roles may be found in groups 
with special tasks. In therapy groups a “doctor’s 
assistant” may fill the group’s need to keep the dis- 
cussion going when the therapist is playing a rather 
Passive role and a “help-rejecting complainer” may 
give the group case material to discuss (Frank et al. 
1952; Margolin 1952). Somewhat similar to the 
“doctor's assistant” may be the “feeder-to-leaders” 
identified in a sociometric study of a home for 
girls (Jennings 1947). These were girls who would 
have their ideas accepted after they had been en- 
dorsed by highly chosen leaders, although they 
were not highly chosen themselves [see LEADER- 
SHIP]. 

The classic study of authoritarian and demo- 


cratic group atmospheres by Lewin, Lippitt, and 
White does not focus on role differentiation; how- 
ever, two types of roles are mentioned in addition 
to the leaders. In the “democratic” groups the 
authors describe two boys who are allies of the 
adult leaders, and in the “autocratic” groups they 
describe a scapegoat who receives the aggression 
of the group (White & Lippitt 1960, pp. 160-186). 
Thrasher (1927) in his study of gangs in Chicago 
gives a longer list of special roles, to which he 
gives such names as brains of gang, funny boy, 
Sissy, show-off, and goat. Redl (1942) lists ten 
types of roles of central persons, which may be 
grouped into three categories: identification ob- 
jects, objects of drives, and ego supports. 

In addition to roles which may arise naturally 
in groups, some authors have suggested certain 
roles which should be introduced if a group is to 
operate with maximum efficiency. For example, 
Jenkins (1948) suggests that effective discussion 
requires attention to such mechanics of operation 
as awareness of direction, goal, and rate of prog- 
ress. He proposes that groups appoint a “group pro- 
ductivity observer” who will report at the end of 
each meeting. 

Where role differentiation occurs in a group, in- 
dividuals accustomed to playing the same roles in 
other groups will probably continue to play them 
in the new situation (Strodtbeck & Mann 1956; 
Davis et al. 1961). In addition, differences in age, 
sex, social class, and occupation between members 
will result in role differentiation in small discus- 
sion groups even when these differences are not 
related to the task at hand (Maas 1954; Torrance 
1954). 

Six roles in discussion groups. From the review 
of the literature presented above it is evident that a 
variety of roles have been identified in discussion 
groups. The number of roles reported may be small 
or large depending upon the level of analysis, the 
task, the group size, and the length of time the 
members have been together. Although the dichot- 
omy of task versus social-emotional roles is ade- 
quate to describe the basic differentiation in some 
groups, in others there are probably more distinct 
roles that have been grouped under these two more 
general headings, A framework for the description 
of more distinct and independent roles is provided 
by Couch’s ( 1960) factor analysis of categories 
of interaction in five-man laboratory discussion 
groups. He finds that interpersonal behavior can 
be described by the following independent dimen- 
sions: dominance versus submission; positive ver- 
Sus negative; task serious versus joking; influence 


attempts versus receptivity; and surface acqui- 
escence versus resistance. If we assume that be- 
havior at the extremes of each dimension may 
identify a role, we find a rather good fit with six 
role patterns identified by Cloyd (1964) without 
reference to any particular dimensions and with- 
out actually assigning them names. 

At the dominance end of Couch’s first factor we 
find a set of behaviors similar to a cluster identified 
by Cloyd that includes “aggressive, self-confident, 
and gets things started.” This role might be called 
the “high talker.” At the submissive end of Couch’s 
first factor is Cloyd’s cluster of traits: “modest, 
shy, and ill at ease.” This role might be called the 
“silent member.” 

In a similar way one could match dimensions 
and clusters of traits to identify a “supporter” who 
is friendly and objective, a “critic” who is idealistic 
and argumentative, a “serious worker” who is de- 
pendable and constructive, and a “joker” who 
makes humorous remarks and challenges others’ 
opinions. Cloyd does not provide examples that 
would fit the last two of Couch’s dimensions; how- 
ever, the last dimension suggests the roles of “con- 
formist” and “nonconformist.” 


From this example it will be seen how small 
group experiments, although independently de- 
signed by researchers using different concepts, can 
be said to deal with a recurrent, scientifically 
identifiable class of phenomena. Indeed, such is 
the wealth of suggestive findings in this area that 
the organization of our present knowledge has be- 
come both an indispensable and a rewarding pre- 
liminary to future progress. 

A. PAUL HARE 


[See also INTERACTION, article on INTERACTION PROC- 
ESS ANALYSIS; LEADERSHIP; SOCIOMETRY.] 
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v 
GROUP PERFORMANCE 


It was a practical interest in group perform- 
ance which led to the massive literature dealing 
with experimental studies of groups. The indus- 
trial revolution and the growth of the large fac- 
tory raised such practical questions as: How can 
groups of workers become more productive? How 
might the performance of the individual worker 
be affected by the presence of others? When is 
a group more effective than the sum of its indi- 
vidual members? Thus, the typical study of group 
performance will vary some factor and measure 
differences in quantity, quality, speed (number 
of units produced per unit time), or efficiency 
(number of units produced per man per unit time). 
Some studies might concentrate on homogeneity 
or similarity of behavior and production of indi- 
viduals in the group as compared to that of in- 
dividuals working alone. The original industrial 
interest spread to other areas, such as military 
situations, research laboratories, and classrooms, 
where the problems were often quite different. 
Reviews of studies of group performance may be 
found in Hare (1962), Kelley and Thibaut (1954), 
and Lorge and his associates (1958). 

Studies of group performance have been grouped 
into four major problem areas: (a) group produc- 
tivity—e.g., studies of how many relays can be 
wired by a group in a given period of time, or 
how many addition problems can be completed; 
(b) group problem solving—e.g., studies of the 
solution of puzzles, construction of a complex 
design involving thought and planning, creativity 
in determining unusual uses for everyday imple- 
ments; (c) group judgment—e.g., group estima- 
tion of the number of parachutes seen in an air 
photo of military exercises, estimation of number 
of beans in a jar, judgment of lengths of lines or 
movement of lights; and (d) group learning and 
retention—e.g., group memorizing of a complex 
story, learning a maze or series of words, military 
units learning complex strategy as a unit. Although 
these categories are convenient, it is obvious that 
a given task will often involve several of the above 
problems in combination: the group must form 
some judgments in learning and retention, learn- 
ing and retention are involved in problem solving, 
group productivity may be improved through crea- 


tive problem solving in the reorganization of group 
operations, etc. 

Individual versus group performance. Early 
observations suggested that workers on production 
lines would tend to increase their level of activity 
when working together. Floyd H. Allport (1920) 
considered such “social facilitation” in terms of 
classical conditioning. The sounds of tapping 
from the shoemaker’s hammer become conditioned 
stimuli for the muscular movements which move 
his hammer. When he works in the presence of 
others, the tapping sounds from his co-workers 
serve as additional conditioned stimuli and thus 
increase further the rate at which the shoemaker 
pounds his shoes. Allport found, in fact, that 
subjects given word-association tasks, vowel-can- 
cellation tests, or multiplication tests performed 
more rapidly when working in the presence of 
others. This was true even when rivalry was di- 
minished by using instructions and by not per- 
mitting knowledge of the scores of others. [See the 
biography of ALLPORT.] 

Early studies of problem solving also indicated 
the superiority of groups over individuals. It was 
suggested that this resulted from erroneous solu- 
tions being rejected in group discussions (Shaw 
1932). It would follow that the superiority of the 
group over individual problem solving would be 
greater in tasks which permitted a greater variety 
of responses, with elimination of incorrect solu- 
tions being more crucial. Such was, indeed, demon- 
strated by Thorndike (1938). [See the biography 
of THORNDIKE. ] 

In one of the few clear-cut experiments on group 
versus individual learning, subjects were asked to 
learn lists of nonsense syllables. Some learned 
cooperatively, in groups of three, and then alone; 
for others the order was reversed. It appeared that, 
generally, subjects learned more rapidly in groups 
and after the group experience could learn better 
on their own. Some superior subjects, however, 
did not benefit from the group learning experience 
(Perlmutter & de Montmollin 1952). 

There is evidence that groups can inhibit the be- 
havior of individuals. In an early study most chil- 
dren asked to turn a crank to move a flag to a finish 
line tended to show “social facilitation” when oper- 
ating together. However, some operated less effec- 
tively when coacting with others. This decrement 
was attributed to overstimulation (Triplett 1898). 

While the studies of individual versus group per- 
formance indicated some superiority of group per- 
formance, they also suggested limitations on any 
such conclusions. The problem can be reconsidered 
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in terms of the factors which lead to variations in 
group performance. 

Size of group. In an early study by Moede 
(1927), a man was given a rope to pull as hard as 
he could, then a second man was added and given 
the same instructions, then others, until eight men 
were pulling, with the strength of pull measured. 
The first man pulled, on the average, 63 kilograms; 
the two together pulled 118; three, 160; and eight, 
248 kilograms. Clearly the total amount of pull in- 
creased with each man, but the work per man was 
reduced with additional members—each new mem- 
ber reduced the work per man by approximately 10 
per cent. This early study of group productivity is 
a prototype of later research on size of group in 
group performance. The results of later studies 
seem to be the same. A study of problem solving in 
the game “Twenty Questions” found four-person 
groups asking fewer questions than dyads, but the 
four-person groups were less efficient (Taylor & 
Faust 1952). Thomas and Fink, summarizing the 
literature, say that generally “both quality of per- 
formance and group productivity were positively 
correlated with group size under some conditions, 
and under no conditions were smaller groups su- 
perior. In contrast, measures of speed showed no 
difference or else favored the smaller groups” (1963, 
p. 373). 

Cohesiveness and morale. In 1927 the Haw- 
thorne plant of the Western Electric Company 
initiated a series of experiments to raise productiv- 
ity and morale. A group of female workers en- 
gaged in wiring relays was placed in a separate 
room and, in the tradition of efficiency experts 
Frederick W. Taylor and Frank B. Gilbreth, changes 
in working conditions were introduced which were 
calculated to improve efficiency—changes to piece- 
work production, increased rest periods, hot meals, 
earlier quitting times, etc.—and with each change 
productivity increased. However, when all these im- 
provements were removed, productivity rose still 
higher. It was pointed out that, clearly, the physical 
factors were not as crucial for increasing produc- 
tivity as cohesiveness and morale [see COHESION, 
SOCIAL; INDUSTRIAL RELATIONS; See also Roethlis- 
berger & Dickson 1939, p. 86]. 

Later studies in various types of work groups, 
including railroad maintenance workers, office 
staffs, and machinery workers, also indicated that 
productivity increased with attraction to the work 
group (Michigan, University of . . . 1954, p. (Oy 
Other research indicated similar relationships in 
military crews (Berkowitz 1956). In a careful ex- 
periment, Van Zelst, working in conjunction with 
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the supervisor and foreman of a large construction 
project, made up some work groups of carpenters 
and bricklayers according to their personal choices 
of co-workers and some of members who were not 
mutually attracted to one another. The former 
groups showed greater job satisfaction, lower labor 
and material cost, and lower turnover rates (1952). 

There are, however, limitations to any generali- 
zation that cohesiveness increases productivity. 
Mutual friendliness, when it shows itself in high 
sociability and horseplay, can decrease productivity 
(Horsfall & Arensberg 1949). Thus, group goals 
and group norms must also be considered. 

Interdependence, cooperation, and competition. 
Patterns of interdependence can have a great effect 
on group performance. The individual worker on the 
production line may be under great pressure to work 
rapidly either through the “pull” of the man next in 
line, who is waiting for the next piece, or through 
the “push” of the preceding worker, whose rapid 
rate of work leaves work piling up on his successor. 
In the British coal mines, a change to a “longwall 
method” of mining led to each shift of workers be- 
coming less aware of other shifts and less aware of 
the extent to which each was dependent upon the 
other. Accidents increased because a preceding 
shift departed without allowing proper safeguards 
for those that followed. For similar reasons, pro- 
ductivity decreased. A major remedy involved in- 
creasing both formal and informal communications 
between shifts and, through other means, making 
shifts and workers aware of their interdependence 
(Trist & Bamforth 1951). Thomas (1957) indicates 
the importance of considerin g both interdependence 
with respect to means and with respect to goals. He 
found greatest productivity in a group with division 
of labor where both types of interdependence ob- 
tained, though there were signs of greater emotional 
tension as well. 

Early studies indicated that rivalry, or competi- 
tion, tended to increase rate of performance, though 
sometimes at the expense of quality (Triplett 
1898). However, later studies found more rapid 
performance among persons who are cooperating. 

Part of the answer lies in the extent to which 
there is also interdependence with respect to means. 
When each person works on a parallel task, as in 
the Triplett experiment in which each subject 
turned his own crank, with speed being the basis 
of success, then competition will likely be more 
effective in increasing performance. When each 
person depends on others for his own movement— 
for example, three persons raising or lowering cor- 
ners of a triangular board so as to center carpen- 
ter’s levels (Raven & Eachus 1963)—then com- 


petition will be detrimental. With competition and 
interdependence of means, each person is com- 
cerned about his activity helping others. There is 
less division of labor, and a person may not wish to 
give information to a co-worker who might thus be 
helped to defeat him (Deutsch 1949). $ 

Division of labor and information. While divi- 
sion of labor may be beneficial to a cooperative 
work group, mutual interdependence also imposes 
additional pressures on the individual, resulting in 
greater pressures to produce but also greater ten- 
sion and greater loss of individuality and crafts- 
manship. This has been observed in industrial set- 
tings. It has been demonstrated that such resulti pi 
tensions can be alleviated somewhat, with greater 
group cohesiveness, greater acceptance of influence 
from co-workers, and less hostility, through clarify- 
‘ing for the individual the nature of the final prod- — 
uct and the ways in which the individual worker's — 
task fits in with that of his co-workers (Raven & 
Reitsema 1957). K 

The question arises as to how information should 
be distributed for maximal group retention. If, for 
example, a complex military tactic must be learned 
by a squad, giving all the information to each mem- 
ber would increase learning time and lead to an in- 
creased rate of forgetting. On the other hand, giving 
each person a nonoverlapping portion, while reduc- 
ing redundancy, increases the possibility of a piece 
of information being permanently lost if only one 
member forgets it. Zajonc and Smoke (1959) sug- 
gest a mathematical model for dealing with this 
problem, but extensive empirical testing is still 
lacking. 

Group norms and decisions. In the Hawthorne — 
study, while it was found that group morale con- 
tributed to higher productivity, it was also observed 
that norms developed which prevented production 
from either exceeding a given level (such a worker 
would be considered a “rate-buster”) or falling be- 
low (being a “slacker”), Similarly, the study by 
Seashore found that increased cohesiveness did not 
lead to greater productivity but rather to greater 
homogeneity of performance in a work group 
(Michigan, University of . . . 1954). It has been 
demonstrated in experimental situations that a 
highly cohesive group will not only be better able 
to influence a member to increase his production 
but also to decrease it (Schachter et al. 1951). 

Lewin suggested that changing the level of pro- 
ductivity of a work group involves a process of 
“unfreezing” an old norm, then change, and “re- 
freezing” at a new normative level (1951, p. 228). 
A reduction in productivity, due to a change in i 
methods of production, may lead to establishment 


of a lower group norm, which will remain long after 
necessary relearning has taken place. Members who 
exceed the norm may be subject to strong group 
pressure to work more slowly. This suggestion led 
to studies which indicated that involving the work 
group in the re-evaluation of the job, with estab- 
lishment through group discussion of an appro- 
priate group standard of production, will ultimately 
lead to a higher level of production with less ten- 
sion and higher morale among the workers (Coch 
& French 1948). 

With respect to group judgments, group norms 
might also have a deleterious. effect. Though a 
group would be more likely to have the combined 
skills necessary to make a proper judgment, it may 
sometimes happen that a majority is wrong. In that 
case, there will be strong pressures for an individ- 
ual who might otherwise judge correctly to refrain 
from uttering the correct answer and, instead, to 
go along with the group. A properly trained leader 
can sometimes assist a group in group judgment 
or problem solving through encouraging the expres- 
sion of minority opinion and bringing out the use- 
ful ideas and contributions of members who are 
otherwise restrained by conformity pressures 
(Maier & Solem 1952). 

Communication networks. An application of 
principles of topology to group behavior by Alex 
Bavelas (1950) led to a series of studies of the 
effects of restriction of communication channels 
on productivity and morale. In an early study, five- 
man groups were given a problem of finding which 
symbols were held in common by all members. The 
communication possibilities were varied. The most 
effective group was that in which four members 
could communicate only with the one “central per- 
son,” a pattern called a “wheel.” A “circle” was least 
effective, with a “chain” and a “Y” being interme- 
diate. The more involved the person was in the net- 
work, the happier he was with the group task. The 
“circle,” while least effective, had the greatest over- 
all morale (Leavitt 1951). The initial findings indi- 
cate that restriction of communication leads to 
greater efficiency. It is suggested that this may be 
true for the simple problems in which information 
must be collated to find a correct answer. The re- 
stricted pattern of the “wheel” saves the group from 
spending time on organization for effective prob- 
lem solving. However, it also lessens flexibility, 
making for less efficiency when the tasks are 
changed or become more complex (Guetzkow & Dill 
1957), When the problem is complex and involves 
a number of alternative paths to solution, the less 
restricted network is more efficient (Shaw 1954). 

Personality factors undoubtedly play @ role. 
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Workers high in acceptance of authority will be 
relatively more efficient in a centralized group 
structure; those low in acceptance of authority will 
be more effective in a decentralized structure (Shaw 
1959). It might also follow that an increase in in- 
telligence and skills would lead to increased effi- 
ciency for the decentralized structure relative to the 
centralized network. 

Leadership and supervision. Clearly, leadership 
and supervision play a very important part in group 
performance, since the leader can often affect and 
control many of the other critical variables dis- 
cussed above. The style of leadership has been given 
particular attention in the literature. In a pioneer 
experiment with groups of boys, it was found that 
the greatest productivity and morale occurred with 
a “democratic” leader, who encouraged participa- 
tion in decisions, gave a clear picture of the group 
activities and the reasons for his requests, took an 
active but not overactive role in the group's activi- 
ties, etc. A “laissez-faire” leader, who allowed com- 
plete freedom and assumed no active role, achieved 
the least productivity. An “authoritarian” leader, 
who was very active in issuing commands without 
giving reasons, was effective in raising productivity 
only insofar as the group was under his immediate 
surveillance. Morale was lowest in the last case 
(White & Lippitt 1960). Other studies in industrial 
settings confirm that productivity and morale are 
higher with a participatory leader or supervisor who 
assumes an active role in the group, gives support 
to his workers, delegates authority, and maintains 
an optimal degree of supervision (Kahn & Katz 
1953 ). There is evidence that similar group-oriented 
roles on the part of the teacher may also lead to 
improved learning. 

Characteristics of the leader have also received 
some attention in relationship to productivity. For 
example, Fiedler (1958), in investigations in a 
large variety of settings, has found that productivity 
is highest in teams where the leader maintains an 
optimal distance from his team and where he per- 
ceives clear differentiation between his best and 
worst co-workers [see LEADERSHIP]. 

Training and group process. Given a knowledge 
of factors which contribute toward improved group 
performance, it should follow that training and edu- 
cation in the application of these principles would 
be a logical next step. Indeed, there have been many 
attempts to introduce training for improved group 
performance—either through training the groups 
themselves or the leaders and supervisors. It has 
been demonstrated that a discussion leader who is 
trained to encourage expression of minority opin- 
jons will effect superior problem solving and deci- 
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sion making (Maier 1953; Maier & Solem 1952). 
It is generally assumed that educators can be taught 
to improve learning in the classroom. 

On the assumption that group ideation can be 
more productive than individual ideation and that 
a group may sometimes inhibit the expression of 
unusual ideas which might be modified so as to be 
fruitful, Alexander F. Osborn introduced training 
in what he called “brainstorming.” In this problem- 
solving situation, members are given a problem, 
such as how to increase the attractiveness of teach- 
ing as a profession, and then are asked to give 
whatever ideas come to mind, no matter how out- 
landish. Thus, group norms are established to en- 
courage rather than discourage strange suggestions. 
These are then gradually selected and evaluated, 
and a unique practical solution may be developed 
from an initially bizarre suggestion. Though the 
basis appears sound, one of the few careful evalu- 
ative experiments suggests that “brainstorming” 
inhibits rather than facilitates creative thinking 
(Taylor et al. 1958). [See CREATIVITY; PROBLEM 
SOLVING. ] 

The National Training Laboratories were estab- 
lished at Bethel, Maine, for the purpose of pro- 
viding summer workshops in interpersonal rela- 
tions. Educators, military leaders, supervisors in 
industry, and others participate in programs in- 
volving some theory sessions, but largely free in- 
teraction in groups, in order to become aware of 
how they affect others and others affect them. The 
original program spread widely and may now take 
place in sessions lasting from a few hours to 
months. “Sensitivity training” is said to have pro- 
duced dramatic changes in the effectiveness of or- 
ganizations. Unfortunately, the problems of evalu- 
ating the effectiveness of “sensitivity training” are 
immense, but the clearest evidence to date of its 
effectiveness consists of testimonials from members 
of organizations which have participated and the 
fact that many such organizations have continued 
to train their supervisors and employees in this way 
(Tannenbaum et al. 1961, pp. 233-238). 

Another recent approach to training is that de- 
veloped in the System Development Corporation. 
In “system training,” the working unit (in this case, 
most work has been done with air defense crews ) 
is considered as a “man-machine system” involving 
more than the time—motion aspects emphasized in 
an earlier period and the interpersonal relationships 
of a later stage. In practice, an air defense crew at 
its home site is presented with a series of problems 
which are simulated on the crew’s own radar 
Scopes, with detailed flight plans, etc. After the 
problem, a careful record is available of all relevant 


interactions, and the crew holds a debriefing session 
in which members analyze their faults and attempt 
to make necessary corrections. One goal is to devel- 
op greater flexibility in the crew, allowing for rapid 
changes in task assignment so as to provide for 
effective adjustment to all sorts of problem situa- 
tions. Data indicate that there is, indeed, improve- 
ment in ability to deal with the simulated problems, 
It is assumed that such improvement also applies to 
future problems which the crew might encounter 
in real life as well (Gagné 1962). 


BERTRAM H. RAVEN 
[See also FIELD THEORY.] 
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PSYCHOLOGY; VOCATIONAL INTEREST TESTING. 


GUILDS 
See GILDs. 


GUILT 
See MORAL DEVELOPMENT; PSYCHIATRY, article 


on FORENSIC PSYCHIATRY; PSYCHOANALYSIS; PUN- 
ISHMENT. 


GUMPLOWICZ, LUDWIG 


Ludwig Gumplowicz (1838-1909), of Polish- 
Jewish parentage, was professor of public law at 
the University of Graz, Austria, from 1875 until 
his death. He is best known for his pioneer work 
in establishing sociology as a social science. His 
contributions to political science and jurisprudence 
were also important, but they consisted mainly of 
applications of his sociological generalizations to 
government and law. A clear and vigorous writer, 
he was much given to controversy in all three 
fields. 

The sociological system of Gumplowicz was 
based on a number of fundamental dogmas or 
principles: (1) social phenomena are governed by 
universal laws that operate in a completely secular 
manner, unrelated to religious or moral considera- 
tions, and they must be studied by using a thor- 
oughly scientific method; (2) sociology rejects all 
value judgments; and (3) sociology is the science 
of the interaction of groups. 

Applying these principles to the evolution of 
society and states, Gumplowicz held that the earli- 
est forms of group life were small, natural, ethnic 
or blood-kin hordes. These groups were unified by 
consanguinity and common, rudimentary economic 
interests; their members lived in sexual promis- 
cuity and relative equality of social position. Soli- 


294 GUMPLOWICZ, LUDWIG 


darity developed in a group through a process of 
“syngenism,” which gradually produces unity in 
any social group. Out of this earliest form of group 
life arose, in succession, the matriarchate and the 
patriarchate, the first types of organized control. 

Since the appearance of the matriarchate and 
the patriarchate, social and political evolution has 
been a never-ending process of external conflict 
between groups, in the form of wars, and of inter- 
nal conflict of interests within groups. The motive 
in all conquests has been the desire for the mate- 
rial gain that may be obtained by exploiting the 
labor of the conquered. Material interests, there- 
fore, have furnished the dynamic impulse in social 
evolution. “So conquest and the satisfaction of 
needs through the labor of the conquered, essen- 
tially the same though differing in form, is the 
great theme of human history from prehistoric 
times to the latest plan for a Congo state. . . . It 
cannot be otherwise since man’s material need is 
the prime motive of his conduct” ([1885] 1963, 
pp. 203-205), 

When the process of conquest and subjugation 
becomes well developed, the principle of amalga- 
mation supplements syngenism in producing unity 
in the state, which, as the highest form of social 
grouping, is the culmination of a long process of 
conquest and of many adjustments subsequent to 
conquest. Gumplowicz believed that there is no 
such thing as indefinite social progress; the historic 
process is the record of the rise and fall of states 
and follows an inevitable cyclical course of growth 
and decline. 

In the initial stage of conquest, there were only 
two social classes, the conquerors and the subju- 
gated. Foreign merchants who moved in produced 
a third class—the middle class or bourgeoisie. 
With the development of social and political insti- 
tutions, the activities of the primary classes of 
tulers, merchants, and exploited masses created a 
need for secondary or derived classes, such as 
priests, professional men, and artisans. These so- 
cial classes were based on a division of labor which 
was created and maintained by coercion. 

The rise of social classes produced a complex 
and unending struggle among them to control the 
policy of the state in order to promote their various 
special interests. These interests were best ad- 
vanced through participation in legislation, and 
social classes found that political parties were the 
most effective agencies for controlling the legisla- 
tive process, Whatever form the struggle of classes 
within the state has taken, it has provided the 
dynamic core of history ([1877] 1907, pp. 33 ff.). 


Paralleling political and economic developments 
there were various processes of assimilation, such 
as the adoption of the language, religion, and 
manners and customs of the conquerors, which 
tended to produce cultural unity. Finally, through 
intermarriage, an ethnic unity was achieved. The 
homogeneous society thus created is the “folk-state” 
or nation, which is the ultimate outcome of social 
and political evolution. The tendency throughout 
history has been for such a folk-state or nation to 
seek to conquer a neighboring nation, and when 
it is successful the whole process of subjugation, 
assimilation, and amalgamation is repeated. 

Gumplowicz’ sociological system is not simply 
an application of Darwinian biology to the opera- 
tion of human society. He was indebted to Darwin 
for the general idea of a struggle for existence, but 
he was also indebted to others: to Comte for the 
determination to analyze social phenomena with 
scientific rigor, and to Spencer for the idea of uni- 
versal evolution and the application of the laws of 
universal evolution to society. His conception of 
the primordial and ceaseless conflict of races and 
social groups came from the writings of Count 
Joseph Arthur de Gobineau and from his own life- 
long contact with the intense struggles among 
races and groups in the Austro—Hungarian Empire. 
He had participated in the Polish revolt of 1863 
and had absorbed much of the socialist and anar- 
chist literature of the first half of the nineteenth 
century: his dogma that the basis of conflict is 
always the promotion of material self-interest was 
based chiefly on the stress laid in this literature 
on the exploitation and expropriation of the masses 
by the property owners, and to a lesser extent on 
Marxist writings. 

The outstanding contribution of Gumplowicz to 
sociology was his naturalistic and secular approach 
to society and social evolution and his conception 
of social evolution as a process of conflict. The con- 
cept of “social process” appeared in the writings of 
such European sociologists as Franz Oppenheimer 
and Gustav Ratzenhofer; in America, the works of 
Gumplowicz and Ratzenhofer were read and di- 
gested by Albion Small, founder of the so-called 
Chicago school of sociology. Small and his disci- 
ples, such as Park and Burgess, and other Ameri- 
can sociologists, notably Ross and Cooley, carried 
further the development of the concept of the 
“social process.” 

Gumplowicz’ contributions to realistic political 
science were notable. He assembled and developed 
in a well-integrated and systematic fashion the 
previously scattered suggestions relative to the 


social conditioning of political phenomena: that 
society is a group held together by material inter- 
ests; that the state is a society organized and con- 
trolled by coercion; and that political legislation is 
a product and reflection of the social process at 
any given time ({1877] 1907, pp. 33 ff.; [1892] 
1902, pp. 49-66). Gumplowicz’ well-documented 
argument that the state has always been founded 
through wars and conflict cleared away pietistic, 
idealistic, and metaphysical misconceptions. His 
argument has won general acceptance, although 
some sociologists, notably Jacques Novicow, have 
charged that he neglected the role of pacific and 
cooperative factors in the origins of states. Gum- 
plowicz’ conception of political parties as interest 
groups has become, perhaps, the most useful 
truism of dynamic political science and has been 
adopted and developed by Durkheim, Duguit, Laski, 
and A. F. Bentley, among others. 

Gumplowicz’ fundamental assumptions led him 
to reject the possibility that man can plan his social 
future and promote his happiness by such social 
inventions as the welfare state. He even suggested 
that the practical value of sociology may be to save 
mankind from wasting time and energy on futile 
schemes of utopian reform. There is some evi- 
dence, however, that Lester Ward was able to con- 
vince Gumplowicz to a certain extent that the 
social sciences may enable man to plan a better 
future (1905). 

Gumplowicz spent most of his professional life 
as a professor of law. His legal theories were a 
direct outgrowth of his sociological doctrines, and 
he is generally regarded as one of the founders of 
the sociological school of jurisprudence. He saw 
laws as invariably growing out of the conflict of 
classes and interests within the state, as social 
products rather than divine revelations, or as wise 
and rational human creations, or as derived in 
some mysterious manner from nature. He believed 
that there are no natural laws, except in the sense 
that laws are a result of the nature of man and 
of the social processes. This precludes the validity 
of classifying laws as good or bad: laws are not 
Passed to promote justice in any abstract sense 
but rather to enable the dominant social class or 
classes to carry out their exploitation ({1892] 1902, 
Pp. 55 ff.). Justice does not determine political 
and legal rights; rather, these rights are the prod- 
uct of the conflict of interests among social classes: 


The premise of “inalienable human rights” rests upon 
the most unreasonable self-deification of man and 
overestimation of the value of human life, and upon 
complete misconception of the only possible basis of 
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the existence of the State. . . . Rights are not founded 
upon justice. On the contrary, justice is created only 
by the actual rights as they exist in the state... . It is 
the simple abstraction of political rights, and it stands 
or falls with them. ([1885], pp. 180-181, 263 in 1899 
edition) 

HARRY ELMER BARNES 


[For the historical context of Gumplowicz’ work, see 
LEGAL SYSTEMS; and the biographies of COMTE; 
DARWIN; GOBINEAU; SPENCER. For discussion of the 
subsequent development of his ideas, see CONFLICT; 
CONSTITUTIONAL LAW, article on CIVIL RIGHTS; LAW, 
article on THE LEGAL SYSTEM; and the biographies 
of BENTLEY; Ducuit; DURKHEIM; LASKI; OPPEN- 
HEIMER; RATZENHOFER; SMALL.] 


WORKS BY GUMPLOWICZ 

(1877) 1907 Allgemeines Staatsrecht. 3d ed. Innsbruck 
(Austria); Wagner. > First published as Philosoph- 
isches Staatsrecht. 

1881 Rechtsstaat und Socialismus. Innsbruck (Austria): 
Wagner. 

1882 Verwaltungslehre mit besonderer Beriicksichtigung 
des österreichischen Verwaltungsrechts. Innsbruck 
(Austria): Wagner. 

(1883-1905) 1926-1928 Ausgewählte Werke. Edited by 
F. Oppenheimer, F. Savorgnan, M. Adler, G. Salomon. 
4 vols. Innsbruck (Austria): Wagner. > Volume 1: 
Geschichte der Staatsthcorien, (1905) 1926. Vol- 
ume 2: Grundriss der Soziologie, (1885) 1926. Vol- 
ume 3: Der Rassenkampf, (1883) 1928. Volume 4: 
Soziologische Essays (1899), Soziologie und Politik, 
(1892) 1928. 

(1885) 1963 The Outlines of Sociology. Edited with an 
introduction and notes by Irving L. Horowitz. 2d ed. 
New York: Paine-Whitman. > First published in 
German. 

1887 System socyologii. Warsaw: Orgelbrand. 

(1892) 1902 Die soziologische Staatsidee. 2d ed. Inns- 
bruck (Austria): Wagner. 

1905 An Austrian Appreciation of Lester F. Ward. Ameri- 
can Journal of Sociology 10:643-653. 

1910 Sozialphilosophie im Umriss. Innsbruck (Austria): 
Wagner. 


SUPPLEMENTARY BIBLIOGRAPHY 

Barnes, Harry E. 1919 The Struggle of Races and So- 
cial Groups as a Factor in the Development of Political 
and Social Institutions: An Exposition and Critique of 
the Sociological System of Ludwig Gumplowicz. Jour- 
nal of Race Development 9:394-419. 

BEROLZHEIMER, Fritz (1905) 1912 The World’s Legal 
Philosophies. Boston: Boston Book. > First published 
as Die Kulturstufen der Rechts- und Wirtschaftsphilo- 
sophie. 

KocuaNnowskl, I. 1909 Ludwig Gumplowicz. American 
Journal of Sociology 15:405-409. 

PosNER, STANISLAW 1911 Ludwik Gumplowicz 1838- 
1909: Zarys zycia i pracy. Warsaw: Orgelbrand. 
Warp, Lester F. 1909 Ludwig Gumplowicz. American 

Journal of Sociology 15:410-413. 

ZEBROWSKI, BERNHARD 1926 Ludwig Gumplowicz: Eine 

Bio-Bibliographie. Berlin: Prager. 


296 GUTHRIE, EDWIN R. 


GUTHRIE, EDWIN R. 


American psychologist, educator, and philoso- 
pher, Edwin Ray Guthrie (1886-1959) was the 
oldest of five children. He was born in Lincoln, 
Nebraska, where he spent his boyhood. His mother, 
Harriet Pickett Guthrie, the daughter of a news- 
paperman, was an elementary-school teacher be- 
fore marriage; his father was the son of a minister 
and the manager of a piano store. 

Guthrie had exhibited vivid intellectual interests 
even as a child. At the University of Nebraska, 
which he entered in 1903, he majored in mathe- 
matics and minored in philosophy. After receiving 
his A.B. in 1907 he remained at Nebraska for grad- 
uate work, majoring in philosophy and minoring in 
mathematics and psychology. In 1910 he received 
his A.M. from Nebraska and was made a Harrison 
Fellow at the University of Pennsylvania, where he 
received his PH.D. in 1912 with a thesis resolving 
various paradoxes of Bertrand Russell (19152). 

Professional life. For five years Guthrie taught 
high-school mathematics, from 1907 to 1910 in 
Lincoln and from 1912 to 1914 in Philadelphia. 
He then joined the faculty of the University of 
Washington, where he remained, except for tem- 
porary leaves, for the rest of his life. 

At Washington, Guthrie started as instructor in 
the department of philosophy, then chaired by 
William Savery. Savery (a student of William 
James at Harvard) and Guthrie (a student of 
Edgar A. Singer at Pennsylvania) became lifetime 
friends. In 1918 Guthrie became assistant profes- 
sor and in 1919 joined the department of psychol- 
ogy, chaired by Stevenson Smith, who also became 
Guthrie’s close friend. Guthrie was made associate 
professor in 1925 and professor in 1928. 

Guthrie married Helen Macdonald in 1920. With 
her, he translated Pierre Janet’s Principles of Psy- 
chotherapy (1924) and traveled extensively—espe- 
cially in France, where Guthrie met Janet, and in 
England, Italy, and Hungary. 

Guthrie was the second winner of the gold medal 
awarded by the American Psychological Foundation 
for “outstanding lifetime contributions to psychol- 
ogy.” Awarded an honorary LL.D. from the Univer- 
sity of Nebraska in 1945, he was elected president 
of the American Psychological Association in the 
same year. During World War u, he was chief 
consultant to the overseas branch of the general 
Staff of the U.S. War Department in 1941 and chief 
psychologist of the overseas branch of the Office 
of War Information in 1942. He was dean of the 
graduate school of the University of Washington 
from 1943 to 1951 and was honored by a building 


on the campus being named after him while he was 
still alive. 


Contributions to science 


Guthrie’s close association with philosophers and 
his formal training in philosophy at an advanced 
level had considerable impact on the problems to 
which he addressed himself and on his resolutions 
of some of these. 

Philosophical clarifications. The nature of ex- 
planation. Guthrie deemed the question “why” as 
unprofitable and almost nonsensical—unless by 
“why” one means “how come,” “under what circum- 
stances,” or “what next.” “Why” in the sense of 
“for what purpose” carries implicit teleological as- 
sumptions perhaps unwarranted and not the most 
fruitful for understanding. 

The most illuminating explanations, Guthrie 
pointed out, are those which summarize sequences 
of observable events: Given this set of observable 
circumstances, what observable subsequent events 
may be most reasonably expected? Such explana- 
tions answer the question “what next” or “how 
come.” An explanation states the general class of 
which some particular sequence is an instance, 
nothing more. 

Causation and the nature of theory. Seeking 
“causes.” as ordinarily conceived, distracts and con- 
fuses us. The usual notion of “cause” implies that 
there is a force within some set of circumstances 
which somehow pushes events in a certain direc- 
tion. Building on the work of David Hume, Guthrie 
abandoned the usual notion of “cause” and thought 
instead in terms of sequences of events. Certain 
events precede other events in time. The earlier 
events need not be assumed to force the later 
ones; they simply precede them. 

Certain events do precede others with great regu- 
larity. Our problem as theorists then becomes one 
of devising general terms to label these classes of 
events and to describe these more regular se- 
quences. Theory construction, so conceived, is an 
inductive process and consists in devising general 
Statements (principles) to summarize as many 
sequences as possible. 

Hedonism. One of the very ancient Greek phi- 
losophies (refined by Aristotle and Plato) held that 
man by his nature is a pleasure-seeking organism. 
A variation of this turns up in the popular idea that 
learning occurs only when some satisfaction Or 
need reduction is involved for the learner [see 
LEARNING, article on REINFORCEMENT], This is @ 
very comforting view. It is heartening to believe 
that, whenever we learn, at least some of our needs 
are thereby being met. The view poses difficulties, 


however, in accounting for the many instances in 
which learning is followed by distress, and no ap- 
parent satisfaction or need reduction. To handle 
this difficulty, psychologists posited an ever-length- 
ening list of motives (conscious and unconscious ) 
or drives (primary, secondary, tertiary) or needs, 
in an attempt to account for seemingly dysfunc- 
tional learning. 

Guthrie handled the difficulty by abandoning the 
idea that we learn only what is followed by some 
need reduction. His approach has economy in the 
numerosity of presumed “needs” or “drives.” It also 
facilitates understanding of mistakes, learned awk- 
wardnesses, and habitual stupidities—without hav- 
ing to presume a “death instinct,” “will to failure,” 
or anything similar—and alerts us to circumstances 
in which learning may lead to damage or disaster. 

The unity of learning. There is, Guthrie sug- 
gested, one kind of learning only; the same prin- 
ciples which hold for learning in one instance hold 
also for learning in all other instances. The ap- 
parent diversity of learning does not stem from 
there being different kinds of learning following 
different principles but arises instead from differ- 
ences of other sorts: different kinds of situations, 
different kinds of responses possible to organisms 
with different musculatures, and different kinds of 
stimuli that can become cues for differing species 
with their differing sense organs and differing 
neural structures. We need not formulate separate 
principles for each of the differing situations, dif- 
fering response types, or differing stimulus sensi- 
tivities. The same set of principles may hold for all 
and be illustrated by all. 

New concepts. Multiple “causation.” At the 
time when Guthrie wrote, a prevalent way of 
thinking about the responding organism was in 
terms of “a” stimulus or “the” CS (conditioned 
stimulus) for various responses. In contrast, gestalt 
psychologists (Köhler, Koffka, et al.) were stressing 
the importance of the totality of the situation. Their 
emphasis was on the totality as a unit—responses 
purportedly being evoked by the total situation as 
a unified, in some ways indivisible, whole. 

Guthrie’s idea, inspired by both theories, was also 
different from both. He emphasized multiple stimu- 
lation as a more adequate basis for comprehending 
behavior than either the single stimulus or than 
the whole situation as merely a unitary totality. He 
Suggested we view any response as a consequence 
of the interplay and, in a way, the summation of 
all stimuli impinging on the organism at that mo- 
ment. He held that the nature of the response made 
by the responding organism is not a function 
merely (or even primarily) of the feature most 
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salient to the observer or experimenter—for ex- 
ample, not a function of “the” conditioned stimulus 
merely. Neither is the response a function of an 
unanalyzable total situation. Rather, the total situa- 
tion can be analyzed into component stimulus 
patterns and their relationships, the organism’s 
response being predictable from these various com- 
ponents simultaneously considered. 

Role of internal stimuli. Throughout the first 
half of the twentieth century, psychologists primar- 
ily emphasized external stimuli, except for those 
internal stimuli considered under the heading of 
“drives.” In accord with his view that all stimuli 
acting on the organism should be considered in 
predicting behavior, Guthrie called attention to the 
many internal stimuli besides those thought of as 
drives. He emphasized that proprioceptive stimuli, 
kinesthetic stimuli, stimuli from visceral responses, 
stimuli from endocrine states, fatigue states, and 
chemicothermal conditions, and other internal stim- 
uli all are present too and should be considered for 
the best understanding of behavior. 

He saw no particular advantage in subdividing 
internal stimuli into two classes: drive-connected 
stimuli versus others. It appeared to Guthrie that 
the various internal stimuli and the various ex- 
ternal stimuli all act in substantially the same way 
and are of equal importance when of equal dura- 
tion and equal intensity. Our difficulty in under- 
standing behavior stems largely from our failure 
to note enough of the dimensions of the stimulus 
situation acting upon the organism rather than 
from a lack of knowledge about an individual's 
“motives” or “drives.” 

Movement-produced stimuli. Every time we 
make any response whatsoever, a wealth of new 
stimuli is brought into existence: new visual stim- 
uli, new tactual stimuli, new proprioceptive stimuli, 
sometimes new auditory and olfactory stimuli, and 
others. This class of stimuli Guthrie called “move- 
ment-produced stimuli,” emphasizing that our own 
responses always change the stimulus world we 
are in. 

Conditioning. The work of Bekhterev and Pav- 
lov was immensely stimulating to Guthrie, as it 
was to other American psychologists [see LEARN- 
ING, article on CLASSICAL CONDITIONING]. One re- 
sult of their influence was Guthrie's suggestion 
concerning the relationship between conditioning 
and learning. Instead of thinking of the phenom- 
ena found in conditioning experiments as arising 
primarily from a pairing of the so-called CS and 
US (unconditioned stimulus), Guthrie considered 
them as arising from the pairing of the CS and 
some response (the unconditioned response [UR] or 
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others )—or more precisely, from the concomitance 
of various stimuli and some ongoing response. 
Conditioning so conceived becomes a paradigm of 
one subclass of the more general class of “learn- 
ing,” and all learning, including that called condi- 
tioning, becomes the result of a pairing of new 
stimuli with ongoing responses. In what appears 
to be Pavlov’s only paper published in a United 
States journal (1932), he restated his original 
position that it is the pairing of stimuli rather than 
of stimuli and responses that is critical. Guthrie 
(1934a) rejected Pavlov’s view and the problem 
remains moot today. 


A theory of learning 


Guthrie thus created a highly original, parsimo- 
nious theory of learned behavior, presenting much 
of it in The Psychology of Learning (1935). For- 
mulations of the basic principles and concepts are 
given in a paper by Voeks (1950). 

Learning, as Guthrie conceived of it, is the proc- 
ess of establishing new stimuli as cues for some 
specified response. This process occurs in a single 
trial and is disrupted only through unlearning. 

Recency versus postremity. Ebbinghaus, Wat- 
son, and others, on the basis of experimental work, 
stressed that the length of time elapsed since learn- 
ing is a key dimension in the preservation of learn- 
ing [see ForcETTING]. Recency was adopted as a 
crucial factor in the preservation of learning. Sig- 
mund Freud, through his clinical work, came to 
the conclusion that learning which has occurred 
early in childhood often is preserved strikingly even 
when there has been little or no opportunity for 
further strengthening of that learning. People ob- 
serving daily life noted that sometimes things re- 
cently learned are best preserved, whereas at other 
times things learned in the distant past seem most 
intact and reappear after long lapses of time. A 
hodgepodge of chaotic data accumulated. 

Guthrie proposed a new conceptualization and 
a new principle which reconciles these divergent 
findings. He suggested that what most other psy- 
chologists call a “response” or “behavior” is actually 
a series of discrete, more-or-less integrated re- 
sponses. For example, the response labeled “picking 
up a pencil” involves a large number of separate 
muscular activities. Similarly, even so simple a 
“stimulus” as a pencil combines many visual, tac- 
tile, and other stimuli. Each component of such a 
compound stimulus object can be, according to 
Guthrie, a cue for a Separate response and may 
tend to elicit such a response. The response re- 
maining cued to each component will always be the 


response most recently made in the presence of 
that particular component. 

When a series of responses is made to a chang- 
ing series of situations which—while changing— 
have some components in common, responses are 
successively attached and detached from the re- 
appearing stimulus components. Again, the re- 
sponse remaining cued to each stimulus component 
is the response last made to that particular part. 
Even when the stimulus component and the re- 
sponse occur early in the stimulus series and at a 
remote time, nonetheless the response remains 
cued to that component whenever the component 
has not turned up subsequently, 

The role of recency in this theory differs a great 
deal from the role it plays in the traditional recency 
principle. According to the latter, as has been 
mentioned, a given stimulus—response connection 
tends to grow weaker with elapsed time. But 
Guthrie held that the cue properties of stimuli can- 
not be weakened by time alone. So long as the 
response is the last-made response to the particular 
stimulus component, the cue properties remain at 
full strength; as soon as some other response is 
made contiguously with the perceived stimulus, the 
original cue property will cease to exist and a new 
cue property will be at full strength. This sequential 
regularity has been formulated in the principle of 
postremity: the last response an organism makes 
to each component in a particular situation is the 
response remaining cued to that component—re- 
gardless of how long the time since that stimulus 
occurred. 

Thus, Guthrie would argue, the old principle of 
recency on occasion tends to operate because the 
more recent the stimuli and the given bit of behav- 
ior, the less time there has been for those stimuli 
to reappear while any other behavior is occurring; 
and hence the less likelihood there is that some 
other response can have become cued to those 
stimuli. However, certain very early stimulus- 
response connections will tend to be perpetuated 
(as Freud reported) because the particular stim- 
ulus conditions are highly unusual, thus reducing 
the possibility of their being present while some 
other new response is being made; hence it be- 
comes likely no new response will be cued to those 
sets of stimuli. 

Removal of stimuli established as cues. The 
view derived from philosophical hedonism, that we 
learn only those forms of behavior which are in 
some way rewarding, was common when Guthrie 
was writing. Both psychologists and laymen be- 
lieved that unless a response is reinforced by drive 


reduction or tension reduction or in some other 
way satisfies some need, no learning will occur. 

Guthrie, however, did not hold that view. He 
noted that under some circumstances men and 
animals learn forms of behavior that make matters 
worse for them, satisfy no need whatsoever, accord 
with no “motive,” and attain no goal. Furthermore, 
under some circumstances these unwanted habits 
are preserved. 

The necessary and sufficient condition for learn- 
ing, Guthrie suggested, is the occurrence of a 
response in the presence of a stimulus not 
already a cue for that response. Similarly, the 
necessary and sufficient condition for the pres- 
ervation of learning is to have the current cues 
for some response absent whenever incompati- 
ble’ responses occur. By this theory, a “reward” 
will preserve learning either when it is itself a cue 
for the new response or when it closely follows the 
response and removes the stimulus established as 
a cue for that response. But any state of affairs, in- 
cluding “punishment,” will likewise preserve learn- 
ing equally well, so long as the cue is removed 
from the organism (or the organism is removed 
from the cue) immediately after the response is 
made—i.e., before another response can be made 
in the presence of the same stimulus, thus cancel- 
ing the original association. Punishment will dis- 
rupt learning when it induces the organism to make 
a response incompatible with the previously learned 
response while the original stimulus is yet present. 
Rewards also will disrupt learning under those" 
circumstances, Degree of “removal from a situa- 
tion” (or partial removal of the situation from the 
animal) may enable us to predict learning and its 
preservation or disruption more fully than do “re- 
ward” and “punishment.” 

One-trial learning. For two thousand years, fol- 
lowing Aristotle and his basic assumptions, phi- 
losophers, laymen, and behavioral scientists alike 
have put great faith in frequency of association 
as a mode of strengthening associations. In our 
century, Pavlov, Thorndike, and Hull all stressed 
the desirability or even necessity of repetitive trials 
to inculcate learning. In contradistinction, Guthrie 
was much impressed by the experimental work of 
Köhler and others on “insight” behavior. 

Guthrie suggested that frequency of trials as 
ordinarily conceived may well be a misleading way 
of thinking about learning and an often futile way 
of addressing oneself to the practical problems of 
learning. He offered his revolutionary principle of 
one-trial learning: Whatever stimuli happen to be 
acting on the behaving organism become full- 
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strength cues for whatever responses the organism 
is making at that time. A single occasion on which 
the specified response occurs concomitantly with 
various stimuli will establish all those stimuli as 
full-strength cues for the specified response. Addi- 
tional trials are useful only for establishing addi- 
tional stimuli as cues for the specified response. 

According to this principle, repetition is often 
futile and under some circumstances actually worse 
than no trials at all. Repetition is futile to the 
degree that the stimuli are the same from one trial 
to another, that the desired response is not being 
made in each trial, or that the stimuli one wishes 
to establish as cues for the response are not those 
actually present. 

Repetition is worse than futile whenever the 
responses actually being made by the organism are 
incompatible with the desired response; for under 
those circumstances more and more stimuli be- 
come cues for responses incompatible with the 
desired responses. This has the dual effect of in- 
creasing the number of stimuli which are cues for 
some undesired response (thus increasing the prob- 
ability of the undesired response being the one to 
appear on subsequent occasions ) and of decreasing 
the pool of stimuli which could elicit the desired 
response. 

Frequency has value only to the degree that new 
stimuli are present from trial to trial and then only 
when the desired response actually is being made 
on the various trials. Under these circumstances, 
additional stimuli become cues for the response 
each time it occurs. Thus, repeatedly practicing a 
difficult passage on the piano will be profitable only 
when correct notes actually are being played in a 
variety of circumstances, the correct notes thus 
becoming cued to additional combinations of audi- 
tory, muscular, and other stimuli accompanying 
them or immediately preceding them. What hap- 
pens otherwise is either nothing or the learning of 
errors. 

Probability of response. The likelihood of any 
specified response occurring at some particular 
time is directly proportional to the extent to which 
the stimulating situation is composed of cues for 
that response. The greater the number of cues 
present at some particular time for the response 
desired, the greater the probability that that re- 
sponse will occur (if the total number of stimuli 
is the same for the various situations being com- 
pared). This principle in the hands of William 
Estes became the basis of what currently is called 
“modern statistical learning theory” [see MODELS, 
MATHEMATICAL; see also Estes 1950; 1959]. 
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Cross-disciplinary implications 

Education. Dewey emphasized that “we learn 
by doing.” Guthrie extended this to “we learn only 
what we ourselves do.” If in classrooms the student 
watches a teacher skillfully solve problems or en- 
gage in intricate feats of cogent reasoning, the 
student will become an intent observer of the 
teacher solving problems—but he will not become 
better at solving problems himself, unless he him- 
self is doing something other than watching. The 
responses we wish to cue to various stimuli must 
be made by the individual himself in the presence 
of those stimuli. 

Guthrie’s theory of learning leads one also to 
place considerably less faith in drill. Making the 
same response over and over again will not further 
learning unless the circumstances are changed— 
and only to the extent that the circumstances are 
changed. Sitting in the same seat in the same room 
with the same internal stimuli from the same emo- 
tional make-up acting upon one while making the 
same response would add nothing to what was 
gained by making the response in the presence of 
those stimuli once. 

A further implication is that the circumstances 
under which one wishes the desired response to be 
made in the future should be approximated as 
closely as possible by the present circumstances. 
The responses made get cued only to those stimuli 
actually present. 

The theory implies too that teachers commonly 
are too prominent a part of the schoolroom situa- 
tion. Whenever a learner is making desired re- 
sponses, the teacher would be wise to be as small 
a part of the stimulating situation as possible. 
When the teacher is a large part of the situation, 
the learner’s responses will, of course, be cued to 
the sight and sound of the teacher (as well as to 
other stimuli from the teacher); when these stim- 
uli constitute a major part of the total situation, 
the desired responses are being cued to relatively 
little else besides the teacher, Hence, in the teach- 
er’s subsequent absence, the desired response is 
less apt to be made than would be the case had 
the teacher been less prominent. 

Many further implications for formal education 
are presented in a book by Guthrie and Powers 
(1950), 

Psychotherapy. We ask “Why did you do this?” 
assuming the person acts toward some end. We 
search for “motives” and speculate on what the 
person gains by his unwanted behavior. But pos- 
sibly he gains nothing by it—even in his own eyes. 


That could be the crux of his problem: he does not 
customarily act in accord with his own values or 
goals. His behavior is largely unaimed 

From Guthrie’s theory we should expect that 
much learned behavior may be unsatisfying even 
to the person engaged in it. The person does not 
necessarily direct his behavior toward any goal, 
conscious or unconscious. He is not necessarily 
acting in a fashion that meets his basic needs or 
fulfills his major hopes. He simply is doing what 
he has been trained to do, making the responses 
to various stimuli that he last made to those stim- 
uli, responses that in some prior context he had 
perhaps been forced to make. The responses may 
meet no need and yet persist. 

Instead of assuming that all behavior is goal- 
directed and meets some need, we should try in 
psychotherapy to help the person develop more 
goal-directed behavior and develop behavior con- 
sonant with his needs: for example, we should help 
him to acquire habits of aiming at goals, habits 
of evaluating his actions, habits of ascertaining 
whether any need is indeed being met by his be- 
havior [see MENTAL DISORDERS, TREATMENT OF, 
article on CLIENT-CENTERED COUNSELING]. 

Further, in light of Guthrie’s theory, instead of 
concentrating primarily on the circumstances un- 
der which the unwanted behavior or feelings occur, 
we should search also for the circumstances under 
which the individual currently does make valued 
responses. The latter offer leads concerning the 


* stimuli which already are cues for valued responses 


and thereby offer a basis on which to build. 
Subsequently, when confronted with unwanted 
behavior, the appropriate question is not “Why?” 
but “What are the circumstances eliciting that un- 
wanted response?” and “What stimuli are cues for 
it?” To get a different response cued to those stim- 
uli, one should not present all at once many of 
the stimuli that are cues for the unwanted re- 
sponse. (That is, one should not, for example, put 
the individual back in the original situation.) This 
would make the unwanted response very likely, 
according to the probability principle. Rather, we 
should present only a few of those stimuli that now 
are cues for the unwanted response while simul- 
taneously presenting many stimuli that are cues 
for a desired response. When the new response 
occurs, a few more of the stimuli may be intro- 
duced that are cues for the unwanted response. 
Thus, gradually, we could detach from unwanted 
behavior more and more of the stimuli formerly 
cues for unwanted behavior and establish them as 
new cues for the new response. On this installment 


plan, unlearning of the old and learning of the 
new proceeds most efficaciously. 


One of the most brilliant and kindly of men, 
Guthrie has given us a rich heritage through his 
teaching, articles, and books. He worked with sus- 
tained endeavor on an extraordinary array of top- 
ics, His writings, remarkably illuminating, thought- 
ful, and thought-provoking, reflect his high good 
humor, courage, and great concern for his fellow 
men. 

VIRGINIA VOEKS 


[For the historical context of Guthrie’s work, see the 
biographies of BEKHTEREV; KÖHLER; PAVLOV.] 
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HADDON, ALFRED CORT 


Alfred Cort Haddon (1855-1940), “the father of 
Cambridge anthropology,” was the son of strictly 
Nonconformist parents living on the outskirts of 
London, His father, John, was more successful as 
a Baptist deacon and Sunday-school teacher than 
in business, and the printing works that he had 
inherited produced dwindling profits. Haddon’s 
mother, Caroline Waterman, eked out their scanty 
income by writing children’s books, mostly religious, 
but, significantly, her best seller was Look, Listen 
and Learn, an introduction to nature study based 
on walks with her children. Her son dedicated his 
Head-hunters “to my Mother who first taught me 
to observe.” 

After a scrappy schooling Haddon entered the 
family business, “duty and drudgery,” as he called 
it. There, however unwillingly, he learned some 
useful lessons: careful and exact lettering and 
drawing, printing technique, proofreading, and busi- 
ness methods. But his whole heart was devoted to 
the study of plants and animals, and his spare time 
was spent in collecting, drawing, dissecting, bot- 
tling, and noting in his diary everything he could 
find, from squashed hedgehogs and unwanted kit- 
tens to water fleas and animalcula. 

“Duty and drudgery” came to an end in 1875, 
when his father decided that the young scientist 
was a bungling failure in the office and that he 
would lose less money by sending him to the uni- 
versity. Haddon was enrolled at Christ's College, 


Cambridge. He won a scholarship and did bril- 
liantly in his tripos. He was nominated to the 
zoological station at Naples and in 1880 was ap- 
pointed professor of zoology at Dublin. 

Haddon’s marriage the following year to Fanny 
Rose was the climax of a college joke that the par- 
ticipants never regretted. He and his friend Holland 
Rose had arranged to exchange sisters for the May 
Week festivities of 1877. Rose married Laura Had- 
don in 1880, and at their wedding Haddon an- 
nounced his engagement to Fanny Rose. 

Armchair science never satisfied him, and in 
1888, “tired of lecturing about things I have never 
seen,” he planned his first visit to the Torres Strait, 
a turning point in his life and in the history of 
anthropology. His carefully kept journal shows how 
his interest in coral reefs was gradually eclipsed 
by interest in the natives. He realized that corals 
could wait but native culture was disintegrating so 
rapidly with the impact of civilization that the 
saving of vanishing data was an urgent need. 

When he left the islands, in 1889, he was already 
planning the Cambridge Anthropological Expedi- 
tion to the Torres Strait, a landmark in anthropo- 
logical research. It took him almost ten years to 
collect the men and the money. He felt a thorough 
record of native life needed a team of specialists 
that included not only those equipped to study 
physical characters, arts and crafts, music, lan- 
guage, religion, and folklore but also, for the first 
time, a psychologist who might interpret native 
thought. In the end there were two psychologists, 
W. H. R. Rivers, a lecturer in the medical school, 
and his pupil C. S. Myers, who was a musician as 
well; from the medical school also came William 
McDougall and C. G. Seligman. All these names 
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later became famous in the social sciences. The 
team was completed by S. H. Ray, an elementary- 
school teacher with an uncanny knowledge of Mel- 
anesian languages, and Anthony Wilkin, a capable 
photographer interested in material culture. Their 
work is enshrined both in the six volumes of the 
Reports (1901-1935) and, more popularly, in Head- 
hunters (1901). 

The expedition was a tremendous success, mainly 
because of Haddon’s foresighted organization and 
the affectionate welcome he received from the 
islanders. Its members hoped that on its return it 
would be crowned by some recognition from the 
university. But Haddon’s only reward was a lecture- 
ship in ethnology at £50 a year. He had never been 
interested in money and was used to a frugal life, 
but with a wife and three children to support he 
had to augment his meager income by lecturing 
all over the country and writing articles and re- 
views at the same time he was writing and editing 
the Reports. 

Americans were more appreciative than “poor 
blind Cambridge,” as Haddon’s own university was 
described. G. A. Dorsey of the Field Columbian 
Museum invited him to lecture at Chicago and else- 
where in 1900, offering twice as much as Cam- 
bridge paid him in a year. And when, later, at the 
invitation of W. Z. Ripley, Haddon gave the Lowell 
lectures at Boston, he calculated gleefully that he 
was getting two dollars a minute. He would return 
to England from these visits enriched and encour- 
aged, only to be depressed again by the apathy and 
financial anxieties at home, It was not until 1904 
that a readership at £200 a year was established, 
and Haddon assumed this post and undertook all 
the lecturing it entailed, pursuing his duties with 
unflagging energy and enthusiasm until his resig- 
nation, in 1925. 

When Haddon first came to Cambridge, in 1875, 
the science of anthropology was nonexistent. When 
he died, in 1940, it had been raised, mainly by his 
selfless devotion, to an honored position in the 
university. His influence was due primarily to his 
personality. He was not a good lecturer, his thoughts 
dashing ahead of his hesitant speech. He lacked 
social graces, retaining to the end of his life some- 
thing of the gaucherie of a schoolboy. At the same 
time that his Nonconformist conscience never for- 
sook him, he was intolerant of cramping conven- 
tions, ignored creeds and sects, and respected only 
those aspects of religion, civilized or uncivilized, 
that were heartfelt and genuine. He made friends 
with high and low. He was a welcome associate of 
academic dignitaries (although barely polite to the 
pompous or pretentious) and was always at his best 


with those in the humbler ranks—illiterate Irish 
peasants, fishermen, and his beloved “savages.” His 
students were among his most devoted admirers, 
and they have maintained his high standards of 
honest work in the study of mankind. 

A. H. QUIGGIN 


[For discussion of the subsequent development of Had- 
don’s work, see OCEANIAN SOCIETY and the biogra- 
phies of RADCLIFFE-BROWN; RIVERS; SELIGMAN, 
C. G.] 
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HALBWACHS, MAURICE 


Maurice Halbwachs (1877-1945), a sensitive 
and humane scholar whose works are among the 
most important in the sociology of the first half of 
the twentieth century, was arrested by the Gestapo 
in Paris in July 1944 and died about eight months 
later in the Buchenwald concentration camp. 

Halbwachs was born in Rheims. For three years 
one of his teachers at the Lycée Henri 1v in Paris 
was Henri Bergson, who influenced him greatly. He 
completed his studies at the Ecole Normale Su- 
périeure with a well-written book on Leibniz (1907) 
and participated in the preparation of the Catalogue 


des manuscrits leibniziens. Soon, however, the so- 
cial sciences attracted his interest. He worked with 
Simiand, Lévy-Bruhl, and especially Durkheim and 
helped edit L’année sociologique. His dissertation, 
La classe ouvriére et les niveaux de vie (1913a), 
marked the beginning of his extensive studies in 
social psychology. 

This study is characteristic of Halbwachs’ work. 
It reveals his sense for the concrete and his con- 
cern for getting down to reality. Only after immers- 
ing himself in concrete data did he develop general 
ideas, working hypotheses, theories, and laws. 
Moreover, Halbwachs made specific methodological 
and theoretical contributions in this study. By dis- 
covering what the group accepted as the proper or- 
der of importance of expenditures, he established 
a scale of working-class needs. He revealed the pri- 
mary importance in these choices of the group's 
representation and of what is consistent with it. 
Clearly, Halbwachs was here applying Durkheim’s 
theory of collective representations. 

This scientific study of the scale of working-class 
needs was supported by a theory of the worker's 
position in modern capitalist societies, a theory that 
now seems questionable at certain points. Accord- 
ing to Halbwachs, these societies assign to the 
working class the task of mastering matter, and 
since the members of this class are thereby limited 
to contact with things, they become isolated from 
the rest of mankind. In their isolation as a group 
from other groups in society, the solidarity among 
the members of the working class becomes en- 
hanced, and it is only by virtue of their class con- 
sciousness that they remain attached to the larger 
society at all. 

A later work, L’évolution des besoins dans les 
classes ouvrières (1933), applied the principal con- 
clusions of the 1913 thesis to those social groups 
in the great industrial nations of the West that oc- 
cupy positions analogous to that of the older work- 
ing class. 

Georges Gurvitch raised some questions about 
Halbwachs’ criteria of social-class membership, but 
he gave him credit for having been clearly aware 
of the great complexity of social class phenomena 
and for having kept his concepts entirely separate 
from all philosophy of history and all political po- 
sitions (Gurvitch 1950). 

Halbwachs displayed a distinct tendency to ex- 
plain needs by relative and subjective factors. He 
admitted that trends of opinion are important 
factors in defining needs, but he did not consider 
the extent to which objective elements also enter 
into the subjective experience of need. For example, 
a real lack of food or clothing is felt by the workers 
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when they compare their standard of living with 
that of the middle class, or the bourgeoisie. 

Halbwachs’ influence as a pioneer in research on 
the definition of needs, as well as on budgets and 
consumption patterns, has been stressed by P. 
Chombart de Lauwe in his book La vie quotidienne 
des familles ouvriéres (1956). Chombart de Lauwe 
also points out that Halbwachs was the first to make 
a genuinely sociological study of the development 
of workers’ budgets. 

Halbwachs’ study Les causes du suicide (1930) 
extends and refines Durkheim's classic work on the 
same subject. For Durkheim, religious factors are 
most important in determining the extent of an in- 
dividual’s integration into a society (and he related 
a lack of integration to a high rate of suicide). 
Halbwachs felt that the nature and importance of 
the religious factor vary with the social and psy- 
chological context and that this context, in turn, 
varies in different countries. 

He was interested in the influence of collective 
memory and tradition on beliefs, and his essay La 
topographie légendaire des évangiles en terre sainte 
(1941) was a specific study of this influence. Thus, 
in various periods of history, the appearance attrib- 
uted to the holy places has changed according to 
the needs and hopes of the Christian groups de- 
scribing these places. 

In his recent book on social psychology, Jean 
Stoetzel (1963) shows how Halbwachs influenced 
the study of memory by distinguishing between in- 
dividual memory, i.e., the present knowledge of the 
past, and collective memory and by seeking the 
patterns governing this collective, or group, mem- 
ory. Halbwachs’ contributions in this area are only 
now beginning to be appreciated outside France. 

During the last years of his life, Halbwachs be- 
came increasingly preoccupied with social morphol- 
ogy and demography. He contributed an important 
article on demographical statistics to the Encyclo- 
pédie frangaise. He also developed the concept of 
social morphology that appears in a rudimentary 
and rather abstract form in Durkheim’s work. Halb- 
wachs was frequently able to discern the social and 
economic structure behind morphological data, es- 
pecially the influence of social classes. In his Psy- 
chology of Social Class (1938), he examined moti- 
vation among the peasant classes, the bourgeoisie, 
industrial workers, and the lower middle classes. 

It is possible here only to suggest the importance 
of the results of Halbwachs’ studies and the orig- 
inality of his methods. He invariably looked at real- 
ity from many different perspectives. There is no 
rigidity or dogmatism in any of his work. He culled 
important elements from several intellectual tradi- 
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tions: he was inspired by Durkheim's vigorous mind 
and was concerned to use such new scientific ap- 
proaches as those of American sociology, but he 
also incorporated Marxist sociology in certain broad 
orientations of his work. 

GEORGES FRIEDMANN 


[For the historical context of Halbwachs’ work, see the 
biographies of DURKHEIM; LE PLAY; Lévy-BRUHL; 
SIMIAND. For discussion of the subsequent develop- 
ment of Halbwachs’ ideas, see Sutcive, article on 
SOCIAL ASPECTS.] 
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HALE, HORATIO 


Horatio Emmons Hale (1817-1896), American 
ethnologist, was born at Newport, New Hampshire. 
His mother was Sarah Josepha Hale, for many 
years editor of Godey’s Lady’s Book. While Hale was 
still a student at Harvard, which he entered in 
1833, he made a study of the language of an 


Algonquian-speaking band encamped on the college 
grounds; he himself set into type and published 
the results of the study. This early attempt at lin- 
guistic research within an ethnological context set 
the tone for all his later work in the emerging 
anthropology of the nineteenth century: through- 
out a long life of periodic, although always enthu- 
siastic, involvement in anthropological research, 
he continued to stress language as the “true basis 
of ethnology.” 

Upon his graduation from Harvard, Hale ac- 
cepted a position as philologist with a United States 
expedition to survey and chart areas of the Pacific, 
under the command of Captain Charles Wilkes. 
Hale’s senior colleague in the scientific corps of this 
first research expedition sponsored by the United 
States government was Charles Pickering, who with 
du Ponceau and Gallatin was a leader among those 
early nineteenth-century ethnologists who stressed 
the collection of comparative linguistic data from 
the human societies recently discovered and con- 
stantly threatened by western expansion. When the 
expedition stopped at the Oregon Territory, toward 
the end of its four-year voyage in the Pacific, Hale 
left it to devote more time to the study of the 
languages of the Indians of the Northwest Terri- 
tory, after which he returned overland to the East. 
Hale’s contribution to the Reports of the Expedi- 
tion was published in 1846 as Volume 6, Ethnog- 
raphy and Philology. Its primary emphasis was 
upon his collection from the southern Pacific, and 
although it contained a mass of miscellaneous and 
often superficial ethnographic observations, Hale’s 
linguistic data, especially those from the Polynesian 
islands, were carefully recorded and are still useful. 
It was upon this linguistic evidence that Hale built 
his theory of Polynesian migrations, which, except 
for refinements derived from more recent data, is 
still essentially sound. 

After the Wilkes expedition and Gallatin’s death 
in 1849, the linguistic emphasis of American eth- 
nology gave way to the rapid rise of archeological 
interests. About this time, also, Hale married and 
moved to Clinton, Ontario, where for the next 
twenty years he devoted himself to business ac- 
tivities. Yet, he was to resume his ethnological in- 
vestigations, stimulated by the proximity of the Six 
Nations Reserve, on the Grand River at Brantford 
and probably also by a developing friendship with 
Lewis H. Morgan. Toward the end of the 1860s he 
began the collection of traditional literature from 
older informants on the reserve. These form the 
basis of his most important substantive contribu- 
tions to the anthropological literature and particu- 
larly to that of Iroquoian studies. 


His interest in language as the foremost indi- 
cator of ethnological status led him to concentrate 
upon that aspect of Iroquois culture. And it was 
his continuing concern for language that led Hale 
in all his work to stress the distinction between 
race, defined biologically, and language and to 
stress the superiority of language for establishing 
the historical (i.e., ethnological) relations of exist- 
ing groups. More particularly, he defined Tutelo as 
a Siouan language within the Iroquoian geograph- 
ical range; he rescued Wyandot Huron from dis- 
appearance and demonstrated that it had been the 
most ancient Iroquoian language; using a technique 
which anticipated more recent glottochronological 
methods, he attempted an arrangement of existing 
Iroquoian languages into a historical sequence; and 
perhaps most important of all for later ethnological 
methods, he used the traditional tales of the Iro- 
quois as a source for ethnological data. 

In his most significant work, The Iroquois Book 
of Rites (1883a), he summarized much of his work 
on the Iroquois. Here, using his several approaches, 
he reconstructed the late prehistory of the Six 
Nations, with a particular emphasis, drawn from 
the traditions he had recovered, upon the story of 
the legendary hero Hiawatha (which he rescued 
from the romanticism and errors of H. R. School- 
craft) and the formation of the League of the 
Iroquois. Hale’s published contributions to the eth- 
nological literature are relatively few, but all of 
them are graced with his intelligence and original- 
ity, with his theoretical bent and his humanism. 

Hale's greatest contribution to American anthro- 
pology, however, lies in his influence upon Franz 
Boas, much of whose early field work on the North- 
west Coast was done under Hale’s supervision and 
direction. Although Boas had spent a productive 
season among the Northwest Coast Indians in 
1886, his subsequent field work in this area during 
the early 1890s was subsidized by a special com- 
mittee of the British Association for the Advance- 
ment of Science, of which Hale was the driving 
force. It was Hale who recruited Boas for this work; 
and although differences of approach to field prob- 
lems developed, Hale was Boas’ constant and en- 
thusiastic supporter. Correspondence between the 
two men indicates that Hale not only provided Boas 
with the material support required but was also 
the source of much in Boas’ emerging point of 
view toward the anthropological enterprise, a point 
of view which was to form the foundation of Amer- 
ican anthropology for half a century. Hale trans- 
mitted to Boas the intensity of his feeling for the 
diversity of culture and language and his distaste 
for the construction of universal historical systems; 
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he provided Boas with the technique for the recap- 
ture of lost elements of culture through the collec- 
tion and analysis of myths and legends; he per- 
suaded Boas of the insufficiency of biological 
criteria for the establishment of a classification of 
the varieties of man and for the evaluation of his 
varying capacities; and by his insistence that Boas 
collect all the data relating to the differences be- 
tween human groups, Hale laid the foundation in 
a field approach for the concept of a “general an- 
thropology,” which came to be the hallmark of 
the work of Boas and his followers. 

Horatio Hale died in 1896. He was mourned by 
Boas as “a man who contributed more to our knowl- 
edge of the human races than perhaps any other 
single student.” Sufficient praise indeed. 


Jacos W. GRUBER 


[For discussion of the subsequent development of 
Hale's ideas, see INDIANS, NORTH AMERICAN and the 


biography of Boas.] 
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HALEVY, ELIE 


Elie Halévy (1870-1937), French historian, pro- 
duced a monumental history of England and also 
wrote about socialism and other modern political 
ideas. He was born into a family known for its 
contributions to the arts. Its founder, a Bavarian 
cantor, came to Paris at the end of the eighteenth 
century. He believed in the possibility of combining 
the moral principles of Judaism with the political 


308 HALEVY, ELIE 


program of the French Revolution, which had 
emancipated his people from their ancient disabil- 
ities. This faith found its way into the title of the 
review L'Israélite français, which he founded dur- 
ing the Bourbon restoration. His two sons were 
soon assimilated into Parisian intellectual and 
musical circles. Jacques Fromental became a 
prominent composer, professor at the Conservatory, 
the first Jewish member and, ultimately, the secre- 
tary of the Academy of Fine Arts. His wife was a 
member of the Rodrigues family. Like so many 
other Saint-Simonians, members of this family 
came to be bankers and entrepreneurs during the 
Second Empire. Fromental’s daughter Geneviève 
married Bizet. She was a friend of Proust and was 
among his models for the Duchesse de Guermantes. 

Léon Halévy, Fromental’s brother, succeeded 
Comte as Saint-Simon’s secretary. Léon himself 
wrote extensively, but he was most notable for 
having two sons who were elected to the Académie 
Frangaise. Lucien Anatole Prévost-Paradol, an il- 
legitimate son, was a brilliant essayist and a critic 
of the Second Empire. His ideas greatly influenced 
the authors of the Third Republic’s constitution. 
Léon’s other son, Ludovic Halévy, was a prolific 
and successful author as well as a librettist for 
Offenbach and Bizet. Although brought up as a 
Catholic, his mother’s faith, Ludovic married Louise 
Breguet, whose Protestant family of artisan in- 
ventors had gone from watchmaking to the position 
of France's leading constructors of precision in- 
struments. Breguets were among the pioneers of 
telegraphy and aviation. 

Ludovic's sons, Elie and Daniel, were raised as 
Protestants. Their home was the center of a bril- 
liant set, which included their uncle, Marcellin 
Berthelot, the great chemist, as well as the painter 
Degas. Both sons were ardent supporters of Drey- 
fus and helped organize the famous “Manifesto of 
the Intellectuals.” Daniel, who collaborated with 
Péguy on the Cahiers de la quinzaine, later sup- 
ported Georges Sorel, contributed to the foundation 
of L'Humanité and the universités populaires (what 
the English call workingmen’s colleges). His name 
was prominent for more than half a century as 
historian, journalist, and editor. 

Elie was more ascetic and brilliant, placing first 
in the national philosophy examination he took 
before entering the Ecole Normale Supérieure. 
There philosophy continued to be his main con- 
cern, as it was of such life-long friends as Xavier 
Léon, Alain [E, Chartier], Celestin Bouglé, Léon 
Brunschvicg, and Dominique Parodi. Committed to 
a secular and rationalist individualism, passionately 
concerned to unite theory and practice, Elie Halévy 


helped found the Revue de métaphysique et de 
morale and the Société Française de Philosophie, 
associations he maintained until his death. Like 
Max Weber and Durkheim, Halévy gained much 
from participation in a great journal that brought 
him into contact with the pivotal works and au- 
thors of his time. Even after he turned from philos- 
ophy to the history of ideas and thence to general 
history, his analytical method still marked every- 
thing he wrote. 

In his first book, La théorie platonicienne des 
sciences (1896), Halévy carefully demonstrated 
Plato's apparent contradictions before establishing 
their ultimate congruity. This way of proceeding 
stemmed from Halévy’s belief that if “one has the 
courage to go to the very end of a ‘negative dialec- 
tic,’ he then perceives that such criticism prepares 
the way for a ‘progressive dialectic.’ By distinguish- 
ing levels, this positive dialectic leads to the discern- 
ment of values in terms of a conception of the 
world and of life that is at once hierarchical and 
constructive, that not only founds and justifies but 
also fulfils the no less essential function of con- 
demning what does not meet its standards” (Brun- 
schvicg 1937, p. 680). Plato served as an example 
of a thinker who had made his way from criticism 
of others to positive construction of his own. His 
standards were rational and self-conscious; he used 
them to pass judgment on the institutions of his 
society. 

As a historian of ideas, Halévy made extensive 
use of “negative dialectic” as a means of penetrat- 
ing to the real, if often contradictory, values and 
motives underlying a school of thought. In addition, 
he sought to chart and explain the discrepant actual 
effects of a doctrine. In The Growth of Philosophic 
Radicalism (1901-1904), Halévy used his method 
to order the results of his extraordinarily thorough 
research on Benthamism. Its political and eco- 
nomic aspects could not, he thought, be logically 
reconciled. In his political theory, Bentham as- 
sumed that only by a legislator can conflicts of 
interest be reconciled in such a way as to secure 
the greatest good of the greatest number; in his 
economic theory, he assumed that individual inter- 
ests are harmonized naturally by some sort of 
invisible hand. Thus, utilitarianism could produce 
either faith in bureaucratic paternalism based on 
legislative intervention or belief in anarchical in- 
dividualism and laissez-faire. Both conclusions were 
in fact drawn by Bentham’s disciples; both had 
important effects upon political theory and public 
policy. 

From this model work in intellectual history, 
Halévy moved to a study of England that owed 


much to his teaching responsibilities at the Ecole 
Libre des Sciences Politiques. The school was estab- 
lished after the defeat of 1871 and the Commune. 
Its founders sought to discover the sources of 
France's political instability as well as her failure 
to develop an elite and institutions such as those 
of England, the pre-eminent example of a nation 
that combined national power with liberty. Halévy 
taught two courses at the “Sciences Po” alternately 
for over forty years: English history and European 
socialism. The first was the subject, the second the 
polemical target, of his life's work, A History of 
the English People in the Nineteenth Century 
(1912-1932). In the first volume, England in 
1815, he isolated his dominant theoretical concern: 
“Why [is] it that of all the countries of Europe 
England has been the most free from revolutions, 
violent crises, and sudden changes?” (ibid., vol. 1, 
p. 424). His answer was a sustained effort to 
demonstrate the inadequacy of Marxism as a mo- 
nistic theory of historical explanation. But like Max 
Weber, Halévy respected Marx enough to accept 
his formulation of the problem: England, the first 
country to be dominated by the capitalist mode of 
production, should have been the first to experience 
a revolution produced by the internal contradictions 
of that economic system. It did not. What could 
explain this? Again like Weber, Halévy did not 
wish simply to replace Marxism by an idealist 
hypothesis asserting that ideas and beliefs cause 
events. Rather, he wished to show that this was an 
instance, and a crucial one, in which religion had 
exercised a force independent of the means of 
production. Weber emphasized the effects of reli- 
gious movements upon economic activity; Halévy, 
their political consequences. 

The decisive difference between British and 
French development stemmed from indirect and 
unanticipated consequences of the Methodist re- 
vival. In France religion had been identified with 
an established church, controlled by the same class 
that dominated the state. Consequently, radicals 
were forced to attack both institutions simulta- 
neously. But in England, Wesleyanism, rejected by 
the Church of England, became associated with the 
Nonconformist tradition of the free churches and 
contributed to their reinvigoration. Thus, Non- 
conformity could act as a social control, disposing 
radicals to think in terms of piecemeal reforms. 
Because in every class Methodism had made con- 
verts who shared its philanthropic values, channels 
were created for communicating the grievances of 
the lower classes, Reforms from above moderated 
injustices unacknowledged by the Continental bour- 
geoisie. 
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Working-class leadership was likewise affected. 
The sects supplied many causes with men from 
the working class, who had learned organizational 
techniques and means of influencing opinion from 
their chapels and classes. Yet the teaching of these 
same sects imposed internal restraints upon any 
tendency to resort to violence. Thus, those mem- 
bers of the working class who led their fellows 
differed from the middle-class and atheist revolu- 
tionary elite on the Continent. Orderly change 
focused on specific measures, and abuses were 
attacked as violations of Christian morality. These 
elements of political style were discovered rather 
than invented by the Independent Labour party. 

In seeking in the sociology of religion the expla- 
nation for British stability, Halévy did not perform 
a speculative leap. Before discounting the signifi- 
cance of explanations based on political institu- 
tions on the one hand and economic forces on the 
other, he subjected the evidence for both to the 
rigorous “negative dialectic” essential to his method. 
It was only after this that he passed to his own 
positive theory based on inquiry into “beliefs, emo- 
tions, and opinions, as well as [into] the institutions 
and sects in which these take a form suitable for 
scientific inquiry” (ibid., vol. 1, p. 383). 

This hypothesis remained at the center of the 
following volumes in Halévy’s History, although as 
he approached his own time, he became more 
narrative and covered subjects essential to general 
history. Imperialism and the Rise of Labour and 
The Rule of Democracy, which took his account 
up to 1914, are more remarkable for their thorough- 
ness and objectivity than for the analytical and 
sociological qualities that characterized England in 
1815. 

As his work progressed, Halévy became one of 
the intellectual symbols of the entente cordiale. 
Sequestered and indefatigable in France, he was 
more sociable during his frequent visits to Eng- 
land, where he spent several months every year. 
Accompanied by his wife, who aided him in his 
research, he mingled freely with politicians of all 
parties, as well as with his university friends. The 
Webbs, Bertrand Russell, H. A. L. Fisher, Sir Ernest 
Barker, G. P. Gooch, and Graham Wallas were 
among his oldest and best friends. During World 
War 1 he voluntarily served in the medical service. 
When fighting ended, he was offered an important 
position in the League of Nations Secretariat. This 
he refused, as he did the offer of a chair at the 
Sorbonne, so that he could devote himself to the 
writing of his life work. Although he never ac- 
cepted any of the French decorations offered him, 
he was touched by the honorary D.PHIL. awarded 
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him by Oxford. Halévy played an important part 
in the commission charged with the publication of 
official French documents on the origins of World 
War I. 

Two other volumes of Halévy’s work were not 
published until after his death. The Histoire du 
socialisme européen (1948) consists largely of stu- 
dents’ notes from his lectures. Halévy had planned 
to write such a book but died before he could do 
so. More valuable is the collection called L’ére des 
tyrannies (1938). The title address, given in 1936, 
summed up his views not only of socialism but also 
of the regimes of Stalin, Hitler, and Mussolini. 
Halévy was among the first to assert that commu- 
nism and fascism, as actually practiced, shared 
crucial characteristics: 


In both cases the country is governed by an armed 
sect, which claims to rule in the interests of the com- 
munity, and which is able to impose its will because 
all of its members are inspired by a common faith. 
(Ibid., p. 226) 

On the one side and the other, the same mixture of 
a proletarian ideology with militarism. Labor camps. 
Labor front. Battles for this or that. As for the regime 
itself, it can be defined only as a permanent state of 
siege under the control of para-military formations 
united by a common faith. (Ibid., p. 245) 


Because such regimes constitute a durable form of 
tule rather than a temporary expedient, Halévy 
called them “tyrannies” in preference to “dictator- 
ships.” Despite his use of classical terms, he con- 
sidered such governments to be unprecedented. 
Their rulers exercised political and intellectual con- 
trols never before available. As for their origin, he 
attributed such tyrannies less to ideas than to cer- 
tain means adopted by the belligerents during 
World War 1: central economic planning, national- 
ization, use of union leaders to eliminate strikes, 
“organization of enthusiasm” by propaganda, and 
Suppression of all opinions judged by the regime 
as adverse to the national interest, 

In The World Crisis of 1914-1918 (1930), 
Halévy examined the concepts of “revolution” and 
“war” to show their interrelation and cumulative 
effects during that period. Previously, historians 
had treated these two subjects in isolation from 
each other. Halévy found the causes of the war 
and the Russian Revolution not in decisions con- 
sciously taken by statesmen but in collective and 
anonymous forces, such as the beliefs in national 
self-determination and the reality of class struggle. 
As in his History, he turned to collective sentiments 
for explanation. He concluded that the waste of 
lives and resources in twentieth-century wars and 
revolutions was due to what men thought and be- 


lieved. Only by transforming their ideas and emo- 
tions can international peace be attained, only by 
substituting compromise for fanaticism can there 
be an end to violence. No one knew better than 
Halévy how unlikely it was that such advice would 
be accepted. He died in 1937, believing that war 
was inevitable and that it would perpetuate tyranny 
throughout Europe. 

MELVIN RICHTER 


[Directly related are the entries LIBERALISM; RADICAL- 
ISM; UTILITARIANISM. Other relevant material may 
be found in ANGLO-AMERICAN SOCIETY; DICTATOR- 
SHIP; History, article on SOCIAL HISTORY; and in 
the biographies of BENTHAM; DURKHEIM: Marx; 
WEBER, Max.] 
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HALL, G. STANLEY 


G. Stanley Hall (1844-1924), American psychol- 
ogist and educator, was born in the rural hamlet 
of Ashfield, Massachusetts. In 1863 he enrolled at 
Williams College, where he studied with Mark 
Hopkins. After graduation, although he was with- 
out a strong sense of vocation, he enrolled at the 
Union Theological Seminary in New York City. He 


spent a year there, more intent on absorbing the 
various facets of city life than on theological study. 
He then sought and received from Henry Ward 
Beecher a loan of $500 for foreign study. He went 
to Bonn, where he concentrated on philosophy and 
theology, but afterward, at Berlin, his interests 
broadened to include physiology, physics, and work 
in a clinic for mental diseases. He also enjoyed 
the lighter side of Berlin, its beer gardens and 
theaters. In 1871, heavily in debt and without a 
degree, he returned home. Failing to receive an 
appointment at a midwestern university, he ac- 
cepted a position as tutor to the children of a New 
York City banker and remained for more than a 
year in this post. 

A charming, enthusiastic person, Hall had a 
lively interest in many of the intellectual issues and 
writers of the day, from associationism to evolu- 
tionism, and from John Stuart Mill to Thomas 
Carlyle. His religious leanings, which had never 
been strong, disappeared and were replaced by a 
mild skepticism. University teaching became his 
professional goal. 

He was offered a post at Antioch College, the 
midwestern outpost of Unitarianism, where for 
three years he taught English literature, French 
and German language and literature, and philoso- 
phy, as well as serving as librarian, choirmaster, 
and even occasionally as preacher. He read Wil- 
helm Wundt’s Principles of Physiological Psychol- 
ogy when it first appeared and immediately decided 
to return to Germany to study psychology. But he 
got only as far as Cambridge, Massachusetts, when 
an offer of an instructorship in English at Harvard 
diverted him from his plan. Although the grading 
of sophomore English themes consumed much 
of his time, he also carried on research in the 
Medical School laboratory of Henry P. Bowditch. 
This physiological research and some study with 
William James were accepted by the department of 
philosophy as fulfilling the requirements of a PH.D. 
degree in psychology. Hall was the first in the 
United States to receive this degree. 

Immediately thereafter he left for Europe. He 
did work in physiology at Berlin and then went to 
Leipzig where he was the first of Wundt’s many 
American students. Yet the enthusiasm engendered 
by his reading of Wundt's work did not survive di- 
rect contact: although he attended Wundt's lectures 
and served as a subject in his psychological experi- 
ments, he apparently did not carry on any research 
of his own. 

On returning to Boston he was without a job or 
even any prospects of one. A providential offer 
from President Eliot of Harvard to sponsor him in 
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a series of extension lectures on education saved 
the day financially and, what is more important, 
brought him to the attention of Daniel C. Gilman, 
president of the then recently founded Johns 
Hopkins University, an institution already cele- 
brated for its pioneer graduate program, organized 
after the plan of the German universities. In 1882 
Hall arrived in Baltimore, and in 1884 he was 
made professor of psychology and pedagogics. 

To further scientific psychology, in 1883 Hall 
set up a laboratory in a private house adjacent to 
the campus. The following year he opened the first 
university psychology laboratory to win formal ac- 
ceptance in the United States. While at Johns 
Hopkins he and his students published papers on 
optical illusions, skin sensitivity, the perception of 
space, and the time sense. John Dewey completed 
a dissertation on the psychology of Kant, William 
Burnham published on memory, and James Mc- 
Keen Cattell (1890) wrote papers on reaction time 
and speed of association, as well as his famous 
“Mental Tests and Measurements,” which intro- 
duced the term mental test. 

Hall had for some time dreamed of the possi- 
bility of establishing a journal devoted to psychol- 
ogy. One day a stranger walked into his office and 
offered him $500 to help found a journal. Although 
the gift was apparently based on a misunderstand- 
ing—the benefactor discontinued his subscription 
after the second year when he discovered that the 
journal did not intend to foster psychic research 
(spiritualism )—it did result in the appearance, in 
1887, of the American Journal of Psychology, the 
first psychological journal in this country. 

Meanwhile, Hall was preparing to leave for Clark 
University in Worcester, Massachusetts, to become 
its first president. The school was being financed 
by Jonas G. Clark, a wealthy, retired merchant who 
wished to endow an educational institution in his 
home town. After what eventually turned out to 
be too sketchy an understanding with Clark, Hall 
departed for a tour of European educational 
centers to inspect possible models for the new uni- 
versity and to find likely prospects for professional 
appointments (Rush 1948). He had visions of a 
graduate school modeled on the German univer- 
sities and perhaps even superior to Johns Hopkins. 
Research was to be paramount, teaching secondary. 

In 1889 Clark University opened its doors with 
only a few departments staffed and with no pre- 
tense at covering all academic fields. Mr. Clark, a 
reticent man, had different ideas from those of 
the eager Hall and proved to be secretive about 
his financial plans for the university. The amount 
of money he advanced was small, very much 


312 HALL, G. STANLEY 


smaller than Hall had been led to expect. Hall did 
not disclose this discrepancy to the faculty, who 
blamed him for the less than expected equipment, 
salaries, and assistantships. “Calls” from the Uni- 
versity of Chicago, then being founded—and sup- 
ported by the Rockefeller millions—decimated the 
Clark faculty. Over the next few years, those who 
remained made the necessary adjustments, some 
more money was raised, and toward the end of the 
century, the bulk of the Clark estate did come to 
the university. It was divided among the library, 
the graduate school, and a new undergraduate col- 
lege. Although Clark’s will stipulated that he was to 
have no connection with the college, Hall continued 
as president. 

Hall, fortunately, had appointed himself profes- 
sor of psychology and all through these hectic 
years taught in the graduate school; it was his most 
brilliant and productive period. From Johns Hop- 
kins he had brought along Edmund C. Sanford to 
head the laboratory and W. H. Burnham to teach 
pedagogy, which was interpreted to mean educa- 
tional psychology and mental hygiene. Adolf Meyer, 
then with the Worcester State Hospital and destined 
to become the leading psychiatrist of his day, gave 
lectures in abnormal psychology. 

Hall was an inspiring teacher to a majority of 
the first generation of American-trained psychol- 
ogists. By 1893, 11 of 14 and by 1898, 30 of 54 of 
the PH.D.s granted in psychology were awarded to 
his students (Harper 1949), Among them were 
Lewis M. Terman, who developed the Stanford- 
Binet Scales of Intelligence, Henry H. Goddard, 
who was a pioneer student of the mentally re- 
tarded, and Arnold Gesell, who did so much pains- 
taking research on the mental and physical de- 
velopment of children. 

Their work indicated that Hall’s own interest 
had shifted from experimental to developmental 
psychology, although he continued eloquently to 
defend laboratory work of all sorts. While at Hop- 
kins he had published a paper entitled “Content of 
Children’s Minds” (1883), With the aid of a ques- 
tionnaire technique, which he was the first to 
apply in the United States, he unearthed a variety 
of information about the thinking of children, He 
questioned them about such matters as the sense 
of self, religious experience, fears, and favorite 
foods. This approach generated great public en- 
thusiasm and before long led to the so-called child 
study movement. Large numbers of parents and 
teachers, outside of Hall’s personal sway, uncriti- 
cally and dogmatically used his technique to secure 
information about children. Hall lost interest in 


the technique and not long afterward a reaction, 
both within psychology and from the public itself, 
set in and the movement disappeared. It did have 
the salutary effect of bringing to the public some 
awareness of the importance of child study and, by 
its very excesses, created a more critical attitude 
toward research in child psychology (Bradbury 
1937). 

Hall had a more sustained interest in adoles- 
cence. His monumental and influential work, Ado- 
lescence (1904), contained the most complete state- 
ment of his particular theory of recapitulation in 
development. The individual child was seen as 
repeating the life history of the race: when a child 
played at cowboys and Indians, for example, he 
was seen as behaving at the level of primitive man, 
In his old age Hall returned to the problem of de- 
velopment and published a volume entitled Senes- 
cence (1922). 

Along with his research, writing, teaching, and 
administrative duties, Hall was concerned with psy- 
chology on the national scene. In July 1892 he 
called the meeting which resulted in the founding 
of the American Psychological Association (Dennis 
& Boring 1952). Almost as a matter of course he 
was elected its first president. The scientific pur- 
pose of this organization was firmly established un- 
der his guidance. He founded and edited still other 
journals: in 1891 the Pedagogical Seminary (now 
the Journal of Genetic Psychology), in 1904 the 
Journal of Religious Psychology, and in 1917 the 
Journal of Applied Psychology. 

Hall was one of the first Americans to arrange 
for a hearing of Freud’s views and to teach his 
theories. Indeed, Freud’s only visit to the United 
States was at Hall’s invitation to the twentieth an- 
niversary celebration of Clark University in 1909. 
In view of the suspicion and dislike that the psy- 
choanalytic movement raised at this time, the 
invitation showed courage on Hall’s part. 

Hall's administrativeorganizational efforts con- 
tinue to live on: the American Psychological As- 
sociation that he founded had 24,000 members by 
1966; Johns Hopkins and Clark University depart- 
ments of psychology each went through a series of 
unsettling experiences but at present are among 
the strong graduate departments; the journals he 
founded, in all but one instance, continue to 
flourish. He was very modern in his stress on de- 
velopment despite the fact that his way of using 
Haeckel’s discredited biogenetic principle has dis- 
appeared without a trace. He has had little direct 
effect upon current intellectual traditions, but his 
enthusiasms, his broad convictions, and his em- 


phasis on developmental issues were transmitted to 
his students, many of whose formulations are very 
much part of the current scene. 


ROBERT I. WATSON 


{Other relevant material may be found in ADOLEsS- 
CENCE; AGING, article on PSYCHOLOGICAL ASPECTS; 
DEVELOPMENTAL PSYCHOLOGY; INTELLECTUAL DE- 
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HAMILTON, ALEXANDER 


Alexander Hamilton (1757?—1804 ) was born on 
the West Indian island of Nevis and therefore 
lacked the attachment to state or region which 
characterized many eighteenth-century Americans. 
His experiences during the American Revolution 
and the decade of government under the Articles 
of Confederation reinforced his belief that Ameri- 
can greatness must be based on a strong and 
energetic central government. 

In 1777, five years after his arrival in the United 
States, he was appointed aide-de-camp to General 
George Washington. Viewing the Revolution from 
the vantage point of Washington’s headquarters, he 
was deeply concerned about the state rivalries, the 
weakness of the Continental Congress, and the ab- 
sence of national sentiment that impeded the Amer- 
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ican war effort. In the years that followed inde- 
pendence, Hamilton worked steadily to breathe life 
into the moribund Confederation government. As 
a delegate to the Continental Congress in 1782- 
1783, he worked closely with such like-minded 
nationalists as James Madison to invest the Con- 
federation with powers, in his phrase, “adequate 
to the exigencies of the Union” (Papers, vol. 3, 
p. 689). Despairing of this effort, he became one of 
the prime promoters of the Constitutional Conven- 
tion and served as a delegate from the state of 
New York. 

Political ideas. Hamilton’s proposals for a gov- 
ernment modeled closely on the British system 
were unacceptable to most members of the conven- 
tion, but he labored industriously to have them 
“tone their Government as high as possible” (ibid., 
vol. 4, p. 218). Although ultimately “no man’s 
ideas were more remote from the plan” (ibid., vol. 
4, p. 253), Hamilton signed the constitution and 
was one of its most eloquent and persuasive de- 
fenders. At the New York State ratifying conven- 
tion, which met in June 1788, Hamilton combined 
the gifts of political theorist and practical politician 
to lead a reluctant New York into the federal fold. 

Hamilton’s greatest contribution to American 
political thought was The Federalist (1788), a 
brilliant defense of the convention’s work, which 
he, John Jay, and James Madison wrote to persuade 
the people of New York to adopt the proposed con- 
stitution. Hamilton wrote more than half of the 
85 essays. Some of the issues he discussed at length 
are no longer important (the dangers of a standing 
army, for example), but his analysis of a viable 
federalism as a system in which laws operate di- 
rectly upon individuals rather than upon states, his 
exposition of the powers of the presidency, and his 
arguments for the necessity of judicial review in a 
federal system remain among the most astute com- 
ments on American government. 

Hamilton was not an original political thinker 
of the stature of Hobbes, Locke, Montesquieu, Rous- 
seau, or Hegel. His particular genius lay in the bold 
and imaginative way in which he adapted and ap- 
plied the ideas of others to the needs of an infant 
republic. He had read many of the political philos- 
ophers of the eighteenth century, among them 
Adam Smith, David Hume, and Sir William Black- 
stone, and he was familiar with the work of such 
European statesmen as William Pitt and Jacques 
Necker, But he learned the most from his experi- 
ence in the American Revolution, the Continental 
Congress, and the New York legislature, namely, 
that a strong government is necessary to promote 
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national prosperity and safeguard personal liberty. 
Like other Federalists, Hamilton believed that lib- 
erty must be predicated on order and that a just 
and enduring government must be one of laws and 
not of men. Far from subscribing to Thomas Jef- 
ferson’s dictum that that government is best which 
governs least, he believed that government to be 
best which, possessing ample powers, uses them 
energetically to achieve national goals. In any dis- 
cussion of the state, his favorite word was “energy.” 
As Clinton Rossiter has remarked, to Hamilton 
“energy was the essence of good government, im- 
potence the sign of bad: here was one obsessive 
principle of his political science that set him off 
sharply from the progressives of his time” (1964, 
p. 163). 

The belief that Hamilton favored a government 
of the rich, the well-born, and the able, and that he 
despised democracy, has long been a commonplace 
of American history. That he wished to win over 
the wealthy to the support of the Union is indis- 
putable; that the purpose of his policies was to en- 
rich any particular class is debatable. The object 
of his statecraft, as he said, was“... a great Fed- 
eral Republic, closely linked in the pursuit of a 
common interest, tranquil and prosperous at home, 
respectable abroad . . .” (Papers, vol. 3, p- 106). 
That he distrusted democracy is also certain. But 
by democracy he meant a government that is 
directly responsive to the peoples’ whims or moods, 
as in ancient Athens or in eighteenth-century New 
England towns. He was, on the other hand, “affec- 
tionately attached to the republican theory” (Works, 
vol. 9, p. 533), by which he meant democracy 
modified by a system of checks and balances such 
as those imposed by the U.S. constitution. 

Economic ideas. By common consent Hamilton 
is ranked as the most forceful and influential secre- 
tary of the treasury in U.S. history. He was ap- 
pointed to that post on September 11, 1789, and 
within the brief span of two and a half years 
(1790-1792) submitted to Congress a number of 
reports that are among the greatest state papers 
in U.S. history. The first and most controversial 
of these papers was the “First Report on the Public 
Credit” ([1790], 1934, pp. 1-50), in which he 
called for the payment in full of the foreign and 
domestic debt contracted by the Confederation goy- 
ernment; for the assumption of the debts incurred 
by the separate states in the prosecution of the 
War of Independence; and for a system of finance 
and taxation adequate to meet these new national 
obligations. Hamilton’s report was based on the as- 
sumption that national honor, Strength, and pros- 
perity are inseparably connected with the estab- 


lishment of sound national credit. After six months 
of acrimonious debate and a famous compromise 
in which Hamilton gave his support to a site on the 
Potomac River as the national capital in exchange 
for the support of his plan by Thomas Jefferson 
and James Madison, Hamilton’s proposals were 
adopted in only slightly modified form. 

In December 1790, six months after the funding 
and assumption measures became law, Hamilton 
submitted to Congress a “Report . . . on a National 
Bank...” ([1790] 1934, pp. 51-95), calling for 
a great quasi-public institution that would provide 
a uniform circulating medium for the country, in- 
crease its “active wealth,” and furnish a safe de- 
pository for government funds and a source of 
government loans in time of national need. 

The “Report on Manufactures” ([1791] 1934, 
pp. 175-276), Hamilton’s only major report re- 
jected by Congress, was perhaps his most impor- 
tant state paper and certainly the clearest statement 
of his economic philosophy. Rejecting the laissez- 
faire economics of Adam Smith as well as the 
agrarian philosophy of Thomas Jefferson, he called 
on the federal government to foster and encourage 
manufactures. The report was based on Hamilton’s 
interpretation of the stage of economic development 
which the United States had reached and the meas- 
ures necessary for further economic growth. 

Administrative activity. _Hamilton’s work as sec- 
retary of the treasury was not confined to recom- 
mendations for the country’s financial and indus- 
trial growth. His organization and management of 
the Treasury Department was a model of efficiency 
and his administrative theory was, in the words of 
the leading student of the history of public admin- 
istration in the United States, “the first systematic 
exposition of public administration, a contribution 
which stood alone for generations” (White [1948] 
1959, p. 127). 

Hamilton’s role in the Washington administra- 
tion was not restricted to fiscal policy. He provided 
congressional leadership, preparing detailed reports 
for congressional guidance, drafting legislation, de- 
fending his conduct of the treasury, and occasion- 
ally preparing speeches for political allies. Nor did 
he refrain from interfering in the management of 
other departments, His help was welcomed by his 
friend Henry Knox, the phlegmatic secretary of 
war, but resented by his political opponent Thomas 
Jefferson, the secretary of state. Hamilton, in brief, 
was the chief policy maker as well as the storm 
center of the administrations of George Washington. 

In the perspective of a century and a half, it may 
well be that Hamilton's most important contribu- 
tion to American history was his interpretation of 


the constitution. To him it was not a mere compact 
among states but a grant of power to the central 
government. The powers conferred on the federal 
government were not to be construed narrowly but 
interpreted so broadly as to allow the exercise of 
powers adequate to its needs. That the constitution 
has been sufficiently flexible to endure for so long 
is due in no small part to Hamilton’s role in making 
it the charter for a strong national state. 


Jacos E. COOKE 


[See also the biographies of JEFFERSON and Mapison.] 
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HAMILTON, WALTON H. 


Walton Hale Hamilton (1881-1958), economist 
and lawyer, was born in Madisonville, Tennessee. 
He was educated at the University of Texas and 
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later taught in the economics department there. 
He then taught economics at the University of 
Michigan and the University of Chicago. In 1915 
he went to Amherst College and remained at Am- 
herst until 1923. During this period there occurred 
the celebrated intrafaculty dispute over the resig- 
nation of the president of Amherst, Alexander 
Meiklejohn, who was allegedly compelled to resign 
because the trustees of Amherst objected to his 
progressive educational views. Hamilton was the 
leader of a group among the faculty that protested 
against what they considered interference by con- 
servative businessmen in the educational policies 
of the college. Subsequently, the entire group re- 
signed, and Hamilton was immediately appointed 
to the Brookings School of Economics and Political 
Science in Washington, D.C. (now the Brookings 
Institution). He was entirely in sympathy with the 
liberal economic and political views that prevailed 
at the Brookings School. 

In 1928 Hamilton shifted from teaching eco- 
nomics to teaching law. Although he had no previ- 
ous legal training, he was selected by Robert 
Hutchins, then dean of the Yale Law School, as 
one of a new group of professors who would teach 
the law as a part of a larger social, political, and 
economic system. Hamilton remained at Yale for 
20 years. 

While at Yale, Hamilton also served the federal 
government. He was one of the brain trusters of 
Roosevelt’s New Deal, and he held important posts 
in the National Recovery Administration. He 
worked with the antitrust division of the Depart- 
ment of Justice in connection with the economic 
problems involved in the administration of the 
antitrust laws. He served on the staff of the Tem- 
porary National Economic Committee, which was 
appointed by President Roosevelt to study and re- 
port on the economic structure of the United States 
during the depression, especially with regard to 
the operation of the free competitive market and 
the sources of its failure to achieve economic 
equilibrium. 

In 1948, Hamilton retired as professor of law 
at Yale, and at age 67 he began a new career as a 
practicing lawyer. Joining the newly organized firm 
of Arnold, Fortas & Porter, he brought to the prac- 
tice of law and the writing of legal briefs a fresh 
economic outlook, and he was very successful in 
actual litigation before courts and administrative 
bodies. 

The dominant theme of Hamilton’s academic 
thought and of his legal arguments was the essen- 
tial unity of law, economics, political science, and 
social science. He maintained that all of the social 
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sciences deal with precisely the same problems 
and that only their points of reference with regard 
to social phenomena differ. This orientation was 
most noticeable in Hamilton’s contribution to the 
reform of legal education that was undertaken 
at Yale. 

Hamilton and his colleagues at the Yale Law 
School, William O. Douglas, Wesley A. Sturges, 
Charles E. Clark (who succeeded Hutchins as 
dean), Myres S. McDougal, George H. Dession, 
Fleming James, Jr., Thurman W. Arnold, and 
others, criticized the conservative method of legal 
analysis, epitomized by the Harvard Law School, 
for its fragmentation of subject matter and its 
rigid classification and separation of the traditional 
fields of law—contracts, torts, conflicts of law, etc. 
Instead, they adopted an entirely new philosophical 
attitude toward the law, based on the premise that 
the only laws to have any practical impact on so- 
ciety are the rules of procedure. Procedure, there- 
fore, became the center of the Yale curriculum, 
and this required a complete reorganization of the 
so-called substantive law courses. 

What was involved here was not a debunking of 
the law to show that legal principles are simply 
horses which judges and attorneys ride to get 
wherever they choose to go; rather, the legal real- 
ists at Yale held that reverence for the immutable 
logic of the substantive law is essential to the main- 
tenance of social institutions. Although the func- 
tion of this logic is purely symbolic, it has con- 
siderable importance in that it is part of the process 
by which we form images of social institutions. 
Widely as this approach to the law is accepted to- 
day, it was revolutionary when Hamilton and his 
colleagues first advocated it. 

What Hamilton mainly taught at Yale was called 
“public control of business,” which in fact covered 
a sizable part of public and constitutional law. He 
had initially tackled law untutored, out of the real- 
ization that no one could understand the nation’s 
economy without knowing the legal rules-upon- 
rules that had made Adam Smith’s and Herbert 
Spencer's theories obsolete, and without an under- 
standing of the ways and workings, the facts and 
folklore of the U.S. Supreme Court, where so many 
of the final decisions affecting business, industry, 
commerce, and agriculture seemed to be made. 
This concern with the noneconomic factors in eco- 
nomics had as a by-product an important impact 
on the development of American sociology: Tal- 
cott Parsons was a student at Amherst when 
Hamilton was teaching there; Hamilton’s approach 
aroused his interest in the study of the institutional 


bases of economics, which led ultimately to his 
study of sociology. 

Throughout his life Hamilton combined keen 
observation and commentary on the American po- 
litical, legal, and economic scene with active par- 
ticipation in the cause of social justice. 


THURMAN ARNOLD 


[See also Pustic Law.] 
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John Lawrence le Breton Hammond (1872- 
1949) was the son of a Yorkshire clergyman; Lucy 
Barbara Bradby (1873-1961), the daughter of the 
headmaster of Haileybury School. Both took greats 
at Oxford—Barbara with distinction—and it was 
by standards derived from Greece and Rome that 
they passed judgment on later times. When they 
married in 1901 Hammond was already deeply in- 
volved in Liberal journalism; he published a study 
of the political career of Charles James Fox (1903) 
and for several years edited the Speaker, a weekly 
journal that later became the Nation. But from 
1907, when he was appointed secretary to the Civil 
Service Commission, he gave his spare time to col- 
laboration with Barbara. She had become deeply 
involved in research in public and private archives; 
it was his role to transmute her findings into vivid 
and arresting prose. In 1933 Oxford gave recogni- 
tion to the parity of their contribution by conferring 
on each an honorary D.LITT. 

Their most widely read books, The Village La- 
bourer (1911), The Town Labourer (1917), and 


The Skilled Labourer (1919), treat the period from 
the accession of George m1 to the first Reform Act 
and consist primarily of “a discussion of the lines 
on which Parliament regulated the lives and for- 
tunes of a class that had no voice in its own desti- 
nies” ({1911] 1948, Preface). These were path- 
breaking works, and, as is usual with such, it is not 
difficult to point to errors of fact and emphasis. It 
is now generally agreed that the enclosure of land 
was carried out less ruthlessly than they thought. 
Their use of the report of Sadler’s Committee of 
1832 on the employment of children was uncritical. 
Their concentration on the records of the Home 
Office led them to present the abnormal cases 
brought to the notice of this department as though 
they were typical. But few historians would deny 
that the dismal account of class struggles presented 
in this trilogy is substantially accurate. In 1923 
came their sympathetic, though critical, biography 
of Lord Shaftesbury (1923), to be followed two 
years later by an admirable background study, The 
Rise of Modern Industry (1925). In The Age of the 
Chartists; 1832-1854 (1930), they extended their 
inquiries to a later generation in which repeal of the 
Corn Laws, and the Ten Hours Act of 1847, the 
Public Health Act of 1848, and other measures re- 
flected a growing sensitivity of the wealthier 
classes to the needs of the poor and the beginnings 
of what the Hammonds called “common enjoy- 
ment.” In none of these works did the authors pay 
much attention to strictly economic or demographic 
trends. But, at a time when the “depersonalization 
of history” has gone quite far enough, it is worth- 
while to turn again to books which are emphatically 
about people and in which moral judgments are 
freely and forcefully expressed. 

Lawrence Hammond was special correspondent 
of the Manchester Guardian at the Paris Peace Con- 
ference in 1919 and the Conference on Ireland in 
1921. He continued for the rest of his life to con- 
tribute articles, reviews, and occasional editorials 
to the paper and in World War m became a perma- 
nent member of its staff. His later books are all 
biographical. James Stansfeld: A Victorian Cham- 
pion of Sex Equality (1932) was written jointly 
with Barbara; but C. P. Scott of the Manchester 
Guardian (1934) and Gladstone and the Irish Na- 
tion (1938)—the greatest, if least known, of his 
works—are from his pen alone. They show his 
liberalism, insight, and command of words. 

After his death in 1949, Barbara continued to 
follow her earlier interests: she lived to the age of 
88, leaving an almost completed (but as yet un- 
published) book on the enclosure of commons in 
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the nineteenth century. The Hammonds were 
modest about their own achievements and gener- 
ous toward opponents. Their books had wide pop- 
ular appeal and, apart from quickening interest in 
the social life of the past, did much to shape con- 
temporary opinion and policy. 

T. S. ASHTON 
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HANKINS, FRANK H. 


Frank Hamilton Hankins, an American sociol- 
ogist, devoted special attention to the role of bio- 
logical factors in human traits and in social 
behavior. He was born in Wilshire, Ohio, in 1877 
and received his bachelor’s degree from Baker Uni- 
versity in Baldwin, Kansas, in 1901 and his PH.D. 
from Columbia University in 1908. In 1964 Clark 
University awarded him an honorary degree. Most 
of his academic career was spent at Clark, from 
1906 to 1907 and again from 1908 to 1922, and 
at Smith College, from 1922 until 1946. 

Hankins was a philosophical realist, tough- 
minded, and rigorously scientific. He was most 
deeply influenced in philosophy and logic by John 
Stuart Mill; in his general sociological thinking by 
Herbert Spencer, Lester F. Ward, William Graham 
Sumner, and Franklin H. Giddings; and in the 
quantitative approach to social problems by Adolphe 
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Quetelet, about whom he wrote his doctoral dis- 
sertation, Francis Galton, Karl Pearson, and Henry 
L. Moore, who in turn had studied under Benini in 
Italy. To be sure, although Hankins himself was a 
talented statistician, he was caustically skeptical 
of the assumptions and methods of the extreme 
quantitative approach of some of his sociological 
colleagues: he characterized their work as “merely 
sinking postholes here and there in the vast field 
of social phenomena.” 

He was a strict scientific determinist and con- 
ceived all phenomena, personal and social, as 
undergoing endless change and rearrangement in 
adjustment to environmental conditions. Thus, a 
particular personality was for Hankins the product 
of an individual’s genetic constitution and his mold- 
ing experiences. The self so formed has to express 
itself, and this self-expression is the sociological 
equivalent of the traditional metaphysical freedom 
of the will. By his reason and by the acquisition of 
scientific methods man can discover causal se- 
quences, not only in the physical world but also in 
the psychosocial and cultural realms, and he can 
learn to adjust to these sequences and even to 
modify or direct them. Yet Hankins had no utopian 
illusions; he doubted that the social sciences could 
achieve control of change comparable to that of 
the physical sciences. Whereas the laws of physical 
science do not vary with time and place, the causes 
of cultural sequences differ from one era to an- 
other, even in the same society. Cultural phe- 
nomena emerge in what are often new causal 
combinations, and predictability is therefore se- 
verely limited. Hankins agreed with Spencer that 
efforts to control political and economic life more 
often than not produce unexpected results. 

In the realm of social institutions, Hankins’ 
determinism was economic: he saw successive po- 
litical theories as rationalizations of new class or 
sectional interests and new religious doctrines as 
sanctions of new economic modes. He believed that 
cultural change is the cumulative product of slow 
alterations in the everyday activities of increasing 
proportions of the populations—changes in daily 
routine that carry with them modifications of folk- 
ways, mores, techniques, and political policies and 
that eventuate in new values and new ideologies. 

Deeply influenced as he was by the work of 
Galton and Pearson, Hankins was perennially con- 
cerned with the effect of such selective processes 
as war, celibacy, persecution, urbanization, and 
education on the quantity and quality of the popu- 
lation, and with the consequences of demographic 
changes on social life. He regarded population 
pressure as an important factor in group hostilities, 


and he believed uncontrolled differential fertility 
of the social classes to be dysgenic: modern medi- 
cine and hygiene, combined with humanitarian 
views, made it possible for people with hereditary 
deficiencies to bear children and so to imperil racial 
soundness and human well-being. He believed ge- 
netically above-average parents are most likely to 
produce children who are morally superior and 
capable of greater success in a competitive social 
order. 

Hankins did of course recognize the moral im- 
perative of a humanitarian perspective and re- 
jected any scheme for the ruthless elimination of 
the unfit. He did not believe that the speedy adop- 
tion of positive eugenic measures was likely and 
therefore favored the active spreading of informa- 
tion about contraception, of facilities that would 
give the less favored strata of the population access 
to contraceptive equipment, and of realistic eugenic 
instruction for all. He doubted on both historical 
and genetic grounds the validity of the assumption 
that the number of potential men of genius is con- 
stant in any given population at all times. However, 
further advances in the science of human genetics 
may conceivably enable man not only to modify 
human nature in general but also to increase the 
number of potential geniuses, to discover them 
early in life, and to develop their powers. 

These concerns naturally led Hankins to con- 
sider the role of heredity and inborn ability in 
relation to social leadership and to question the 
assumptions of egalitarian democracy. He was 
highly critical of egalitarian doctrine, holding that 
for “democratic society . . . to continue in a sound, 
healthy condition, it must concern itself quite as 
much, if not more, with the hereditary constitution 
of its people as with efforts at a further equaliza- 
tion of material conditions” (1923, p. 411). Any 
realistic approach, he thought, must take the form 
of effective birth control policies, which will, in a 
quantitative sense, keep the number of people 
down to a level where their material needs can be 
supplied and, in the qualitative realm, assure an 
increase in the number of the superior types Te- 
quired to deal with the increasingly difficult prob- 
lems of our era. Thus, in order to survive, democracy 
must both accept hereditary differentiation in abil- 
ity and, by education and other means, mitigate 
and undermine hereditary stratification in society. 
If the channels between the different strata are kept 
as wide-open as possible, the basic fact of individ- 
ual differences may be harmonized with democratic 
ideology. 

Hankins was deeply interested in the problem of 
race and the impact of racial dogmas on political 


theories. His chapter “Race as a Factor in Political 
Theory” (1924) remains the most authoritative ex- 
position and appraisal of the subject. In his book, 
The Racial Basis of Civilization (1926a), he took 
a middle ground between the fervid exponents of 
racial superiority and racial purity and the follow- 
ers of Franz Boas and others who contended that 
there is no real validity in the notion of racial 
differences and hence, scientifically, no race prob- 
lem. Hankins believed that all nations and nation- 
alities, although greatly mixed racially through 
conquest and immigration, ultimately acquire a 
sense of racial solidarity and pride of both race 
and culture. He maintained that race crossing is 
a source of racial soundness and strength and that 
the historical record shows that periods of high 
civilization have been preceded by an extensive 
mingling of racial stocks. While he doubted, in 
principle, that all racial stocks have the same ca- 
pacity for cultural achievement, since this would 
mean that they all had had identical mutational 
and selective processes, he recognized that actual 
racial superiority or inferiority cannot be proved or 
disproved because race mixture is so extensive and 
cultural environments are so different. 

Hankins’ place in American sociology is secure. 
His concern with the problem of population quan- 
tity was shared by his professional colleagues, and 
he stood out among them by stressing the impor- 
tant issue of population quality. His work on the 
race issue was a sane and valuable contribution. 
Despite the fact that most of his students were 
undergraduates, a surprisingly large number of 
them became professional sociologists as a result 
of his influence, and some of them, like Howard 
Odum, Clifford Kirkpatrick, and Franklin Frazier, 
became well-known in the field. The esteem in 
which he was held by his fellow sociologists is 
evident from the fact that he was elected first 
president of the Eastern Sociological Society in 
1930, president of the American Sociological Soci- 
ety in 1938, and president of the Population As- 
sociation of America in 1945. Hankins’ most impor- 
tant editorial contribution in the sociological field 
was to act, from 1936 to 1937, as the first editor- 
in-chief of the American Sociological Review, the 
official journal of the American Sociological So- 
ciety (now Association ). 

Harry ELMER BARNES 


[For the historical context of Hankins’ work, see CRE- 
ATIVITY, article on GENIUS AND ABILITY; Race; and 
the biographies of BENINI; GALTON; Grppincs; MILL; 
Moore, HENRY L.; PEARSON; SPENCER; SUMNER; 
WARD, LESTER F.] 
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Alvin Hansen’s place in the history of economic 
thought and the importance of his role as an eco- 
nomic adviser are undisputed. By propagating and 
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developing the ideas formulated by Keynes, Han- 
sen did more to effect the Keynesian revolution than 
any other American economist. His books are widely 
read in the United States and abroad; his Guide to 
Keynes (1953), for example, was translated into 
Spanish, Japanese, Italian, French, Hindi, and 
Korean. Most of his other books have also been 
widely translated. 

Born in 1887 in South Dakota of Scandinavian 
parentage, Hansen achieved academic eminence 
despite the handicap of having received his educa- 
tion at small, impoverished Yankton College. For 
some years he worked as a schoolteacher, principal, 
and school superintendent. Then he went for grad- 
uate training to the University of Wisconsin, where 
he also began teaching. From Wisconsin he moved 
to Brown and then to the University of Minnesota 
before going to Harvard in 1937 as the Lucius 
N. Littauer professor of political economy. He spent 
almost twenty years as an economist in Cambridge. 
After retirement he yielded to demands to serve as 
a visiting professor at various institutions, includ- 
ing the University of Bombay, Yale, Smith, Vassar, 
Michigan State, Haverford, and Wesleyan. 

Hansen excelled as a teacher. A Harvard semi- 
nar in fiscal policy, which he shared with John H. 
Williams, became one of the most famous in the 
country. Week after week, for almost twenty years, 
outstanding scholars and public servants attended 
this seminar. The gifts of Hansen and Williams 
were dissimilar but complementary. Williams was 
a cautious theorist and rather critical of many of 
Keynes's ideas, but open-minded and far from dog- 
matic either in criticism or advocacy and acute in 
analyzing issues. Hansen, too, could be critical, but 
his dominant attitude was one of enthusiasm for 
the New Economics. The interplay between 
Williams’ skepticism and Hansen’s acceptance pro- 
vided just the right atmosphere for the exploration 
and examination of the new ideas. Hansen’s zest 
was contagious: he not only inspired his students; 
there were few visitors who came away from these 
seminars uninfluenced. The seminars served as a 
platform for debates among the participants, 
whether regular members, other Harvard col- 
leagues, or guests, Faculty members who were too 
Sensitive to indulge in public debates with their 
peers soon learned to stay away. It is a tribute both 
to Williams and Hansen that their debates never 
led to personal differences, 

Although Hansen’s influence is most effectively 
conveyed by cataloguing his students, it is not pos- 
sible here to list the hundreds of students he in- 
spired and the numerous doctoral dissertations he 
supervised. His former students are members of 


virtually every distinguished department of eco- 
nomics, for example, the departments at the Massa- 
chusetts Institute of Technology, Yale, Harvard, the 
University of California, Stanford, the University 
of Chicago, Princeton, and Cambridge, England. 

There was a reciprocal exchange of ideas be- 
tween Hansen and his students. His influence is 
clearly evident in An Economic Program for Amer- 
ican Democracy (1938), a prescient document 
signed by R. V. Gilbert and six other young Harvard 
and Tufts economists. Hansen, in turn, was influ- 
enced by the young economists who wrote the book. 
At age 50, he had a receptivity to new ideas that 
was quite remarkable. 

It is likely that no American economist of the 
twentieth century has had more influence on econ- 
omists in the government than Hansen. As early 
as 1933-1934 he served as director of research and 
secretary of the important Committee of Inquiry on 
National Policy in International Economic Rela- 
tions. In the years when Cordell Hull’s reciprocal 
trade agreements were first being established, Han- 
sen served as an aide to the State Department; this 
was a time when economists were rare indeed in 
the department. A few years later, he was the eco- 
nomic adviser to the prairie provinces before the 
Canadian Royal Commission on Dominion—Provin- 
cial Relations. At the same time, 1937—1938, he 
became a member of the Advisory Council on So- 
cial Security. Again, in 1941-1943, he had the priv- 
ilege of being the chairman of the United States- 
Canadian Joint Economic Committee. One of his 
most important assignments was that of special 
economic adviser to the Federal Reserve Board, 
a post which he held from 1940 to 1945. 

Before 1933 Hansen had been somewhat of a 
deflationist and a critic of those who questioned 
Say’s law, which assumes that what is produced will 
be taken off the market. Keynes was the boldest 
critic of Say’s law: he asserted that investment may 
not equal the gap between what the economy is 
capable of producing and what people wish to con- 
sume. Hence his great emphasis on investment and 
the means of stimulating it, In fact, Keynes built a 
theory of income formation on the basis of a rela- 
tively stable consumption function and multiplier. 

Hansen was always capable of changing his 
mind, and in a review of Keynes’s General Theory 
(1936) he was generally favorable to Keynes’s 
views. But neither Hansen nor, very likely, any 
other economist in the United States at that time 
anticipated the tremendous impact the General 
Theory was to have. (In part, its many obscurities 
concealed the cogency of its argument.) Hansen 
appreciated Keynes’s intellectual adventurousness: 


“We are living in a time when economics stands in 
danger of a sterile orthodoxy. The book under re- 
view warns us once again, in a provocative manner, 
of the danger of reasoning based on assumptions 
which no longer fit the facts of economic life . . .” 
(1936, p. 686). But Hansen was clearly at odds 
with Keynes on some issues. Keynes had urged a 
reduction of the rate of interest as a means of stim- 
ulating investment, since the amount of investment 
depends upon the relation of the rate of interest 
and the marginal efficiency of capital, although his 
stress on manipulation of the interest rate was not 
nearly so great in the General Theory as it had been 
in his earlier Treatise on Money. Hansen was not 
as enthusiastic as Keynes about interest rate policy, 
cautioning that efforts at general monetary control 
- had already revealed the weakness of this method. 

Both in the Treatise on Money and in the Gen- 
eral Theory, Keynes had shown an awareness of 
the need for adequate investment and of the de- 
pendence of the amount of investment on the op- 
portunity for investment. Hansen was the principal 
architect of a theory of stagnation, built on this 
theme of Keynes and concerned with the circum- 
stances of insufficiency of demand. Hansen agreed 
with Keynes that the business cycle reflects the var- 
iations in the marginal efficiency of capital and 
that the impact of secular trends is more disrupt- 
ing than that of the cycle. He wrote in 1938 that 
“not every period can be characterized as a kind of 
new industrial revolution. . . . Add to this the wholly 
new fact of a rapidly approaching cessation of pop- 
ulation growth. Let the perennial optimist reflect 
on the enormous masses of capital that found in- 
vestment outlets during the nineteenth century for 
no other reason than that the population of Eng- 
land quadrupled, that of Europe trebled, while that 
of the United States increased fifteen-fold. For these 
and other reasons we shall do well to ponder deeply 
the problem of . . . secular stagnation” (1938a, pp. 
477-478). 

Time and again, and especially in the 1930s, 
Hansen returned to the theme of the insufficiency 
of demand and the method of treating it. One of 
his great contributions was to establish and popu- 
larize the idea that compensatory fiscal policy is 
the most powerful means of maintaining the right 
level of aggregate demand. One cause of the unsat- 
isfactory conditions of the 1930s, he believed, was 
the accumulation of reserves in the Old Age Insur- 
ance Fund. This meant that purchasing power was 
being hoarded. The recovery of the 1930s was, ac- 
cording to Hansen’s theory, a consumption recovery 
generated by a $4,000 million expansion in con- 
sumer installment credit and by $14,000 million 
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of federal expenditures for recovery and relief; and 
it vanished when these two stimuli were played out. 
Hansen was among the earliest to develop a pump- 
priming policy to deal with recessions. 

In the 1930s Hansen’s stagnation theory at- 
tracted much attention, but in the prosperous 1940s 
and 1950s its validity was doubted and even ridi- 
culed. Detractors of the theory did not take into 
account, however, the extent to which World War 11 
created a pent-up and temporarily sustained de- 
mand, which Hansen had predicted but which he 
did not expect to be permanent. By the late 1950s 
Hansen appeared to be vindicated: demand was 
insufficient to curb the rise of unemployment that 
resulted from productivity gains and the inflow of 
new entrants into the labor market. (This influx 
rises with improved conditions and requires an 
even larger increase in output—and consequently 
an increased demand—to prevent rising unemploy- 
ment.) And it was the expansion of government, 
which Hansen had advocated and which his critics 
had considered absurd, that served to prevent the 
stagnation that Hansen had warned against. Iron- 
ically, the same critics who had denied that the 
slowing of population growth could depress busi- 
ness were predicting booms for the 1960s as the 
result of the acceleration of population growth. 

In the 1930s Hansen anticipated some of the 
problems that were to confront the United States 
in the 1960s (see 1938b, pp. 303-318). Unlike 
Keynes, he was not inclined to assume that if gen- 
eral demand is adequate, most of our problems will 
be solved. He found that between 1870 and 1930 
the rise in the white-collar group was six times as 
large as that of other classes and that the gains of 
the managerial and professional classes had also 
been large. He related this relative growth to the 
large amount of unemployment among unskilled 
workers and concluded that these workers needed 
more education. He was well aware that although 
adequacy of demand is crucial, there are some 
pockets of unemployment that must be treated di- 
rectly, for example, the mining areas and the tex- 
tile towns. 

Keynes's theory was broadly conceived and left 
much for others to do. He was not concerned about 
the consequences of a rising national debt, and 
Hansen was able to show that the elimination of 
the national debt would raise more problems than 
it would solve. Interested in the consumption func- 
tion, Keynes was aware of the relation of consump- 
tion to tax structure; but it was Hansen, in his State 
and Local Finance in the National Economy (Han- 
sen & Perloff 1944), who revealed the perversity 
of modern state and local tax systems and spelled 
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out the required tax and spending policies for pe- 
riods of boom and depression. In this book Hansen 
revealed that a basic inadequacy of fiscal policy 
in the 1930s was the lack of control over state and 
local government contributions to total spending. 
In periods of boom these governments, with their 
slight dependence on income taxes, were unable to 
achieve the automatic rise of tax receipts that 
would help contain the boom, while in a period of 
depression they aggravated the economic situation 
and discouraged consumption when they compen- 
sated for the declining yield of direct taxes by an 
increasing recourse to consumption taxes. 

Today there is wide acceptance of views that 
were generally rejected before Keynes’s General 
Theory was published—witness the 1964 federal 
tax cut. Among the innovations in economic think- 
ing that owe much to Hansen's influence are the 
following: an appreciation of the importance of 
keeping interest rates low enough vis-a-vis the mar- 
ginal efficiency of capital to assure adequate invest- 
ment; an awareness that the level of investment 
depends not only on interest rates and the margi- 
nal efficiency of capital but also on the relation of 
consumption and income, and that without ade- 
quate consumption investment outlets will soon dry 
up; the concept of an equilibrium in modern econ- 
omies at a point far below full employment, with 
full employment only a “limiting point of the whole 
range of possible positions of equilibrium”; a policy 
of national debt management that does not only 
strive to keep interest costs at a minimum but also 
considers the effects on the economy of issuing se- 
curities when rates are low and money is being 
created to stimulate the private economy. Hansen 
deserves as much credit for these advances as any 
American economist. He helped clothe the Keynes- 
ian skeleton, and his simple, clear presentation, 
in the classroom and out, won many adherents to 
the New Economics. 

Hansen did not, however, limit his work to the 
elaboration of Keynesian economics. He had, for 
example, much to say about international monetary 
policy, and he contributed to the creation of the 
International Monetary Fund. Yet even in this area 
he was influenced by Keynes (1944), asserting that 
international disequilibrium prevails even if gold 
or foreign exchange reserves are not being reduced, 
so long as the domestic restraints which are used 
to conserve reserves bring about unemployment. 
Adequacy of demand is crucial. Hence the need to 
keep international reserves large enough so that 
temporary imbalance can be treated, thus leaving 
enough time to deal with structural change. 

As an economic analyst, also, Hansen made sig- 


nificant contributions. Here are but a few exam- 
ples: (1) The familiar diagram with the marginal 
cost curve cutting up through the unit cost curve 
at its bottom seems to have been first presented by 
Hansen in the 1920s. (2) About 1930, when the 
Anglo-Saxon literature stressed monetary factors in 
cycles, Hansen (with D. H. Robertson ) was one of 
the few to stress real fluctuations in exogenous in- 
vestment opportunities, in the fashion of Marx, 
Tugan-Baranovskii, Spiethoff, Cassel, Schumpeter, 
and other Continental writers. (3) Hansen pio- 
neered accelerator—multiplier models, and he and 
Keynes stressed the importance for investment of 
Steady income growth. (4) He and some of his 
students were among the independent discoverers 
of the balanced-budget multiplier theorem. Asso- 
ciated with Hansen in this discovery were such 
economists as William A. Salant, Paul A. Samuel- 
son, Harold Somers, and Henry C. Wallich, and the 
theorem was also discovered in Denmark by J. Gal- 
ting and in the United Kingdom by Keynes and 
N. Kaldor. (5) Independently of J. K. Galbraith, 
Hansen emphasized the important social utility at 
the margin of the public sector as against the 
private sector. 

The diversity and the importance of Hansen’s 
contributions to the theory and application of eco- 
nomics place him among the great economists of 
recent times. 

SEYMOUR E. Harris 


[For the historical context of Hansen’s work, see the 
biography of KEYNES, JOHN MAYNARD. For discus- 
sion of the subsequent development of his ideas, 
see FISCAL POLICY.] 
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HARRINGTON, JAMES 


James Harrington (1611-1677), English politi- 
cal theorist, was the oldest son and heir of Sir 
Sapcotes Harrington, a member of an aristocratic 
family that had risen to eminence under the 
Tudors. Orphaned at an early age, he enjoyed 
throughout his life the means to live as he pleased. 
After a brief stay in Oxford, where he perfected his 
knowledge of languages, he traveled extensively on 
the Continent, There he befriended the exiled elec- 
tor palatine and professed a militant Protestantism. 
By the time he returned to England in 1635 he was 
a firm republican. He was, however, also deeply 
attached to Charles 1 and remained with him to the 
end. Torn by such a conflict between personal 
friendship and political conviction, he took no di- 
rect part in the Civil War. After the execution of 
the king he devoted himself entirely to his studies. 
The product of his retirement was The Common- 
wealth of Oceana, a constitutional blueprint. Pub- 
lished in 1656 after some opposition by Cromwell, 
it remained Harrington’s only book. All his other 
writings are explanations of its main points or 
answers to its critics. His public life was limited to 
publishing and debating his proposals for a con- 
Stitution, After the Restoration he was put in prison, 
where his health, mental as well as physical, was 
utterly destroyed. He died without ever recovering 
his mental health. 

It was perhaps Harrington’s own social position 
that made him a decidedly aristocratic republican: 
no society, he felt, can do without the military and 
political talents of the nobility. The greatest influ- 
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ence upon his thinking was not personal experi- 
ence, however, but the works of the “masters of 
ancient prudence”—Plato’s Laws, Aristotle's Poli- 
tics, Polybius’ Histories, and the works of their only 
heir, Machiavelli. In stark contrast to these sources 
of wisdom, he saw the “modern prudence” of feudal 
and absolutist theories as nothing but a series of 
errors. Among his contemporaries only Hobbes won 
his grudging approval. For although he despised 
Hobbes’s reasoning “by geometry” and his mo- 
narchical ideas, he shared with Hobbes a distaste 
for religious warfare. Harrington's solution to the 
problem of religious strife was to establish a state 
church that would represent “the public conscience” 
without limiting toleration, leaving theological dis- 
putes to the universities. 

Political theory, according to Harrington, has to 
be historical, and he based his own ideas on a 
theory of historical development. As he saw it, the 
“Gothick balance,” or feudalism, had come to an 
end. It had never been a stable order, but a tug of 
war between the nobility and the kings, who had 
uneasily shared both property and power. In Eng- 
land this order had come to an end when Tudor 
legislation reduced the nobility and put the balance 
of property into the hands of the people. This had 
made the old order impossible and had brought 
about the Civil War. The same impersonal forces, 
moreover, were at work in the revolutions in Europe 
as well, and this linking together of all the up- 
heavals of the age was one of Harrington’s most 
original perceptions. 

Government is, in Harrington’s view, a super- 
structure based on property. Ruling requires armies 
and whoever feeds the soldiers will thereby rule. 
Hence, if the balance of property is in the hands of 
the people, stability requires a republican form of 
government, and stability is the great end of politi- 
cal organization. 

Harrington saw his own task as that of revealing 
to the people of England the necessities of their 
situation and of suggesting to them the appropriate 
political institutions. This was “political anatomy,” 
which he considered a science analogous to that of 
William Harvey. The analogy implies that atten- 
tion to details, to every artery of the body politic, 
is necessary, and such was the approach he used, 
His suggestion for producing the necessary order 
in England was a bicameral legislature, indirectly 
elected by all “who lived off their own,” by secret 
ballot, with rotation in office, and a citizen army to 
defend the new society. Eternal stability was to be 
ensured by an agrarian law that would permanently 
limit the amount of property any one person might 
own. For changes in the balance of ownership of 
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land had always been the source of strife and had 
caused the decline of ancient and modern republics. 

While Harrington’s influence upon his contem- 
poraries was limited, he enjoyed a great reputation 
in eighteenth-century America. Constitution makers 
have continued to admire him, since he was the 
very epitome of the constitutional engineer. In re- 
cent years his contribution to social history has 
become the subject of debate. Since Eduard Bern- 
stein, many Marxist historians have regarded Har- 
rington as a precursor of dialectical materialism. 
They have found evidence in his ideas for the 
theory that the Civil War in seventeenth-century 
England was a bourgeois revolution caused by the 
“rise of the gentry.” Against this view, it has been 
said that the gentry was a declining class, in no 
sense bourgeois, and that Harrington used his polit- 
ical fantasy to justify its power. All these theories 
tend to underestimate his passionate classicism. 
Harrington avowedly looked to the remote past to 
find guidance for a desirable future. From Polybius 
he learned that deliberately planned constitutions 
are more likely to endure than those that develop 
haphazardly. From him, too, he took the theory of 
the inevitable cycle of decline that all the “pure,” 
unbalanced forms of government—monarchy, aris- 
tocracy, and democracy—must suffer. However, 
while Polybius thought of these constitutions en- 
tirely in terms of divisions of governmental power, 
Harrington held that they depend upon the distribu- 
tion, or “balance,” of landed property. If stability 
was his end, no less than Polybius’, he did not stop 
with the latter's theory of a mixed constitution, in 
which democratic, aristocratic, and monarchical 
elements are poised in such a way as to prevent any 
changes in the political order. Instead Harrington 
proposed his agrarian law as the means of achiev- 
ing stability: its enactment would permit a “natural” 
aristocracy to form a deliberative assembly and the 
people to be represented ina deciding body. This 
proposal was, indeed, his own contribution to con- 
Stitutional theory. His ideas of historical change, 
however, were derivative, as he was perfectly happy 
to admit—witness his endless tributes to “ancient 
prudence.” The necessity for a return to the classi- 
cal past both in theory and in practice was not, in 
his view, a historical inevitability, but the sole 
means by which England might learn to become 
stable and powerful. 


JupirH N. SHKLAR 


[For the historical context of Harrington’s work, see 
CONSTITUTIONS AND CONSTITUTIONALISM; UTOPIAN- 
ISM; and the biographies of ARISTOTLE; HOBBES; 
MACHIAVELLI; PLATO, For discussion of the subse- 
quent development of his ideas, see the biography 
of BERNSTEIN. ] 
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David Hartley (1705-1757), called the father of 
British psychology, was indeed the first to use the 
word “psychology” in its modern sense. He at- 
tempted a comprehensive interpretation of psycho- 


logical phenomena based on the observation of 
behavior, bearing in mind its physiological sub- 
strate. His Observations on Man: His Frame, His 
Duty and His Expectations (1749) sets out a phi- 
losophy of human life in terms of a special theory 
of association. It is as the first comprehensive 
“associationist” that Hartley is chiefly remembered 
in histories of philosophy and psychology. But he 
also deserves study for his general method and his 
grasp of the special complexities involved in any 
attempt to elucidate mental processes by reference 
to physiological mechanisms. 

Hartley's theory of association is derived from 
Locke's Essay Concerning Human Understanding 
(1690) and John Gay’s preface to King’s Essay on 
the Origin of Evil (King 1702). Parallel to the 
formulation in terms of the association of ideas 
and equally important is his physiological theory. 
This avowedly hypothetical physiology is based on 
Newton’s suggestion that the sensation of seeing 
comes from “vibrations, being propagated along the 
solid Fibres of the optic Nerves into the Brain. . . . 
For because dense Bodies conserve their heat a 
long time . . . the Vibrations of their parts are of a 
lasting nature, and therefore may be propagated 
along solid Fibres of uniform dense Matter to a 
great distance, for conveying into the Brain the 
impressions made upon all the Organs of sense” 
(Newton [1704] 1952, Query 12, p. 345). Hartley 
was thus among those who denied the view, then 
commonly held, that the nerve fibers are hollow 
and allow the transmission of influence by flow of 
animal spirits, He thought such vibrations might 
underlie the transmission of all sensory and motor 
messages to and from the brain and considered 
how the resulting cerebral activity might represent 
the whole variety of human experience. He sup- 
posed that all coding and decoding of both sensa- 
tions and ideas take place within the “medullary 
substance” of the brain—that is, the white matter 
—and that “Vibratiuncles’—miniature vibrations 
that correspond to the incoming vibrations of sensa- 
tion—are generated and persist there, giving rise, 
by combination and contiguous association, to 
memory and ideas. He illustrated and tried to test 
these suppositions against his own observations of 
human and animal behavior, ranging from “auto- 
matic actions” (that is, reflexes) such as respira- 
tion and digestion to such “ideas of intellect” as 
“theopathy.” 

For this ambitious and possibly premature un- 
dertaking, Hartley had good qualifications. Son of 
a country clergyman, a scholar of Jesus College, 
Cambridge, he studied classics, mathematics, and 
divinity, intending to take holy orders. Doctrinal 
scruples caused him to turn instead to medicine. 
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While a fellow of Jesus College, he came across 
Gay's suggestion of the all-pervasive power of as- 
sociation and elaborated on this theme in a brief 
pamphlet, Conjecturae quaedam de sensu, motu, 
et idearum generatione (1730). All his further 
work was devoted to the development of the ideas 
here outlined, and he determined to try physio- 
logical and philosophical theory “to give guidance 
to the art of ethics and customs.” Nineteen years 
elapsed before the publication of his magnum opus. 
During this time he was occupied with a medical 
practice among rich and poor, with family life, and 
with the cultivation of numerous friends. Among 
his intimates were Stephen Hales, Sir Hans Sloane, 
William Cheselden, Bishop Butler, and John Byrom. 
He was elected fellow of the Royal Society in 1743. 
Always receptive to other people’s ideas, he was 
courageous, if not always discreet, in trying out 
new ones, and his contemporaries did not take him 
quite seriously as a philosopher or scientist. 

The Observations seems to have made little im- 
pression at the time of its publication. Joseph 
Priestley, however, who found the implied neces- 
sitarianism congenial, republished the work in 
1775, omitting all the physiological parts, which 
he regarded as valueless speculation, Hartley's the- 
ory of association became the major influence on 
the English empiricist school of the nineteenth 
century, notably in the psychological works of the 
Mills and Alexander Bain (Warren 1921; Ribot 
1870). 

Although there is little in Hartley's work that 
directly influenced his immediate successors, it still 
bears examination for its freshness of outlook and 
catholicity of interest. Hartley wove into his formal 
argument a mass of material whose relevance to 
the science of psychology has progressively in- 
creased since his day. Parallel to the analysis of 
consciousness, from primitive sensations to ab- 
stract ideas, runs an account of the evolution of 
behavioral mechanisms, from the simplest “auto- 
matic actions” (reflexes) through the “secondarily 
automatic actions’—that is, basic muscular skills 
such as walking, whose character Hartley was the 
first to define and comment on—to the supposed 
cerebral concomitants of such “mental” skills as 
thinking. The reflexes, of course, derive from Des- 
cartes (1662). Of particular interest is the analysis 
of the mechanics of habit, which Hartley evidently 
gained from some lesser known passages in Male- 
branche’s De la recherche de la vérité (1674— 
1675). In his account of the higher mental func- 
tions, Hartley revealed the considerable influence 
of the work of John Wilkins, bishop of Chester, 
on language (1668). Thus, his theory of memory 
relies on the “perpetual recurrency of the same im- 
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pressions, and clusters of impressions” which come 
to form “the rudiments or elements of memory” in 
the same way as letters of the alphabet form the 
rudiments of language (1749, part 1, pp. 374- 
375). Similarly, his discussion of conjecture and 
hypothetical thinking draws on the work of Abra- 
ham de Moivre (1718) and other contemporary 
mathematicians who were concerning themselves 
with the practical aspects of probability theory. 

It was from Thomas Willis (1664; 1672) that 
Hartley gained the knowledge of the anatomy of 
the brain which brought him close to a post- 
Darwinian view of the relation of man and animals. 
“The brute creatures,” he wrote, “prove their near 
relation to us, not only by the general resemblance 
of the body but by that of the mind also” (1749, 
part 1, pp. 413-414). Hartley was led, on the basis 
of the very meager information available, to a 
primitive doctrine of the localization of cerebral 
function. This preoccupation passed out of psy- 
chology and neurology and, in the form in which 
he stated it, only reappeared in late nineteenth- 
century experimental and clinical studies of brain 
function. Nor had Hartley neglected that other pre- 
occupation of nineteenth-century medicine, the 
hysterical disorders. 

Hartley's theology, sometimes trite and always 
constrained by contemporary Biblical interpreta- 
tions, is nevertheless nodal to his philosophy as a 
whole. It was his primary concern to provide re- 
ligion with a natural and scientific basis through a 
genuinely psychological analysis of religious ex- 
perience. Hartley regarded the second (and much 
longer) part of the Observations, devoted to man’s 
“Duty and Expectations,” as interdependent with 
the first. He seems to have perceived that his at- 
tempt did not wholly succeed, but he pressed the 
laws of mechanism as far as they would go. “All 
the evidences for the mechanical nature of body 
and mind,” he wrote, “are so many encouragements 
to study them faithfully and diligently, since what 
is mechanical may be both understood and reme- 
died” (1749, part 1, p. 267). Later psychologists 
have increasingly adopted this principle of method, 
even if some, like Hartley, have remained aware 
of elements in human consciousness that are re- 
sistant to it. 


KATHLEEN C. OLDFIELD 


[For the historical context of Hartley’s work, see the 
biography of Locke; for discussion of the subse- 
quent development of Hartley’s ideas, see NERVOUS 
SYSTEM, article on STRUCTURE AND FUNCTION OF 
THE BRAIN; STRESS.] 
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HAURIOU, MAURICE 


Maurice Hauriou (1856-1929), French legal 
theorist, was born at Ladiville (Charente). He 
studied law at Bordeaux and, after taking first place 
in the 1882 competition for agrégation, was ap- 
pointed professor at Toulouse. He taught there for 
46 years, until his death. 

Hauriou taught the history of law for several 
years. He then went on to teach administrative law, 
and it was in this field that he was to make his most 
lasting contributions. He became deeply interested 
in sociology—in 1893 he wrote Les facultés de droit 
et la sociologie—but after his most important work 
in this area, La science sociale traditionelle (1896), 
aroused considerable opposition, he decided, as he 
wrote to Georges Renard, to present in juridical 
dress the ideas that he had at first tried to get ac- 
cepted in sociological dress (Renard 1931). Indeed, 
this approach is implicit in the commentaries on 
decisions that he made for the Sirey collection for 
37 years, from 1892 to 1929 (1929), and in his 
Précis de droit administratif (1892), which went 
through 11 editions during his lifetime. But it is 


primarily in his Principes de droit public (1910), 
his Précis de droit constitutionnel (1923), and in 
his article “La théorie de l'institution et de la fonda- 
tion” (1925) that his sociophilosophical views may 
be found. 

Hauriou’s thought is marked by liberalism, indi- 
vidualism, and attachment to tradition. He was 
strongly influenced by Bergson and by the doctrine 
of the Catholic church; he described himself, in the 
Principes de droit public, as a Comtean positivist 
turned Catholic positivist, that is, as a positivist who 
will go so far as to utilize the social, moral, and 
juridical content of Catholic dogma. As a tradi- 
tionalist, he saw himself restoring the classic doc- 
trines of French public law and defending natural- 
law conceptions from the menace of the aggressive 
objectivism of Hans Kelsen. 

However, Hauriou was not simply a conservative 
content to do battle from old positions. He did not 
regard the substance of law as purely conceptual 
but saw it as a concrete reality that embodies ob- 
jective ideas and is subject to their creative influ- 
ence. In this way he introduced into juridical 
thinking the sociological data that the normative 
(objective) method prescribed. Hauriou did not 
believe that the sociological study of the law meant 
merely the observation of social facts; instead, he 
was concerned with discovering how government 
de facto becomes government de jure (1892). As 
he saw it, this transformation occurs as ideas are 
incorporated into facts, thus creating what G. 
Gurvitch has called normative facts. 

What has become known as the “Hauriou theory 
of the institution” focuses on this process of the 
penetration of social reality by ideas and analyzes 
the way in which ideas order that reality. An insti- 
tution, by Hauriou’s definition, is an “idea of a 
project, or enterprise, that becomes a reality and 
persists juridically in a social milieu” (1925, p. 10). 
An institution is an objective reality, the product 
of a particular social equilibrium, and the source 
of legal rules. With this definition Hauriou elimi- 
nated all voluntaristic interpretations of the crea- 
tion of law. He introduced a dynamic factor into 
the realm of law, for although institutions give sta- 
bility to the law, this is not a stability that pre- 
cludes change, The continuity of an institution de- 
pends on its constant adaptation to new conditions 
of social life (1910, pp. 213, 250). 

On these bases Hauriou erected a theory of the 
state. Rejecting the theory of the social contract, 
he based the origin of the state on the institution: 
individuals do not delegate power to a governing 
minority; they assent, by custom, to that institution 
which is the state. While the subject consents only 
indirectly to the power of the law, he accepts di- 
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rectly the institution to which that power is linked 
and in whose behalf it is exercised. Furthermore, 
Hauriou believed that the state exists only in cer- 
tain civilizations, that the institution of the state is 
taken seriously only by certain races, and that with 
certain others it becomes a mere facade behind 
which traditions of personal power or the organiza- 
tion of clans and clienteles are perpetuated. As he 
saw it, there are peoples that are incapable of put- 
ting into effect the idea of the commonweal (1929). 

The state, therefore, is distinct from the nation. 
It is subject to law only at its own will, and this is 
why Hauriou, on the political plane, believed firmly 
in the merits of democracy and, more particularly, 
of parliamentary democracy. But taking a wide his- 
torical perspective, he considered this type of gov- 
ernment to be nothing more than a temporary 
political form. For all its recognized merits, parlia- 
mentary democracy is only a stage in an irrrevers- 
ible development that begins with aristocratic 
monarchies and ends with administrative empires 
(1929, p. 142). The successive types of govern- 
ment evolve as governmental institutions become 
nationwide; within the nation, more and more nu- 
merous elites are formed, which enter into the 
machinery of government and transform its opera- 
tion by their presence. 

So far as the functioning of parliamentary de- 
mocracy in particular is concerned, Hauriou ad- 
vocated the dominance of the executive over the 
legislative power. He believed that the best way to 
maintain the superiority of the executive over the 
legislative power is to limit the power of the depu- 
ties to initiate legislation. In his view, the concilia- 
tion of power and liberty is brought about by virtue 
of order, that order which follows if the people sup- 
port actions originated by the executive. This sup- 
port is almost always spontaneous; force enters into 
the picture only in exceptional cases, to keep refrac- 
tory minorities within bounds. 

Hauriou stressed that the character of the state 
is determined to a large extent by what he called its 
social constitution; that is, its socioeconomic struc- 
ture, the place of citizens within the state, and the 
rights and liberties of individuals. He was deeply 
committed to individual liberty as the prerequisite 
to law that is based on the juridical person and on 
subjective right, and he believed that individual 
initiative is the basis of social order (1929, p. vii). 
However, he did recognize that human fallibility 
makes the limitation of individualism necessary: 
liberty must be conditional on rules of morality 
and social sanctions. On the socioeconomic level, 
limitations on individual liberty can be achieved 
by certain modifications in the structure of power: 
“Freedom of association, decentralization, corpo- 
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rative representation, here are three essential re- 
forms, all three interdependent, which will assure 
a harmonious separation between the state and 
positive society, and permit the development of col- 
lective intervention in economic conflicts without 
a monstrous increase in the power of the state” 
(1896, pp. 392-393). 

In the sphere of administrative law, Hauriou suc- 
ceeded in erecting an extremely solid body of doc- 
trine that is still suggestive, although it has be- 
come outmoded by changes in the extent and modes 
of action of the state. In general, to be sure, his in- 
fluence was less in his day than that of Duguit, the 
other master of French public law at the turn of 
the century. Yet despite certain obscure points, 
Hauriou’s thinking is still provocative, and his work 
foreshadowed the orientation of those French polit- 
ical scientists who are studying the nature of 
political institutions in general and of power in 
particular. 

G. BurDEAU 


[See also ADMINISTRATIVE LAW; STATE; and the biog- 
raphies of Ducuir and KELSEN.] 
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HAWTREY, R. G. 


Ralph George Hawtrey, English economist, was 
born in 1879. He was educated at Eton and Trin- 
ity College, Cambridge, where he obtained a first- 
class honors degree in mathematics. After leaving 
Cambridge he entered the administrative civil serv- 


ice and worked at the Treasury from 1904 until his 
retirement in 1945. He was knighted in 1956. 

Throughout his career he held only two academic 
(or quasi-academic) appointments: in 1928-1929 
he was given special leave by the Treasury to lec- 
ture in economics at Harvard University; and after 
his official retirement he was elected Price professor 
of international economics at the Royal Institute of 
International Affairs, a post which he held from 
1947 to 1952. 

Hawtrey is often wrongly regarded as one of the 
Cambridge academic economists and is grouped 
with Pigou, Keynes, Robertson, and other pupils of 
Marshall. But he acquired his knowledge of eco- 
nomics not at Cambridge (he never attended even 
one of Marshall’s lectures) but at the Treasury, 
where he came under the stimulating influence of 
Sir John (later Lord) Bradbury, permanent secre- 
tary of the Treasury. 

The first of Hawtrey’s books, Good and Bad Trade 
(1913), contains the germ of many of his later 
ideas on monetary theory. He subsequently devel- 
oped these ideas at greater length and with more 
precision in such major works as Currrency and 
Credit (1919), The Art of Central Banking (1932), 
and Capital and Employment (1937). 

Hawtrey was one of the outstanding figures 
among those British economists who led the way 
in much of the rethinking of monetary theory after 
World War 1, and he had a considerable influence 
on the early development of Keynes’s thought (Har- 
tod 1951, p. 357). The emphasis he laid on the 
concept of money as something abstract, though 
capable of numeration—a unit of account, divorced 
from any necessary link with a metallic or other 
substance—made a marked impact on the minds 
of a generation of economists who had grown up 
with the traditional nineteenth-century attitude 
toward the gold standard. 

He was one of the English pioneers of the “in- 
come” approach; thus, he wrote in Good and Bad 
Trade: “The total effective demand for all commod- 
ities per unit of time is the aggregate of all money 
incomes, The total cost of production of all com- 
modities per unit of time is the aggregate of all 
money incomes” ([1913] 1962, p. 7). He developed 
this idea further in his Currency and Credit by 
bringing in, for example, the notion of the income 
velocity of money, which he defined as the ratio of 
consumers’ outlay to the unspent margin: Con- 
sumers’ outlay (out of income) includes invest- 
ment, while the unspent margin is the supply of 
the means of payment—that is, the excess of the 
purchasing power people and businesses have ac- 
quired over what they have spent. 5 

Hawtrey based his monetary explanation pri- 


marily on the argument that credit is inherently 
unstable because of the working of the banking sys- 
tem and its effect on the holders of stocks in par- 
ticular. Dealers (merchants, wholesalers, and re- 
tailers) occupy a key position in the economy: 
“[They] are the economic leaders. They take the 
initiative in production, and the activity of the man- 
ufacturers depends on the orders they give” ({1919] 
1950, p. 427). Since dealers hold stocks largely 
with borrowed money, they are peculiarly suscep- 
tible to changes in the rate of interest they have to 
pay for the money they borrow. It is chiefly because 
of the importance which he attached to the role of 
dealers that Hawtrey, unlike Keynes and many 
other economists of recent times, always main- 
tained that the operative instrument for credit reg- 
ulation is the short-term, not the long-term rate of 
interest. 

While Hawtrey agreed that a low short-term rate 
may be able to do little to assist an economy to re- 
cover from a deep depression, he was convinced 
that a sufficiently high bank rate (possibly even 
10 per cent or higher) can damp down and counter- 
act inflationary forces caused by credit expansion, 
however strongly they may be working. He recently 
contended that the 7 per cent bank rate, which 
operated for six months in 1964-1965, was largely 
ineffective because it was too low at a time when 
the long-term rate (the yield on consols) was itself 
7 per cent. 

His views as to the utility of public works under- 
went a good deal of modification in the course of 
time. In his earliest work he contended that “the 
Government by the very fact of borrowing for this 
expenditure is withdrawing from the investment 
market savings which would otherwise be applied 
to the creation of capital” ({1913] 1962, p. 260). 
Later he said: “It is not the Government expendi- 
ture [on capital works] that gives employment, but 
the Government borrowing. The borrowing would 
have the same effect if it were to meet a deficit due 
to a remission of taxation” (1928, pp. 112-113). 
The shift in his position is even clearer in the fol- 
lowing passage: “Now it is quite true that capital 
outlay undertaken by the Government may conduce 
under appropriate conditions to an expansion of 
general demand. It must be additional, that is to 
say, must not by using up the resources of the in- 
vestment market, prevent or delay an equal amount 
of privately initiated capital outlay. It need not nec- 
essarily be financed by avowedly inflationary meth- 
ods; if it merely sets in motion money held idle in 
balances for want of eligible investments, it will 
enlarge the flow of money” (1944, p. 95). But he 
went on to say that business ought never to be 
allowed to lapse into such a state of depression and 
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stagnation that government expenditure is the ap- 
propriate remedy. If this is allowed to happen, it 
points to a failure on the part of those who control 
the banking system to exercise their proper func- 
tion. He consistently preached the doctrine that the 
best way to stop depression is to stop the preceding 
boom; and for this purpose credit regulation can be 
fully effective. 

Hawtrey’s position in the ranks of modern mone- 
tary theorists does not depend so much upon the 
validity of his views on the causation and control 
of the trade cycle as upon the deeper and clearer 
understanding which he made possible of both the 
operation of the banking system and the role of 
money in a modern economy. 

His name is associated in the minds of most 
economists with the doctrine that the trade cycle is 
essentially a monetary phenomenon, and it should 
not be overlooked that his writings include a care- 
ful analysis of the part played by the period of pro- 
duction and by the widening and deepening of 
capital in different phases of the cycle. 

No account of his work, however short, should 
omit mention of the contribution Hawtrey made 
at the International Monetary Conference held at 
Genoa in April 1922. He was one of the represen- 
tatives of the British Treasury there, and it was 
largely through his influence that agreement was 
reached on the desirability of cooperation between 
the central banks in the regulation of credit in 
order to prevent undue fluctuations in the purchas- 
ing power of gold. However, the Genoa Resolutions 
were never put into effect. 

When Hawtrey was 82 years of age, he pub- 
lished The Pound at Home and Abroad (1961), 
in which, inter alia, he upheld with customary 
vigor the thesis that the primary causes of the re- 
cent difficulties of the British economy were, first, 
the fixed parity of the sterling exchange with a 
depreciating dollar, and second, the undervalua- 
tion of the pound ever since it was devalued in 
1949 from $4.03 to $2.80—in this latter conten- 
tion his has been a lone voice among British 
economists. This book also contains his memo- 
randum of evidence submitted to the Radcliffe 
Committee on the Working of the Monetary Sys- 
tem and some trenchant criticisms of parts of the 
report of that body. 

Although Hawtrey adhered in general to the 
same fundamental tenets throughout his career, he 
modified their expression so as to take account 
both of historical developments and of the evolu- 
tion of contemporary economic thought, and it is 
by his later books and editions that his work should 


be assessed. 
C. W. GUILLEBAUD 
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HEALTH 


“Health is a state of complete physical, mental 
and social well-being and not merely the absence 
of disease or infirmity,” according to the World 
Health Organization (1946). It will be the purpose 
of this article to develop some concepts about 


health and disease, exploring a few implications 
of the WHO definition in the context of both West- 
ern and non-Western medical ideas. First, notions 
of singular and multiple causation of disease will 
be contrasted. Second, three types of ideas about 
health will be defined. Finally, some applications 
of the health concept to units beyond the human 
individual will be mentioned. 


Disease and its causes 


Illness is a disvalued process that impairs the 
functioning or appearance of a human person and 
may ultimately lead to death. The definition of 
health given by the WHO includes social as well 
as physical and mental well-being. This reflects a 
concern with the person as a member of human 
groups—an entity certainly not limited to the body 
of that person. The components of an individual 
(e.g., blood, body, soul, spirit, shadow, name, etc.) 
are defined differently from one culture to the next. 
The death of the organism, however, is a biological 
constant which is taken into account conceptually 
in all cultures, and customs prescribe how the dis- 
position of the corpse is to be arranged. Different 
components of the individual may be thought to 
depart from the presence of the living at different 
times, and these various departures are marked by 
a series of ceremonies (van Gennep [1909] 1960, 
pp. 146 ff.). Some components, such as the “soul,” 
may be thought never to cease existing entirely but 
to remain near the living or in some locality spe- 
cially set aside for its kind. 

Disease, then, may involve a temporary or per- 
manent impairment in the functioning of any 
single component, or of the relationship between 
components making up the individual. An impair- 
ment of a person, furthermore, need not be re- 
stricted to a decrease in his ability to function in 
his ordinary ways: for example, among the Ashanti 
of West Africa, a congenital birthmark which leads 
to no discomfort or danger of death can be con- 
sidered a sufficiently severe fault to disqualify a 
man from the office of chief. In many cultures, 
theories of disease will include explanations of con- 
genital defects or imperfections, and the distinction 
between these and other illnesses may become rele- 
vant for further analysis (Polgar 1963). 

Explanations of illness are not only useful to 
reaffirm the values of a social unit or to make death 
psychologically more tolerable for the next of kin 
but serve most immediately to indicate courses of 
preventive and curative action. To effect prevention 
or cure one should identify a course of events 
which presumably has produced the impairment. 
Herein lies the rationale of diagnosis, which is one 


of the three basic elements of all medical systems 
(the other two being therapy and prophylaxis). 

Notions of singular causation. During the last 
decades of the nineteenth and the early part of the 
twentieth century, Western medicine was heavily 
dominated by the notion that most diseases are a 
result of infection caused by microorganisms. This 
type of conception—that disease simply results 
from the entry by a foreign agency into the body 
of the patient—is paralleled by the ideas found 
among many tribal people that illness is caused by 
“object intrusion” or “spirit possession.” Walther 
Riese has drawn attention to this similarity of ideas 
in stating that “ontologic” etiology (a conception 
of disease as caused by a monadic “alien-ferment” ) 
“in its crudest form . . . identifies these agents, if 
not the diseases themselves, with demons, in its 
scientific form, with germs” (Riese 1953, pp. 66- 
67). He does not imply, of course, that demons 
and germs are equally valid concepts in an em- 
pirical sense. 

The emergence of the “doctrine of specific etiol- 
ogy of disease” (Dubos [1959] 1961, p. 90) as the 
dominant idea in medicine is related to the mech- 
anistic world view prevalent in the late nineteenth 
century. Far older features of Western thought, such 
as the grammatical dualism of subject and predi- 
cate, the Judeo-Christian and Platonic mind-body 
dichotomy, and the experimental approach of the 
alchemists, provided a suitable background for the 
development of this “doctrine.” Of the greatest im- 
mediate relevance to it were the discoveries of 
Pasteur and Koch in the realm of bacteriology. 
Instead of emphasizing the patient and his total 
environment, as Western medical traditions had 
done previously, proponents of this “doctrine” 
spread the notion that all important infections 
could be controlled by therapeutic serums and pre- 
ventive vaccines specific for all microbes (Dubos 
[1959] 1961, p. 130). Although a number of vac- 
cines and antitoxins had been developed before the 
turn of the century, it was not until the 1930s that 
the sulfa drugs were discovered, and it was another 
decade later that penicillin came to be used. The 
great decreases in the mortality of children and 
young adults, which are nowadays often attributed 
to clinical medicine and the use of specific drugs, 
actually preceded these discoveries and mostly ie- 
sulted from better nutrition and the hygienic meas- 
ures carried out under the leadership of medical 
reformers, many of whom had even opposed the 
germ theory of disease (Rosen 1958, pp. 225 ff.; 
Dubos [1959] 1961, p. 131). 

In the contemporary practice of clinical medi- 
cine, the inadequate care often received by patients 
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unfortunate enough to suffer from a disability for 
which no specific etiology can be identified is symp- 
tomatic of the legacy of the bacteriological era. 
Von Mering and Earley (1965) trace the difficulties 
of such problem patients to, among other factors, 
the hospital as the main locale for diagnosis and 
treatment, as well as to the “growth of medicine as 
a science of tests and measurements rather than 
an art involving the five senses.” These authors 
find that “the clinic physician and general practi- 
tioner share a kind of ‘molecular man’ orientation 
which seems to predispose them to be more con- 
cerned with the specifics of the presenting com- 
plaint, and to look eagerly for major disease in 
every bed or consulting room” (von Mering & 
Earley 1965, p. 199; see also Pflanz 1964). 

Multicausal conceptions of disease. The recent 
theoretical developments away from the doctrine 
of specific etiology are spearheaded by advocates of 
comprehensive medical care and psychosomatic 
medicine and by some epidemiologists. All three of 
these segments of the medical community regard 
illness as an interaction of many factors and, cor- 
respondingly, favor treatment of patients once 
more as total organisms in a complex setting. One 
of the foremost modern exponents of this view is 
the epidemiologist John Gordon, who has shown 
the interplay of the host, the agent, and the (physi- 
cal, biological, and social) environment in the 
spread of a good number of both infectious and 
noninfectious diseases (see, for example, Gordon 
1958). The studies of John Cassel, another noted 
epidemiologist, on the spectrum of health disorders 
resulting from independently documented socio- 
cultural processes exemplify a further step away 
from the one cause-one disease manner of think- 
ing (Cassel 1964). Although writers in the psycho- 
somatic tradition of medicine often use concepts 
like “stress” or “conflict” as if they were specific 
causes of illness, the emphasis in this school of 
thought is on the patient's physical and mental 
well-being, and consideration is often given to his 
social milieu as well (King 1963). Comprehensive 
medical care is more than a movement to improve 
the institutional means by which patients and 
sometimes families are medically supervised. The 
theory that underlies these arrangements includes 
rejection of both the dominant disease orientation 
of modern Western medicine and the organic— 
functional dichotomy, and it places a strong em- 
phasis on the patient as a person (Steiger et al. 
1962). 

Multicausal conceptions of disease are neither 
new in the Western medical tradition nor unique 
to it. One main theme in the Hippocratic writings 
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is that disease is to be traced to an imbalance be- 
tween the person and his external environment; 
much emphasis is also given to the relationships 
among different environmental factors, such as 
exercise and diet, and to the connections between 
disturbances in an organ and the whole body 
(Sigerist 1951-1961, vol. 2, pp. 317 ff.; Dubos 
[1959] 1961, pp. 117 ff.). In non-Western socie- 
ties there are many multicausal ideas about 
disease. The distinction between conditions that 
make persons particularly susceptible and events 
that precipitate the onset of the disease is particu- 
larly common: for example, the Maori of New 
Zealand see “bad acts” by the patient as predispos- 
ing to, and external spirits or objects as the im- 
mediate cause of, an illness episode (Newell 1957); 
in the Middle East, a well-formed male child is 
identified as especially susceptible to attack by the 
“evil eye” (Shiloh 1958). 

Related to this division between predisposing and 
precipitating factors is the division between the 
reasons why a particular person becomes ill at a 
particular time and the explanation of the way in 
which it happens. These latter two types of causes 
may be termed incidence notions and etiological 
notions (Polgar 1962, pp. 166 ff.); they also bear 
some similarity to the Aristotelian efficient and 
material causes (Riese 1953, pp. 66 ff.). In some 
non-Western medical systems there are categories 
for “natural” diseases—usually minor ills such as 
the digestive problems of infants (Nurge 1958 )— 
which do not require an explanation for the occur- 
rence of the disability in the particular instance 
and hence do not raise questions about who is “re- 
sponsible.” In urbanized as well as nonurban so- 
cieties, however, the search for the transgression 
of the patient himself or the malevolent action of 
another being (human or supernatural) is a major 
element of the diagnostic process, 

In small tribal or peasant communities, the as- 
signment of responsibility for illness to a relative 
or neighbor (whose departure from prescribed 
norms of behavior is pinpointed as a breach of 
taboo, witchcraft, irresponsibility, or sin) helps to 
bring latent interpersonal conflicts into the open 
where they are more easily resolved (Paul 1953; 
Firth 1959, pp. 135 ff.). Similarly in the Judeo- 
Christian tradition the attribution of illness to sin- 
ful behavior served to reinforce the mores of the 
society. With increasing secularization, this diag- 
nostic category became less satisfactory, and in 
scientific medicine it was replaced by “naturalistic” 
explanations. However, residues of this earlier con- 
cept of sin still affect attitudes toward disease; for 


patients and their families, a physician's diagnosis 
which fails to blame anyone for the occurrence of 
the illness also fails to deal with the sense of guilt 
they often have and leaves them vaguely dissatis- 
fied (Sigerist 1951-1961, vol. 1, p. 157). 


Three conceptions of health 


If disease is seen as an individual's departure 
from perfectly well-meshed social or physiological 
performance, health, by contrast, becomes an 
asymptote—an ideal that can be approached but 
never attained in actuality. In the WHO definition, 
the expression “complete physical, mental and so- 
cial well-being” [emphasis added] echoes this type 
of conception. 

Variants of the asymptotic concept. Two main 
variants of the asymptotic notion about health can 
be identified. One variant, the harmonious working 
together of disparate elements, is a dominant theme 
in the Indo-European tradition, antedating Galen’s 
notion of the “four fluids” and manifest today in 
the influence of Walter B. Cannon’s ideas about 
homeostasis. The yang and yin of Chinese philos- 
ophy also indicate a search for balance, the restora- 
tion of which is one of the healer’s primary goals 
(Huard & Wong 1959, pp. 105 ff.). Grand designs 
of physiological, physical, and metaphysical order 
—each replicating the elements of the other—are 
typical of classical times. 

The second type of asymptotic conception is a 
backward-looking romanticism, which has been 
described by Dubos in his chapter “The Gardens 
of Eden” ({1959] 1961, pp. 1-25). For Rousseau 
and his followers, the ills that beset Western society 
are consequences of the departure from a perfect 
State of harmony with nature that is entailed in 
the process of becoming civilized. Freud also ac- 
cepted the myth of a precursor of modern man who 
was exempt from the latter's neuroses, since this 
imaginary “savage” did not inhibit the biological 
drives toward aggression and sexuality (Riese 
1953, pp. 14 ff.). Remnants of ideas about “primi- 
tive man’s” closeness to “nature” remain today in 
such medical folklore as the myth of easier parturi- 
tion among American Indians and the “innate 
superiority of their sense organs. When this theme 
is transposed to the life cycle of individuals, chil- 
dren may be seen romantically (for example, by 
the poet Wordsworth) as endowed with sensitive 
understanding which they gradually lose by ex 
posure to the eroding influence of the “civilized 
ways of adults. 

In operational terms, the asymptotic definition 
of health is mostly negative; it implies the absence 


of manifest disturbance. While this notion has 
advantages in focusing attention on the nonexist- 
ence of a clear break between the presence or 
absence of disease, by the same token it makes for 
difficulties in conducting health surveys and plan- 
ning for medical facilities (Lewis 1953; U.S. De- 
partment of Health . . . 1966). 

The elastic concept. Another set of notions 
about health centers on the accumulation of re- 
sistance to potential danger. This may be termed 
the elastic concept. Examples of health behavior 
derived from this manner of thinking include re- 
stricting the water intake of children to make them 
hardy, homeopathic medicine, and variolation (of 
differing empirical value, of course). Adversity is 
not regarded here as a disruption of some prior or 
ultimate harmony but rather as an ordinary and 
expected circumstance for which preparations 
can and should be made. This manner of regarding 
health seems to play a substantial part in modern 
preventive medicine. Another good contemporary 
example of an application of the elastic view of 
health is psychoprophylactic training for child- 
birth, by which women are taught to cope with the 
hardships of delivery through psychological condi- 
tioning together with certain exercises ( Bing et al. 
1961). Some accumulated resistance potentials can 
be measured operationally in the scientific labora- 
tory by testing an individual’s capacity to produce 
specific antibodies when challenged by an antigenic 
substance or his capacity for continued adequate 
performance of sensory tasks under controlled 
changes in temperature, humidity, pressure, and 
other conditions. 

The open-ended concept. The outstanding dif- 
ficulty with the asymptotic notion of health (which 
is circumvented by elastic conceptions) is its un- 
attainability. By turning the argument around, one 
can start with death as a kind of absolute zero and 
fix no upper limit for human functioning (Bates 
1959, p. 59). This may be termed the open-ended 
conception of health. The outstanding example of 
this ideology is involved in the attempts to formu- 
late a philosophy of “positive mental health.” While 
some concepts used by the writers in this tradition, 
such as “self-actualization,” would fall in the cate- 
gory of asymptotic notions, the criteria of growth, 
zest, and creativity clearly belong under the open- 
ended rubric, The theorists of positive mental 
health share with the authors of the WHO defini- 
tion and others mentioned above the desire to 
construct a manner of looking at health which is 
based “not merely on the absence of disease or in- 
firmity.” However, they go beyond the WHO view 
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of health, and beyond most of preventive medicine 
generally, in their search for positive goals which 
are independent of disease (Jahoda 1958). Health 
promotion in nutrition, for example, aims to pre- 
vent deficiency diseases (a goal which is of the 
“elastic” type) or persuade people to consume 
recommended daily norms of nutrients (an asymp- 
totic-type idea). By contrast, “zestful living” does 
not reach an optimum at certain levels of energy 
expenditure and could even make people occasion- 
ally more prone to injury or disease. 


Modern medical practice 


In terms of actual health behavior in urban 
societies, open-ended conceptions are more likely 
to be put into practice in national parks, beauty 
parlors, bathrooms, or athletic studios than in the 
offices of doctors or psychologists. Physicians may 
recommend vacations, walks in the “fresh air,” or 
other types of exercise, but this is usually pre- 
scribed for incipient illness or problems of over- 
weight rather than for promoting health as such. 
In non-Western societies one may find practices 
aimed at increasing supernatural power, physical 
strength, prosperity, wisdom, virility, or femininity, 
which are conceptually and behaviorally integrated 
with actions to prevent or cure disease. In indus- 
trialized societies, however, increased specializa- 
tion results in the separation of medical institu- 
tions from the religious, esthetic, recreational, and 
economic spheres. As mentioned above, the focus 
of Western medicine narrowed as the doctrine of 
specific etiology of disease became the dominant 
view. Thus, health promotion through such cus- 
toms as taking cold showers, swallowing vitamin 
pills to “pep you up,” giving laxatives routinely to 
children, taking walks, and the like is seldom trans- 
mitted as part of the professional medical system 
but rather is passed on through relatives, friends, 
or the mass media. 

The attempt of the mental hygienists to develop 
a new and positive content for the concept of 
health is further limited by concern for the possi- 
bility of their encroachment on other institutions. 
Brewster Smith (1961, p. 301) has commented on 
the difficult position of the psychologist who is 
asked to provide notions of mental health as sub- 
stitutes for weakened religious values; and Freidson 
(1961/1962, pp. 125 ff.) has warned about the 
dangers of bringing questions of nonconformity to 
moral, legal, or political norms under the umbrella 
of medicine. In spite of these problems, it may be 
predicted that scientific medicine will gradually 
adopt a more open-ended conception of health as 
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the technological tasks of health maintenance in a 
population with increasing proportions of older 
people are accomplished and as the relationship 
between people and their environment once again 
becomes the central arena of medical concern. 


Health beyond the individual 


The WHO definition does not specify whether 
its terms apply only to the health of the individual. 
In the Greek medical system of the fifth century 
B.C. and that of some modern Western physicians, 
as described above, health is seen as an interaction 
between a person and his surroundings. This type 
of conception is carried even further in the ideas 
of many non-Western peoples. Margaret Mead 
(World Federation for Mental Health 1953, pp. 
217 ff.) mentions several examples of “continuity” 
between the well-being of man and of the soil and 
between the body and “other bodies of the social 
unit.” It is but a short step from a focus on these 
interrelationships to a consideration of the larger 
unit itself, without necessarily looking at the in- 
dividual within it at all times. 

As the student in schools of public health is 
often reminded, his “patient” will usually be a com- 
munity. Public health is thus not only the name of 
a medical specialty but also refers to the well-being 
of various publics (Brockington 1958, pp. 19 ff.). 
The health of other entities, such as families, so- 
cieties, the human species, or the entire ecosphere 
of this planet, has also been discussed. 

The resistance potential of a human collectivity 
to an epidemic of infectious disease cannot be de- 
scribed as the sum or the average of individual 
immunity: the degree of resistance in different 
age groups or the spatial dispersion of the popula- 
tion are crucial in estimating the level of “herd 
immunity’ (Gordon 1958). Mental illness is re- 
garded by a number of psychologists and psychi- 
atrists as a pathological state of an entire family. 
The illness may be discovered through the request 
for treatment of a single member who acts as the 
“messenger boy,” carrying the information about 
the trouble to the outside world, although he is 
neither the only one sick nor necessarily the one 
most seriously disturbed ( Gruenberg 1957). There 
are also some writers who consider it appropriate 
to label entire societies (for example, Nazi Ger- 
many) as pathological and to wonder if any 
“healthy adjustment” is possible for individuals 
living in them (Devereux 1956). 

Western medical practitioners almost inevitably 
put a higher value on prolonging individual life 
than on the health of the social unit—witness the 


grotesque situation where catheters, sedatives, ex- 
orbitant hospital bills, and oxygen tents prevent a 
dying man from making a decent and meaningful 
departure from his relatives. Under different cul- 
tural circumstances the reverse evaluation may 
predominate, as among the Navajo Indians of the 
southwestern United States, who are more con- 
cerned with the well-being of the entire kin group 
than with the maximum comfort of, say, a con- 
genitally malformed infant (Levy 1962). 

For an entire species, health may be regarded 
as a matter of Darwinian “fitness” for continued 
survival. Unless a species is approaching death 
through extinction, however, it may be impossible 
to diagnose its current degree of fitness. The pos- 
sibility of using modern medicine to keep alive 
individuals with genetically inherited diseases and 
the higher reproductive rate of the impoverished 
classes have been a focus for alarm by some eugen- 
icists. Whether any real danger of “deterioration” 
exists for the gene pool of the whole human species 
is debatable (Medawar 1960); but, of course, con- 
ceptions of health which regard the proliferation 
of a “chosen people” as good and their relative 
submergence by “heathens” or other out-groups as 
bad are not a recent development (Haller 1963). 

The health of the entire ecological system that 
exists on the surface of the earth can also be 
evaluated in terms of the survival potential of 
“life.” Evolution on this planet—inorganic, bio- 
logical, and social—has in the past moved toward 
increasing degrees of entropy retardation (Polgar 
1961). The catastrophe of nuclear war or the 
slower but equally irreparable consequences of ac- 
celerating population growth are threats to the 
survival not only of “civilized” man but also of the 
energy balance of our entire terrestrial ecosystem. 
According to this view, our future well-being in 
this world as we know it depends on mankind’s act- 
ing deliberately to safeguard and to continue ac- 
cumulating the ordered energy and information 
that evolution represents, 

STEVEN POLGAR 


[See also ILLNESS; MEDICAL CARE; MENTAL HEALTH; 
PUBLIC HEALTH. Other relevant material may be 
found in the articles on CREATIVITY; EPIDEMIOLOGY; 
EUGENICS; PSYCHOSOMATIC ILLNESS; SoctAL DAR- 
WINISM.] 
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HEARING 


Hearing is an especially important avenue by 
which we gain information about the world around 
us; one reason is that it plays a primary role in 
speech communication, a uniquely human activity. 
Clearly, then, our ability to perceive our environ- 
ment and therefore to interact with it, both in a 
physical and verbal or abstract sense, is dependent 
in large measure upon our sense of hearing [see 
LANGUAGE, article on SPEECH PATHOLOGY: PER- 
CEPTION, article on SPEECH PERCEPTION]. 

The study of the auditory system is carried on 
by a variety of disciplines, including psychology, 
physics, engineering, mathematics, anatomy, physi- 
ology, and chemistry. This article deals primarily 
with work in psychology, although it will be neces- 
sary to refer to other areas for a more complete 
understanding of certain phenomena. The periph- 
eral hearing mechanism is reviewed from the 
standpoint of anatomy, hydromechanical action, 
and electrical activity. The basic subjective corre- 
lates of auditory stimuli are discussed, together 
with current research and theory. In all cases, we 
are concerned with the normal rather than patho- 
logical auditory system. 

The peripheral hearing mechanism. When one 
speaks of the ear, the image that first comes to 
mind is the flap of cartilaginous tissue, or the 
pinna, fixed to either side of the head, The pre- 
sumed function of the pinna is to direct sound 
energy into the ear canal, or external auditory 
meatus. In some animals, such as the cat, the 
pinna may be directionally oriented independently 
of the head. For all practical purposes, however, 
man does not possess this ability. It has been 
shown that because of the particular shape of 
man’s pinnae, sound arriving at the head is modi- 
fied differentially depending on its direction of 
arrival, This may well provide a cue for the local- 
ization of a sound source in space, 

The external meatus and the eardrum. The ex- 
ternal meatus is a tortuous canal about one inch 
in length, closed at the inner end by the eardrum, 
or tympanic membrane. The meatus forms a pas- 
Sageway through which sound energy may be 


transmitted to the inner reaches of the ear, The 
meatus has the general shape of a tube closed at 
one end; it tends to resonate at a frequency of 
about 3,000 cycles per second. Because of this 
resonance the pressure of sound waves at the ear- 
drum, for frequencies in this vicinity, is twenty 
times greater than that at the pinna. The meatus, 
therefore, serves as a selective amplification de- 
vice, and, interestingly enough, it is primarily in 
this frequency range that our hearing is most 
sensitive, 

The middle ear. The eardrum marks the bound- 
ary between the outer and middle ear. At this point 
variations in sound pressure are changed into 
mechanical motion, and it is a function of the 
middle ear to transmit this mechanical motion to 
the inner ear, where it may excite the auditory 
nerve. This transmission is affected by three small 
bones, the auditory ossicles, which form a bridge 
across the middle ear. The ossicles are named for 
their shapes: the malleus (hammer), which is 
attached to the eardrum; the incus (anvil), which 
is fixed to the malleus; and the stapes (stirrup), 
which articulates with the incus and at the same 
time fits into an oval opening of the inner ear. The 
ossicles not only provide simple transmission of 
vibratory energy but in doing so actually furnish a 
desirable increase in pressure. The ossicles are 
held in place by ligaments and may be acted upon 
by two small muscles, the tensor tympani and the 
stapedius. The function of these muscles is not 
clear, but it has been suggested that their contrac- 
tion, together with changes in mode of ossicular 
motion, serves at high levels of stimulation to 
reduce the effective input to the inner ear. 

The inner ear. The “foot plate” of the stapes 
marks the end of the middle ear and the beginning 
of the inner ear. The inner ear actually consists of 
two portions that, although anatomically related, 
Serve essentially independent functions. Here we 
are concerned only with the cochlea, which con- 
tains the sensory end organs of hearing, The 
cochlea, spiral in shape, is encased in bone and 
contains three nearly parallel, fluid-filled ducts 
running longitudinally. The middle of the three 
ducts has as its bottom a rather stiff membrane 
known as the basilar membrane. On this mem- 
brane is the organ of Corti, within which are the 
hair cells, or the sensory receptors for hearing. The 
hairs of the hair cells extend a short distance up 
into a gelatinous plate known as the tectorial mem- 
brane. When the basilar membrane is displaced 
transversely, the hairs are moved to the side in a 
shearing motion and the hair cells are stimulated. 

Displacement of the basilar membrane is brought 


about by fluid movement induced by the pistonlike 
action of the stapes in the oval window. Since fluid 
is essentially noncompressible, its displacement is 
possible because of the existence of a second open- 
ing between the cochlea and the middle ear, the 
round window. When the stapes moves inward, 
a bulge is produced in the basilar membrane and 
the round window membrane moves outward. The 
bulge or local displacement of the basilar mem- 
brane is not stationary but moves down the 
membrane away from the windows. If the move- 
ment of the stapes is periodic, such as in response 
to a pure tone, then the basilar membrane is dis- 
placed alternately up and down. Thus, when a pure 
tone is presented to the ear, a wave travels con- 
tinuously down the basilar membrane. The ampli- 
tude of this wave is not uniform but achieves a 
maximum value at a particular point along the 
membrane determined by the stimulus frequency. 
High frequencies yield maxima near the stapes; 
lower frequencies produce maxima progressively 
farther down the membrane. 

Electrical potentials. Many of the mechanical 
events just described have an electrical counter- 
part. The cochlear microphonic, an electrical poten- 
tial derived from the cochlea, reflects the displace- 
ment of the basilar membrane. The endocochlear 
potential represents static direct current voltages 
within various portions of the cochlea, whereas 
the summating potential is a slowly changing di- 
rect current voltage that occurs in response to a 
high-intensity stimulus. Also observable is the 
action potential, which is generated by the auditory 
neurons in the vicinity of the cochlea. Neural 
potentials reflecting the activity of the hair cells 
are transmitted by the eighth cranial nerve to the 
central nervous system. 

Psychoacoustics. Although it is true that one of 
the principal functions of man’s auditory system 
is the perception of speech, it does not necessarily 
follow that the exclusive use of speech stimuli is 
the best way to gain knowledge of our sense of 
hearing. In the study of hearing, the use of simple 
stimuli predominates and the common stimulus is 
the sine wave, or pure tone. 

Traditionally, psychophysics has investigated 
problems in (1) detection of stimuli, (2) detec- 
tion of differences between stimuli, and (3) rela- 
tions among stimuli. Psychoacoustics has followed 
a similar pattern. 

Threshold effects. It has been shown that a 
number of factors are influential in determining 
the minimum magnitude (often called intensity 
or amplitude) of an auditory signal that can be 
detected. Specifically, absolute thresholds are a 
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function primarily of the frequency and duration 
of the signal. Under optimum conditions, sound 
magnitudes so faint as to approach the random 
movement of air molecules, known as Brownian 
movement, may be heard; the displacement of the 
basilar membrane in these cases is a thousand 
times less than the diameter of a hydrogen atom. 
Masked thresholds, those obtained in the simul- 
taneous presence of a signal and other stimuli that 
mask its effect, depend on the frequency and 
relative magnitude of each stimulus. 

In addition, previous auditory stimulation will 
affect subsequent absolute thresholds. Generally 
the effect is to lower auditory sensitivity, although 
in some instances sensitivity may be enhanced. 
Pertinent factors here include the magnitude, fre- 
quency, and duration of the “fatiguing” stimulus 
as well as the interval between the presentation of 
the fatiguing and test stimuli. 

Differential thresholds. There are as many 
studies dealing with the detection of differences 
between two stimuli as there are ways in which 
stimuli may be varied. Only a few examples, there- 
fore, will be cited here. With pure tones thresholds 
for hearing frequency differences become greater 
as frequency is increased, and smaller as magni- 
tude is increased. Differences as small as one part 
in a thousand are detectable. Differential thresholds 
for magnitude depend upon the same parameters, 
but in a more complex way. 

Noise stimuli, those sound waves lacking a peri- 
odic structure, may be varied with respect to 
magnitude, bandwidth, and center frequency, but 
differential thresholds with noise are generally 
predictable from the pure tone data. 

Signal detection theory. Recently, there has 
come into psychophysics, principally by way of 
psychoacoustics, a new way of thinking about de- 
tection data. This new approach makes use of 
signal detection theory and offers some novel ideas. 
First, it offers a way to measure sensory aspects of 
detection independently of decision processes. That 
is, under ordinary circumstances, the overt re- 
sponse to the signal depends not only upon the 
functioning of the receptor but on the utility of 
the response alternatives. If it is extremely im- 
portant that a signal be detected, under ordinary 
circumstances a subject is more likely to give a 
positive response regardless of the activity of the 
receptor. 

Second, it rejects the notion of a threshold; that 
is, a threshold in the sense that a mechanism exists 
which is triggered if some critical stimulus level is 
exceeded. One basis for such rejection is clear from 
the previous paragraph. 
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The theory of signal detectability substitutes for 
the concept of a threshold, the view that detection 
of a stimulus is a problem in the testing of statis- 
tical hypotheses. For example, the testing situation 
can be so structured that two stimulus conditions 
exist: the signal is present and the signal is absent. 
Clearly, there are four alternatives: (1) the listener 
can accept the hypothesis that the signal was 
present when, in fact, it was; (2) he can reject 
this hypothesis under the same conditions: (3) he 
can accept the hypothesis that the signal was ab- 
sent when, in fact, it was; (4) he can reject this 
hypothesis. By making certain assumptions about 
the characteristics of the stimulus and proceeding 
under the ideal condition that the observer makes 
use of all information in the signal, the probabili- 
ties associated with these alternatives may be 
mapped out. It has been shown that an actual ob- 
server behaves as if he were performing in this 
fashion, and his performance may therefore be 
compared to that ideally obtainable. 

Suprathreshold phenomena. With signals that 
are easily audible, it is generally conceded that 
there are three primary perceptual dimensions to 
hearing: pitch, loudness, and quality. Considerable 
effort has been expended in searching for the 
stimulus correlates and physiological mechanisms 
associated with these dimensions. 

Pitch and theories of hearing. With a simple 
pure tone, pitch is usually associated with the fre- 
quency of vibration of the stimulus. High fre- 
quencies tend to give rise to high pitches. His- 
torically, there have been two general types of 
theories of hearing: “place” theories and “volley” 
theories (often called frequency theories). 

The most commonly Suggested mechanism of 
pitch is based on a place hypothesis, which holds 
that the pertinent cue for pitch is the particular 
locus of activity within the nervous system, It 
seems likely that stimulation of specific neurons 
or groups of neurons is related to the displacement 
patterns of the basilar membrane. The chief alter- 
native to the place hypothesis is the volley or rate 
of neural discharge hypothesis, which holds that 
the rate or frequency with which neural discharge 
occurs within the auditory nerve is the determinant 
of pitch; the higher the frequency, the higher the 
pitch. The frequency of neural discharge, in turn, 
is synchronous with the stimulus frequency. 

Any results in which pitch is influenced by any 
parameter other than frequency is not in accord 
with the neural discharge hypothesis. Such results 
include changes in pitch brought about by differ- 
ences in the magnitude of the stimulus, by mask- 
ing, fatigue, or auditory pathology. On the other 


hand, the place hypothesis cannot readily explain 
how a pitch corresponding to a particular fre- 
quency is perceived, when, in fact, little or no 
energy exists in the stimulus at that frequency. 
Such a situation exists for several pitch phenom- 
ena: the residue, periodicity pitch, time separation 
pitch, and Huggins’ effect. 

Loudness. Loudness is related to the magni- 
tude of the stimulus, but not exclusively so, Fre- 
quency and duration of the stimulus are secondary 
factors in determining loudness. The loudness of a 
stimulus depends upon prior acoustic stimulation 
in somewhat the same manner that absolute 
threshold does. Generally, loudness decreases fol- 
lowing adaptation, and the pertinent parameters 
are the same as those that influence threshold 
shifts. 

Quality. Quality, or timbre, is a complex per- 
ceptual quantity that appears to be associated with 
the harmonic structure of the sound wave. The 
greater the number of audible harmonics, the 
richer or fuller the sound will appear. The converse 
also appears to be true. Relatively little work has 
been done in this area. 

Scaling and harmonics, Psychophysical scaling, 
or the assessment of the relation between the mag- 
nitude of the stimulus and the magnitude of the 
sensation, has been of interest for many years. 
New methods, whose chief virtues are simplicity 
and relative freedom from bias, have recently 
stimulated additional research, Auditory dimen- 
sions that have been studied include loudness, 
pitch, duration, volume, density, and noxiousness. 

The principal finding is that in nearly all cases 
the relation between stimulus and sensation is a 
power function. 

Nonlinear effects. In simple systems in which 
response magnitude is a nonlinear function of 
stimulus input, harmonics are generated when the 
system input is in the form of a simple sinusoid. 
When the input is two sinusoids, or pure tones, 
then in addition to harmonics, components exist 
whose frequency is equal to the sum and the dif- 
ference of the input frequencies, Such effects are 
seen when the auditory system is driven at moder- 
ate and high intensities. That is, additional tones 
called aural harmonics and sum and difference 
tones are perceived corresponding to the predicted 
frequencies. This seems to indicate that the audi- 
tory system behaves in a nonlinear fashion over 
the upper part of its dynamic range. 

Binaural hearing. Under most conditions, stim- 
uli arising from a single sound source are repre- 
sented somewhat differently at each of the two 
ears. The auditory system makes use of these subtle 


differences in such a fashion that we are able to 
localize the sound in space. The binaural system is 
especially sensitive to small temporal disparities 
in the two inputs, being capable of discriminating 
differences as small as 0.000008 second. Intensity 
differences at the two ears also play a role in 
localization. 

Although localization effects are the most dra- 
matic event in binaural hearing, other interesting 
binaural phenomena occur. For example, less en- 
ergy is required for threshold if both ears, rather 
than just one ear, are stimulated. Similarly, the 
same loudness may be achieved binaurally with 
less energy than it could be monaurally. 


Our sense of hearing provides us with informa- 
tion relative to vibratory or acoustic events. This 
information relates to the magnitude, frequency, 
duration, complexity, and spatial locus of the 
event. The peripheral auditory system is an ele- 
gantly designed hydromechanical structure. The 
sensory cells themselves and complexities of their 
enervation are of considerable importance, but are 
less well understood. In total, hearing is an ex- 
tremely versatile sensory process with exquisite 
sensitivity. 

ARNOLD M. SMALL, JR. 


[Other relevant material may be found in ATTENTION; 
NERVOUS SYSTEM; PSYCHOPHYSICS; SENSES; and in 
the biography of HELMHOLTZ.] 


BIBLIOGRAPHY 

CONFERENCE ON THE NEURAL MECHANISMS OF THE AUDI- 
TORY AND VESTIBULAR SYSTEMS, BETHESDA, MD., 1959 
1960 Neural Mechanisms of the Auditory and Ves- 
tibular Systems. Springfield, Ill.: Thomas. > The first 
16 chapters deal with the auditory system. 

GELDARD, FRANK A. 1953 The Human Senses. New York: 
Wiley. 

HARVARD UNIVERSITY, PSYCHO-ACOUSTIC LABORATORY 1955 
Bibliography on Hearing. Cambridge, Mass.: Harvard 
Univ. Press. > Contains more than ten thousand 
titles. 

HELMHOLTZ, HERMANN L. F. von (1862) 1954 On the 
Sensations of Tone as a Physiological Basis for the 
Theory of Music. New York: Dover. > First published 
as Die Lehre von den Tonempfindungen als physio- 
logische Grundlage fiir die Theorie der Musik. A 
classic which in many ways is as important today as 
when it was written. 

Jercer, James (editor) 1963 Modern Developments in 
Audiology. New York: Academic Press. > Especially 
valuable for its readable review of signal detection 
theory and its coverage of the effect of acoustic stimu- 
lation on subsequent auditory perception. 

Von Béxfisy, Grorc (1928-1958) 1960 Experiments in 
Hearing. New York: McGraw-Hill. > A compilation 
of Georg Von Békésy’s writings on cochlear mechanics, 
psychoacoustics, and the ear’s conductive processes. 


HECKSCHER, ELI 339 


Wever, Ernest G. 1949 Theory of Hearing. New York: 
Wiley. > A review of theories of hearing with a spe- 
cial attempt to show the cogency of a volley theory. 

Wevenr, Ernest G.; and LAWRENCE, MERLE 1954 Physio- 
logical Acoustics. Princeton Univ. Press. > Emphasis 
on the mechanics of the middle ear. 


HECKSCHER, ELI 


Eli F. Heckscher (1879-1952) was from the be- 
ginning of his academic studies at the University 
of Uppsala (Sweden) both an economist and a 
historian. A distinctive feature of his life work was 
his insistence on the importance of both economic 
theory and statistical evidence to economic history. 
He first stated this position in his dissertation Till 
belysning af jdrnvigarnas betydelse för Sveriges 
ekonomiska utveckling (1907; “The Contribution 
of the Railroads to Swedish Economic Growth”) 
and later expressed it in the acknowledgment of his 
debts to Alfred Marshall and William Cunningham, 
one a theorist and the other a historian. 

In 1909 Heckscher became the first professor of 
economics and statistics at the new business school 
in Stockholm, and twenty years later a personal 
chair was created for him as the head of the Stock- 
holm Institute for Economic History. 

Heckscher’s passionate involvement with the 
central issues of Swedish economic and social pol- 
icy throughout the first half of the twentieth cen- 
tury produced a steady flow of pamphlets and 
articles, and even an incomplete bibliography con- 
tains 1,148 items (Ekonomisk-historiska Institutet, 
Stockholm 1950). As a publicist he was independ- 
ent of political parties and spoke from a position 
of laissez-faire liberalism. He considered state in- 
tervention proper to few areas other than education 
and basic social legislation, although he did advo- 
cate strong government measures to maintain com- 
petition. He was orthodox in monetary and fiscal 
matters; he championed the restoration of the gold 
standard in the 1920s and objected to deficit finance 
in the 1930s. 

The application of economics to problems of 
policy and history was more congenial to Heck- 
scher than the development of pure theory. His 
most important theoretical contribution, an article 
entitled “The Effect of Foreign Trade on the Dis- 
tribution of Income” (1919), was, in fact, pro- 
voked by Knut Wicksell’s review of a collection of 
his essays on Swedish economic policy. In this 
article Heckscher stressed the role of factor endow- 
ments in giving rise to comparative advantage. His 
purpose was to establish that free trade, possibly 
combined with a policy of redistributive taxation, 
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was preferable to protection, which would give rise 
to economic loss for the country as a whole. Elab- 
orated by Bertil Ohlin, the argument came to be 
known as the Heckscher-Ohlin theorem about the 
tendency, on certain assumptions, for the prices 
of factors of production to move toward equality 
as the result of trade in commodities. 

In his teaching, articles, and essays, Heckscher 
roamed the entire span of Western economic his- 
tory, but he made his main scholarly contributions 
in two specific fields. His active concern with pub- 
lic affairs led him first to the historical study of 
economic policy, His Continental System (1918), 
written during World War 1 concurrently with a 
study of war economics, remains the classic inter- 
pretation of Napoleonic policy, although he has 
been charged with dismissing too lightly the poten- 
tial danger to the English economy. It was, how- 
ever, as the author of the monumental study of 
Mercantilism (1931) that Heckscher secured a 
lasting international reputation. Conceived as a 
study of the intellectual history of European eco- 
nomic policy between the Middle Ages and the 
coming of laissez-faire, this extensive analysis of 
the objectives and the rationale of mercantilism 
united strands of earlier appraisals, Like Schmoller 
and Cunningham, Heckscher regarded the pursuit 
of power and national unification as the main- 
spring of mercantilist policy; like Adam Smith he 
found protectionism and erroneous monetary con- 
ceptions to be central and objectionable features 
of the “mercantile system” of commercial policy. 

In spite of his exposure of the intellectual crude- 
ness of mercantilist doctrine and despite his own 
distrust of regulatory policy, the doctrine emerged 
in Heckscher’s treatment as a comprehensive view 
of economic life rather than a system of random 
fallacies. Indeed, since he believed that the concept 
of “mercantilism” explains the general character- 
istics of European history at that time, he stressed 
the uniformity rather than the diversity of national 
economic policies. Later criticism and controversy 
arose from Heckscher’s emphasis on economic 
ideas, i.e., his refusal to explain economic policy 
in terms of particular conditions and interests, 
which he regarded as of secondary importance 
compared to the prevailing conception of economic 
facts and relationships. 

Heckscher’s second field of major achievement 
was his pathbreaking economic history of Sweden. 
His magnum opus on Sveriges ekonomiska historia 
fran Gustav Vasa (1935-1949) was to remain un- 
finished; but his shorter history of Swedish eco- 
nomic growth (1941a) constitutes the first survey 
of its kind. The backwardness of the medieval 


Swedish economy relative to the European con- 
tinental countries and the lateness and rapidity of 
Swedish industrialization were the problems that 
Heckscher attacked, with a rare attention to both 
details and the grand design. Although his human- 
ism and erudition led him into every branch of 
historical inquiry, his scholarly credo stressed the 
importance of quantifiable evidence. The relatively 
rich material on Swedish demographic conditions 
since the early eighteenth century provided him 
with especially valuable material, but he also stud- 
ied every aspect of the economy by his creative 
Statistical use of archival sources. 

Heckscher’s view of Swedish economic history 
has on several points been revised by later research. 
(See the appendices to the second Swedish edition 
of Svenskt arbete och liv 1941b.) This does not 
detract from its significance as a point of departure 
for any interpretation of the growth of the Swedish 
economy. Some of Heckscher’s statistical analyses 
—for instance, his studies of the role and prev- 
alence of “natural economy” in fiscal administra- 
tion and of the structure of Swedish foreign trade 
in the sixteenth century, and his Malthusian analy- 
sis of Swedish population movements in the eight- 
eenth century—have, of course, been modified or 
replaced. However, such revisions are in the spirit 
of Heckscher’s own achievement and are therefore 
a tribute to it rather than a rejection. 


GORAN OHLIN 


[Directly related are the entries ECONOMIC THOUGHT; 
article on MERCANTILIST THOUGHT; INTERNATIONAL 
TRADE; and the biographies of CUNNINGHAM; MAR- 
SHALL; SMITH, ADAM.] 
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Georg Wilhelm Friedrich Hegel (1770-1831), 
who left his deepest mark upon the philosophy of 
history, is commonly regarded as the representative 
philosopher of German idealism in the post-Kantian 
era. To his contemporaries, however, he appeared 
rather as the continuator of a mode of thought 
begun by Kant (1724-1804) and amplified by 
Fichte, Schelling, and the romantic school, which 
responded to certain logical and metaphysical prob- 
lems originally raised by the natural sciences. 
Born in 1770, the year in which Kant inaugurated 
his professorship at Königsberg with his disserta- 
tion De mundi sensibilis atque intelligibilis forma 
et principiis (“The Form and Principles of the Sen- 
sible and Intelligible Worlds”), Hegel grew up in 
the relatively liberal milieu of refugee Protestant- 
ism in the south German Duchy of Wiirttemberg. 
As a student in Tiibingen, 1788-1793, he partici- 
pated in the then widespread enthusiasm for the 
French Revolution. He was to spend the next seven 
years in Bern and Frankfurt, tutoring the children 
of patrician families. During these years he im- 
mersed himself in the philosophy of religion, was 
influenced by his reading of Spinoza, and wrote 
(but did not publish) some highly critical studies 
of Christian theology, which remained unknown 
for over a century. His appointment to a post at 
the University of Jena in 1801 set him free for 
systematic teaching in philosophy; and this phase 
of his activity was crowned by the publication of 
his first major work, The Phenomenology of Mind 
(1807), completed during the battle of Jena. 

The upheaval that followed the triumph of 


Napoleon over the Prussian army gave Hegel the 
personal satisfaction of seeing the emperor—that 
“world-soul on horseback,” as he described him— 
but it also deprived him of his teaching position. 
After editing a newspaper in Bamberg (Bavaria) 
for a year, he was appointed rector of a Gymna- 
sium at Nuremberg, a post he held from 1808 to 
1816. During these years (which also witnessed 
his marriage to Marie von Tucher) he completed 
his second great work, the Science of Logic 
(1812-1816). Appointed to a chair of philosophy 
at Heidelberg (1816-1818), he there published his 
Encyclopedia of Philosophy (1817), a work which 
carried into effect a scheme already propounded 
by him in 1801: that of a tripartite philosophical 
system embracing a logic, a philosophy of nature, 
and a philosophy of spirit. The last mentioned 
formed the basis of the Philosophy of Right (1821), 
the last major work published during his lifetime, 
which was written during his early years at the 
University of Berlin, where he taught from 1818 
to 1831. By that time he had become famous, and 
his courses attracted students from all over Ger- 
many. The lectures he gave during those years on 
history, religion, aesthetics, and the history of 
philosophy were published after his death and 
helped to spread his fame to a wider public. During 
these last years he had become a confidant of 
the Prussian minister of education and something 
of a conservative, but he retained his theological 
rationalism and some of his early enthusiasm for 
the French Revolution and Napoleon. His death 
in November 1831, during a cholera epidemic, 
precipitated the dissolution of his school into con- 
flicting liberal and conservative factions. 

The extreme complexity of Hegel’s thought, and 
the all-embracing character of his system, along 
with a certain ambiguity in his political utterances, 
made it possible for exponents of very different 
schools to lay claim to his authority; and through- 
out the remainder of the century his name was a 
battle cry for opposing parties in philosophy and 
politics. Contrary to a widespread misconception, 
Hegel was never in his lifetime associated with 
German nationalism; he gave guarded support to 
the Prussian state but remained firmly committed 
to the principle of equality before the law. His 
mature political philosophy has affinities with that 
of Edmund Burke, and even his notorious utter- 
ances on the subject of the Prussian monarchy 
and the loyalty due to its representatives maintain 
a balance between Hobbesian authoritarianism 
and conventional Toryism: they are not in the 
slightest degree “totalitarian” and bear only an 
indirect relation to the doctrines of those twentieth- 
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century German and Italian ideologists who in- 
voked his authority. 

While the philosophy of history (notably in the 
form it assumed in nineteenth-century Germany ) 
bears Hegel’s mark, the impact of Hegelianism on 
the social sciences is more difficult to assess. Social 
thought in Hegel’s own day was subsumed under 
political theory. The study of economics was still 
in its infancy (although Hegel had become ac- 
quainted with it during his Frankfurt years), and 
social theorizing in general turned upon constitu- 
tional problems. These indeed had been the subject 
of Hegel’s first publication, in 1798, a critical study 
of Swiss constitutional developments. His last 
major work, the Philosophy of Right, develops a 
political philosophy which holds a precarious bal- 
ance between rationalism and authoritarianism, 
somewhat in the manner of Fichte, Kant, and the 
Enlightenment theorists who preceded them. By 
the 1840s this book had furnished a target for the 
first major broadside directed by the youthful Marx 
against Hegel in his Kritik des Hegelschen Staats- 
rechts (1843). In general the theory of civil so- 
ciety is subsumed by Hegel under the theory of 
the state; the latter is viewed as the embodiment 
of rationality, as against the conflicts of material 
interests that make up the daily life of the civil 
Society. Hegel counterposes the rational univer- 
sality of the whole to the particular desires and 
Strivings of the individuals in a manner reminis- 
cent of Hobbes and contrary to Rousseau, for 
whom the “general will” springs from the fusion 
of individual wills. Such a fusion, and with it the 
notion of a social contract, is denied by Hegel on 
the grounds that civil society is too anarchic to 
generate a true consensus. The “actuality of the 
ethical Idea” is attained in the state, which is 
“absolutely rational,” “mind objectified,” “the actu- 
ality of concrete freedom” ([1821] 1942, pp. 155- 
160). Hegel differs from Hobbes in holding that 
the state is not to be viewed as the guarantor of 
civil society, but as an end in itself. It is not simply 
the guardian of personal freedom and property, 
for in that case loyalty to the state would be op- 
tional, whereas according to Hegel it is only as a 
member of the political realm that the individual 
has objective reality and an ethical life (ibid., 
p. 156). This exaltation of the state appears to 
have been a fruit of Hegel’s youthful enthusiasm 
for the classical polis. It led him into what even 
his contemporaries regarded as an extravagant 
glorification of the Prussian monarchy; but since 
it was coupled with conservative distrust of pop- 
ular movements and indeed of the people, of that 
part of the body politic “which does not know 


what it wants” (ibid., para. 301), Hegel’s doctrine 
cannot be described as totalitarian even by impli- 
cation. Its nature is traditionalist rather than 
romantic and belongs to the eighteenth century 
rather than to the twentieth. 

Hegel did, however, repudiate traditional natural 
law and—by implication—international law. His 
influence on European (notably German) thinking 
thus ran counter to the gradual acceptance of 
liberal doctrines throughout the nineteenth cen- 
tury. The Hegelian view of interstate relations, 
which entered the consciousness of the German 
educated classes by way of the Prussian bureauc- 
racy and the educational system it controlled, is 
Hobbesian and subversive of much that is regarded 
as fundamental in Anglo-American jurisprudence. 
On Hegelian principles, contracts between states 
are not valid, since sovereignty cannot be abro- 
gated by treaty. The test of sovereignty is war, 
which discloses the truth that nations are not 
subordinate to law but operate in a Hobbesian 
“state of nature.” War is necessary and may even 
be regarded as beneficial, since it is the test of a 
people's willingness to maintain its freedom and 
independence. It also makes it possible to achieve 
a degree of social integration which civil society 
by itself cannot secure (ibid., para. 324), These 
doctrines became part of the official credo of 
Bismarckian Germany and may be said to have 
deepened the ideological gulf between Germany 
and the West which was first made manifest in 
the war of 1914-1918. They clearly run parallel 
to the critique of natural law doctrine implicit in 
the “pure theory of law” (reine Rechtslehre) asso- 
ciated with H. Kelsen and his followers. This influ- 
ential school of “legal positivism” maintains that 
the substantive concept of law as the embodiment 
of prelegal rules of a moral nature is metaphysical 
and irrelevant to the actual practice of lawmaking 
(Kelsen 1955; cf. Kelsen 1945). The displacement 
of the older doctrine by this positivist doctrine, 
which implicitly sanctions the abrogation of “so- 
called fundamental liberties,” may be regarded as 
a belated triumph of Hegelianism, although out- 
side central Europe similar ideas were developed 
without reference to Hegel’s philosophy. So far as 
the U.S.S.R. is concerned, its legal theory may 
perhaps be described as Hegelian; but there are 
elements of natural law doctrine both in classical 
Marxism and in the Russian Populist tradition, and 
since the late 1950s there has been a tendency to 
revive them. 

Hegel's ideas also reached the social sciences by 
a different channel: via the philosophy of history, 
and in particular through the growing influence of 


HEGEL, GEORG WILHELM FRIEDRICH 343 


Marxism. The relationship of Marx to Hegel is, 
however, more complex than appears from the 
popular notion that Marx merely inverted the 
Hegelian system by “standing it on its head,” i.e., 
by substituting a supposed “materialist dialectic” 
for Hegel’s idealist one. Marx took over from Hegel 
the conception of history as the self-creation of 
man and the idea (first expounded by Hegel in 
the Phenomenology) that the prime motive force 
of the historical process is human labor, or the 
practical activity of men in society. Marx did not, 
however, subscribe to the notion that this process 
can be reduced to a logical schematization, and his 
approach to empirical history is more in tune with 
French materialism than with German idealism. 
(This part of his work in some respects anticipates 
modern sociology. ) He also repudiated Hegel's polit- 
ical doctrine. The state appears in the Marxian 
corpus as an arbitrary external power superim- 
posed upon human society; and it is thus a form 
of “human  self-alienation” [see ALIENATION]. 
Hegel's authoritarian political philosophy did have 
some influence on Ferdinand Lassalle (1825— 
1864), and through him on German socialism, but 
was antipathetic to Marx. 

The relevance of Hegel’s general philosophy to 
what is often called “historicism” is difficult to 
assess. The notion (which Whitehead restated and 
applied to the natural sciences) that the peculiari- 
ties of reality are only local and temporary con- 
cretions of a process stretching beyond them has 
been popularized by Marxian writers, but it is not 
held by them alone. Its conservative counterpart 
is the doctrine that cultures are to be regarded as 
“organic wholes” rather than as casual accretions 
of unrelated features. The Hegelian concept of 
dialectical change can be, and has been, reformu- 
lated as a description of processes whereby social 
organisms create their own environment and are 
in turn influenced by it. Hence it has been said 
that “a philosopher-scientist like Whitehead can 
restate Hegel’s theory, not knowing that it is 
Hegel's” (Collingwood 1946, p. 128). Although 
originally intended by Hegel to account for natural 
processes, the idea of a “dialectical” interrelation- 
ship between man and his environment is clearly 
of general application, and it may be that the 
long-run significance of Hegel's philosophy for the 
social sciences will be found to lie in this kind of 
approach. Hegel is unquestionably the chief orig- 
inator of what is sometimes called “process 
thought.” His philosophy finds room for the efflo- 
rescence of the higher forms of culture and for the 
values we attach to them by postulating a series 
of “levels of organization,” rising from the lower 


to the higher through historically conditioned 
transformations which introduce new qualitative 
changes. This concept has proved fertile in induc- 
ing historians and sociologists to look upon history 
not as a field governed by immutable “laws” but 
as a process in which something fresh is created 
at every moment. 

GEORGE LICHTHEIM 
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Hermann Heller (1891-1933), German political 
scientist, was born at Teschen in Austria. He 
taught at Kiel, Leipzig, Berlin, and Frankfurt. In 
1933, he emigrated and died in Madrid in the 
same year. 

Heller revived political theory in Germany, 
where it had been emptied of content by both 
legal positivism and political voluntarism. The cen- 
tral problem of order, Heller taught, is the tension 
between will and norm, which must not be re- 
solved in favor of either pole. Man’s nature is 
essentially utopian, looking toward what ought to 
be. What ought to be, however, cannot order hu- 
man existence without the acts of a public will that 
in any society makes binding decisions about legal 
rules and assures their regular enforcement. The 
modern state is essentially a hierarchically organ- 
ized unity of will that performs the function of 
deciding and enforcing positive law within a given 
territory. In relation to the legal system, the state 
is sovereign, ‘that is, a supreme will subject to no 
other will. For the sake of the law, the state must 
in an emergency maintain itself even against the 
law. The sovereign will, however, can posit law 
only within the range of transpositive fundamental 
principles of law (Rechtsgrundsätze), which are 
either formal (logical) or substantive (ethical) 
norms. 

Heller construed this higher law not as unchang- 
ing but as shaped by various historical civiliza- 
tions. Political theory must explain the state in 
world-immanent terms but not reduce it to nature. 
Rather, the state is a part of human culture (de- 
fined as facts suffused with meaning). It exists in 
and through human activities, among which it has 
reality as a gestalt, a structure retaining its identity 
in the flow of its changing parts. The proper meth- 
od of political theory is neither normative nor 
causal but, rather, sociological. It can never be 


“value-free” in Max Weber's sense, although it can 
free itself from bias. An explanation of the state 
must take into account underlying social entities, 
for example, historical entities of national culture 
(Kultureinheiten). Human nature, a constant in 
any political order, must be seen as shaped by his- 
tory. Man, society, and culture are basic conditions 
of any state and cannot be artificially produced. 

Much of Heller’s work dealt with the political 
crisis of the West. He conceived of it as a collapse 
of political order stemming from the destruction 
of political theory by positivism. The antipolitical 
notion of the automatically self-regulating natural 
order produced modern legal positivism, which 
construed the concept of order without the state 
and apart from any concrete content. As a reaction 
there arose the twentieth-century cult of sheer 
force—or the strong man—and the notion of the 
intrinsic value of a command. Heller blamed this 
development on the bourgeoisie, which, when it 
was in power, disavowed its former moral convic- 
tions about the necessity of order and embraced 
positivist legality instead. 

Heller was a leading German socialist, although 
not a follower of Marx, whose materialism and his- 
torical messianism he rejected. Socialism was 
justified, he felt, by the desirability of social homo- 
geneity, of a community of interests and values 
that alone can form the basis of a democratic state. 
Class antagonism endangers this community, 
which can be restored only by including the work- 
ers in its economic and cultural benefits. No such 
community is possible apart from the historically 
grown unity of common national language, tra- 
ditions, values, and ways of life. Socialism and the 
national community are interdependent. Heller's 
socialism was national although not nationalistic. 
He favored the unification of Europe as the only 
way of safeguarding national cultures in an age 
of superpowers, 

Heller's thought was eclectic rather than de- 
rived from any one master, Although he rejected 
the major conclusions of Hegel, Marx, Weber, 
Dilthey, and Kelsen, he made use of some of the 
ideas and methods of each to compensate for the 
biases of the others. The result was not a historical 
dialectic but a dialectic that synthesized elements 
which political theorists had often separated. Man 
consists of both body and soul, society includes 
physical as well as Spiritual meaning, legal order 
presupposes both norm and will, political theory 
must use not only causal but also normative analy- 
sis, and so forth. It was his ability to hold such 
tensions in his mind without trying to resolve them 


for the sake of a unified system that secured 
Heller’s position as one of the strong voices of 
sanity in an age of political disorder. 


GERHART NIEMEYER 
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From the perspective of posterity, Hermann Lud- 
wig Ferdinand von Helmholtz (1821-1894) made 
his most significant contributions to the fields of 
sensory physiology and psychology. In particular, 
he laid the foundations for the experimental in- 
vestigation of the sensory processes in audition and 
vision, His successes resulted from his unusual 
mathematical and mechanical abilities and from 
his skill in assembling, sifting, and logically inter- 
relating large quantities of information. 

Helmholtz was born in Potsdam, the eldest of 
five children, The atmosphere of his home, al- 
though not scientific, was intellectual. Helmholtz 
mentioned that he frequently listened to the phil- 
osophical discussions between his father, August, 
a distinguished teacher of languages and philoso- 
phy in the Gymnasium, and his colleagues. August 
Helmholtz was most important in guiding his son’s 
early education, He taught him Greek and Arabic, 
encouraged him to study poetry, art, and music, 
and trained him in the writing of expository essays. 
Even at an early age, however, Helmholtz was 
attracted more to physics and mathematics than to 
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the humanities. Speaking of his youth, Helmholtz 
recalled, in 1891, that while the rest of his class 
in the Gymnasium read Cicero and Vergil, he him- 
self would work out optical and geometrical prob- 
lems under his desk. 

Helmholtz represented a scientific ideal to his 
contemporaries. He was a man of remarkable tal- 
ents, universal interests, and indefatigable energy. 
He published over 200 papers and books making 
fundamental contributions to the fields of anatomy, 
medicine, physics, physiology, and psychology. He 
invented scientific and medical instruments. A con- 
templative mind led him to consider the philosophic 
questions raised by developments in science. A 
sense of social responsibility led him to deplore the 
gulf separating those trained in the humanities and 
those trained in the sciences; this gulf he sought to 
bridge by delivering popular lectures explaining 
the developments of science to laymen. 

For the social sciences, Helmholtz’ views on the 
methodology of science and the theory of knowl- 
edge are of most immediate relevance. The method 
of studying and interpreting biological and psycho- 
logical phenomena was a crucial issue for the sci- 
entists and philosophers of the nineteenth century. 
Helmholtz was convinced that the same scientific 
method is appropriate for the analysis of vital and 
mental phenomena as for the analysis of physical 
phenomena. However, he did not advocate a reduc- 
tive materialism. For him, memory and learning 
are facts, the laws of which can be investigated 
by experimental procedures, even if they cannot be 
reduced to known physiological processes ( Koenigs- 
berger [1902-1903] 1906, p. 305). Rather, he em- 
phasized that the basic method of all scientific 
inquiry is to discover facts through systematic ex- 
perimentation and to go beyond these facts to 
general laws by analysis and abstraction. His lec- 
ture on Goethe (see Helmholtz [1853-1870] 1962, 
pp. 1-21) contains an interesting presentation of 
his views. After acknowledging Goethe’s insightful 
contributions on the morphology of the skull and 
the plant, Helmholtz brought up the matter of 
Goethe’s obstinate animosity to Newton's analysis 
of color. The difference between them, Helmholtz 
suggested, lay in their attitudes toward the acqui- 
sition and conception of knowledge. Goethe, the 
artist, believed that direct confrontation with a 
phenomenon is sufficient to provide an understand- 
ing of it. He was “accustomed to look as it were, 
right into the subject, and to reproduce his intui- 
tion. . . . His success [was] proportionate to the 
vividness of the intuition” (ibid., p. 16), Helm- 
holtz acknowledged that intuitive insight is the 
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source of the creative idea for both the scientist 
and the artist but insisted that, for the scientist, 
careful observation and reflection are an initial 
step sufficient only to suggest a hypothesis. The 
establishment of scientific truth depends on experi- 
mental verification. Moreover, the artist is satisfied 
simply to describe and hesitates to abstract. “A step 
into the region of abstract conceptions which must 
be necessarily taken if we wish to penetrate into 
the causes of phenomena, scares the poet away” 
(ibid., p. 16). Goethe objected to Newton’s theory 
of color because it substitutes quantitative meas- 
urements of the color-mixing relationships for di- 
rect naturalistic observation of colors. Helmholtz 
correctly described how observational intuition led 
Goethe badly astray when he asserted that it is 
absurd for white light to be the result of a fusion 
of colored lights. On the other hand, Goethe’s ap- 
peal to experience did reveal important phenomeno- 
logical facts (the distinction between warm and 
cold colors, the existence of a psychologically unique 
red, yellow, green, and blue, and the coupling of 
the colors red and green and yellow and blue) 
whose potential physiological significance Helm- 
holtz apparently overlooked. Helmholtz’ intellec- 
tual attitude was that of the nineteenth-century 
physicist: it was the source of his acute and pene- 
trating analyses, although it occasionally led him to 
miss the point. 

Work in physics. At the age of 16, Helmholtz 
announced to his father that he wanted to study 
physics. His father, however, could not support the 
cost of an academic course and enrolled him in 
the Friedrich Wilhelm Medical Institute in Berlin. 
This institute gave free instruction on the condi- 
tion that those enrolled later spend some years as 
surgeons in the Prussian army. In Berlin, Helm- 
holtz came under the influence of Johannes Müller, 
professor of physiology at the University of Berlin, 
and Müller’s influence is apparent throughout his 
physiological research. In 1842, at 21, he received 
his doctorate for a thesis in anatomy confirming 
Müller’s conjecture that the nerve fibers originate 
in the ganglion cells. Yet Helmholtz was to join 
Du Bois-Reymond, Briicke, and Virchow, all Miil- 
ler’s students, in a Pact to abolish Miiller’s notion 
that within the living organism there exists a sep- 
arate vital force distinct from chemical and physi- 
cal forces. These men realized that physiology 
needed to be securely established as a branch of 
chemistry and physics. Liebig had placed the ques- 
tion of vitalism within the domain of experimental 
science by transforming the philosophic question 
into the experimental problem of whether or not 
the mechanical energy and the heat produced by 


an organism result entirely from its own metab- 
olism. Helmholtz realized that this problem is 
closely connected to the law of the conservation 
of energy and turned his attention to that law. 
Joule had already established the equivalence be- 
tween mechanical work and heat, and Helmholtz 
sought to extend the principle of the conservation 
of energy to physiology. He began by investigating 
the relation between muscular activity and heat. 
For this purpose he devised the myograph, a device 
which records the twitching of a muscle, a thermal 
galvanometer sensitive to temperature changes of 
-001° centigrade, and a means by which a muscle 
can be stimulated by electric shocks of short dura- 
tion. He found that a single muscle contraction 
can give rise to an increase in temperature of from 
-001° centigrade to .005° centigrade, showing that 
the chemical processes in muscle contraction pro- 
duce heat. Helmholtz then turned to determining 
the equivalence between the heat produced by an 
animal and the caloric value of the food taken in. 
He calculated the amounts of heat given off in 
various ways and the efficiency of an animal's 
metabolism and concluded that the total heat given 
off corresponds to the heat generated by the oxida- 
tion of the food consumed. Unknown to Helmholtz, 
the extension of the law of the conservation of 
energy from inanimate to animate phenomena had 
already been suggested by Robert Mayer. Helm- 
holtz next set himself the task of showing the 
universal application of the conservation of energy 
principle by theoretically deriving it from more 
fundamental physical principles. He showed that 
if all matter consists of particles, and if these inter- 
act according to forces acting along the lines join- 
ing the particles, and if all forms of energy depend 
upon the motion and position of the particles, then 
the conservation of energy must hold. These results 
he announced in a lecture he gave (at the age of 
26) to the Physical Society of Berlin on July 23, 
1847, 

Work in physiology. Helmholtz’ scientific abil- 
ity now became generally recognized, and with the 
aid of Alexander von Humboldt, president of the 
Academy of Sciences in Berlin, he was released 
from further military service. After a year as a 
lecturer in anatomy at the Academy of Arts in 
Berlin, Helmholtz became professor of physiology 
at Königsberg. He remained in Königsberg from 
1849 to 1855. 

Nerve impulses. The first important physio- 
logical problem to concern Helmholtz after his 
move to Königsberg was the rate of nerve trans- 
mission. Müller had stated that the speed of the 
nerve impulse is comparable to the speed of light 


and unmeasurable. Du Bois-Reymond’s theory of 
the polarization of molecules in nervous tissue sug- 
gested to Helmholtz that the nerve impulse, al- 
though electrical in nature, is not simply an electric 
current. Helmholtz stimulated nerves near and far 
from a muscle and measured the time it takes for 
the muscle to contract. The difference in time 
divided by the distance between the two points 
showed that the rate of the neural impulse in the 
motor nerve of a frog is about 27 meters per second. 
Helmholtz also devised the reaction time experi- 
ment to measure the speed of nerve transmission 
in humans. An electric shock was given to a point 
on the skin and the observer was directed to move 
his hand as quickly as he could. Although variable, 
the results showed that the rate of transmission of 
the impulse is between 50 and 60 meters per sec- 
ond. The novelty of these studies provoked opposi- 
tion from physiologists. In fact, both Müller and 
von Humboldt refused at first to believe the results 
and it was only through the continued arguments 
of Du Bois-Reymond that they were convinced of 
the correctness of Helmholtz’ work. 

Optics and visual perception. In 1851 Helm- 
holtz invented the ophthalmoscope. the most cele- 
brated of his inventions. He suggested that the 
blackness of the pupil results from the fact that the 
head of an observer looking at the eye of a person 
blocks the rays of light that would illuminate the 
retina. The ophthalmoscope is a device that illumi- 
nates the retina through the pupil, rendering it 
visible. In its modern form, constructed by H. Reute 
in 1852 (M’Kendrick 1899, p. 78), the ophthalmo- 
scope enables one to look at the retina through a 
small opening in a reflecting plate. In 1852 Helm- 
holtz invented the ophthalmometer, a device for 
measuring the radii of curvature of the anterior 
and posterior surfaces of the cornea and the lens. 
At this time Helmholtz turned to the problem of 
accommodation. A Dutch physiologist, Cramer, had 
shown that accommodation consists in an increas- 
ing curvature of the lens when the eye changes 
from a far point to a near point. Helmholtz, un- 
acquainted with the work of Cramer, arrived inde- 
pendently at the same conclusion. Helmholtz’ new 
contribution to the theory of accommodation was 
his hypothesis about the way in which the curva- 
ture of the lens is altered. Using his anatomical 
knowledge, he suggested that when the fibers of 
the ciliary muscles contract, the tension of the lens 
capsule is reduced, and the lens’s own elasticity 
causes the anterior surface of the lens to bulge 
forward. 

From 1851 to 1856 Helmholtz was mainly en- 
gaged in research in physiological optics. The result 
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of this work, together with a survey of the ger- 
mane literature, is presented in his monumental 
three-volume handbook on physiological optics 
(Helmholtz 1867). The handbook demonstrates 
Helmholtz’ mastery of the physical, mathematical, 
and physiological problems involved in vision and 
visual perception. The worth of the handbook is 
attested to by its continued scientific pertinence a 
hundred years later. Volume 1 describes the anat- 
omy and optical properties of the eye. Volume 2 
deals with problems of visual sensation. In 1852 
Helmholtz undertook to test the laws of color mix- 
ture. Newton had failed to distinguish between the 
mixture of colored lights and of colored pigments. 
Thomas Young, in his famous paper of 1801, had 
suggested (in analogy to the mixture of pigments ) 
that red, yellow, and blue are the three primaries 
from which all colors can be derived by suitable 
mixtures. But to his surprise, Helmholtz discovered 
that the combination of yellow and blue lights pro- 
duces not green, as was supposed, but white. Helm- 
holtz was the first to differentiate clearly between 
the addition of lights. which produces a color be- 
cause of the superposition of excitations of differ- 
ent wave lengths stimulating the eye, and the addi- 
tion of pigments, which produces a color that is a 
result of the wave lengths reflected by both pig- 
ments. Yellow and blue pigments when mixed give 
green because neither pigment absorbs green light. 
After clearing up the confusion between light mix- 
tures and pigment mixtures, Helmholtz proceeded 
to develop a comprehensive theory of color vision. 
He based his theory on the extension of Miiller’s 
law of specific nervous energies to the sense organs 
and on the assumption that the facts of color 
matching accurately mirror underlying physiologi- 
cal processes. Just as the difference between the 
sensations of light and sound depends on the kind 
of nerve fibers stimulated, so the sensation of color 
depends on the kind of nerve fiber excited in the 
eye. The data on color mixture, moreover, indicate 
that all hues except the most saturated can be 
obtained by suitable combinations of three pri- 
maries: red, green, and violet. Helmholtz made ex- 
plicit that part of Young's theory which dealt with 
nerve activity, by postulating that the nerve fibers 
must respond in varying degrees to all wave 
lengths. Hue depends on the relative frequencies 
of the impulses set up in the three types of fibers, 
brightness on the total excitation of the fibers, and 
saturation on the amount of white produced by 
fusion. He also extended the theory to account for 
color blindness, negative afterimages, and succes- 
sive contrasts. The Young—Helmholtz theory of 
color vision is still widely accepted, although cer- 


348 HELMHOLTZ, HERMANN VON 


tain facts of color blindness and color thresholds 
pose difficulties for it. 

Volume 3 deals with problems of space percep- 
tion and object perception. Central to Helmholtz’ 
theory of space and object perception is his doctrine 
of unconscious inference. Helmholtz argued in his 
inaugural lecture at Königsberg in 1852 that the 
law of specific nerve energies implies that sensa- 
tions serve only as the signs of external objects. 
That is, sensations tell us as much about the real 
nature of external phenomena as the name of a 
man tells us about the man, Accordingly, percep- 
tual experience involves a construction on the part 
of the observer. A problem of great concern for 
Helmholtz was to determine what in the nature of 
perceptual experience is innate and what is derived 
from experience. He asserted that what in the per- 
ceptual image can be modified by experience must 
itself be considered a product of experience ([1867] 
1924-1925, vol. 3, p. 13). Applying this criterion, 
he concluded that the only innate properties are 
the intensity and the quality of a sensation. Thus, 
conscious experience consists initially of a mass of 
sight, sound, taste, and smell sensations, each dif- 
ferentiated only with respect to quality and inten- 
sity. How does one come to experience coherent 
objects ordered in space and time? The answer lies 
in the distinction between sensation and perception. 
Perception involves a two-stage process: (1) sen- 
sation and (2) unconscious inference, Sensations 
are the immediate products of stimulating sense 
organs and associated neural systems. Unconscious 
inference is a central reasoning-like process which 
utilizes the information provided by sensation to 
infer the properties of external objects and events. 
Thus, perception is not simply an aggregate of 
sensations. Perception goes beyond the sensations 
provided by the receptor organs and entails a judg- 
ment about the environment. The unconscious 
judgments in perception are automatic and irre- 
sistible. The knowledge gained in perception, based 
on sensations and the memory of sensations, is a 
knowledge of acquaintance (kennen). In contrast, 
the conscious judgments in thought are flexible 
and deliberate, and the knowledge gained, based 
on concepts, is a knowledge of awareness (wissen), 
For Helmholtz these differences, however, are only 
superficial. Both the unconscious judgments in per- 
ception and the conscious judgments in thought 
involve the same mental Operations and have the 
same logical status ([1853-1870] 1962, pp. 178- 
181). The view that perception inherently involves 
a judgment has played an important role in psycho- 
logical theory. 


Acoustics. In 1855 Helmholtz was appointed 
professor of anatomy and physiology at Bonn, 
where he remained until 1858, when he became 
professor of physiology at Heidelberg. While stil] 
immersed in the preparation of his Physiological 
Optics Helmholtz had decided to write a similar 
work on acoustics. In 1856 he reported the dis- 
covery of summation tones. These tones, whose 
frequencies are the sum of two input frequencies, 
are generated because of the nonlinear response 
of structures in the external ear, In 1857 Helm- 
holtz formulated his well-known theory of vowel 
quality. Differences between vowel sounds arise be- 
cause the mouth cavity when shaped for different 
vowels serves as an adjustable resonator which 
differentially strengthens the harmonic components 
of the fundamental voice tone. Papers quickly fol- 
lowed on sound production, tone quality, harmony, 
and the musical scales. In 1862 Helmholtz’ classic 
work, On the Sensations of Tone, appeared. This 
book brings together his many experimental and 
historical studies in acoustics and music theory. 
The aim of the work is to provide connecting links 
between the fields of physical acoustics, physio- 
logical acoustics, and music. The most famous of 
Helmholtz’ contributions to audition is his theory 
of pitch perception. Applying the law of specific 
nerve energies to the ear, Helmholtz suggested that 
there exist specific fibers in the inner ear that 
respond by resonance to the different frequencies. 
Each fiber excites its own nerve so that the neural 
impulses transmitted to the brain correspond to 
the component frequencies of a sound wave. Thus, 
the resonance theory of pitch provides an explana- 
tion of Ohm’s law which asserts that it is possible 
to hear the simple harmonic components into 
which a complex sound may be analyzed, Although 
such a simple resonance theory is no longer ten- 
able, experiments have demonstrated that a locus 
of maximum vibration which changes with sound 
frequency does occur on the basilar membrane in 
the way Helmholtz suggested, Helmholtz extended 
his theory of pitch to provide an account of timbre, 
dissonance, consonance, and melody. Timbre de- 
pends upon the relative intensities of the harmon- 
ics present, dissonance on the beats generated 
when two frequencies affect overlapping segments 
of the basilar membrane, consonance and melody 
on the identity of harmonics produced by simul- 
taneous tones or a succession of tones. Helmholtz 
views on the musical relationships have been much 
discussed and criticized but are still relevant today. 

Aesthetics. Since his early student days, Helm- 
holtz had been concerned with problems of aes- 


thetic significance. Although science and art differ 
in their methods, he believed them to be intimately 
connected because both seek to acquaint us with 
reality (Koenigsberger [1902-1903] 1906, p. 172). 
While the scientist expresses truth by means of 
abstract concepts, the artist expresses truth by 
means of images. Since art has a cognitive func- 
tion, the articulation of meanings that cannot be 
conveyed by words, it must conform to the laws 
governing human intelligence. The function of the 
scientist is not, however, to instruct the artist. This 
is essentially impossible since neither the artist 
nor the observer is consciously aware of the laws 
on whose fulfillment beauty depends ({1862] 1954, 
p. 366). Rather, the function of the scientist is to 
provide the artist with the information which may 
aid him in his artistic invention. The artist can 
succeed in his work only when he is aware of the 
subtle relations present within a phenomenon and 
of their effects upon an observer (Koenigsberger 
[1902-1903] 1906, p. 106). In a lecture on anat- 
omy and sculpture delivered in 1848, Helmholtz 
considered the methods and benefits of teaching 
anatomy to artists (ibid., pp. 51-57), and in a 
lecture on optics and painting, Helmholtz ({1853— 
1870] 1962, pp. 250-286) discussed the complex 
problems faced by the painter who seeks to produce 
an accurate representation of the forms, sizes, dis- 
tances, and illuminations of objects in a scene. To 
cope successfully with these problems, the artist 
must translate rather than copy nature, taking into 
careful consideration the operation of the sensory 
system. For Helmholtz the relations between art 
and science are particularly clear and striking in 
music. Musical composition depends more on the 
immediate sensations than is the case with the 
other arts, like sculpture, in which the object repre- 
sented has a much greater importance ({1862] 
1954, p. 3). Part 3 of the Sensations of Tone deals 
with the agreeableness of different chords, the 
formation of scales, and the elementary rules of 
musical composition, Helmholtz cautioned that the 
history of music shows that the construction of 
scales and musical composition is by no means 
determined by the properties of the ear but depends 
on artistic invention and aesthetic principles which 
change with time (ibid., p. 235). 

Epistemology. Helmholtz’ interest in sensory 
physiology naturally led him to the problems of 
epistemology. Following the English tradition, he 
appealed to psychology for an understanding of 
epistemology. He argued that a knowledge of the 
sensory mechanisms is necessary for determining 
the status of objects, In a letter to Carl Ludwig, in 


HELMHOLTZ, HERMANN VON 349 


1855, Helmholtz wrote, “I believe that philosophy 
will only be reinstated when it turns with zeal and 
energy to the investigation of epistemological proc- 
esses and of scientific methods. . . . Most essential 
of all in this critical investigation is the exact 
knowledge of the processes of sense-perception” 
(Koenigsberger [1902-1903] 1906, p. 139). Kant 
had established 12 categories of a priori concepts. 
These represented the operation of inherent char- 
acteristics of the mind, which condition experience 
and do not arise from experience. Examples of 
a priori concepts are space and time, the geomet- 
rical axioms, and the law of causality. Helmholtz 
first questioned the correctness of Kant’s position 
concerning the inherent nature of space. He con- 
cluded that the experience of an ordered space is 
not innate but is derived from movements of the 
eye and body ([1853-1870] 1962, p. 156). Helm- 
holtz’ empirical theory of perception led him to an 
empirical theory of cognition. In 1866 he turned 
to the epistemology of mathematics. He undertook 
to show that the geometric axioms are also the 
products of experience and that they depend upon 
the notion of congruence, which is based upon 
mechanical movements and is itself acquired 
through experience. Thus, the axioms of geometry 
are not innate but have their origin in an individ- 
ual’s experience with objects and with the space 
in which he lives. To show that the axioms of 
Euclidean geometry are not examples of a priori 
intuitions, Helmholtz independently developed Rie- 
mannian non-Euclidean geometry. In 1888 Helm- 
holtz tried to show that the axioms of arithmetic 
can also be derived from experience. He accepted 
only the law of causality as a priori and transcen- 
dental and asserted that all thought presupposes 
the law of causation, which is therefore not deriv- 
able from experience (Koenigsberger [1902-1903] 
1906, p. 142). For example, the law of causation 
underlies the unconscious inference by which we 
pass from sensation to perception, since this proc- 
ess assumes the existence of external objects that 
cause our sensations. Whether the mechanism of 
unconscious inference is itself innate or is derived 
from experience, Helmholtz left as an open ques- 
tion (ibid., p. 110). 


In 1871 Helmholtz became professor of physics 
in Berlin. He spent the last years of his life pri- 
marily in physical research. He was concerned with 
experiments in electrodynamics, physical optics, 
meteorological physics, and the physical signifi- 
cance of the principle of least action. Helmholtz 
also encouraged Heinrich Hertz, perhaps his most 
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famous student, who verified Maxwell’s theory by 
producing electromagnetic waves. In 1887 the 
Physical—Technical Institute was founded in Berlin, 
with Helmholtz as director. In 1893 he visited the 
Chicago World’s Columbian Exposition and toured 
the United States and Canada. While returning 
home, he fell down the ship’s stairs and suffered 
a concussion and severe loss of blood. He recov- 
ered, but those who saw him reported that he 
tired easily and found it difficult to work. In July 
1894 he suffered a cerebral hemorrhage and died 
in September of that year. 

Jacos BECK 


[See also HEARING; PERCEPTION, article on DEPTH PER- 
CEPTION; PSYCHOPHYSICS; VISION, especially the 
article on COLOR VISION AND COLOR BLINDNESS. ] 
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HENDERSON, L. J. 


Lawrence Joseph Henderson (1878-1942), 
American chemist and sociologist, was born in 
Lynn, Massachusetts. His father was a ship chan- 
dler of Salem, with business interests in the French 
islands of St. Pierre and Miquelon; and it was 
there, of all strange places, that Henderson ac- 
quired his life-long devotion to the civilization of 
France. 

His regular academic career lay in the field of 
biological chemistry. He graduated from Harvard 
College in 1898 and from Harvard Medical School 
in 1902. There followed two years’ work in chem- 
ical research at the University of Strasbourg. Hen- 
derson then returned to Harvard, rising through 
the academic ranks to become professor of chem- 


istry. In the light of his later interests in social 
science, it is significant that his early research was 
devoted to the mechanisms of neutrality regulation 
in the animal organism and that in 1908 he 
achieved a precise mathematical formulation of 
the acid-base equilibrium. There was much inter- 
est in research on problems of physiological equi- 
librium at Harvard at the time. Henderson had 
some influence on, and was always interested in, 
the work on homeostasis carried out by his col- 
league Walter B. Cannon, professor of physiology 
(Cannon 1932). Henderson’s later research turned 
to similiar problems in the chemistry of blood. Here 
his investigations were of fundamental importance, 
leading, through the work of his students, to such 
applications as the use of blood plasma to save the 
lives of the wounded in World War 11. He was one 
of the most original and distinguished biological 
chemists of his time. 

Henderson also kept up an irregular interest in 
the wider problems suggested by his chemical re- 
Search, problems in the philosophy of science and 
the methodology of inquiry into systems of vari- 
ables in complex relations of mutual dependence. 
For years he took Part in seminars in the philos- 
ophy department at Harvard, and in 1911 he 
offered the first course given there on the history 
of science. He wrote two general books on the 
relationships between the organism and its environ- 
ment and between determinism and teleology: The 
Fitness of the Environment (1913) and The Order 
of Nature (1917). His general question might be 
put thus: How do nature’s efficient causes cooper- 
ate with her final ones? 

Henderson’s interest in sociology came late in 
his life but was a natural development of what had 
gone before. About 1926 William M. Wheeler of 
Harvard, whose study of insect societies led him to 
read widely in human sociology, suggested to Hen- 
derson that he read Pareto’s Sociologie générale 
(see Pareto 1916). At once Henderson became an 
enthusiast. He felt that Pareto’s treatment of sci- 
entific methodology and equilibrium phenomena 
was excellent and that Pareto’s substantive views 
on human society made explicit conclusions he 
himself had arrived at more intuitively. In 1932- 
1933 Henderson conducted a seminar on Pareto’s 
sociology—the first such seminar in an English- 
Speaking country. One of the products of the sem- 
inar was An Introduction to Pareto (1934) by 
Homans and Curtis, the first book on the subject 
in English. Henderson published his own book, 
Pareto’s General Sociology (1935), emphasizing 
such matters as equilibrium, the social system, the 


mutual dependence of variables, and the problems 
of induction and abstraction. Through his book, his 
seminar, and a later course he offered called “Con- 
crete Sociology,” he made these ideas part of the 
thinking of a number of sociologists present at 
Harvard at the time, including Talcott Parsons and 
Robert K. Merton. Henderson was also the first 
chairman of the Society of Fellows at Harvard, 
established in 1933, whose members, besides re- 
ceiving scholarships to study whatever they wished, 
dined together once a week. Among the early mem- 
bers were C. M. Arensberg, B. F. Skinner, W. F. 
Whyte, and G. C. Homans. They soon became fa- 
miliar, through their conversations, with Hender- 
son’s views. 

One final connection of Henderson with social 
science should be mentioned. In 1926 he became 
director of the Fatigue Laboratory of the Harvard 
Graduate School of Business Administration. In the 
adjoining office was Elton Mayo, then beginning his 
researches in the Western Electric Company. The 
plan was that Henderson should study the physi- 
ology of work, while Mayo studied its psychology. 
Henderson had no direct influence on the conduct 
of the Western Electric researches, but his ideas 
became part of the intellectual atmosphere in which 
the research team—Mayo, T. N. Whitehead, and 
F. J. Roethlisberger—did their work. Through the 
Harvard Business School Henderson also came into 
contact with Chester I. Barnard and encouraged 
him to write The Functions of the Executive (1938). 
In 1939 Henderson was appointed chairman of the 
Committee on Work in Industry of the National 
Research Council, and under his leadership the 
committee surveyed and evaluated much research 
in this field, both physiological and sociological 
(National Research Council 1941). 

Henderson never carried out any empirical re- 
search in social science, nor did he become widely 
read in its literature, but his ideas about methodol- 
ogy in the broadest sense of the word were deeply 
influential at a time and place of more than ordi- 
nary significance in the development of sociology. 


GEORGE CASPAR HOMANS 
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HERING, EWALD 


Ewald Hering (1834-1918) was a sense physi- 
ologist who played an important role as a pioneer 
in the new experimental physiological psychology 
that got under way as a separate discipline in the 
1860s. The founder of this new branch of psychol- 
ogy was, it is said, Wilhelm Wundt, who was 
Hering’s contemporary. Hering’s fame was based 
specifically upon his cont ibutions to the under- 
standing of the phenomena of vision—the psycho- 
physiology both of visual space perception and of 
color vision, Hering was a nativist; that is to say, 
he believed that the spatial ordering of points on 
the retina and the skin is innate, that it is not 
learned, as the empiricists, led by Hermann von 
Helmholtz, believed. Hering is, however, best 
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known to posterity for his theory of color vision, 
a theory that for many years remained the alterna- 
tive to Helmholtz’ theory, Moreover, he designed 
many beautiful pieces of apparatus to illustrate 
crucial points in his theories and is more respon- 
sible than anyone else for this new psychology’s 
being dubbed “brass-instrument psychology.” In- 
deed, he participated in the new experimental 
movement in many ways; he was, for instance, one 
of the first editors of the Zeitschrift fiir Psychologie, 
the first independent journal of experimental psy- 
chology in Germany. 

Hering was born in Altgersdorf, a small town 
south of Berlin. At the age of 19 he went to Leipzig 
to study medicine; there he was influenced by 
E. H. Weber, the physiologist who established the 
law that bears his name, and G. T. Fechner, the 
founder of psychophysics. The dean of physiolo- 
gists, Johannes Miiller, at Berlin, influenced him 
from afar, and Hering would have liked to work 
under him but never did. In 1860 Hering began 
practicing as a physician and also writing in the 
field of physiology. He had published the five parts 
of his treatise on Space perception by 1864, a year 
before Helmholtz published the third volume of a 
handbook on psychophysiological optics—the part 
that included what Helmholtz had to say about the 
perception of space. Thus began a long-continued 
Opposition between Hering and Helmholtz—Her- 
ing, the nativist; Helmholtz, the empiricist. Hering 
was following Johannes Miiller, whose thought had 
been influenced by Kant. Helmholtz, an empiricist, 
was holding to the tradition of Hermann Lotze and 
found himself supported by Wundt. Hering’s views 
were taken up presently by Carl Stumpf and later, 
after the turn of the century, by the gestalt psy- 
chologists, who have readily acknowledged their 
debt to him. There was a bitter and indecorous 
quarrel between Hering and Helmholtz about the 
true shape of the visual horopter, the locus of points 
perceived as single by the two eyes. 

Hering’s publication on space perception was 
enough to effect a call to Vienna to succeed the 
distinguished physiologist Carl Ludwig, who him- 
self went to Leipzig. Five years later Hering was 
invited to Prague to succeed J. E. Purkinje, another 
well-known physiologist, At Prague his attention 
turned to the problems of color, His volume of 
1878 presented his theory of color vision and again 
brought him into conflict with Helmholtz, who held 
to the three-element theory originally suggested by 
Thomas Young, Hering argued for three different 
color substances in the retina, each one capable of 
two reversible, Opposite reactions, giving respec- 


tively the colors red or green, yellow or blue, and 
white or black, and their combinations. Both types 
of theory were still alive eighty years later, Hering 
also, in 1879, suggested a theory of the tempera- 
ture sense that was similar to his color theory, cold 
and warmth being elicited by a pair of antagonistic 
reactions in the thermal sense organ. 

After 25 years at Prague, Hering was called to 
Leipzig in 1895, to succeed for a second time the 
now very famous Ludwig. Here he undertook his 
more mature work on visual sensation, which came 
out in four fasciculi from 1905 to 1920, the last 
published posthumously, for Hering died at Leipzig 
in 1918, at the close of World War I, 

At the turn of the century every undergraduate 
student of psychology was taught Hering’s and 
Helmholtz’ theories of vision. For a while Helm- 
holtz’ conception seemed to prevail over Hering’s, 
but subsequently Hering’s view of antagonistic 
biological processes gained plausibility as a result 
of new research, and both types of theory con- 
tinued to receive consideration at a sophisticated 
level. Hering, the nativist, also influenced gestalt 
psychology and the scientific movement toward 
phenomenology and existentialism, With respect to 
phenomenology, Hering himself showed the influ- 
ence of two of his great predecessors in visual 
theory—Goethe and Purkinje. 

Epwin G, BORING 
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HERSKOVITS, MELVILLE JEAN 


Melville Jean Herskovits (1895-1963) was an 
American anthropologist. Two points are central in 
understanding his conception of anthropology: 
first, he was a humanist concerned with the total 
range of cultural behavior and, second, he believed 
that the inductive method is the only valid method- 
ology for anthropology. In both he revealed the per- 
vasive influence of Franz Boas, his teacher at 
Columbia University. 

The most complete presentation of Herskovits’ 
views is found in Man and His Works (1948), 
revised and abridged as Cultural Anthropology 
(1955). Culture was the inclusive concept for him, 
comprehending the behavioral and the ideational, 
the group and the individual. His humanistic orien- 
tation was apparent in his interest in the indi- 
vidual, whom he viewed as an active participant in 
the shaping of culture. As the individual was im- 
portant in his view of culture, history was impor- 
tant in his ideas of culture change. Acculturation, 
reinterpretation, retention, and syncretism are con- 
cepts which he helped formulate and each has its 
historical dimension. 

The continuity of past and present was a fre- 
quent theme of Herskovits’ studies of Africa and 
of the Negro in the New World. It was his thesis 
in The Myth of the Negro Past (1941) that New 
World Negroes reveal their west African heritage 
in motor habits, codes of behavior, social institu- 
tions, family organization, religion, language, and 
art. In opposition, sociologists argued that syn- 
chronic factors adequately explain American Negro 
behavior and that it is unwise to link the Negro so 
directly to his origins in west Africa [see the 
biography of Frazier]. Although the book stimu- 
lated interest in Africa among some students of 
the New World Negro, it did not achieve general 
attention for almost two decades, when some of the 
facts about Africa became better understood. 

Herskovits’ approach to cultural change was the- 
oretical as opposed to practical in its orientation. 
He was always critical of applied anthropology and, 
in this as well as in his historical approach, was 
at variance with much of the work of British an- 
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thropology in Africa. His views on these points 
were summarized in Acculturation: The Study of 
Culture Contact (1938a), which was the subject 
of a rejoinder by Malinowski (1939) in behalf of 
functionalism and administration-oriented anthro- 
pology that appeared in Africa. 

Herskovits’ opposition to applied anthropology 
was to a large extent based in cultural relativism. 
Although he in no sense originated the concept, in 
Man and His Works and in subsequent writing he 
became one of its most uncompromising spokes- 
men. He derived relativism from the enculturative 
experience through which standards of judgment 
are learned, and from this he concluded that judg- 
mental evaluations of cultures are culture-bound. 
This, of course, does not abrogate comparison but 
implies that the bases of comparison are culturally 
determined and should be made explicit. He in- 
sisted that the concept of relativism had relevance 
to all cultural learning and called attention to the 
influence of culture on perception. He felt that it 
was important to distinguish between “absolutes,” 
which vary from culture to culture, and “uni- 
versals,” which are consequences of the human 
condition; and he concluded, “To say that there is 
no absolute criterion of value or morals, or even, 
psychologically, of time or space, does not mean 
that such criteria, in differing forms, do not com- 
prise universals in human culture” (1948, p. 76). 
Cultural relativism led him to reject the term 
“primitive,” though he had used it extensively, as a 
pejorative, to indicate a lack of unity of custom, 
tradition, belief, or institution. He subsequently 
used “nonliterate,” a term which he felt was more 
neutral and descriptive. 

Consistent with his humanistic interests, Hersko- 
vits was interested in religion, music, graphic and 
plastic arts, and folklore, particularly in African 
cultures. Dahomean Narrative: A Cross-cultural 
Analysis (1958), written with Frances S. Hersko- 
vits, who collaborated with him throughout his 
career, is a detailed group of tales assembled 
during their first African field trip. It is an im- 
portant work because it carefully relates the narra- 
tives to their cultural matrix and abstracts from 
them items of comparative theoretical interest. 
Herskovits was interested in the ways culture in- 
fluences the arts, as well as the ways the arts vali- 
date culture. He used both religious and aesthetic 
data to document African retentions among New 
World Negroes. 

If Herskovits was often humanistic in his orien- 
tation he was not narrowly so, and he did not neg- 
lect the social aspects of culture. One of his earliest 
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interests was in the field of economics, and The 
Economic Life of Primitive Peoples (1940) was 
the first general text in that field. It stressed the 
importance of the cultural context of economic be- 
havior and the limitations of a conception of eco- 
nomics based exclusively upon experience in West- 
ern society. However, he adopted the conventional, 
“formal” definition of economics, the application 
of scarce means to given ends, and, in the main, 
tried to follow the categories of academic eco- 
nomics in ordering his data. This compromise is 
most apparent in the first chapter of the revised 
edition, entitled Economic Anthropology (1940). 
While Polanyi and his followers (Polanyi et al. 
1957) proposed a more sociological, “substantive” 
definition of economics, Herskovits, typically, ob- 
jected that their approach seems to deny the role 
of individual choice and concluded that “we must 
not reject Economic Man only to substitute Society 
as an exclusive formula for understanding eco- 
nomic behavior and as a base-point for analysis” 
({1940] 1952, p. 8). Polanyi was even more skep- 
tical than Herskovits about the application of tra- 
ditional economics to nonliterate behavior. His 
followers have suggested that the characteristics 
of markets, money, and surpluses in industrial so- 
ciety are so peculiar that the terms are scarcely 
applicable in nonliterate societies. While Hersko- 
vits saw the difference between the economies of 
nonliterate and industrial society as being one of 
degree, followers of Polanyi, like George Dalton, 
see them as differing in kind. It is worth noting that 
the field of economic anthropology has developed 
along the general directions of Herskovits’ pioneer- 
ing efforts. [See Economic ANTHROPOLOGY; Ex- 
CHANGE AND DISPLAY; TRADE AND MARKETS; and 
the biography of PoLANyt.] 

Africa was an early interest of Herskovits; his 
doctoral thesis was on the cattle complex of east 
Africa. His culture area mapping (1924; see also 
Bascom & Herskovits 1959) has been used ex- 
tensively. Many early American Africanists were 
trained in the program of African studies that 
he founded and directed at Northwestern Uni- 
versity. As interest in Africa expanded, the African 
Studies Association was formed with Herskovits as 
its first president. One of his last books, The Hu- 
man Factor in Changing Africa (1962), is a sum- 
mary of his encyclopedic knowledge of the con- 
tinent, his interest in culture change, and his 
humanistic orientation. 


JAMEs H. VAUGHAN, Jr. 


[For the historical context of Herskovits’ work, see the 
biography of Boas; for discussion of the subsequent 


development of Herskovits’ ideas, see FOLKLORE; 
HISTORIOGRAPHY, article on AFRICAN HISTORIOG- 
RAPHY; PRIMITIVE ART.] 
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HETTNER, ALFRED 


Alfred Hettner (1859-1941) was an important 
leader in the development of German geography 
and was particularly influential in defining for ge- 
ographers generally the scope and methods of their 
subject. 

He was born in Dresden and grew up there in a 
large family with intellectual and artistic interests. 
He became interested in geography while he was 
in secondary school and declared his intention of 


specializing in that field as soon as he began his 
university training. He studied at four different Ger- 
man universities and traveled and did field work for 
four years in South America. After teaching for ten 
years in Leipzig, he was appointed in 1897 to the 
newly established chair of geography at Tübingen 
and then, less than two years later, to the new chair 
at Heidelberg, which he held until his retirement 
in 1928. 

Although his training and his early work were 
principally in geomorphology, Hettner, like Ritter 
and Ratzel, was more interested in human geog- 
raphy. Moreover, he became increasingly involved 
in establishing a methodology for geography, an 
enterprise he had initially regarded as auxiliary to 
work on other problems but which ultimately be- 
came a primary aspect of his life’s work (1927, 
p. iii). 

The opportunity to present his methodological 
position came in 1895, when at the age of 36 he 
founded the Geographische Zeitschrift. The new 
journal was to be devoted to “the advancement 
of genuine geographical learning,” and Hettner 
opened his introductory article with the questions, 
“What is geography? What does it seek to accom- 
plish and what is it able to accomplish?” (1895, 
p. 1). During the following two decades he pre- 
sented detailed answers to these questions in a 
series of essays that came to be regarded as classics 
in geographic methodology (1927). During his 
career he also published numerous substantive 
studies in different aspects of systematic geography, 
which elaborate or illustrate his methodological 
principles (see his four-volume work in systematic 
physical geography, Vergleichende Länderkunde, 
1933-1935; and his three-volume Allgemeine Geo- 
graphie des Menschen, 1947-1957). Important as 
these studies were, Hettner’s principal influence 
was exerted through the Geographische Zeitschrift, 
over which he maintained close control for forty 
years (Schmitthenner 1941, p. 453; “Drei auto- 
biographische Skizzen,” pp. 22-26). 

The concepts of geography that Hettner promot- 
ed were not derived by logical deduction from any 
a priori philosophical position; they were developed 
empirically from the study of the history of the de- 
velopment of the field since antiquity (1927, pp. 
110 ff.). Among geographers he was influenced par- 
ticularly by Humboldt, Ritter, Kirchhoff, Marthe, 
and Richthofen (“Drei autobiographische Skiz- 
zen”; Hartshorne 1939). His distinctive function 
was to justify the conclusions drawn from the his- 
torical development of geography in terms of the 
methodology and philosophy of science. His gen- 
eral philosophical viewpoint was based, he said, 
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neither on Kant nor on Comte but on Lange's 
Geschichte des Materialismus; and his scientific 
methodology, on Wundt's Logik (Hettner 1926, 
p. 306). But in explaining the relation of geography 
to other fields, Hettner formulated a scheme of the 
division of the sciences which, as he later learned, 
had been presented a century earlier by Kant and 
by Humboldt, also, apparently, independently 
(Hartshorne 1958). 

In examining the historical development of ge- 
ography, Hettner found that major misunderstand- 
ings had arisen through the failure to identify cor- 
rectly the object of study. Geography is not, as its 
name suggests, the over-all science of the earth but 
the study of the earth shell, more commonly called 
earth surface, as it varies in the character or con- 
tent of its areal parts. The basic approach is choro- 
logical: to describe and interpret the varied char- 
acter of the earth surface. This areal variation is 
formed by many diverse elements, which vary from 
place to place, interrelated in any particular place 
and interconnected between different places. 

If geography is basically chorological, Hettner 
reasoned, it is not dualistic; rather, in any inhabited 
area the physical and human features are so intri- 
cately interlaced as to form a single subject for 
study. Geography is, therefore, not to be divided be- 
tween the natural and the social sciences or de- 
fined as the study of relations between natural and 
human features of the earth surface. It is a unitary 
discipline, and the reality it studies is composed of 
heterogeneous but interrelated elements. 

Two theoretically different approaches are use- 
fully combined in the study of areas: the approach 
of regional geography, which analyzes the full com- 
plex of features in individual areas, and the ap- 
proach of general (or systematic, or topical) geog- 
raphy, which compares areas in terms of particular 
kinds of features. While Hettner regarded the for- 
mer, the analysis of individual regions, as the 
“crowning” product of geographic study, he consid- 
ered comparative systematic studies no less essen- 
tial. Geography, then, is both nomothetic and idio- 
graphic (1927, pP. 217-218, 221-224, 398-404). 

Hettner’s work helped to produce a marked de- 
gree of agreement among German geographers in 
the early decades of the twentieth century; such 
agreement does not exist in many other countries. 
Outside of German-speaking countries Hettner’s 
ideas were not widely known until they were ex- 
pounded at length in English (Hartshorne 1939). 
They have since become familiar and influential 
in many countries. In establishing his basic metho- 
dological structure, Hettner did not attempt to build 
something new; rather, he believed that he “had 
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clearly expressed and methodologically established 
what was actually present in the development” of 
geography (1934b, p. 382). This may well be the 
reason why the influence of his work has been so 
strong and can be expected to endure. 


RICHARD HARTSHORNE 


[For the historical context of Hettner’s work, see the 
biographies of HumBotpt; RitTER; Wunopt. For 
discussion of the subsequent development of his 
ideas, see GEOGRAPHY, especially the article on so- 
CIAL GEOGRAPHY. ] 
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Bruno Hildebrand (1812-1878), economist and 
one of the founders of the German historical 
school, was burn in Naumburg (Thuringia), the 
son of a civil servant. He entered the University 
of Leipzig as a student of theology but soon shifted 
to history. He very early joined the liberal—nation- 
alist student movement, and since this affiliation 
made him the object of police attention, he fled 


to Breslau to escape harassment; however, he was 
imprisoned there before being allowed to continue 
his studies. In 1836 the University of Breslau 
awarded him his doctorate and shortly afterward 
the rights of Dozent. In 1839 he was promoted to 
acting professor of history. But Hildebrand’s rest- 
less and intensely political nature found the scope 
of historical study too confining; increasingly his 
interests and his lectures focused on political phi- 
losophy and economics, 

The most productive decade of Hildebrand’s aca- 
demic career began with his appointment, in 1841, 
to the chair of Staatswissenschaften (government) 
at the University of Marburg. Here, once again, he 
clashed with the authorities, especially when as 
university rector he championed the liberties of 
students and staff; in 1845 he was relieved of his 
administrative position, and in 1846 he was charged 
with lése-majesté and dismissed from his teaching 
post. Hildebrand was eventually acquitted, but he 
was not reinstated in his professorship until the 
1848 revolution had swept a liberal government 
into office. 

Hildebrand was elected deputy of the Frankfurt 
National Assembly, where he distinguished himself 
as an uncompromising protagonist of constitutional 
government and as an indefatigable member of 
the parliamentary commission on economic and 
Social affairs. With the triumph of absolutism, how- 
ever, his cause was doomed. The Diet was abol- 
ished, and Hildebrand, as one of the outspoken 
liberals, was charged with high treason and forced 
to flee into Switzerland where, in 1851, he was 
appointed to a chair at the University of Zurich. 
Hildebrand spent five years in Zurich, active not 
only as a professor but also as a company director 
promoting and building a railway. He then moved 
to the University of Bern and helped establish the 
first Swiss statistical office at Bern. 

In 1861 Hildebrand accepted an invitation ex- 
tended by the University of Jena and returned to 
his native Thuringia. In 1863 he started the Jahr- 
bücher fiir Nationalökonomie und Statistik. In the 
following year he was instrumental in founding 
the statistical office of the Thuringian states. As 
its first director, he planned the work of the new 
bureau in such a way as to link it closely with the 
research and teaching activities of the university 
program in economics. 

Although Hildebrand was neither a fluent nor 
an exciting lecturer, good ‘students found him an 
inspiring teacher, and some of the better econ- 
omists of the German historical schoo! were prod- 
ucts of his “Jena seminar,” Because of this impact. 
Hildebrand’s position in the history of German 


economic thought cannot be assessed in terms of 
his writings alone. 

One of Hildebrand’s earliest works, Die National- 
dkonomie der Gegenwart und Zukunft (1848), is 
also his most important; it is clearly the work of a 
youthful homme engagé. The first chapter is de- 
voted to an evaluation of Adam Smith. Acknowl- 
edging the greatness of the Scotsman’s contribution, 
Hildebrand proceeded to criticize the philosophy 
of natural law and the instinct of self-interest 
underlying Smith’s reasoning. Hildebrand denied 
the universality that Smith claimed for his theory 
and asserted that Smith’s basic assumptions were 
ahistorical; inasmuch as they derived from the 
English scene, they were irrelevant to an under- 
standing of the problems in other eras and other 
lands. Above all, Hildebrand felt that the premises 
did not fit the peculiarities of German conditions. 

Hildebrand admitted that his view of classical 
economics had been influenced by Adam Müller. 
Yet he added that he could accept neither the 
feudal and medieval notions of the German roman- 
tic school nor its reactionary conclusions. Even 
Friedrich List, though allowed some credit for his 
insights and for his contributions to the important 
public debate on international trade policy, was 
given short shrift by Hildebrand. 

Significantly, Hildebrand devoted the largest 
part of Die Nationalökonomie der Gegenwart und 
Zukunft to a review of socialist literature, in par- 
ticular to an examination of the recently published 
Condition of the Working Class in England by the 
then unknown Friedrich Engels. He saw the social- 
ist writers as “up-to-date,” with a historical per- 
spective and above all a focus on the predicament 
of the proletariat in industrial society. At the same 
time, Hildebrand rejected all socialist values, warn- 
ing that the socialist society would be the “grave 
of individualism and civilization,” and his stage 
theory of economic progress was surely intended 
as a polemic against Marxist socialism. According 
to Arthur Sommer, Hildebrand’s vision of society 
as advancing from barter to monetary exchange 
before reaching its highest synthesis in a credit 
economy was meant as an anticommunist mani- 
festo. In this scheme the fully developed credit 
economy would give the propertyless wage earner 
access to capital and thereby resolve one of mod- 
ern society's most pressing problems without re- 
course to socialism. 

In the last 15 years of his life, Hildebrand turned 
away from theoretical issues and concentrated on 
directing statistical studies bearing upon important 
social problems. He thus anticipated the program 
of the Verein fiir Sozialpolitik, which he joined as 


HILDEBRAND, BRUNO 357 


a charter member in 1873. (He was the only one 
of the founders of the German historical school to 
do so.) 

Hildebrand failed to develop a coherent system 
of economics. The source of his failure lies (see 
Eisermann 1956) in his petty bourgeois parochial- 
ism, with all its fears and prejudices regarding the 
rapidly industrializing world. This was the cause 
of his fuzzy thinking and of his pathetic efforts 
to reconcile basically irreconcilable points of view. 
Fighting, as it were, a two-front battle against the 
feudal aristocracy on the one hand and the emerg- 
ing proletariat on the other—while simultaneously 
concerned about the precariousness of its own eco- 
nomic status—the German middle class was 
doomed to political impotence. Its intellectual lead- 
ers, including Hildebrand, reacted to their conflict 
by abandoning clarification of the social process in 
favor of politically innocuous empirical research 
and social engineering. 

HERBERT KISCH 


[For the historical context of Hildebrand’s work, see 
ECONOMIC THOUGHT, article on THE HISTORICAL 
scHooL; and the biographies of MÜLLER, ADAM 
HEINRICH; SMITH, ADAM.] 
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HINDUISM 


Hindus are found living in many parts of the 
world, but the vast majority of them (approxi- 
mately 376.5 million) are concentrated in the Indo- 
Pakistan subcontinent. Of this population, approxi- 
mately 366.5 million are in India and ten million 
in Pakistan. Hindus are also found in the Hima- 
layan states of Nepal (the only contemporary 
Hindu state), Sikkim, and Bhutan; in Burma, 
Ceylon, Malaysia, and other countries of south- 
east Asia; and in east and south Africa, the Carib- 
bean islands, Guyana (British Guiana), Fiji, and 
the United Kingdom. 


The study of Hinduism 


The doctrines of Hinduism, unlike those of 
Christianity and Islam, are not embodied in any 
one sacred book, nor does Hinduism have a single 
historical founder. There are not one but innumer- 
able gods, and it is not essential to believe in the 
existence of God in order to be a Hindu. Hinduism 
is rich in contradictions, there being no particular 
beliefs or institutions that are common to all 
Hindus. Every belief considered basic to Hinduism 
has been rejected by one Hindu group or another, 

A major problem in the study of Hinduism, as 
in that of any world religion, is to understand the 
interaction between the theological and popular 
levels. There is a vast body of sacred literature in 
Hinduism, including the Vedas, Brahmanas, Upan- 
isads, Vedangas, Dharmaśāstras, Nibandhas, 
Purāņas, Itihāsas, Darsanas, Agamas, and Tan- 
tras. These texts contain elaborate and abstract 


philosophies and theologies, mythologies, manu; 
for the performance of sacrifices and other 
rites in temples and homes, and codes of condu 
for daily life. Generally speaking, until 
Indian and foreign scholars concentrated on tl 
literature, while the description of actual institu- 
tions, rites, and beliefs was left to missionaries, 
travelers, and administrators. It is only in the last 
twenty years that bibliocentricism has been re- 
placed by a more rounded view of Hinduism and 
the relation between the texts and actual behavior. 
The social scientist’s concern for understanding 
any religion in its social context is likely to be 
satisfied more for the modern than for the earlier 
periods of history. Source materials are almost 
entirely lacking for the study of the history of pop- 
ular Hinduism; even in the study of the histo 
of literary Hinduism, data are not available for 
the reconstruction of the social context. For exam- 
ple, the date, provenance, and authorship of tex 
are not certain. And finally, the student of contem- 
porary Hinduism is faced with the problem that 
the systematic reconstruction of Indian history, 
which began with the coming of the British, has- 
brought to light material that has since become an 
active part of the Hindu religion. In the reinterpre- 
tation of Hinduism that has been occurring since 
the nineteenth century, the philosophical and 
literary levels have been emphasized, to the neglect © 
of actual institutions, rites, and beliefs. ` 


Hinduism and the social order 


Hinduism, lacking a centralized church, is 50 
inextricably entangled with Hindu society that it 
is very difficult to say where one ends and the 
other begins. This is particularly true of caste, 
which according to creation beliefs expressed in 
the Rg Veda has a divine origin. The four varnas, 
or caste orders, emerged from the limbs of prime- 
val man, who is a victim in the divine sacrifice 
that produced the cosmos. The brahmans emerged 
from his mouth, ksatriyas from his arms, the 
vaisyas from his thighs, and the Sadras from his 
feet. (The untouchables are not mentioned in the 
hymn.) There are in reality not four but innumer 
able castes, called jatis, each of which claims to 
belong to one of the four varnas. When the Hindu 
sacred or legal texts discuss caste, it is mostly 
varna that they have in view and very rarely jatt. 

Certain ideas regarding pollution and purity are 
cardinal in Hinduism, although there are differ- 
ences among the various castes in the strictness 
with which rules deriving from these ideas are 
adhered to and the degree of elaboration found in 
behavior governed by them. Intercaste relations 


are generally defined by ideas of pollution. Nor- 
mally, each caste is endogamous and complete 
commensality prevails only within it. Thus, there 
are many kinds of restrictions between castes— 
on the free acceptance of food and drink, on inter- 
marriage and sex relations, on touching or going 
near a member of another caste, etc.—and they 
are expressed in terms of pollution. This means 
that failure to observe the rules makes the upper- 
caste person impure, and he has to perform a 
purificatory rite, simple or elaborate, according to 
the seriousness of the violation. 

While caste is central, it does not entirely deter- 
mine Hindu religious behavior. There are other 
aspects of the social structure that embody reli- 
gious behavior. The village community and the 
family also function as cult groups. There are 
deities—usually goddesses—in every village who, 
if suitably propitiated, keep out epidemics and 
drought and look after the villagers. There is an 
elaborate complex of rites de passage, including 
wedding rituals and funeral rites that may take 
several days to perform. Calendrical festivals and 


vratas, or ritual austerities carried out for specific — 


periods to attain particular ends (e.g., birth of a 
son), consume a good part of people's energies. 
time, and money. 

It is important to demarcate those aspects of 
religious behavior that are affected by caste from 
those that are not. The relation between sect and 
caste, in particular, offers a fruitful area for re- 
search. 


Dominant theological ideas of Hinduism 


Hinduism does not have a body of clearly de- 
fined dogma, but some theological ideas may be 
considered basic. And while the many sects and 
schools have taken different standpoints on theo- 
logical issues, the issues themselves are common 
to most, Since the time of the Upanisads, which 
laid the foundations of Hindu philosophical think- 
ing, certain concepts recur again and again. A 
major issue has been the nature of Brahman 
(universal soul) and its relation with Atman (the 
individual soul). One view is concerned only with 
this dichotomy, does not posit the existence of 
God, and considers Brahman as absolute and 
attributeless. (There was also the Charvaka school, 
which was atheistic and hedonistic.) Most other 
views, however, recognize the existence of God and 
consider the issue of his relation with Brahman, 
on the one hand, and Atman, on the other. The 
Atman is considered to be indestructible and 
passes through an endless migration, or series of 
incarnations. The character of any incarnation, 
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human, animal, or superhuman, is influenced by 
karma, the net balance of good and bad deeds in 
previous births. Goodness or badness is defined by 
reference to dharma. The reward for a saintly life 
is moksa, which releases the individual from the 
chain of births and deaths and brings him into 
contact with God. 

The ideas of karma, dharma, and moksa are 
intimately related to the caste system. The Dharma- 
sūtra states that if a man does good deeds, he will 
be reborn in a high caste and well endowed, while 
if he does sinful acts, he will be reborn in a low 
caste or even as an animal. Dharma is thus identi- 
fied with the duties of one’s caste, and birth in a 
particular caste becomes an index of the soul's 
progress toward liberation. 

The nature of moksa and how to achieve it are 
major issues in Hindu theology. The main ways of 
achieving moksa are through knowledge, deeds, 
and love and devotion toward God. Generally, the 
way of knowledge requires an individual to re- 
nounce the world, including caste and family, and 
lead the life of an ascetic. This way has been 
followed by only a few. It was the Bhagavad Gita 
that first emphasized the way of works and devo- 
tion and thus brought liberation within reach of 

 “man-in-the-world,” including women and the 
lower castes, The most popular form of devotion, 
however, is the worship of one’s chosen god accord- 


_ing to tradition. In the last hundred years the 
“Bhagavad Gita has been reinterpreted by Indian 


political leaders, including Gandhi and B. G. Tilak, 
to provide the basis for a life devoted to altruistic 
action. 

Discussion of these issues by theologians has 
been in Sanskrit and in the context of ideas devel- 
oped in logic, metaphysics, astronomy, grammar, 
literature, law, and other branches of traditional 
learning. The basic theological positions have, 
however, reached the common people through 
myths and stories narrated in local languages. How 
influential these ideas were and the nature of their 
relation to strictly local or sectional ideas and 
beliefs are still subjects for research (see, in this 
connection, Srinivas 1952, p. 227). 

Sanskritic deities. Those deities whose attri- 
butes and modes of worship are described in myth- 
ological, liturgical, and other texts may be 
called Sanskritic. The Vedic pantheon reflects the 
syncretism that resulted from the conquest by 
nomadic Indo-European Aryans of the ancient 
urbanized civilizations of the Indus Valley and a 
continuing contact with the aboriginal tribal peo- 
ples of the subcontinent. Most of the deities, major 
and minor, are nature gods: Indra, the most prom- 
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inent of all, is the sky god; Agni, the fire god; 
Varuna, the water god; Sūrya, the sun god; and 
so on. Visnu, who later became a high god, began 
as only a minor figure, a mere aspect of the sun 
god. The Vedic god of thunder, Rudra, was at first 
associated with Siva, who eventually became the 
dominant partner. The chief Vedic gods were 
gradually transformed into the trinity of Brahma, 
the creator; Visnu, the protector; and Siva, the 
destroyer. Brahma does not appear in the Vedas 
but seems to have developed during the period of 
the Brahmanas. His importance subsequently de- 
clined, and nowadays Visnu and Siva are the two 
most important gods. 

Every major deity in Hinduism has many forms, 
and around each form there is a myth. Visnu has 
a number of incarnations, the chief of which are 
Rama (man), Krishna (man), Narasimha (man- 
lion), and Varaha (boar). The idea behind the 
many forms is that God periodically allows himself 
to be reborn on earth, to overcome evil and restore 
righteousness. In addition to incarnations, Visnu 
has one thousand names, according to the Maha- 
bharata, and many more according to other texts. 
Rama and Krishna, originally incarnations of Visnu, 
became important gods in themselves, each with 
many forms and names. The idea of incarnation 
is not associated with Siva, but he, too, has many 
names. In addition, each deity or each form of a 
deity has a wife, who is usually worshiped along 
with her husband. 

Sakti, the personification of the female principle 
in the creation of the universe, occupies almost 
as important a place in the Hindu pantheon as 
Visnu and Siva. In the Sakti cult a female deity is 
sometimes worshiped independently of association 
with a male deity, but when a male deity alone is 
worshiped, generally he is some form of Siva rather 
than of Visnu. Further, Skanda and GaneSa, the 
sons of Siva, and Hanumant, the chief of the mon- 
key army of Rama, are also popular deities. The 
birds and animals on which the gods sit are called 
vahanas (“vehicles”) and are worshiped. The sun, 
moon, stars, fire, mountains, lakes, animals, snakes, 
trees, and plants continue to be objects of worship. 
Frequently river deities are anthropomorphized. 
For example, Ganga, or Ganges, is a form of the 
goddess Parvati, and many smaller rivers are be- 
lieved to be manifestations of Ganga. The cobra 
cult in southern India is identified with Skanda. 
There are also deities symbolizing the synthesis of 
different deities, such as the three-headed Trimūrti 
and Dattatreya, representing the unity of Brahma, 
Visnu, and Siva. The union of Siva and Visnu is 
expressed in the composite god Harihara; Ardha- 


narisvara represents an attempt to symbolize the 
unity of Siva and Parvati. 

The henotheistic tendency is important in Hindu 
mythology and ritual: the deity who is being wor- 
shiped is praised above all others. Pantheism pre- 
vails, but all deities, from Visnu or Siva to the 
lowest village deity, are considered to be manifes- 
tations of the same god. These ideas have enabled 
Hinduism to absorb local cults and deities and even 
accept all other religions as true. 

A Hindu temple embodies the henotheistic idea. 
There is, accordingly, one principal deity, from 
whom a temple derives its name and whose image 
occupies a prominent place in the temple, and 
there are also a few minor deities, represented by 
smaller images in different parts of the temple 
Thus, in a Siva temple Siva would be the principal 
deity, and Parvati, Ganega, and the bull Nandi 
would be minor deities; whereas in a Sakti temple, 
Sakti would be the principal deity and Siva would 
be one of the minor deities. Not all Hindu deities 
are associated with temples, however. Some of the 
Vedic deities, such as Varuna and Agni, are invoked 
mostly during sacrifices, while Brahma and Surya 
seem to have had temples in the past but do not 
have them nowadays. Some deities (e.g., GaneSa) 
have temples only in certain regions. 


Sectarianism 


Like other religions, Hinduism has given birth 
to many sects in the course of its history, and it 
is not always easy to say whether a sect is within 
the Hindu fold or outside. Buddhism and Jainism 
had emerged as distinct sects by about the fifth 
century B.C., and both spread over wide areas, 
Buddhism, in particular, spreading over almost 
the entire country. But over the centuries their 
influence declined, and Buddhism had almost en- 
tirely disappeared from the country of its origin 
by about a.D. 1000. It is only in recent years that 
large numbers of Untouchables, in particular the 
Mahar of Maharashtra, became converted to Bud- 
dhism in protest against the indignities they were 
subjected to under the caste system. There is a 
sizable Jain population in India today, and Jains 
are very similar to Hindus. Not only are there 
castes among them, but some trading castes of 
Gujarat have Jain and Hindu subdivisions, and 
marriage occurs across sect lines. Islam has pre- 
sented a serious challenge to Hinduism. There 
are about 129.5 million Muslims ( nearly 47 million 
in India and 82.5 million in Pakistan) in the sub- 
continent. While a small proportion of them came 
from the Middle East, the majority were converts 
from among the Hindus. They have a caste system 


in some ways similar to that of the Hindus, and 
the converts have retained many Hindu practices 
—so much so that in the case of some groups it 
is even now extremely difficult to say whether 
they are Hindu or Muslim. There are also sects 
(Kabirpanthi, Sikhism) and cults that combine 
both Hindu and Muslim traits. One of them, 
Sikhism, has claimed to be a distinct religion, but 
this does not mean that Sikhs do not have anything 
in common with Hindus. The Sikhs are divided 
into castes, with even an Untouchable division, and 
have veneration for Hindu holy places. Until re- 
cently, in many families in rural Punjab one son 
would become a Sikh while the others remained 
Hindu. Many Hindu castes became Sikhs in an 
effort to improve their status. The Pirana sect in 
Gujarat and the recent Saibaba cult have both 
Hindu and Muslim followers. In its later phase the 
Bhakti movement was influenced by Sufism. 

At the present time there are a very large num- 
ber of sects, a few major and many minor. Each 
sect has a founder, a cult, a body of doctrine, and 
a social organization of its own. 

In most sects one deity is considered to be 
supreme and is identified with the supreme Brah- 
man. While Visnu, Siva, and Sakti are the most 
important nuclei for the formation of sects, they 
are not, however, the only nuclei, sects having 
also arisen around Sarya, Ganesa, and Dattatreya. 
It is wrong to speak of a single, homogeneous sect 
associated with any of these deities. The many 
Vaisnavite sects, for example, are distinguished 
from each other, first, by the particular form of 
Visnu and his consort that they worship; and 
second, where the same form and consort are 
worshiped, by differences in the mode of worship 
and body of theological doctrine; and finally, by 
their internal organization. There are elaborate 
rules regarding the making of idols, and there is 
a systematized iconography. The Sri-Vaisnavas 
worship Visnu and his consort Laksmi; the Ma- 
dhvas worship Krishna but not Radha; the Nim- 
barkas, Vallabhacharis, and Chaitanyaites worship 
both Krishna and Radha but differ in several other 
respects; and the Ramanandis worship only Rama 
and his associates. Comparable differences exist 
among Siva and Sakti sects. 

Each sect recognizes several minor deities, in- 
cluding its chief deity’s spouse, but they rarely 
include the entire Hindu pantheon. In each sect 
the founder and the things associated with him 
are objects of special veneration. Each sect has an 
elaborate complex of rituals for temple and domes- 
tic worship and for life-cycle ceremonies. It has 
its own specially emphasized festivals and sacri- 
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fices and its own identifying word or sentence of 
great religious potency. A sect mark put on the 
forehead easily distinguishes a member of one sect 
from that of another. 

The major sects are known for their distinctive 
philosophical standpoints, as for example, the pure 
monism of the Smartas, the qualified monism of 
the Sri-Vaisnavas, and the dualism of the Madhvas. 
Minor sects do not have elaborate philosophies, 
although they do have their own special ideas and 
beliefs. While the philosophical and ethical posi- 
tion of a sect is important in understanding its 
religious practices, other elements are influential. 

Each sect has not only its own sacred literature, 
written by its founder and other leaders, but also 
a selective attitude toward the great texts of 
Hinduism. 

Another major problem in the study of sects is 
understanding the nature of their relation to ascet- 
icism. Sects composed entirely of ascetics repre- 
sent a bizarre element in Hinduism. The members 
of these sects go about scantily clothed, smear 
their bodies with funeral ashes, wear long, matted 
hair, and perform a number of physical feats. 
They maintain monasteries (akhddas, literally 
“gymnasiums”), where they are reputed to carry 
on occult practices, and they also manage temples, 
which enable them to keep in touch with the 
masses and recruit members. 

Sects composed entirely of householders and 
those consisting of both ascetics and householders 
are the most numerous and popular. The ascetics 
in the latter sects are grouped into different mon- 
asteries, each having its own core of hereditary 
adherents and its corporate property in temples, 
land, ete. Many ascetics are found to be involved 
in intersectarian rivalry and politics. When a sect 
is composed only of householders, the patrilineal 
descendants of the founder preside over the sect. 

There are many small sects, whose membership 
is confined to a single linguistic region or to a 
small area within a linguistic region, but the mem- 
bership of the major sects cuts across language 
barriers. In the case of a major sect, it is necessary 
to distinguish between areas with a high concen- 
tration of its members and areas with relatively 
few members. Thus, while the majority of the 
Madhvas are found in southern India, there are 
small groups of them in Gujarat, Uttar Pradesh, 
Bihar, and Bengal. While the majority of Valla- 
bhacharis are found in Gujarat and Rajasthan, 
there are small groups of them all over northern, 
western, and eastern India. Another noteworthy 
feature is that, while the majority of the members, 
temples, and monasteries of a sect may be found 
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in one part of the country, it may have a temple 
or a monastery in each of the major pilgrim centers 
in the country. The founder of every major sect 
traveled about the country, first in search of knowl- 
edge, then to win dialectical battles, and finally, 
to give discourses and recruit followers. Frequently 
the founder and his followers came from different 
regions. There were centers of religious learning 
in different areas, and there was a convergence of 
schools of learning at each center; finally the cen- 
ters were woven into networks. Some of these 
centers, such as Banaras, Vrindaban, and Sriran- 
gam, enjoyed high prestige, and a scholar’s victory 
in religious disputation may have taken place at 
the court of the king or at a religious fair. 

Normally, membership in a sect, unlike that in 
a caste, is not hereditary but comes through initia- 
tion. And there is hardly any sect that is composed 
of only one jati. Even the Lingayat sect of the 
Kannada region, which is commonly regarded as 
a caste, is composed of a number of jatis, or endog- 
amous units. And even when a whole caste is 
included in a single sect, membership in the sect 
is not automatic but by initiation, Sometimes the 
members of a caste will be distributed among more 
than one sect, and some may not belong to any 
sect at all. Some castes in Gujarat contain not 
only members of two or three Hindu sects but also 
Jains. Sometimes the members of a single family 
have different sectarian affiliations. The rise and 
fall of various sects over the centuries indicates 
that religious positions were not always determined 
by birth. 

No sect recruits members from all castes, Un- 
touchables are very rarely admitted into sects 
including the high castes; even a sect admitting 
“touchable” castes would cover only a certain span 
in the caste hierarchy. Generally speaking, Un- 
touchables have produced their own sects. The 
older sects recruited brahmans and higher non- 
brahmans but not lower non-brahmans, and there 
are sects founded by non-brahmans that do not 
include brahmans. Even though a sect includes 
members from more than one caste, caste distinc- 
tions are not entirely obliterated. 


Nonsectarian Hinduism 


Nonsectarian Hinduism is found both in towns 
and villages; it is largely Sanskritic in towns and 
non-Sanskritic in villages. 

Non-Sanskritic Hinduism is, however, an ideal 
type and has the following characteristics: the 
deities have non-Sanskritic names and oral myths 
attached to them; they are represented by unhewn 
stones or crude images; the modes of worship are 


local and do not follow any liturgy; offerings in- 
clude meat and liquor, and the priests, or shamans, 
as well as the devotees, are generally drawn from 
the lower castes. All these conditions rarely occur 
simultaneously, and it is more common for the 
Sanskritic and non-Sanskritic elements to be mixed 
in varying proportions. Thus, a deity’s name may 
be a corrupt form of a Sanskritic name or a com- 
pound of a Sanskritic and a non-Sanskritic name. 
Usually village goddesses are regarded as manifes- 
tations of Parvati, and village gods of Siva. Such 
identification makes possible the acquisition of 
Sanskritic characteristics by non-Sanskritic deities, 
and it is not unknown for a deity with a single 
name to be worshiped according to non-Sanskritic 
modes in one village and Sanskritic modes in 
another. In one village the deity’s image may be 
housed in a fine temple, while in another it is 
embedded in the earth at the foot of a tree. Fre- 
quently, there are institutionalized links between 
a village deity and the pilgrim center of the San- 
skritic deity with which he is identified. 

There are also temples to Sanskritic deities, 
where brahmans are priests, the offerings vege- 
tarian, and the mode of worship Sanskritic. But 
on certain occasions, such as the deity’s festival, 
the brahman withdraws and animals are sacrifi- 
cially decapitated by a non-brahman. The brahman 
priest re-enters the temple only after purifying it. 
In temples where non-brahmans are priests, brah- 
mans may propitiate such a deity during an epi- 
demic or other disaster. In exceptional situations 
a brahman might even make an offering of a 
fowl to a non-Sanskritic deity through a non- 
brahman friend. 

It is important to note that the attitude of 
brahmans and other high castes toward non- 
Sanskritic deities is not fixed and unalterable. 
There are different types of brdhmans, high as 
well as low, and learned as well as ignorant. 
(Among the Smarta brahmans of Tamilnad, the 
priests, Kurukkals, are regarded as lower than 
other brahmans. In Gujarat, the priestly Tapodhan 
brahmans are rated very low indeed.) A learned 
brahman may have to oblige his powerful non- 
brahman patrons by manufacturing a myth in 
Sanskrit for one of their deities. Hindu mythology 
has grown in this manner (see the Appendix on 
“The Kavéri Myth” in Srinivas 1952, pp. 241-246). 

A temple is sectarian only when it is part of a 
sectarian organization. In this sense a large ma- 
jority of Hindu temples, including some of the 
biggest, are nonsectarian. Many of these are ex- 
tremely wealthy, having vast land estates, large 
amounts of jewelry and precious metals, and also 


a considerable income from offerings by devotees. 
They employ many people and have an elaborate 
and complex body of ritual, calendrical festivals, 
special pūjās, etc. Although nonsectarian, these 
temples are subject to regional sectarian influ- 
ences. For example, the modes of worship in non- 
sectarian temples of Krishna in Gujarat are influ- 
enced by the modes of worship prevalent in Krishna 
temples of the dominant Vallabhachari sect of the 
area 

In the majority of temples dedicated to San- 
skritic deities, the priests are brahmans; only vege- 
tarian and nonalcoholic offerings are made, and 
the rituals are conducted according to a liturgical 
text. Even in some Siva temples, where priests are 
Lingayats (southern India) or Gosais (Gujarat), 
they perform liturgical rituals and make only 
vegetarian and nonalcoholic offerings. In many 
Sakti temples, particularly those influenced by 
the left-hand Sakti sects, the deity is worshiped 
according to the Tantric texts and offerings of 
meat and liquor are made. However, although 
Sanskritization has had a widespread effect, some 
temples continue to sacrifice animals and make 
liquor offerings on certain occasions. In Bengal, 
Bihar, and Assam blood sacrifice still remains a 
normal mode of worship. 

Nonsectarian Hindus generally worship many 
deities, although there are some who are devotees 
of a single deity, sometimes a deity in a particular 
temple. It is common to see a devout Hindu going 
on a daily round of the principal temples in a 
village or in a ward of a city. They observe the 
festivals of Siva, Visnu, and others, and they go 
on pilgrimages to great shrines all over the country. 


Hinduism and the polity 


Devout Hindus regarded their king as a repre- 
sentative of God on earth. This belief was common 
to all, including brahmans, who themselves claimed 
to be gods on earth. The social order, as repre- 
sented by the caste system, was also believed to 
be divinely created. The rules of the social and 
moral order were subsumed under the ethicoreli- 
gious concept of dharma. In his role as the guard- 
ian of dharma, the king had to maintain the caste 
system, This meant that the idea of inequality ex- 
pressed in the caste system had the king’s support 
and sanction. Different castes had different rights, 
duties, and privileges, and punishment had to take 
into account the caste of the offender and that of 
the victim. The disabilities traditionally imposed 
on untouchables also had the king’s sanction. His 
powers included the right to promote or demote 
individual castes, and he was the final court of 
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appeal in any matter pertaining to caste. This 
power was so integral to kingship that it was 
exercised by the Mughal rulers and also by the 
British in their very early days in India. 

It is important to remember that the king had 
this power, inasmuch as uncritical reliance on the 
sacred literature conveys the impression that the 
brdhmans were all-powerful and that kings only 
carried out their decisions. The privileges enjoyed 
by brdhmans and religious personages were in 
fact conferred on them by the king, and they 
could be withdrawn. As recently as 1892, in the 
princely state of Mysore, the government passed 
an order that all nominations to the headship of 
monasteries must have the prior approval of the 
maharaja, and failure to obtain this approval 
would involve the retraction of grants of land and 
money made by the state (Smith 1963, pp. 302— 
303). A swdmi, or head of a monastery, is revered, 
and the maharaja even performs the ritual of 
washing the feet of some swamis; but he also has 
the power to determine who becomes a swami. 

The Hindu king had the same beliefs and values 
and took part in the same ceremonies as his peo- 
ple, although the manner of his celebrating a festival 
or his devotion to a particular deity or temple often 
set the religious style of the kingdom. Temples 
favored by royalty (e.g., Tirupati, Tanjore, and 
Madurai) developed into great pilgrimage centers. 
They were generously endowed with land and 
jewelry; famous sculptors were invited to lavish 
their skill on them; and great musicians sang 
there on certain occasions. The conversion of a 
prince to a sect was an important event in its 
history, and a large number of people followed 
their king into the new faith. And while there is 
a tradition of tolerance in Hinduism, discrimina- 
tion against the members of a rival sect was not 
unknown. 

It is clear that no conceptual separation between 
the state and the church was possible in the Hindu 
system of ideas. Nor was the need for such a dis- 
tinction very necessary. First, Hinduism did not 
possess a powerful, centralized church, with a sin- 
gle pontiff and a hierarchy of officials, which 
would constitute a potential threat to kingly su- 
premacy. Second, the castewise division of func- 
tions confined brāhmans to the religious realm, 
while the ksatriyas had the political realm to them- 
selves. That a separation between the two did not 
always obtain should not surprise us. In fact, the 
development of sacrifice during the late Vedic 
period marked an increase in brahman power and 
arrogance, and Buddhism and Jainism both ap- 
pealed to ksatriyas and vaiSyas, partly because of 
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their rejection of brahman pretensions (Ghurye 
1932, pp. 65, 69, 70, in 1950 edition). Speaking 
generally, it was not so much the throne that at- 
tracted brahmans as the power behind it. 

Hinduism has a tradition of tolerance, and 
Hindu rulers in general seem to have been hos- 
pitable to different sects and religions. Hindu 
tolerance is, however, related to the caste system 
in several ways. First, each caste has its own 
style of life, and from childhood onward people 
accept diversity as a basis for relationship. Second, 
caste, along with village and extended kin groups, 
ensured conformity in practice, and a stable society 
could afford to give its members intellectual free- 
dom. The other source of such freedom was the 
institution of the holy man, who ritually renounced 
the world—his relatives even performed funeral 
rites for him at his initiation into the order—and 
who could then preach as he wished. Max Weber 
has rightly said, “The freedom of thought in 
ancient India was so considerable as to find no 
parallel in the West before the most recent age” 
(quoted in Smith 1963, p. 62). 

Hindu tolerance of other religions and its hospi- 
tality to new ideas provided a favorable soil for 
the eventual declaration of India as a secular 
state. There were, however, other tendencies, and 
certain nineteenth-century attempts (e.g., the 
Arya Samaj) to purge Hinduism of its many evils 
by advocating a return to the Vedas contained 
frankly revivalist elements. Moreover, Indian na- 
tionalism also expressed itself occasionally in the 
Hindu idiom, and this had the effect of alienating 
the Muslims. But during British rule there emerged 
a highly westernized Hindu elite, which, while 
rooted in the country and its traditions, was com- 
mitted to independence, democracy, egalitarianism, 
and secularism. It is this elite that not only de- 
clared India a secular state but also attempted 
wholeheartedly to establish the principle of the 
equality of man. 

Hinduism and economic development. Weber 
(1921) thought that the Hindu belief in the trans- 
migration of the soul and the related doctrines of 
karma and dharma, seen in the context of caste, 
produced an irrational, otherworldly social ethic 
that prevented the development of industrial capi- 
talism. Weber’s thesis has gained wide popularity, 
and Hinduism is now believed to be one of the 
major obstacles in India’s economic development. 
This belief, however, rests on a partial view of 
Hinduism. Weber himself noted a few elements 
of a “rational” ethic in Hinduism—the existence of 
this-worldly asceticism and positive economic moti- 
vation among Jains, Lingayats, and Madhvas, and 


an occupational ethic among merchants and arti- 
sans. There are elements in Hinduism favorable 
to economic development (Singer et al. 1958; 
Lambert & Hoselitz 1963). The very ascetics whom 
Weber considered disseminators of irrational and 
otherworldly ideas among the masses are often 
the heads of large and wealthy monasteries and 
temples, the management of which calls for con- 
siderable administrative ability. 

Hindu reform and modernization. Hinduism 
has, in the course of its long history, undergone 
many and radical changes, and several diverse 
forces have contributed to making Hinduism what 
it is today. The establishment of the Pax Britannica 
released many new forces, affecting Hinduism at 
every point. The disruption by the British of some 
of the social institutions of Hinduism, such as 
caste, untouchability, satī, human sacrifice, female 
infanticide, infant marriage, etc., made it clear to 
the orthodox that the state could use its power to 
alter their religious institutions. European mis- 
sionaries who came to India for evangelical pur- 
poses sharply criticized Hinduism, and Hindus 
were made to realize poignantly that some influ- 
ential outsiders thought that everything was wrong 
with their religion. Reformist Hindus—many of 
whom had attended mission schools—could not 
help remembering missionary criticisms of their 
religion, and in creating institutions to bring about 
changes in their society, they naturally emulated 
the organizations and work of their critics. From 
a long-term point of view the most important ele- 
ment in the reinterpretation and reformulation of 
Hinduism was the emergence of a westernized 
Indian elite, which eventually took over power 
from the British and which used that power to 
introduce fundamental changes, such as the aboli- 
tion of untouchability, the legalization of inter- 
caste marriage, widow marriage, and divorce, and 
the enforcement of monogamy. It is this elite that 
after a century and a half of Western influence 
declared India to be a secular state. Secularism 
does not mean that evil social institutions will be 
allowed to flourish, just as the principle of equality 
has not prevented the state from giving a variety 
of special privileges to Scheduled Castes and Tribes 
for a specific period of time, in order to enable 
them to catch up with the others. 

One of the great reforming leaders of the new 
elite was Raja Ram Mohan Roy, who founded a 
religious society, the Brahma Samaj, in 1828. A 
daring religious thinker, scholar, and educator, 
Ram Mohan Roy was influenced by Vedanta and 
Islam before he studied Christianity; he was 4 
monotheist and opposed to idolatry. He was an able 


and courageous controversialist and fought the or- 
thodox pandits with arguments they could appre- 
ciate. For instance, in his efforts to purge Hinduism 
of the idolatrous accretions it had acquired over the 
centuries, he advocated a return to Vedic Hinduism: 
“Like Luther, who appealed to the Bible as an 
authority against medieval corruptions, he took his 
stand on the Vedas, the earliest Hindu scriptures, 
in which he found a form of pure Hinduism, of 
which the basis was a belief in one God, which 
was not vitiated by idolatry, and which gave no 
sanction to distinctions of caste or such practices 
as suttee” (O'Malley 1941, p. 67). (Dayananda 
Saraswati, who founded the Arya Samaj, a reli- 
gious brotherhood, in 1875, was only following 
Ram Mohan Roy in his efforts to introduce radical 
changes in Hinduism by championing a revival 
of Vedic Hinduism.) Ram Mohan Roy was aware 
that an appeal to the authority of the Vedas would 
carry weight with the orthodox pandits, and he 
set the style for a debate that went on for nearly 
a hundred years between reformists and diehards, 
both of whom quoted the scriptures in support of 
their views. 

Ram Mohan Roy also discovered that a prin- 
ciple of reason lay at the basis of the classical 
Indian philosophy of Vedanta, as set forth in the 
Upanisads. “Ram Mohan Roy abandoned the tradi- 
tionally accepted bases of Hindu religion and 
Brahminic authority in favor of reason. Hinduism 
could be justified in its essentials on the ground 
that it provided a reasoned explanation of reality. 
Everything from the West could be considered in 
the same light. There could be assimilation and 
not merely borrowing at random” (Spear 1961, 
p. 296). He thus laid the foundation for a reinter- 
pretation of Hinduism freeing it not only from 
social institutions such as caste and untouch- 
ability but also from a welter of beliefs, ideas, 
myths, and ritual. He did not regard these as the 
“essence” of Hinduism. The essence was selected 
portions of the Vedanta, the Bhagavad Gita, and 
bhakti or devotional literature. Subsequent re- 
formers, such as Vivekananda, Saraswati, Auro- 
bindo Ghose, Tilak, Gandhi, and Radhakrishnan, 
carried on the work of reinterpretation. Over the 
years the reformers, the greatest of whom was 
Gandhi, built up a body of public opinion in favor 
of introducing drastic changes in Hinduism. It 
was this opinion which later enabled the state to 
take legislative action against certain Hindu prac- 
tices that were repugnant to the modern outlook. 

The second half of the nineteenth century saw 
the beginnings of the growth of nationalist senti- 
ment among Hindus, and nascent patriotism drew 
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upon Hindu sentiment and traditions. A concern 
for the people and the country inevitably meant 
activism, and the Bhagavad Gita provided a reli- 
gious source for political activism and altruism. 
The karma marga, or “path of works,” received 
emphasis at the expense of the other two paths. 

Two contrary processes have been gaining 
strength in modern India. The first one is 
Sanskritization, a process by which the rites, cus- 
toms, beliefs, and style of life of the higher castes, 
and in particular the brahmans, are taken over by 
the lower groups, including the Untouchables and 
the tribal peoples. But while the lower groups in- 
creasingly Sanskritize their style of life, the higher 
strata become increasingly westernized. Western- 
ization, like Sanskritization, is a multilayered 
process, including the acceptance of Western tech- 
nology, Western political, legal, and social institu- 
tions, and Western literature, philosophy, and 
science. The spread of Sanskritization and west- 
ernization across the country and to different struc- 
tural levels is beginning to produce nationwide 
uniformities in religion and culture. Everywhere 
village deities traditionally associated with epi- 
demics of diseases such as plague, smallpox, and 
cholera seem to be losing ground, while the pres- 
tigious Sanskritic deities are becoming more 
popular. Blood sacrifices and offerings of liquor 
to deities are also becoming less popular. The 
horizon of the peasant is widening, and the richer 
peasants now visit pilgrimage centers several 
hundred miles away from their villages. Films, 
radio, textbooks, newspapers, journals, and paper- 
back books are strengthening “regional” and “all- 
India” Hinduism, at the expense of strictly “local” 
forms. Life-cycle rituals are becoming abbreviated, 
while the purely social aspects of such rituals get 
elaborated; this seems to be particularly true of 
educated Hindus in towns. In fact, the forms 
Hinduism is taking among the educated urban 
Hindus is only beginning to be explored by social 
scientists (see, for example, Singer 1959). The 
search for a satisfying philosophy leads many 
members of this class to become devotees of one or 
another spiritual leader. Some of these leaders 
are traditional heads of monasteries, while others 
are modern figures. Hinduism has in the past de- 
pended upon caste, village, joint family, Hindu 
kings, monasteries, and centers of pilgrimage for 
its perpetuation. Radical changes have occurred 
in all of them. Moreover, there has been a growth 
in secularization, egalitarianism, and rationalism. 
But new organizations such as the Ramakrishna 
Mission, the various hermitages of religious lead- 
ers, the Bharatiya Vidya Bhavan, and finally, de- 
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partments of the central government such as the 
All-India Radio, and in some states departments 
supervising temple administration, are reinterpret- 
ing Hinduism in a modern direction. 


M. N. SRINIVAS AND A. M. SHAH 


[See also ASIAN SOCIETY, article on SOUTH ASIA; CASTE, 
especially the article on THE INDIAN CASTE SYSTEM; 
PoLLUTION.] 
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HINTZE, OTTO 


Otto Hintze (1861-1940) spent the first twenty 
years of his life in his native Prussian province of 
Pomerania. After studying for two years at the 
University of Greifswald, he transferred in 1880 
to the University of Berlin, where as a student of 
J. G. Droysen he became deeply imbued with the 
ethical implications and political principles of the 
Prussian tradition. He early impressed his teachers 
at the university with his capacity for relating 
minutiae to generalizations as well as with his in- 
dependent, critical judgment (Hartung [1941] 
1961, pp. 500-501). At a later point in his studies 
Hintze became associated with Gustav Schmoller, 
and this association greatly influenced his develop- 
ment as a professional historian, an influence re- 
flected in his publications during the next decades. 

Hintze’s first major work was a study of the 
Prussian silk industry in the eighteenth century. 
It consisted of two volumes of documents and a 
third volume that analyzed the effects on the in- 
dustry of the mercantilist policies of Frederick 11. 
At the completion of this work Hintze, at the age 
of 34, was given an appointment at the University 
of Berlin, and thereafter he was rapidly promoted. 
His position entailed lecturing on constitutional, 
administrative, and economic history as well as on 
politics, a rather unusual combination at the time. 

As a collaborator of Schmoller’s in the publica- 
tion of the Acta borussica, Hintze next turned to 
the study of Prussian administration from 1740 to 
1756. The six volumes of documents published as 
a result of this work carried an introduction by 
Hintze (1901) that was a penetrating analysis of 
this administrative system and of the major politi- 
cal decisions it embodied. Subsequently, Hintze 
continued to be associated with the Acta borussica 
in a leading capacity, while also becoming editor- 
in-chief of the Forschungen zur brandenburgischen 
und preussischen Geschichte. 

Hintze’s writings on Prussian history—a series 
of essays published at various times between 1896 
and 1931 and collected as Geist und Epochen der 
preussischen Geschichte (1896-1931) and his 
summary volume Die Hohenzollern und ihr Werk 
(1915)—show a high order of critical, scholarly 
judgment. Although he was an avowed admirer of 


the Prussian monarchy and an avowed conserva- 
tive, his approach to Prussian political institutions 
was objective and comparative. He came to see the 
Prussian case as a “paradigm for the formations 
and transformations of the modern state generally” 
(Hartung [1941] 1961, p. 506). The case of Prus- 
sia, as Hintze saw it, illustrates J. R. Seeley’s 
dictum that the degree of internal freedom in a 
country is inversely proportional to the political and 
military pressures on its frontiers. More generally, 
this means that the internal social structure of a 
country—its class conflicts and constitutional 
framework—and its international position must be 
analyzed together. Such an approach shows that 
Prussia’s militarism and monarchical constitution 
were by-products of her central European location; 
no attempt to form a nation-state in this area could 
succeed without the discipline and centralized au- 
thority of Prussian institutions (1915, pp. vi-vii). 
Applying this analysis to other times and places 
Hintze showed that in the sixteenth and seven- 
teenth centuries the great struggle between France 
and the Hapsburg empire, together with the im- 
potence of Germany, left countries like Poland in 
a power vacuum, which greatly facilitated such 
internal conflicts as those between powerful fam- 
ilies and thus militated against the formation of a 
strong political structure ([1902-1932] 1962, pp. 
515-521). Again, in England, early political cen- 
tralization, together with relative military security, 
strongly militated against absolutism, bureaucracy, 
and militarism of the Continental type, so that the 
modern English state of the eighteenth and nine- 
teenth centuries depended on the ruling agrarian 
and industrial classes rather than on the crown 
([1902-1932] 1962, pp. 362-368). 

Hintze’s nationalist and monarchist persuasion 
was severely shaken by Germany's defeat in 1918. 
That experience brought greater acceptance on his 
part of democratic tendencies nationally and fed- 
erative tendencies internationally, although so great 
a change of view was naturally difficult for a con- 
vinced conservative (Oestreich 1964, pp. 21-22). 
Impaired health forced him to discontinue univer- 
sity teaching, and during the years of the Weimar 
Republic he was a private scholar and publicist. 
Hitler's rise to power completely destroyed the 
world in which he had grown to maturity, and in 
1938 Hintze, whose wife was Jewish, resigned from 
the Prussian Academy of Science. When his col- 
lected essays were published shortly after his death 
in 1940, all his discussions of the work of Jewish 
authors were excised. 

Despite disillusionment and personal adversity, 
after World War 1 Hintze turned to a fuller elab- 
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oration of his projected comparative, constitutional 
history. There are indications that he completed a 
major work in this field but that it was lost during 
World War m. What remains is a series of essays 
on administrative history, feudalism, the estate- 
constitutions of Europe, the world-historical con- 
ditions of representative institutions in Western 
European societies, and Polish constitutional his- 
tory, as well as brief synopses of his views concern- 
ing the major phases of European political history. 
Taken together, these writings do for the study of 
political institutions what Max Weber's sociology 
of religion did for the study of religious beliefs. By 
comparative analysis they reveal the institutional 
preconditions that shaped the Western system of 
constitutional states and the major types of con- 
stitutions that emerged within this system. 
Hintze’s work, which is informed with his wide 
knowledge of history, has a developmental empha- 
sis that is not evolutionist and a conceptual ap- 
proach that, like Weber's, brings out the singularity 
of a historical sequence or configuration by means 
of strategic contrasts with other civilizations. He 
had no use for biological analogies in social analy- 
sis, since he was much concerned with the role of 
ideas and political decisions in social change. For 
him as for Weber, functional relations are the end 
products of human actions that may produce inno- 
vations that are no less genuine because they are 
conditioned. Although it remained fragmentary, 
Hintze’s work is a major contribution to the com- 
parative analysis of social and political institutions. 


REINHARD BENDIX 


[For the historical context of Hintze’s work, see the 
biographies of SCHMOLLER; WEBER, Max.]} 
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I 
THE RHETORIC OF HISTORY 


In this article the word “historiography” will 
be taken to mean the craft of writing history 
and/or the yield of such writing considered in its 
rhetorical aspect. The term “history” will be used 
to describe the study of the past as a systematic 
discipline. It will not be used to refer to the past 
as such. Unnecessary ambiguities created by using 
the term “history” to identify both the past and the 
Systematic study of it have occasionally led to 
gratuitous confusion. The terms “historiography,” 
“the rhetoric of history,” and “history writing” will 
be used as synonyms, Historiography is different 


from the collection of historical evidence, the edit- 
ing of historical sources, the exercise of historical 
thought and imagination, the criticism of histori- 
cal writing, and the philosophy of history, but it 
is related to all of them and overlaps some of them. 
It is also different from the history of history 
writing. This distinction must be kept in mind 
because in recent decades the term “historiography” 
has increasingly been used to mean the history of 
history writing—in effect, a branch of intellectual 
history or a subbranch of the sociology of knowl- 
edge. It will not be used in that sense here. 

Perhaps the drift of the term “historiography” 
from its initial moorings of meaning resulted from 
the neglect of systematic study of the rhetoric of 
history and from the view, usually implicit rather 
than explicit, that that rhetoric is not specific 
to history, or not specific enough to warrant sys- 
tematic study; for if the rhetoric of history is 
generic rather than specific, then historiography 
so defined has no separate identity and merges 
into general rhetoric. Although many historians 
have said wise and witty things about the writing 
of history, none has focused more than intermit- 
tent attention on the problems it raises. The casual- 
ness with which historians have investigated the 
structure of historiography, compared with the care 
and exacting scrutiny to which they subject the 
nature of data, evidence, and inference in works 
of history, indicates that in their role as critics 
they regard the latter as the historians’ legitimate 
central preoccupation and the former as a second- 
ary and independent matter in which excellence, 
although desirable, is dispensable and without 
effect on the validity of the finished work. 

To test this implied judgment—which is very 
hard to reconcile with the care and pains which 
many of the best historians lavish on their own 
history writing—this article will attempt an analy- 
sis of historiography. 

The models of history writing that will be dealt 
with will be those provided in recent times by the 
better historians in their better moments. “In re- 
cent times” because, although the writing of his- - 
tory is old, the general professional commitment 
of historians always to write it with the maxi- 
mum verisimilitude to the past is relatively new— 
scarcely 150 years old. The focus will be on “the 
better historians in their better moments” because 
they should and often do set the standards at 
which the profession aims. It is the practice of 
these historians when actually writing history, 
not their explicit theoretical or quasi-philosophical 
views, that is of concern here; for it is in their 
practice of historiography rather than in peripheral 
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excursions into the problems of methodology that 
they reveal their effective commitments. Although 
much that follows will apply to all historiography, 
the article is only incidentally concerned with his- 
torical writings intended to codify knowledge of 
history already available, such as textbooks; pri- 
marily it is focused on history writing aimed at 
extending the bounds of historical knowledge. 
Since historiography communicates what the his- 
torian knows or thinks he knows, a consideration 
of the historian’s way of writing that does not 
relate it to his way of knowing is doomed to trivi- 
ality. Finally, as an overarching principle this 
article will seek to relate the rhetoric of history to 
the rhetoric of the mathematizing natural sciences, 
taking physics and chemistry as models, in order 
to make clear the similarities of historiography to 
these other ways in which men seek to communi- 
cate what they think they know, and its differences 
from them. 


The language of historians 


The most cursory comparison of any profes- 
sional journal of history with an issue of the 
Physical Review will carry the conviction that 
the rhetoric of history differs grossly from that of 
physics. Ideally the vocabulary of physics is exact 
and denotative, and its syntax is mathematical, 
expressing quantitative relations between entities 
defined with the minimum ambiguity possible, 
given the current state of knowledge of the science. 
Now a part of the distinctive vocabulary of histori- 
ography is also denotative and unambiguous—the 
terms “pipe roll” or “writ of trespass” no less 
so, for example, than the term “specific gravity,” 
although most if not all of the vocabulary of his- 
tory that is at once distinctive and unambiguous 
is likely to refer to universes more constricted in 
time and space or less homogeneous and uniform 
than the ones the vocabulary of physics refers 
to. Moreover, historians often use quantitative 
data: grain yields per acre, rates of population 
growth, average rainfall; and sometimes they em- 
ploy the syntax of mathematics to determine such 
matters as median income, ratio of cargo weight 
to number of sailors, range of probable error in 
equating number of manors owned with gross 
income from land. Thus, they occasionally mathe- 
matize, they often quantify and enumerate, and 
part of their vocabulary is wholly denotative. When 
operating in a sector of history writing where they 
deem rhetorical devices of this sort alone to be 
appropriate, they assimilate the form of their vo- 
cabulary very closely to that of the natural sciences 
and exercise a like care to decontaminate it of con- 


notative and evocative overtones. Yet very few his- 
torians consistently write history this way, and 
scarcely any of that very few would insist that all 
history ought to aim at the rhetoric of the natural 
sciences as an ideal goal; they do not seek to make 
the rhetorical form which they find convenient 
into a prescriptive rule for the entire profession, 
as they would be obligated to do if they thought 
that only with such a rhetoric could one come close 
to an account of things past that approaches maxi- 
mum verisimilitude. 

In this respect historians differ notably from 
many social scientists. Both historians and social 
scientists claim, as natural scientists do, that by 
their method of investigation and formulation ap- 
propriately applied, they are able to produce cu- 
mulative increments to men’s knowledge. Both 
historians and social scientists acknowledge that 
the form of their rhetoric is not always coincident 
with that of the natural sciences. But since most 
social scientists take the rhetoric of the natural 
sciences as their goal and ideal, they make a major 
intellectual effort to assimilate their way of writing 
to that of the natural sciences. In this matter they 
regard any nonconformity on their part as a de- 
formity, to be either overcome or lamented. In 
practice, historians rarely see historiographic prob- 
lems in this light. Far from always seeking the 
forms of language which will enable them to make 
historiography into the closest possible replica of 
the language of the natural sciences, they choose 
—often unself-consciously, but sometimes well 
aware of what they are doing—to write in a way 
that the rhetoric of the sciences forecloses. They 
deliberately choose a word or a phrase that is im- 
precise and may turn out to be ambiguous, be- 
cause of its rich aura of connotation. Without 
compunction they sacrifice exactness for evocative 
force. Since the only common purpose to which 
historians are bound by their calling to commit 
themselves is to advance understanding of the past, 
the only possible justification for such a sacrifice 
is that it serves to increase knowledge of the past, 
that sometimes an evocative rhetoric is the best 
means a historian has for formulating and com- 
municating what he knows. Whether or not the 
historians who in practice follow this rhetorical 
strategy are fully aware of it, the implication of 
this strategy is very radical. It entails the claim 
that historians can produce cumulative increments 
of knowledge without consistent resort to the rhet- 
oric which scientists have found indispensable for 
formulating and communicating what they know. 
Positively it implies that for communicating what 
the historian knows, a rhetoric more like that em- 
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ployed in the fictive arts than like that employed 
in the sciences is not only permissible but on 
occasion indispensable. 


Knowing and communicating 


To analyze historiography, we must first deter- 
mine its place in the general process of knowing 
and communicating. In order to identify and com- 
prehend this general process, we will start with a 
concrete proposition: Willie Mays knows baseball. 
To deny this proposition is to fly in the face of the 
unanimous and considered judgment of several 
million people who have seen Mays play ball. Al- 
ternatively, it is to restrict the meaning of the term 
“know” so drastically as to impede the flow of dis- 
course and snarl up a channel of communication. 
For a man who can convince some five million 
observers out of five million, and his employers to 
the tune of a salary of $125,000 per year, that he 
knows has come as close to ironclad demonstra- 
tion as one can get in matters involving human 
judgment. To generate this conviction, Mays em- 
ploys the rhetoric of action, the most common and 
universal method of demonstrating that one knows. 
(The author realizes that he is rather stretching 
the common sense of the term “rhetoric” in the 
present context.) The adequacy of such demon- 
stration is measured by consistent appropriateness 
of visible (or perceivable) response. Briefly, Willie 
Mays shows that he knows by what he does. While 
he demonstrates almost perfectly that he knows, 
for two reasons he would have great difficulty in 
saying how he knows what he knows. (1) Since 
much of what he knows lies in the area of action, 
he probably knows no way of putting it into words. 
(2) Much of what he knows, he knows from long 
experience, and although some of what he knows 
in this way can be rendered accessible in verbal 
form, and is known by others in that form, that 
is not in fact how he knows it. If we went further 
and insisted that he so exactly communicate in 
words what he knows that others could test its 
validity by replication, we would clearly be asking 
the impossible, because he does not know what he 
knows with some abstraction from himself called 
his discursive intellect but with his whole person; 
and in every game he plays, he demonstrates its 
validity, not by verbal or mathematical formula- 
tion but by the unique and unreplicable perfection 
of his response, 

In one area of human knowing, investigation 
in the natural sciences, however, formal conven- 
tion has established as a minimal evidence of 
knowing the ability to state the results in a 
wholly denotative rhetoric of verbal signs; all con- 
sistently appropriate responses are required ulti- 


mately to assume this form. This convention has 
been adopted because scientists set a very high 
value on generalizability of statement, replicability 
of experiment, and logical entailment as tests of 
knowing; and the rhetoric they have chosen en- 
forces entailment and makes generalization and 
replication possible with maximum efficiency and 
certainty. Scientists have no doubt adopted this 
particular rhetorical stratagem because they have 
found it very useful for eliciting and testing 
answers to the kinds of questions they habitually 
ask. Correlatively, however, it prevents them, as 
scientists, from asking questions that are recalci- 
trant to formulation in the rhetoric they have 
chosen, and a fortiori from answering such ques- 
tions. As a result of this set of circumstances, in 
the sciences it is by definition meaningless to say 
that one knows or understands a result, a law, 
or a hypothesis but cannot state or communicate 
what one thus claims to understand. If I cannot 
state the binomial theorem, or the law of inverse 
Squares, or the valence rule, I do not know it, and 
any claim to the contrary is simply nonsense. In 
effect, the strategy of the sciences in the codifica- 
tion of their results is to reduce to a minimum in 
practice and to nothing in theory the gap between 
cognition and written communication. 

The range of knowing, then, extends from what 
Willie Mays knows about baseball, which we would 
accept as demonstrated whether or not Mays can 
put together a single coherent sentence on the 
subject, to what a physicist knows about the re- 
sults of investigations in physics, which he can 
demonstrate only with a denotative vocabulary and 
a mathematical syntax. The common factor is the 
capacity of each to render a consistently appro- 
priate response; the difference lies in the nature of 
the response that is deemed appropriate; and the 
criterion of appropriateness is efficacy in encom- 
passing the purpose of the activity to which the 
actor has committed himself, 

In this general setting of the relation between 
knowing and communicating, what is the relation 
between knowing history and its communication 
in writing, that is, historiography? In the first 
place, among professional historians there is a 
rough consensus about the responses appropriate 
for writers of history when they set out to com- 
municate what they know. This consensus enables 
one to speak of “the better historians” with a sense 
that one’s judgment is not merely personal, ran- 
dom, or arbitrary. The consensus is approximately 
registered by the price a particular historian fetches 
in the current job market. (Only approximately, 
however, since a number of items other than the 
profession’s estimate of his history writing go to 
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make his price—his age, his reputed aptitude in 
face-to-face teaching, his personal qualities, the 
relation of supply to demand in his special field of 
interest, and so on.) Second, in writing history, as 
we have seen, some historians consistently use a 
vocabulary and a syntax similar to those which 
are standard in the natural sciences and some do 
not. There is, however, no significant correlation 
between the list of those who do so and the list of 
“better historians.” This implies that in histori- 
ography, historians do not accept the scientists’ 
conception of what constitutes a consistently ap- 
propriate response in the public presentation of 
the results of scientific investigation. An inquiry 
of no very pressing sort would reveal that in prac- 
tice they have refused fully to commit themselves 
to a scientific rhetoric because they concern them- 
selves with questions and answers which are not 
wholly tractable to the kind of formulation that 
scientists aim at, and because they assume in prac- 
tice a relationship between cognition and com- 


munication in history different from that which is - 


currently acceptable in the sciences. 

Historians acquire their command of the data 
that they ultimately deploy in writing history 
through reading and considering what men have 
written in the past and about the past and by 
giving attention to remnants and traces of men’s 
handiwork surviving from the past: buildings, 
tools, pictures, field systems, tombs, pots—arche- 
ological data in the broadest sense. The process 
of learning starts in a haphazard way with the 
student’s earliest interest in history, and for the 
professional historian becomes more systematic 
and more sharply focused in the course of his 
training. In general, the training of a historian aims 
at the complementary but sometimes conflicting 
goals of simultaneously extending the range of his 
knowledge and-of bringing it intensively to bear 
on some limited constellation of past happenings, 
in writing about which he is expected to make 
some contribution to the advancement of historical 
knowledge. 

This process of knowing which goes on through- 
out the historian’s active professional life is not 
identical with the knowing through experience 
which enables most men to meet most of the 
contingencies of their days on earth without con- 
tinual bafflement, frustration, and disaster; but it 
is similar to it. It is not identical because in some 
measure the historian chooses what he will con- 
front, while much of what men know through day- 
to-day experience comes at them haphazardly by 
no choice of their own. 

Second, the layman’s kind of knowledge is in 
large part acquired through face-to-face confronta- 
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tion with persons and situations, while the histo- 
rian’s confrontation with men and situations of the 
past is mainly (in most cases, wholly) indirect, 
mediated by the surviving documentary and archeo- 
logical record. Nevertheless, the difference ought 
not to be exaggerated, for there is considerable over- 
lap. The printing press and more recent media of 
communication have vastly extended the role of 
indirect confrontation in day to-day experiential 
knowledge. Moreover, in his quest for understand- 
ing of the past, the historian relies to a considerable 
extent on the cautious and qualified analogical ap- 
plication of experiential knowledge which he has 
accumulated in personal, face-to-face transactions 
during his own lifetime. 

Finally, the difference between face-to-face con- 
frontation and confrontation mediated by the 
historical record is one of mode but not neces- 
sarily of quality, or intensity, or depth, or coher- 
ence, or completeness. Any man who reads and 
meditates on the 12 volumes of the correspondence 
of Desiderius Erasmus, extending over a span of 
four decades, plus his massive literary output, may 
justly argue that in quality, intensity, depth, co- 
herence, and completeness his experiential knowl- 
edge of that sixteenth-century intellectual, acquired 
through mediated confrontation, is more firmly 
based than the experiential knowledge that any 
contemporary of Erasmus had of him. Indeed, it 
is as firmly based as the knowledge available today 
about any intellectual now living. The similarity 
between the two kinds of knowledge we have been 
considering is that for many purposes the consis- 
tently appropriate response which indicates their 
presence is not and need not be verbal, and when 
verbal, it need not and quite possibly cannot take 
the form of wholly denotative statement, much 
less of mathematical formulation. For such pur- 
poses it lies closer to the Willie Mays pole of re- 
sponse by effective action than to the scientific 
pole of response by communicating results in un- 
ambiguous statement. 

In a large part of his work, the historian has no 
need at all for such statement or for any coherent 
verbal statement at all. He is massively engaged 
in finding out what happened and how it happened. 
To do these things he must formulate rough hy- 
potheses, often very rough, about what happened 
and how it happened, and then examine the avail- 
able record to verify or correct his hypotheses. 
But at the outset, from an almost limitless range 
of conceivable hypotheses he must select for in- 
vestigation the very few that lie somewhere in 
the target area; he must select only those for 
which the surviving records hold forth some hope 
of verification; and he must have a sense of what 
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records among a multitude are likely to provide 
the evidence he needs. A historian unable to do 
any of these things would remain an inept novice 
all his life. Relying on their knowledge of the past, 
historians successfully do these things day after 
day, yet most of them would be at a loss to explain 
their particular choices. They tend to ascribe their 
general aptitude for making better rather than 
worse choices to “knowing the ropes” and “having 
been around a long time in this period.” In effect, 
the knowledge that a historian relies on for a very 
considerable part of his work is experiential and 
results from a long and extensive familiarity with 
the historical record. (There is a legitimate doubt 
whether much of the foregoing does not apply as 
readily to the actual work of scientists as to that 
of historians.) For the historian the link between 
knowing and communicating is loose and weak; 
on the basis of his own experience of this loose- 
ness, he inclines to give some, if not full, faith and 
credit to colleagues who claim to know about the 
past much that they cannot adequately express 
in writing. Communication through historiography 
requires historians to put into written words what 
they know experientially and diffusely about the 
past, to organize it into coherent and sequential 
statements in order to make it fully accessible first 
to themselves and then to others. Their communi- 
cation with others, the history they end up writing, 
thus starts four removes from the episodes in the 
past that concern them. Between the two lie the 
historical record, the historians’ experiential knowl- 
edge acquired through their exploration of that 
record, and their attempts to communicate to 
themselves what they know. 

Through these interposed layers historians, in 
and by their writing, seek, along with many other 
things, to enable their readers to follow the move- 
ment and to sense the tempo of events: to grasp 
and do justice to the motives and actions of men; 
to discern the imperatives that move men to action; 
and to distinguish those imperatives from the 
pseudo imperatives that have become mere exer- 
cises in pious ejaculation; to recognize the impact 
on the course of events of an accident, a catas- 
trophe, or a bit of luck; and to be aware of what 
the participants in a struggle conceived the stakes 
to be. (This particular set of items which historians 
sometimes feel called on to communicate through 
the three layers has purposely been selected be- 
cause (1) none of them are explanatory in the 
scientific sense and (2) none can be effectively 
communicated in a purely scientific rhetoric.) The 
historian’s ability both as an investigator of the 
past and as a writer of history is measured nega- 


tively by the extent to which he lets the layers that 
intervene between the episode in the past and the 
reader of his work insulate the reader from the 
past, positively by the extent to which he is able 
to make these layers serve as a conductor of 
knowledge of the past to his readers. Historians 
give faith and credit to their fellows who protest 
that they know and yet confess that they cannot 
communicate what they know, because in some 
measure every historian is aware how far he has 
failed in his writing of history to penetrate those 
layers in his effort to communicate what he knows 
of the past. 

Given this sense of the inadequacy of their use 
of language to their task, historians would surely 
welcome as an alternative a wholly denotative 
universal vocabulary which would narrow to a 
scarcely discernible crevice the perilous chasm 
that for them separates cognition from communi- 
cation and sets them ransacking the whole store- 
house of their mother tongue instead of relying on 
a manageable number of well-designed symbolic 
structures to overcome it. The fact that no such 
alternative adequate to communicate what his- 
torians know about the past has up to now emerged 
Suggests that the relation between knowing the 
past and the writing of history is such as to pre- 
clude that alternative, that in practice historians 
believe that the sacrifice of the knowledge of the 
past which it would entail renders inappropriate 
the universal imposition in historiography of the 
denotative rhetoric of scientific discourse. 

Of course historians can avoid their rhetorical 
difficulties (1) by attempting to communicate 
about the past only that knowledge which can be 
expressed in a rhetoric nearly like that of the 
natural sciences or (2) by attempting to know 
about the past only what can be so communicated. 
In fact, historians have pursued both these courses. 
Some historians have taken the first course either 
because of a special interest in the sorts of his- 
torical problems manageable within the confines 
of a quasi-scientific rhetoric or because their special 
aptitude for that rhetoric has turned their atten- 
tion to the sorts of problem with which it can deal. 
Others have taken the second course either out 
of allegiance to a conception (or misconception) 
called scientific history or because by calling their 
thinking and writing about the past scientific his- 
tory, they thought they could sanctify their incom- 
petence and dullness, 

On the whole, however, historians have not been 
willing to truncate their knowledge of the past 
to fit the special aptitudes of a few historians or 
the misconceptions or painful ineptitude of a num- 
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ber of others. Instead, to make their experiential 
knowledge of the past accessible to readers who 
cannot recapitulate the processes by which that 
knowledge was acquired, they have used almost 
every device of rhetoric compatible with their 
commitment to a clear and intelligible presenta- 
tion of the evidence on which their knowledge is 
based. That the language they use is frequently 
evocative and even metaphorical and that much 
of its vocabulary is not that of scientific demon- 
stration but of the ordinary discourse of educated 
men, testifies to their conviction—rarely explicit, 
sometimes not wholly conscious—that these are 
the appropriate means for bringing their readers 
into that confrontation with events long past and 
men long dead which is an indispensable condition 
of knowing them. To the extent that the historian’s 
rhetoric falls short of communicating what he be- 
lieves he has discovered about the past, what he 
thinks he knows does not become generally avail- 
able and cannot be tested publicly by other his- 
torians. To that extent, therefore, there is a loss 
of potential knowledge of the past. Conversely, to 
the extent that he succeeds in communicating 
anything that hitherto he alone has known, there 
is gain. Therefore, the advancement of historical 
knowledge depends to a considerable extent on the 
quality of the historian’s rhetoric, on the efficacy 
of his historiography, and is almost inseparable 
from it; far from impeding the advancement of 
historical knowledge, language evocative rather 
than wholly dénotative in intent and character be- 
comes a means and a condition of that advance- 
ment. 


Modes of explanation 


Historiography is the means for communicating 
in writing what the historian thinks he knows 
about the past. Efficient and effective communica- 
tion requires him, in writing history, to array what 
he knows according to some principle of coherence. 
The principle of coherence traditionally and still 
most generally employed by historians is narrative. 
Usually, but not always, they communicate what 
they know by telling a story or stories. Despite its 
venerable antiquity, narrative has recently come 
under attack as a means for providing coherence 
in history. The most general ground for attack 
seems to be the contention that the coherence it 
Provides is nonexplanatory or inadequately ex- 
planatory. In this respect it is compared invidiously 
with the principle of coherence by subsumption 
under general laws supposedly standard in the 
rhetoric of the sciences, which is said to provide 
adequate explanation. If subsumption under gen- 


eral laws is the principle of coherence standard in 
the sciences, if by the criteria of the sciences it 
alone provides adequate explanation, and if the 
provision of adequate explanation is the sole or 
prime function of the sciences, then clearly narra- 
tive does not meet the scientific standard of co- 
herence, nor does it provide adequate scientific 
explanation. It remains to ask, however, why his- 
torians should prefer a principle of coherence and 
criteria of explanatory adequacy borrowed from the 
rhetoric of the sciences both to narrative, their own 
traditional principle of coherence, and to the view 
of the nature and conditions of historical explana- 
tion which their use of narrative implies. 

The ascription of adequacy to explanations of 
the general-law type seems to be based on both 
aesthetic and practical considerations. (1) Ele- 
gance, precision, clarity, internal consistency, and 
structural tidiness—a place for everything and 
everything in its place—in the domain of knowing: 
all these are aesthetically attractive concomitants 
of general-law explanations and are the generator 
and the justification of the denotative vocabulary 
and the mathematical syntax at which scientists 
aim. (2) In many or most cases, such explanations 
afford scientists the opportunity to replicate the 
experiment that was offered as evidence of the 
applicability of the general law, and thus to test 
the validity either of the law or of its application. 
(3) The rhetoric of the sciences facilitates the 
rapid and efficient identification of new problems 
and problem areas. (4) The expansion of the range 
and precision of general-law explanations has 
roughly coincided on the temporal scale with an 
extraordinary expansion of man’s control of vast 
tracts of his environment over which hitherto he 
had exercised no control at all. Although by the 
general-law canon, or indeed by any canon of ex- 
planation that has not been in disrepute since 
Aristotle, such a coincidence taken alone does not 
adequately explain the expansion of control, still 
nothing succeeds like success, and the aura of 
prestige acquired by the natural sciences in the 
past three centuries has rubbed off on the criteria 
of adequate explanation ascribed to them, espe- 
cially in the eyes of intellectually insecure social 
scientists and historians. 

No one has ever made clear why the criteria of 
adequate explanation acceptable to scientists in 
their work should also be acceptable to historians 
in theirs, or why adequacy of explanation should 
be the sole criterion of consistently appropriate 
response for the historian engaged in the work 
of communicating his knowledge in writing. Ade- 
quacy of explanation is clearly relative to that 
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which is to be explained. A guidebook to a city, for 
example, explains the location of a “point of in- 
terest” by designating what streets it is on and 
nearest to and by indicating the means of access 
to it from other points by various means of trans- 
portation. A real estate survey explains its loca- 
tion by designating the frontage and length of the 
lot, the street it faces on, and the distance of the 
lot from the nearest intersecting street. Both ex- 
planations are accurate, exact enough for their 
respective purposes, and therefore adequate; each 
communicates the knowledge likely to be sought 
by one particular sort of seeker and thus pro- 
vides the appropriate response to his questions; 
neither invokes any general law, nor need it do 
so; neither is a scientific explanation, nor need it 
be so, Moreover, the notion that the sole appro- 
priate response of the historian to his commit- 
ment to communicate what he knows is something 
designated “explanation” is wildly arbitrary. It 
involves either consigning a large part of that 
response to the domain of irrelevance or so ex- 
tending the meaning of “explanation” as to render 
it unrecognizable by scientists, philosophers, ordi- 
nary readers, and historians themselves. 

Historical narrative. Narrative, which is the 
thetorical mode most commonly resorted to by 
historians, is also their most common mode of 
explanation. It is not in fact scientific by the cri- 
teria just indicated; it cannot be rendered scien- 
tific because it is formally not reducible to the 
general-law type of explanation; and no more than 
the “explanations where” set forth above need it 
be scientific in order to be adequate, unless one 
insists on applying the scientific criteria of ade- 
quacy to a nonscientific explanation. Narrative is 
the most common mode of historical explanation 
because it is often the kind of explanatory answer 
solicited by a kind of question that historians very 
often ask and that is very often asked of them. 
Two ordinary forms of this question are “How did 
it come about that... ?” and “How did he (or 
they) happen to... ?” For example, “How did it 
come about that a Labour government took power 
in England in 1964?” or “How did the New York 
Giants happen to play in the World Series in 
1951?” The following discussion will be organized 
around a treatment of this second question. The 
writer has selected it because it leads quickly into 
so many of the topics of this section and because 
its evidential base is one on which he is more than 
ordinarily well informed. 

The call for an explanation of how the Giants 
happened to play in the World Series of 1951 can 
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be so construed as to make it amenable to explana- 
tion of the general-law type. 


A. The particular facts 

1. In 1951 the New York Giants were a baseball 
team in the National League. 

2. In 1951, during the official National League base- 
ball season, the New York Giants won more games 
from the other teams in that league and lost fewer to 
them than any of the other teams in the league won 
or lost. 

B. The general law 

Whenever during the official National League season 
a National League team wins more games and loses 
fewer than any other team in that league, it plays in 
the World Series. 

The answer perfectly fulfills all the requirements 
of the general-law type of explanation, including 
denotative univocal vocabulary and strict deduc- 
tive entailment. Yet from the point of view of the 
writer and reader of history, such an answer is 
patently unsatisfactory. The reason is that in the 
context of the National League season of 1951, 
the appropriate response to the question “How did 
the Giants happen to play in the World Series of 
1951?” is the historical story of how the Giants 
came to lead the National League at the end of 
the official season that year. A general-law ex- 
planation cannot tell that story; indeed, it cannot 
tell any story. It is not built to tell stories. From 
this very simple instance an important conclusion 
follows: general law and narrative are not merely 
alternative but equally valid modes of explanation. 
In the above instance the general-law explanation 
does not tell the questioner what he wants to know; 
for him it is neither a good nor a bad, neither an 
adequate nor an inadequate, explanation—it is no 
explanation at all. And conversely, to other ques- 
tioners asking scientists and historians for an 
“explanation,” a response in the form of a story 
would be quite inappropriate and therefore no ex- 
planation at all. The validity of either mode of 
explanation is determined by the appropriateness 
and adequacy of its response to a particular ques- 
tion. In effect, the validity of modes of explana- 
tion is not something that exists in vacuo, but only 
in relation to what particular inquirers at particular 
moments seek to know. 

In view of the frequent irrelevance of pure gen- 
eral-law explanation to past situations that require 
the telling of a story, attempts have been made to 
adapt the general-law type of explanation to narra- 
tive. Narrative explanation is usually presented as 
a series of statements of continuous causal linkages 
between events such that in the chains of causa- 
tion (1) each effect is imputed to precedent causes 
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and (2) the imputation implies either the actuality 
or the possibility of a general law or laws such that, 
taken with the precedent causes, they entail the 
effect. 

For present purposes it is to our advantage that 
the official rules of baseball provide us with a vo- 
cabulary almost as purely denotative as that of the 
sciences. In that vocabulary we can produce a 
narrative explanation of how the New York Giants 
won the 1951 National League pennant and thus 
played in the World Series; this explanation con- 
forms to the foregoing model. 


National League Standings as of September 30, 1951 


Won Lost 
Brooklyn Dodgers 96 58 
New York Giants 96 58 


Because of the tie at the end of the regular season 
Brooklyn and New York were required to play addi- 
tional games, the first team to win two games to be 
designated as the National League entry in the World 
Series. 

First additional game, Oct. 1, 1951: final score, 
New York 3, Brooklyn 1; games won, New York 1, 
Brooklyn 0. J 

Second additional game, Oct. 2, 1951: final score, 
Brooklyn 10, New York 0; games won, New York 1, 
Brooklyn 1. 

Third additional game, Oct. 3, 1951: inning-by- 
inning score to second half of the ninth inning: 
Brooklyn 100 O 0 0 0.3.0 
New York 000 000 10 
Score to second half of the ninth inning, Brooklyn 4, 
New York 1. New York at bat. 

The first batter singled. The second batter singled. 
Because the first batter was a reasonably fast runner, 
he advanced to third base. Because the third batter hit 
a short fly ball which was caught, he was out. Be- 
cause the fourth batter doubled, the first batter scored 
a run, and the second batter advanced to third base, 
where he was replaced by a substitute runner because 
he had hurt his leg. The Brooklyn pitcher was replaced 
because three New York players out of four had made 
safe hits off his pitching. Because the fifth batter hit 
a home run the substitute runner, the fourth batter, 
and the fifth batter scored runs. Because New York 
scored four runs in the second half of the ninth in- 
ning, making the score 5 to 4, they won the game. Be- 
cause they won two games of the play-off before 
Brooklyn did, they won more games and lost fewer 
than any other team in the National League. Because 
of this they played in the World Series of 1951. 


About the preceding narrative explanation a 
number of highly instructive points are worth 
Noting. 

(1) It almost perfectly conforms to the pro- 
posed structure of narrative explanation outlined 
above; that is, it is a series of sentences in which 


the causal connections between the events men- 
tioned are explicit or clearly implicit, and into 
which the relevant possible general laws may 
readily be inserted. Any number of such laws are 
not merely possible but actually available, e.g., if 
the two leading teams in the National League have 
won and lost the same number of games at the end 
of the regular season, the rules require that they 
resolve the tie by playing against each other until 
one of them has won two games. 

(2) All the facts as stated are verifiably true 
and all the causal inferences are valid, and there- 
fore the whole narrative explanation is historically 
true and accurate in every respect. 

(3) Nevertheless, at a number of crucial points 
it is hard to see how particular effects were strictly 
entailed by a combination of antecedent causes and 
general laws. What, for example, is the general 
law which with the precondition (three hits and 
one out, among four men at bat) entails the re- 
placement of one pitcher by another? Even if one 
elaborated further on the boundary conditions— 
and that can be done—it is difficult to see how gen- 
eral laws can be invoked and a strict entailment 
made to work here. 

(4) It is true that once the substitute Brooklyn 
pitcher, Ralph Branca, whose mere presence in the 
game seems not amenable to narrative explanation 
(see above), released the ball, and once the fifth 
New York batter, Bobby Thomson, began his swing 
of the bat, a combination of a few special cases 
(mainly ballistic) of the general laws of motion 
with the National League ground rules on home 
runs suffices strictly to entail that Thomson hit a 
home run. It is hard, however, to envision the com- 
bination of conditions and laws that would strictly 
entail a decisive precondition of that home run: to 
wit, that Thomson decided to swing at Branca’s 
pitch in the first place. 

(5) Even if these problems of the logic of nar- 
rative explanation can be resolved, the account as 
presented raises a number of difficulties and ques- 
tions. (a) Why does the explanation begin with the 
play-off at the end of the regular season? On the 
face of it, in a regular season that ends in a tie, 
every game played throughout the season by the 
tied teams is of equal causal importance and there- 
fore should receive equal treatment. (b) By the 
same token, why is a fuller account (inning-by- 
inning score) given of the last game of the official 
season than of the two previous games, and a still 
fuller account of the last half of the last inning of 


the last game? ne 
(6) Most important, the explanation is historio- 
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Figure 1 — The positions of the New York Giants in relation to the Brooklyn Dodgers in the 
1951 pennant race, with standings shown at end of play on given date 


graphically pitiful, and the historian who offered it 
would immediately lose the historian’s moral equiv- 
alent of a union card. 

Given the problem with which we started, these 
difficulties go to the heart of the trouble. They make 
it clear that offering an answer in the form of a 
narrative explanation which is structurally deter- 
mined solely by the logic of causal ascription is not 
an appropriate response to the difficulties or an ade- 
quate solution to the problem. Within the bounds of 
the logic of causal ascription there is no solution for 
them. That logic cannot justify the shifts in the 
scale of the story. Yet it is reasonable to suspect 
that one of the few things which most readers 
would intuitively regard as appropriate about the 
above dreary but true narrative response to the 
question about New York being in the World Series 
in 1951 would be precisely the successive expan- 
sions of the scale of the story, The reason for this 
is that the appropriate response to the question is 
not a true narrative explanation determined by the 
logic of causal ascription but the historical story 
truest to the past, determined by the rules of histor- 
ical evidence and the rhetorical rules of historical 
storytelling. Of this larger context a true narrative 
explanation is a part, but only a part. If this is so, 
then the true historical story rightly determined by 
the rules of historical rhetoric will be preferable to 
a true narrative explanation because it communi- 
cates more knowledge and truth about the past 


than such an explanation does. But if that is so, 
then the rhetoric of history writing, not its logic 
alone, is implicated in providing increments of 
knowledge and truth about the past. 

Let us continue with the example under exami- 
nation, keeping in mind the problems of where to 
start the historical story and on what scale to tell 
it. Figure 1 describes the relative positions of the 
two contenders in the National League pennant 
race of 1951, 

The first things to note in Figure 1 are the shifts 
in scale and the considerations which determined 
them. The over-all consideration is that of telling a 
historical story in such a way as to maximize the 
increment of knowledge and truth communicated. 
That within a framework identical with the one 
in which the figure is constructed (the 154-game 
baseball season) it may be desirable to have no 
change of scale and not to tell a story at all be- 
comes clear on considering the description of the 
American League season of 1939 in Figure 2. 

Figure 2 is constructed on uniform scales for 
each axis, plotting the games won by the New York 
Yankees against the games won by the team in the 
league that was in second place. That this season 
calls for no narrative explanation is manifested by 
the nonconvergence of the lines in F igure 2, which 
shows (1) that by June 1, the Yankees were seven 
games ahead, (2) that thereafter the minimum gap 
between them and their nearest rival was six 
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Figure 2 — The positions of the New York Yankees and the second-place American League 
team in the 1939 baseball season 


games, (3) that by the end of the season the 
Yankees led by 17 games. In short, for more than 
the latter two-thirds of the season there was never 
a moment when it even looked like a pennant race, 
so that on the face of the record, to answer the 
question “How did the Yankees come to be in the 
World Series?” with a historical story would be a 
historiographic error: the even-paced, dull, and 
trivial chronicle which the effort would yield would 
itself demonstrate the inappropriateness of such a 
rhetorical response. 

By the same token, the climbing line in Figure 1 
indicates that the record of the 1951 National 
League season calls for a historical story, and that 
to write about its history without telling such a 
story is to fail to make the appropriate historio- 
graphic response. The data in Figure 1 start at the 
point where the extended historical story should 
begin: August 11, 1951, at the end of play. At that 
point New York was at its maximum distance be- 
hind Brooklyn, 13 games; and the next day New 
York began a series of 16 consecutive wins. For the 
next extension of the narrative and for the expan- 
Sion of the scale of the graph, the directive of the 
record is more ambivalent. The options lie between 
(1) September 14, when, still six games behind, 
New York began a series of five consecutive wins 
which by September 18 moved it to within three 


games of Brooklyn and (2) September 21, when, 
four games behind, New York won the last seven 
games of the regular season. It had moved into a 
tie at the end of play September 28, kept the tie 
by winning the last two games of the regular sea- 
son, and was again tied at the end of the second 
game of the play-off. It is to be noted that (1) 
although there are two options for starting this 
expansion, there are only two serious options, and 
(2) they have an identical terminus ad quem, the 
point at which the next expansion of scale begins 
in the final game of the play-off. Either of the above 
alternative solutions is historiographically correct; 
any other is incorrect. There may be two or more 
right answers to some historiographic, as to some 
mathematical, problems. This does not imply or 
entail that there are no wrong answers. This simple 
observation and distinction, evident to any mathe- 
matics student who has gotten as far as quadratic 
equations, seems to have escaped most historians. 

The expansions of scale, then, are not arbitrary; 
each can clearly be justified from the historical rec- 
ord on historiographic grounds, and each expan- 
sion coincides with a period in telling about which 
the historical storyteller would extend the dimen- 
sions of his story. Three further points must be 


made. 
(1) On the basis of true narrative explanation 
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determined by the logic of causal connection, it 
proved impossible to determine where to begin the 
historical story of the 1951 pennant race or what 
dimensions to give to any of its parts. Indeed, since 
causal connection is subject both to infinite regress 
and to infinite ramification, and since that histor- 
ical story and any other must have a beginning and 
finite dimensions of its parts, it is in principle im- 
possible on the basis of the logic of narrative ex- 
planation alone to tell a historical story at all. On 
the other hand, the rhetoric of historical Storytell- 
ing provided us with the means of recognizing 
whether there was a historical story to tell, where 
the story should start, and roughly what the rela- 
tive dimensions of its parts should be. If this is so, 
(a) in the telling of a historical story, increments 
of historical knowledge and truth are unattainable 
on the basis of the logic of narrative explanation 
alone, for on that basis alone it is impossible so 
much as to begin such a story; and ( b) for achiev- 
ing such increments, insights into the rhetoric of 
historical storytelling, whether experiential and im- 
plicit or discursive and explicit, are indispensable. 

(2) The clock and the calendar provide no 
guidance to the appropriate dimensions of a his- 
torical story. Between those dimensions and mere 
duration, measured in homogeneous scaled incre- 
ments, there is no congruence. The historical story- 
tellers time is not clock-and-calendar time; it is 
historical tempo. The problems involved in reason- 
ably accurate determination of historical tempo 
have never been systematically studied, although 
results of the disaster of not studying them strew 
the historiographic landscape. But two points are 
clear. (a) Disproportions in historical Stories in- 
duced by failure correctly to appraise historical 
tempo result in the telling of distorted stories about 
the past. To that extent they diminish, and correct 
perception of tempo increases, available knowledge 
of the past. (b) The logic of narrative explanation 
has nothing to say on the subject of historical 
tempo; it is a question that can be dealt with only 
in the area of the rhetoric of historical storytelling. 
And so once again the communication of incre- 
ments of knowledge and truth about the past hinges 
on the correct solution of problems of historiog- 
raphy. (c) Correct determinations of historical 
tempo and the appropriate correlative expansions 
and contractions of scale in a historical Story de- 
pend on the examination in retrospect of the his- 
torical record. That is to say, when the historian 
tells a historical story, he must not only know some- 
thing of the outcomes of the events that concern 
him; he must use what he knows in telling his 
story. 


In the case of writers of history, Gallie’s interest- 
ing analogy between historical understanding and 
following a game or story breaks down | 1964, pp. 
22-50). It applies to consumers of history, the 
readers, not to its producers, the writers. The 
readers need not know the outcome of the story; 
and it is well if, at least, they do not know the 
writer's construal of the outcome, since not know- 
ing it whets their curiosity and intensifies their en- 
gagement and vicarious participation in the Story, 
thus augmenting their knowledge of the past. But 
unless the writer has the outcome in mind as he 
writes the story, he will not know how to adapt the 
Proportions of his story to the actual historical 
tempo, since that is knowable only to one who 
knows the outcome. For example, the decisive point 
for transforming the proportions of the historical 
Story of the 1951 pennant race was entirely unob- 
served, unpredictable, and unpredicted by any con- 
temporary observer. On August 11, at the point of 
maximum distance between Brooklyn and New 
York, no one foresaw or could have foreseen that 
New York was on the point of beginning a 16-game 
winning streak that transformed the baseball sea- 
son into a pennant race in which New York was 
the ultimate victor. Indeed, the perspective of the 
historical storyteller throughout should be double 
—that of a contemporary observer and that of one 
who knows about Bobby Thomson's home run; but 
it is from the latter Perspective, not the former, 
that the historian can perceive the historical tempo 
and thus determine the appropriate proportion of 
the historical story. 

Historical analysis. Telling a historical story is 
not the only way in which a writer of history can 
increase knowledge of the past, as we have seen in 
the case of the American League season of 1939. 
Even to the question “How did it happen that . . .?” 
it is not always an appropriate response. Yet in the 
case in point, the response must have a historical 
character. The structure of Figure 2 suggests that 
it should take the form of historical analysis; for 
with its nonconverging lines it indicates that New 
York was a team so much better than any other in 
the league that it was beyond effective challenge. 
Consequently, to increase historical understanding 
there is nothing to do but analyze that betterness 
of New York, to seek out its ingredients and render 
them intelligible to the reader. Here, of course, the 
abundant Surviving statistics of baseball provide 
a useful historical record to start with—the base- 
on-balls, strike-out, and earned-run averages of 
the pitchers; the batting and total base averages, 
the stolen bases, runs scored, and home runs of 
the hitters. (Fielding statistics, however, do not 
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provide a satisfactory statistical basis for evalu- 
ating defensive performance in baseball.) 

If the rhetoric of historical storytelling has re- 
ceived little attention from historians or others, the 
rhetoric of historical analysis has received none. 
When considering historical storytelling, the rhet- 
oric of the fictive story offers a useful model; when 
considering historical analysis, the rhetoric of sci- 
ences in which the subject matter is less compat- 
ible with universal generalization than in physics 
might be appropriate. The almost complete lack of 
any serious concern with the problem may be due 
to the notion that the sciences have no rhetoric; 
but if one conceives of rhetoric as the organization 
of language appropriate to that particular kind of 
communication which is relevant to a particular 
activity, then any activity which is committed to 
verbal communication of its results has a rhetoric. 

In recent years, instead of giving serious consid- 
eration to the serious problems of historical rhet- 
oric, historians have engaged in considerable, and 
sometimes somewhat rancorous, discussion of the 
nonproblem of the relative merits of analysis and 
narrative in history writing. The discussion is foot- 
less because of two false assumptions: (1) that 
regardless of the character of the historical record, 
the historian has a wholly free option between anal- 
ysis and narrative; (2) that these two historio- 
graphic modes mutually exclude each other, so that 
historians in all their work must opt wholly for the 
one or wholly for the other. With respect to the 
first assumption, we have seen in the instance of 
the two baseball seasons that the historical record 
presents us with constellations of events in which 
there is no serious option, where in one case to 
choose analysis, in the other to choose narrative, as 
the predominant mode would be a historiographic 
error and would prevent the historian from commu- 
nicating what he knows about the past, and even 
from knowing it adequately. 

In the second place, these modes are not mu- 
tually exclusive, and this again is evident with re- 
spect to our two “models.” Although the mode of 
writing the history of the 1939 season should be 
predominantly analytical, still the force of the anal- 
ysis would be strengthened by the resort to exem- 
plary stories; and this is particularly the case with 
respect to fielding, in which the New York Yankees 
indeed excelled but which, because of limitations 
in the statistical record, lends itself better to anec- 
dotal treatment, that is, to telling short illustrative 
Stories, The case for the use of any analysis at all 
in the instance of the 1951 season might seem more 
dubious. The very fact of the tie at the end of the 
regular season could be taken as fair proof from 


Table 1 — Percentage of games won by the New York 
Giants and the Brooklyn Dodgers in the 1951 


baseball season 
New York Brooklys 
Gients Dodgers 
Beginning of season to Avgust 11° 54% 66% 
August 12 to end of season B4%, 0% 
Total season (154 gomes) 62% 62% 


* As of Avgust 11 the Giants hod ployed 110 gomes ond the Dodgers 106. 


a large sample of 154 games that overall the two 
teams were so well matched as to make analysis 
an exercise in futility. Actually, an examination of 
the record yields a quite different result. 

Table 1 shows the percentage of games won out 
of games played by each team up to August 11, and 
from August 12 to the end of the regular season. It 
clearly poses two analytical questions: (@) how to 
account for the marked superiority of Brooklyn in 
the first hundred-odd games of the season; (b) how 
to explain the overwhelming superiority of New 
York in the last forty-odd games. For such an un- 
dertaking, as we have seen, analysis is the proper 
historiographic mode. Note that the selection of 
analysis as the dominant mode for the first seven- 
tenths of the season does away with the need for 
telling an inevitably thin story and thus enables 
the historian to maintain the proportion called for 
by the demand of historical tempo. One further 
complication: The analysis would fail to reveal a 
part of what made the difference, the part, told by 
Eddie Stanky more than a decade later, about a 
battered battalion with pulled muscles, bad throw- 
ing arms, and cracked bones that still could not 
lose for winning. And thus, to do the analysis itself 
justice, the historian would have to afforce it with 
historical stories about Alvin Dark, Wes Westrum, 
Sal Maglie, and Stanky's own slides-into-bases that 
were hard to distinguish from overt assault and 
battery. Our two examples were themselves care- 
fully selected extreme cases of records that call re- 
spectively for analysis and storytelling. In most 
history writing, the need for a mix of the historio- 
graphic modes of storytelling and analysis is even 
more obvious. The serious historiographic problem 
is not how to avoid the mix in order to maintain 
the superiority of one mode over the other, but how 
to proportion it and how to manage it. 

In the foregoing “model” of the pennant race of 
1951, the discussion has slipped—the author has 
intentionally allowed it to slip—insensibly from the 
problem of narrative explanation to the problems 
of historical storytelling. In this it has followed the 
curve, as it were, of historical curiosity itself, both 
in the reader and in the writer of history. The orig- 
inal question, “How did it come about that. . .?,” 
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has become the more amorphous “Tell me (or let 
me find out) more about. . . .” The demand is no 
longer merely further explanation. A reasonably 
full explanation is presumably already in hand. 
That explanation itself has led reader and writer 
of history alike to shift the ground of their interest. 
Because of it they have become aware that they 
have stumbled onto one of the great events in base- 
ball history, the event that culminated in Bobby 
Thomson's home run—the equivalent (in its 
sphere) of the defeat of the Armada, the battle of 
Stalingrad, the Normandy landings. What they 
want under these circumstances is not more or 
fuller explanation; what they want is confrontation 
with the riches of the event itself, a sense of vi- 
carious participation in a great happening, the sat- 
isfaction of understanding what those great mo- 
ments were like for the ordinarily cool Russ Hodges, 
Giant radio announcer, who, as the ball arched 
from Thomson's bat into the stands, went berserk 
and screamed into the microphone, “The Giants 
win the pennant! The Giants win the pennant! 
THE GIANTS WIN THE PENNANT!” And what 
those moments were like for those who saw what 
he saw and for those who heard him. Confronta- 
tion and vicarious participation are not historical 
explanation or explanation of any sort in any ordi- 
nary sense of the word. Yet clearly they are some- 
times a part, and an indispensable part, of under- 
standing the past as it actually was. Therefore, to 
argue that they have no place in historiography is 
at once arbitrary and absurd. 

Finally, when the historian needs to bring those 
who seek to understand the past into confrontation 
with and vicarious participation in some part of it, 
he often finds the rhetoric of the sciences wholly 
inadequate for his purposes. In this sector of his- 
toriography it is hard to imagine a response to the 
proper demand on historians to render an accurate 
and effective account of the past that would be less 
appropriate than one couched in scientific rhetoric. 
In this sector, indeed, to do his work properly, to 
tell the truth about the past, the historian must 
marshal resources of rhetoric utterly alien to the 
rhetoric of the sciences in order to render his ac- 
count forceful, vivid, and lively; to impart to it the 
emotional and intellectual impact that will render 
it maximally accessible and maximally intelligible 
to those who read it. 


The analysis of historiography 


The attitude of the historical profession to the 
writing of history has been ambivalent. Compared 
with the systematic attention historians have given 
to the techniques of historical investigation, their 


attention to the problems of historiography has 
been casual, and in their public judgments of the 
work of other historians they have tended to regard _ 
the rhetoric of history as at best a peripheral con- 
cern. On the other hand, some very able historians 
take far greater pains with their writing than would 
be warranted if the rhetoric of history were a mere 
pleasing embellishment not substantially involved 
in the advancement of the understanding of the 
past; and the consensus of the profession has rate 
ified their practice by conferring on the most skill- 
ful writers of history rewards in prestige and pay 
that would be exorbitant if the yield of that skill 
were judged to be merely an amusing but super- 
erogatory display of verbal pyrotechnics. 

If the preceding arguments about the insepa- 
rability of the communication, and therefore the 
advancement, of historical knowledge from the rhet- 
oric of history have any merit, then it would seem 
that a concerted effort to develop useful methods 
of analyzing historical rhetoric should stand high 
on the agenda of historians. In fact, however, their 
general concerns seem to be directed mainly toward 
two other areas, indeed, toward two nonproblems: 
(1) generalization and (2) the application of new 
knowledge in the social sciences to the study of 
history. The first is a nonproblem because in fact — 
historians generalize and have generalized fruit- 
fully at many levels for at least a couple of cen- 
turies, so that to raise at this late date the question 
of whether they do so or whether they ought to do 
So seems a little useless. Second, the application to 
the historical enterprise of any viable new tech- 
nique for knowing is always desirable and is con- 
ditional only on the mastery of the technique and 
the identification of historical problems to which 
it can be usefully applied. In this respect the social 
Sciences do not constitute a special case distinct 
from other techniques. Some historians have found 
some of the work of the social sciences useful for — 
their particular purposes; others have not; still 
others have been preoccupied with other legitimate 
professional concerns. Their resistance to the de- 
mand for immediate and universal application to 
history of quantitative methods and of psychoana- 
lytic insight does not seem to warrant the concern 
that it elicits from those who regard it as a chronic 
and possibly fatal disease of the historical profes- 
sion as a whole. It does not appear to stand much 
above the level that reasonable professional pru- 
dence, sensible and limited skepticism, and resent- 
ment of new encroachments on limited resources 
of time, energy, and ability ordinarily generate. 

Indifference to the problems of analyzing histor- 
iography, however, is easy to understand. In the 
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first place, such analysis may well turn out to be 
a sterile exercise. In the second place, the need for 
analysis is evident only if one accepts the view that 
such attributes of historiography as accessibility, 
force, vividness, and depth are not merely decora- 
tive but have true noetic value. Although in unself- 
conscious practice many historians in fact accept 
this view, it remains submerged because of the 
counterthrust of an equally unself-conscious and 
incoherent assent to the ascription of noetic value 
only to the rhetoric of the sciences, especially to its 
denotative vocabulary and to its attributes of pre- 
cision, simplicity, univocality, and so on. 

In order to justify presenting the sketchy pro- 
gram for the analysis of historiography which this 
section will offer, it may be well to indicate the 
noetic bearing of at least one of the potential and 
requisite traits of historiography mentioned above 
—accessibility. In On the Origin of Species Charles 
Darwin says, “Nothing is easier to admit in words 
than the truths of the universal struggle for life, 
or more difficult—at least I have found it so—than 
constantly to bear this conclusion in mind.” The 
rhetorical problem that Darwin here points to is 
that of accessibility. It is a persistent problem for 
explanation in the narrative mode. The writer of 
history needs to be always watchful to see that 
pertinent previous generalizations, pertinent pat- 
terns of action previously identified, and pertinent 
parts of the story, already told, come to bear for the 
reader at the places where they are enlightening and 
revelatory. Even where it is technically accurate, 
dull history is bad history to the extent to which 
it is dull. By subjecting all the historian knows to 
the homogenizing and flattening operations of his 
own mechanical rhetoric, dull history blurs his find- 
ings for himself and for those who read his writing. 
Those findings then fail to become, or rapidly cease 
to be part of, the “workable reserve,” the readily ac- 
cessible knowledge, of the writer and reader, which 
remains concurrently present in their minds as the 
one composes and the other tries to follow the nar- 
rative or analysis. Consequently, in the course of 
events neither will see a partly ordered and pat- 
terned, and therefore partly intelligible, procession 
of change but a disjointed and arbitrary and there- 
fore unintelligible one—just one damn thing after 
the other. A reader to whom almost nothing is com- 
municated may reasonably suspect that the writer 
had almost nothing to communicate; but, as we 
have seen, because of the gap between knowing 
and communicating in history, this is not necessar- 
ily so. Rather, unintelligible communication is not 
communication at all; uncommunicated knowing 
can add no increment to the available body of 


knowledge, and frequently the failure to produce 
such an increment is a failure in historiography, 
the absence of accessibility. 

Accessibility has been treated here as an absolute 
trait of history writing, and of course it is not so. 
It is relative not only to the historian’s rhetorical 
capacities but also to the absorptive capacities of 
the historian’s audience, which depend on their 
prior knowledge. An amount of detail necessary to 
render what he wishes to communicate accessible 
to one audience would simply clutter the text for 
another audience and stultify their imaginations, 
thus diminishing for that second audience the 
range of conceptions that the historian wants them 
to have in mind. The problem that this situation 
poses for the writer of history is a complex one; 
it is another of the many places where the rhetoric 
of the historian intersects and is entwined with the 
knowledge he communicates and the truth-value 
of what he has to say. There is not space to treat 
the matter of accessibility further here, but what is 
said below on the matter of word lists indicates 
some of the ramifications of the problem. 

The analysis of historiography can conveniently 
be divided into macroanalysis, microanalysis, and 
analysis of structure. Macroanalysis is the analysis 
of an individual piece of history writing as a whole; 
microanalysis is the analysis of any fragment of 
historical rhetoric without primary regard to and 
out of relation to the historiographic whole of which 
it is a part. Analysis of structure deals with his- 
toriographic traits, devices, and practices which are 
common to all or to a very considerable number of 
historical works. Hitherto we have keyed our dis- 
cussion of historiography to the rhetoric of scien- 
tific statement and explanation in order to make 
and keep clear the likenesses and differences be- 
tween the two. We have suggested, however, that 
in at least one trait which it requires in order to 
communicate some of the things the historian 
knows—its reliance on a connotative and evocative 
vocabulary—the rhetoric of history is nearer to that 
of the fictive arts than to that of the natural sci- 
ences. So before examining the types of historio- 
graphic analysis, it will be appropriate to point out 
a major difference between historical and fictive 
rhetoric—the overriding commitment of historians 
to fidelity to the surviving records of the past 

Fidelity to records. The difference between his- 
torical and fictive rhetoric blurs slightly at the ex- 
treme limits, where, on the one hand, a novelist 
tells a story which he intends to reflect his concep- 
tion of historical actualities and, on the other, a 
historian makes a story of a “typical case” that he 
imaginatively constructs out of his long experience 
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with the historical record. The blurring itself be- 
comes visible in a comparison of Conrad’s Nostromo 
and Oscar Handlin’s The Uprooted, but so does the 
difference that is blurred. For the worth of Nostromo 
as a novel would not diminish if the patterns of 
life Conrad ascribes to Costaguana, the imaginary 
Latin American republic which provides its setting 
and the substrate of its characters, were shown to 
be quite remote from extrinsic actuality. On the 
other hand, unless the record suggested that the 
immigrants’ emotional response to their move to 
America was something like what The Uprooted 
imputes to them, the historiographic worth of The 
Uprooted would be nil, regardless of any merits 
that literary critics might ascribe to Handlin’s prose 
style. 

The standard of judgment of a fictive work does 
not depend on its compatibility with external ac- 
tuality. The work as such depends for its authen- 
ticity or validity only on its relevance to the sector 
of general human experience which its author in- 
tends it to explore, describe, and render accessible. 
It can be true or false only to itself; and the knowl- 
edge which it communicates is independent of any 
particular in the record of man’s past (though not, 
of course, independent of human experience in gen- 
eral). Or as A. J. Liebling put it, in treating the 
problems of a newspaperman, “To transmit more 
than half of what you understand is a hard trick, 
far beyond the task of the so-called creative artist 
who if he finds a character in his story awkward 
can simply change its characteristics, (Even to sex, 
vide Proust and Albertine. Let him try it with Gen- 
eral de Gaulle. )” 

It is precisely with Charles de Gaulle and his sort 
that reporters like Liebling and historians often 
have to deal. The standard of judgment of a his- 
torical work is ultimately extrinsic. Its authenticity, 
validity, and truth depend on the effectiveness with 
which it communicates knowledge (not misunder- 
Standing) of the actual past congruent with the 
surviving record. The quality of its rhetoric is to 
be measured solely by its success in communicating 
such knowledge. 

Macroanalysis. It follows from what has just 
been said that the unit of macroanalysis in histori- 
ography differs from the unit of macroanalysis in 
fictive studies. In the latter it is the entire particular 
work—novel or drama, ode or sonnet—considered 
as a self-contained unit. The macroanalyst can 
therefore demand of himself an examination of the 
whole relevant documentation and can reasonably 
expect those for whom he is writing to have the 
core element of that documentation (the work un- 
der analysis) before them. Ordinarily the macro- 


analyst of historiography cannot demand so much 
of himself, still less expect so much of his readers, 
For him the relevant documentation is the work 
itself plus the historical record of the episodes with 
which the author concerned himself, not merely 
the part he used but any important part that 
through errors of omission he failed to use. It is 
improbable that in most. instances the analyst will 
command the full range of documentation; it is 
practically impossible under ordinary circumstances 
to expect the reader of the analysis to have the doc- 
umentation in front of him. 

Despite these limitations, some experiments in 
detailed macroanalysis seem desirable because only 
in such analysis does one deal with the actual unit 
of historiography—the historical work. Whether 
that work be a long treatise or a short article, its 
presentation is the means by which by far the 
largest part of the increments of historical knowl- 
edge is communicated. It is also the place where 
historians meet their worst failures—from the nov- 
ices who, having researched their subject, have not 
a notion how to organize it for effective communi- 
cation to those senior historians who have so com- 
pletely surrendered to their own ineptitude as to 
transform verbosity into a criterion of excellence. 
By selecting a relatively short piece of historical 
writing based on a record of manageable dimension 
and reproducing both the piece and the record, it 
would be possible to perform the sort of detailed 
macroanalysis in historiography that is a common- 
place in the field of literary criticism. 

Until a few such analyses are attempted, it is 
impossible to estimate what gains, if any, in the 
understanding of historiography may accrue from 
them; but it is hard to see why the macroanalysis 
of a historical study would be less fruitful of knowl- 
edge than the analysis of, say, Waiting for Godot. 
It is evident that unless such analysis is attempted, 
Some aspects of the writing of history are bound 
to remain wrapped in mystery. For example, the 
present writer is a reasonably competent practi- 
tioner of history writing, and he has done a reason- 
able amount. One of the most effective sentences 
he has ever written in a historical essay is the fol- 
lowing: “It was just the right thing for him to do.” 
To understand why that simple declaratory state- 
ment composed of flat monosyllables should be ef- 
fective would depend on a careful macroanalysis 
of the entire essay of which it is the last sentence. 

To the best of the writer's knowledge, up to the 
present no macroanalysis of any historical work 
has even been attempted on the scale and in the 
way above proposed, Only after it has been at- 
tempted several times will any estimate of its value 
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be more than an idle guess. In the meantime, the 
discussion of historical story writing and historical 
analysis in previous sections of this article points 
to a very few of the problems—proportion of the 
story, historical tempo, balance of analysis and nar- 
rative—with which macroanalysis would have to 
concern itself. 

Microanalysis. Although microanalysis is pri- 
marily concerned with single small items of histor- 
ical rhetoric, the radical severance of it from ma- 
croanalysis is not practically possible. Some sense 
of the whole framework remains essential, because 
only through that sense can one arrive at a judg- 
ment on the ultimate efficacy and appropriateness 
of a given small item of historical writing which is 
a part of a historiographic whole. “The Army of the 
Covenant of the Scots with their God marched 
across the Tweed to rescue their sore beset English 
brethren” is historiographically sound, if its general 
context requires at the point of its introduction a 
quick communication of the spirit in which the 
Scots took the field in 1644. If the total context is 
the logistical problems created by the presence of 
a considerable military force in the agriculturally 
unproductive north of England in the mid-seven- 
teenth century, however, a less allusive statement, 
detailing the number of Scots who entered the 
northern counties and their daily requirement of 
food and forage, would be rather more appropriate. 

For the examination of any single element of 
historical rhetoric, macroanalysis ( although not 
necessarily on the scale above suggested) is desir- 
able. Only by means of it can one finally judge 
whether that element is appropriate, for its appro- 
priateness is a function of the whole context of 
which it is a part. Ostensible grotesquerie—Alexan- 
der in full plate armor—may be appropriate enough 
if we see the whole picture, as Panofsky showed in 
discussing medieval representations of that hero. 
Because, however, the single historical statement 
has a dual context—both the work of history in 
which it is embedded and the actuality of the past 
to which it refers—it is possible to clarify some of 
the specific characteristics of the rhetoric of history 
by microanalysis considered with minimal refer- 
ence to the total structure of the historical work of 
which the fragment under microanalysis is a part. 
In effect, given a five-page account of the battle of 
Waterloo embedded in a historical work, by refer- 
ring the account to the historical record, it is usu- 
ally possible to say within the limits the historian 
set for the account whether it is historiographically 
sound at the level of microanalysis. On examining 
it in connection with the whole work, however, we 
might alter our judgment on the grounds that in its 


macroanalytic context it is disproportionately long 
or short, that it is dissonant with the rest of the 
book, or even that it is wholly irrelevant. 

Microanalysis of historiography is therefore pro- 
visional in the judgments it yields on the material 
it deals with, but it does at least yield provisional 
judgments. Here we have space to treat only one 
hypothetical example of microanalysis. Let us sup- 
pose a historian faced with the problem of dealing 
in two pages with the character and administration 
of U.S. President Warren G. Harding. One can con- 
ceive of his chocsing to do it in the style of the late 
Ring Lardner and with a rhetoric—a vocabulary 
and syntax—as close to Lardner’s as his own sense 
of historiographic proprieties and that of his editor 
would permit. Or one can conceive of a character- 
ization the whole tone of which was heavily heroic 
in vocabulary and syntax—so long as the under- 
tone made it evident that the verbal heroics were 
mock heroics. What would be wholly inappropriate 
to a brief characterization of Harding and his en- 
tourage would be a rhetoric of intentional, unrelent- 
ing, and unremitting solemnity. On the other hand, 
briefly to characterize Abraham Lincoln in either 
of the former rhetorical modes would not only be 
bad taste, it would be bad historiography; and the 
historian who employed either would promptly be 
marked by his peers as inept and incompetent. For 
Lincoln was a serious man (which did not prevent 
him from being a very humorous one) and a serious 
historical figure, and any attempt to present him in 
a short sketch which failed to reflect this fact would 
to that extent fail to communicate to the reader 
something he needed to understand about the reali- 
ties of a part of the past. It would thereby not only 
fail to advance but perhaps would even diminish 
his knowledge and understanding of the past, his 
grasp of part of its meaning, his store of historical 
truth. 

The implications of this excursus on the use of 
microanalysis of historiography in connection with 
characterizing actual persons of the past are worth 
a little further attention, since one of the persistent 
problems of history writing, calling for microanal- 
ysis, is that of characterization. In effect, in many 
kinds of historical investigation the historian en- 
counters persons in the record of the past. He can 
disregard them as persons and transform them in- 
to, say, numbers; and a demographic historian 
quite rightly does just this, simply because that is 
in fact the aspect under which he encounters them. 
He is like a man trying to get on a full elevator who 
encounters the persons already aboard merely as 
“a full elevator.” But that is not always, or even 
often, the way a historian encounters persons in 
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the record of the past. If he encounters them as 
persons, an attempt to avoid characterizing those 
implicated in an important way in the account he is 
rendering is a refusal to deal faithfully with the 
record of the past. 

No one whose judgment is worth serious con- 
sideration has ever suggested that historians must 
never characterize people they encounter in the 
past; and it is at least arguable that the normal 
rhetoric of history is such that a historian dealing 
with extensive data on the deeds and words of a 
person of the past cannot avoid characterizing him, 
that the only question is whether he characterizes 
him well or ill, whether he does him justice or in- 
justice. Nor has anyone ever argued that it is 
desirable or indeed even possible adequately to 
characterize a man in the wholly denotative rhet- 
oric that is appropriate to scientific discourse. In- 
deed, the very phrase “do justice,” which is quite 
appropriate to describe the goal of characterizing 
a man, is itself so massively connotative, so indis- 
pensably imprecise, as to render nugatory any hope 
of accomplishing with a sterilized denotative vocab- 
ulary and syntax a mission so vaguely described 
and imprecisely delimited. And yet it is possible by 
microanalysis of historical writing to arrive at judg- 
ments not merely of “bad” and “good” but also of 
“false” and “true” (or at least “truer” ) with respect 
to the connotative rhetoric which a_ historian 
chooses to employ in fulfilling his commitment to 
do justice to the character of a man. Nor is there 
any great mystery about this in the case of men 
concerning whom the historical record is reason- 
ably ample. Considering the rhetorical possibilities 
as a very broad spectrum and also as a complete 
spectrum within the bounds of the rhetorical poten- 
tialities of the common language structure, there 
will be areas of that spectrum into which what is 
known about a particular man cannot be fit with- 
out manifest distortion of the record and areas into 
which it fairly fits, although in both cases there 
may be several such areas. This was manifestly the 
case in the instances of Harding and Lincoln dealt 
with above, But to distort the record is precisely to 
communicate ignorance rather than knowledge, 
misunderstanding rather than understanding, 
falsehood rather than truth, 

The only necessary qualification here is that no 
historian does, and no sensible historian claims to, 
communicate the whole truth about a man, since 
there are many things about any man living or 
dead which no human being, not even the man 
himself, knows. The full knowledge on which alone 
a final judgment is possible exists only in the mind 
of God. The facts remain that in certain reaches of 


historiography the characterization of men is ines- 
capable, that the rhetoric of such characterization 
is inescapably nonscientific, and that the knowl- 
edge, understanding, and truth communicated by 
the history of which the characterization is a part 
will in some measure depend on how well or ill the 
historian deploys the resources of this inevitably 
nonscientific rhetoric, on the appropriateness of his 
response to the demands that the historical record 
makes on his ability to use nonscientific language 
in delineating a character. The curious problems 
that this situation implies deserve further exami- 
nation. 

Analysis of structure. The general analysis of 
historiography deals with those traits and devices 
of historical rhetoric which are unique to the writ- 
ing of history, or, more frequently, with those traits 
and devices which historians use in a unique way, 
a way which differentiates them from their use in 
the sciences or in the fictive arts. In the following 
treatment of structural analysis of historiography 
we will focus attention on three devices of the rhet- 
oric of history: (1) the footnote, (2) the quota- 
tion in the text, (3) the word list; and we will con- 
centrate on what differentiates the historian’s use 
of these devices from the use scientists make of 
their homologues in scientific rhetoric. 

Footnotes. Historians and scientists both use 
footnotes, and for one purpose they use them in 
about the same way: they use them to cite to the 
“literature” of the subject or problem about which 
they are writing. Historians, however, also use foot- 
notes in a variety of other ways. One way historians 
use them and physicists do not is to cite to the his- 
torical record, the substrate of evidence on which 
historians erect their accounts of the past. Citation 
to that record is the way a historian makes his 
professional commitment clear in action, as the 
report on the experiment is the way a physicist 
makes his commitment clear, In both instances it 
is a commitment to maximum verisimilitude (which 
does not mean exact replication in every detail). 
For the physicist it is maximum verisimilitude to 
the operations of nature as glimpsed through con- 
sideration of the experimental cluster; for the his- 
torian, verisimilitude to the happenings of the past 
as glimpsed through consideration of the surviving 
record, 

The well-nigh universal use of footnotes to the 
record by historians indicates that they are all still 
committed to writing about the past, as Ranke put 
it, wie es eigentlich gewesen, as it actually hap- 
pened. In today’s somewhat more sophisticated lan- 
guage, we might say that historians are concerned 
and committed to offer the best and most likely 
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account of the past that can be sustained by the 
relevant extrinsic evidence. Let us call this state- 
ment about the historian’s commitment the “reality” 
tule. 

Historians employ the footnote for a host of re- 
sidual matters other than citations to the record— 
lists of names, minor qualifications of assertions 
made in the text, polemic criticisms of other histo- 
rians, short statistical tables, suggestions for future 
historical investigation, and many more. This raises 
two questions. (1) Amid the apparent chaos of 
“residual” footnotes, are there any rules at all regu- 
lating their use? (2) What is the relation of any 
rules found to the “reality” rule? 

As to the first question, the application of any 
rule about footnotes requires an act of judgment in 
each case, and among historians judgment about 
the uses of residual footnotes differs. It might seem 
that in matters of judgment, as in those of taste, 
there is no disputing. Butis this so? Let us consider 
an example. 


At Shilbottle, in the case of three separate parcels of 
meadow, 31, 20 and 14 acres respectively, the first 
rendered 42s. in 1415-16 and 30s. in 1435-6, the sec- 
ond 28s. in 1420-1 and 23s. in 1435-6, and the third 
24s. in 1422-3 and 14s. in 1435-6. At Guyzance 6% 
husbandlands each rendered 13s.4d. in 1406-7, but 
10s. in 1435-6. 

At Chatton and Rennington, on the other hand, the 
situation was more stable. At Rennington the clear 
revenues were £17.8s.3d. in 1435-6 and £17 in 
1471-2 and at Chatton £40.18s.7d. in 1434-5 and 
£36.18s.7d in 1472-3. At Chatton the decline was due 
to a fall in the value of the farm of the park, from 
£6.13s.4d. to £2.135.4d.... 


The above passage is embedded in the text of a 
study of the wealth of a magnate family in the fif- 
teenth and early sixteenth centuries and the effect 
on that wealth of concurrent changes in the econ- 
omy, the military apparatus, and the political sit- 
uation in England. Can anyone suggest that em- 
bedding it in the text instead of quarantining it in 
a footnote was not an error of judgment? But to 
say it was one is to imply a rule from which the 
erroneous judgment was a deviation. Can such a 
rule, a “law” of historical rhetoric or historiography, 
be stated? Approximately the rule might go: “Place 
in footnotes evidence and information which if in- 
serted in the text diminish the impact on the reader 
of what you, as a historian, aim to convey to him.” 

So although in the matter of the use of residual 
footnotes judgment is inescapable, we are not at all 
confronted with mere arbitrariness but with a rea- 
sonably precise rule or law. We may name it the 
“maximum impact” rule. Inevitably, marginal situa- 
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tions exist in which historians disagree about how 
to achieve maximum impact or the success of a 
particular rhetorical presentation. The existence of 
such marginal situations, however, does not mean 
that all situations are marginal and that there- 
fore there is no rule, or that any rule is as good as 
any other. Lawyers have a saying that hard cases 
make bad law, but they do not feel impelled there- 
upon to argue that there are no easy cases and no 
good law. Because there are some matters both sub- 
stantive and procedural concerning which they are 
very uncertain, some historians have fallen victim 
to the notion that everything about the past and 
about writing about it is infected with a total un- 
certainty. Yet this is clearly not so in the case of 
the residual footnote, where there was no difficulty 
in finding a rule not heavily infected with uncer- 
tainty. 

What, then, is the relation of the two rules—the 
“reality” rule and “maximum impact” rule—to each 
other? In the example of data that, by the second 
rule, ought to be withdrawn from the text and con- 
signed to a residual footnote, those data are in- 
formative and relevant with respect to the substan- 
tive historical argument the historian is presenting, 
and they are as complete, as explicit, and as exact 
as possible. But the historian is also committed 
to conveying to the reader with maximum impact 
his conception and understanding of the past as 
it actually happened, the “reality” of the first rule. 
And paradoxically, this implies that in the interest 
of conveying historical reality to the reader with 
maximum impact, the rules of historiography may 
require a historian to subordinate completeness, 
explicitness, and exactness to other considerations. 
If this is so, it indeed separates historiography 
from the rhetoric of the sciences as currently 
conceived. 

Quotation in the text. Again, although both 
may quote in the text, there is a major difference 
here between the historians and the physicists. If 
physicists could not quote in the text, they would 
not feel that much was lost with respect to advance- 
ment of knowledge of the natural world. If histo- 
rians could not quote, they would deem it a 
disastrous impediment to the communication of 
knowledge about the past. A luxury for physicists, 
quotation is a necessity for historians, indispen- 
sable to historiography. 

The kind of quotation that historians deem indis- 
pensable is quotation from the record. Again two 
questions arise. ( 1) Is there any rule governing 
quotation from the record? (2) How does that rule 
relate to the “reality” rule? 

Consider a hypothetical case of inept quotation. 
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Suppose in writing the history of the Civil Rights 
Act of 1964, a historian were to quote verbatim 
from the Congressional Record the entire debate 
on the act in both the House and the Senate. The 
result would be relevant, exact, and accurate—and 
not only the judgment but the sanity of the histo- 
rian would fall under serious question. Again the 
paradox: maximum completeness, accuracy, and 
exactness are not always essential or even desirable 
in the historian’s work of trying to tell the reader 
what really happened. Now consider an adept quo- 
tation taken from E. Harris Harbison’s The Chris- 
tian Scholar in the Age of the Reformation: 


Erasmus had absorbed [Lorenzo] Valla’s historical per- 
spective, his sense of the historical discontinuity be- 
tween pagan antiquity and the Christian era... a 
sensitivity to anachronism. On one occasion he ridi- 
culed the absurdity of the practice . . . of using Cice- 
ronian words to describe an utterly different modern 
world: “Wherever I turn my eyes I see all things 
changed, I stand before another stage and I behold a 
different play, nay, even a different world.” The world 
of Cicero (or of Paul) can be understood and even in 
a sense relived—but only if we recognize that it had 
its unique existence, once, in a past now dead. (1956, 
p. 93) 


The function of Harbison’s brief but apt quota- 
tion from Erasmus is not mere validation or proof 
of his assertions; he could as well have effected that 
by citation or quotation in a footnote. By using 
Erasmus’ own words in the text, he sought and won 
a response not merely of assent but of conviction, 
not just “Yes,” but “Yes, indeed!” Nothing Harbison 
could have said about Erasmus’ sense of history 
could produce the conviction about it that Erasmus’ 
own assertion of his intense feeling of distance 
from antiquity produces. 

The quotation aims at something in addition to 
conviction, however. The quotation communicates 
the historian’s own view of what happened in the 
past by the particular means of confrontation. It 
says in effect, “In my judgment the most econom- 
ical way at this point to tell you what I believe 
Erasmus meant and to convince you that he meant 
it is to confront you directly with what Erasmus 
said.” This provides us with a third general rule of 
historiography—an “economy-of-quotation” rule: 
Quote from the record of the past only when and to 
the extent that confrontation with that record is 
the best way to help the reader to an understanding 
of the past wie es eigentlich gewesen. It is evident, 
however, from the instance of the hypothetical 
case of the Congressional Record that mere con- 
frontation with the record of the past is not 
necessarily the best way to achieve this understand- 


ing or even to achieve historical confrontation, In- 
deed, far from being a clear glass window through 
which the reader may capture an image of the past, 
quotation from the record injudiciously used can be 
a thick opaque wall that cuts him off from it. 
Granted that confrontation is an appropriate means 
for a historian to avail himself of in his efforts to 
convey to the reader an understanding of what ac- 
tually happened, it then becomes possible to tran- 
scend the paradox previously noted. It opens up 
the possibility that the microscopic means of his- 
toriography have to be adapted to its macroscopic 
ends and that it is part of the task of the writer of 
history to mediate understanding and confrontation 
by devices of the rhetoric of history less direct 
but more compelling, more to the purpose than 
a simple maximizing of completeness, accuracy, 
and exactness. 

The word list. The word list is a device useful 
both in the rhetoric of history and in the rhetoric of 
the sciences. (It is a sequence of words, usually 
nouns, whose relations as members of a set are 
often made evident by a sequence of commas and/ 
or semicolons, the conjunction “and,” or typograph- 
ical arrangement in a table.) Consider the follow- 
ing lists: 


An inert element will not react or enter into chemi- 
cal combination with any other element. In order of in- 
creasing atomic weight the inert elements are helium 
(4), neon (20), argon (39), krypton (84), xenon 
(131), and radon (222). 

The average incomes of only three of the learned 
professions fall into the first quartile of all average 
incomes. In descending order of quartile and rank, the 
average incomes of members of the learned profes- 
sions were as follows: surgeons (1,2), physicians 
(1,4), dentists (1,7), college professors (2,23), high 
school teachers (3,41), clergymen (3,47), grade school 
teachers (3,52). 


The first list is scientific; the second, historio- 
graphic. They are in many respects similar. In in- 
tent the words composing them are wholly denota- 
tive. They are not supposed to cast any shadow, to 
connote or evoke anything. Their arrangement 
(ascending order, descending order) is dictated en- 
tirely by considerations of rational utility. They 
both implicitly relate to an informational frame- 
work equally denotative in intention—the periodic 
table of all chemical elements, the table of average 
incomes of the total population classified by pro- 
fession and trade. Both listings aim to achieve @ 
Purpose universal in the rhetoric of the ae 7 
common but not universal in the rhetoric of hi 
tory. The scientist always wants the state, process, 
and set of entities he is dealing with so labeled that 
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the labels unambiguously and unequivocally point 
to that state, process, and set only. For the scien- 
tists purpose when he is formally communicating 
what he knows, words need to be free of contami- 
nation, of connotation, evocation, and emotive force, 
as sterile as the apparatus in an operating room. 
Otherwise he may find the wires of communication 
snarled and, as a consequence, have to rectify 
avoidable confusion. In this matter the historian’s 
purpose often coincides exactly with that of the 
scientist. It is only under the conditions and with 
a vocabulary of the kind above specified that he can 
to his own satisfaction transmit some of the kinds 
of information and understanding that he intends 
to communicate. Yet even the very close approxi- 
mation to scientific rhetoric exemplified by the fore- 
going historiographic word list deviates from the 
scientific standard in ways that help to differenti- 
ate both the problems and the purposes of history 
from those of the sciences. Consider the question 
“Is not zinc (65) also an inert element?” To answer 
this question one can pour hydrochloric acid over 
zinc. Since one of the yields of this operation is zinc 
chloride (ZnCl), a chemical combination or com- 
pound, zinc is not an inert element. 

The taxonomic system of chemical elements— 
the periodic table—is thus free of ambiguity. Sup- 
pose, on the other hand, the question were raised 
whether clergymen and elementary and high school 
teachers should be included as members of the 
learned professions when the executives of large 
corporations are excluded. The question points to 
doubts about a system of classification that might 
include store-front preachers and graduates of ret- 
rograde teacher-training colleges among members 
of the learned professions while excluding the prod- 
ucts of the better graduate schools of business ad- 
ministration, These doubts thus revolve about the 
identifying traits of the learned professions and the 
expediencies involved in the selection of any one 
set as against alternative sets of traits for classifi- 
Catory purposes. 

In any developed natural science, expediency in 
the choice of traits for a taxonomic system depends 
on the “importance” of the traits within the bounds 
of that science—e.g., in chemistry, valence and 
atomic weight as against color and taste, And im- 
portance is graded by applicability within the 
framework of generalizations or “scientific laws” 
that articulate the structure of the science in ques- 
tion and form the basis of its dominant mode of 
explanation. The dominant mode of historical ex- 
planation, narrative, emits no such clear, uniform 
signal for determining importance, and therefore 
in historiography the expediencies of alternative 


387 


taxonomic systems often remain equivocal and de- 
batable. It is this situation which generates the in- 
terminable discussions among historians about 
whether sixteenth-century monarchies were really 
absolute, whether the Indians in the encomiendas 
in the Spanish colonies were really in servitude, 
whether the owner-operator of a small newsstand 
is really a capitalist. Such discussions seem futile 
because they purport to deal directly with the actual 
character of the past, a historical problem, when in 
fact they are concerned with the relative expedien- 
cies of alternative taxonomic devices for communi- 
cating knowledge of the past, a historiographic 
problem. The problem of taxonomy so considered, 
however, is anything but trivial (1) because classi- 
fication systems both condition effectiveness of 
communication and channel the course of histor- 
ical thinking, and (2) because in the very nature 
of the rhetoric of history, terms like “capitalist,” 
“absolute,” and “learned profession” cannot be ren- 
dered wholly denotative to the consumer of history 
writing. Given the nonscientific values pursued in 
historiography, a historian using such terms will 
have to decide, for the purposes of the story or nar- 
rative explanation engaging him at the moment, 
how much time and effort he should expend in 
separating the connotative values from those terms 
and how important those connotative values are 
for advancing the historical understanding of the 
matter at hand. It is evident, in any case, that the 
general analysis of historical rhetoric involves a 
study of problems of taxonomy in history closer 
than any undertaken up to now. 

One further trait of the above historiographic 
word list needs to be noted: it is either elliptical or 
meaningless. It acquires meaning only if time and 
place are specified, whereas no such specification is 
necessary in the above scientific list. A statement 
whose formal structure and manifest purpose seem 
very close to those that characterize the natural 
sciences illustrates the dominant time-place speci- 
ficity of the rhetoric of history as against the domi- 
nant time-space generality of the rhetoric of the 
sciences. Thus the analysis of a historical word list 
reinforces the conclusion that has emerged time 
and again in the course of this discussion of the 
rhetoric of history: despite occasional likenesses, 
historiography is radically unassimilable to the 
rhetoric of the natural sciences. 

This can be even more effectively illustrated by 
another example of the historiographic use of a list. 


The Christian Revival, that intensification of religious 
sentiment and concern that began long before 1517 
and extended long beyond, in its full span had room 
for Cardinal Ximenes and Girolamo Savonarola; Mar- 
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tin Luther and Ignatius Loyola, the Reformed churches 
and the Jesuits, John of Leiden and Paul 1v, Thomas 
Cranmer and Edmund Campion and Michael Servetus. 


The names constitute a historiographic list, intend- 
ed to serve a particular purpose of the rhetoric of 
history. It emits a signal, and what the signal says 
to all who hear it is: “Draw on the reservoir of your 
knowledge of the times in which these men lived 
to give meaning to this list.” If that reservoir is al- 
together empty, then inevitably the list will itself 
be historiographically empty, meaningless, a mere 
collection of sounds, just as the sentences about the 
inert gases are empty of meaning to any who have 
no notion of what a chemical element or a chemical 
reaction or atomic weight is. The reason for this 
similarity is that in the present case both the his- 
toriographic rhetoric and the scientific rhetoric pre- 
suppose that the reader already possesses a body of 
precise and exact knowledge of the particular uni- 
verses to which they refer. The scientific and the 
second historiographic statement both conform to 
the “reality” rule; they are meaningless unless there 
are such elements as helium, neon, and argon; and 
unless there were such men as Loyola, Cranmer, 
and Paul 1v. Yet the second historiographic list 
serves a rhetorical function quite different from 
that served by the scientific list. First, consider the 
order of the two lists. Given the gases’ common trait 
of inertness, the order of the scientific list indicates 
the scientist's normal preoccupation with establish- 
ing scalable differences of homogeneous traits—in 
this case, weight. In the historiographical list, on 
the other hand, no such preoccupation is discern- 
ible, yet the arrangement of the names lies at the 
very heart of the matter. 

Note that there are three alternative ways of 
writing the historiographic list, all of which main- 
tain the essential arrangement, to convey whatever 
information it contains. 


(1) Cardinal Ximenes and Girolamo Savonarola, 
Martin Luther and Ignatius Loyola, the Reformed 
churches and the Jesuits, John of Leiden and Paul rv, 


Thomas Cranmer and Edmund Campion and Michael 
Servetus. 


(2) The pre-Reformation cardinal who reformed 
the church in Spain, and the pre-Reformation monk 
who was burned at the stake for his reforming efforts 
in Florence; the first great figure of the Reformation 
and the first great figure of the Counter Reformation; 
the cutting edge of the Protestant attack and the cut- 
ting edge of the Catholic counterattack; the most fanati- 
cal prophet of the radical Reformation and the most 
fanatical pope of the era of religious strife; the Prot- 
estant martyred by the Catholics, the Catholic martyred 
by the Protestants, and the martyr who escaped death 


at the hands of the Catholics only to receive it at the 
hands of the Protestants. 


(3) Cardinal Ximenes, the pre-Reformation cardinal 
who reformed the church in Spain, and Girolamo 
Savonarola, the pre-Reformation monk who was burned 
at the stake for his reforming efforts in Florence; 
Luther, the first great figure of the Reformation, and 
Loyola, the first great figure of the Counter Reforma- 
tion; the Reformed churches, the cutting edge of the 
Protestant attack, and the Jesuits, the cutting edge of 
the Catholic counterattack. .. . 


The persons balanced in tension with one an- 
other are the same for all three versions of the list, 
and the arraying is identical in all three. On mathe- 
matical principles, a member of any of the lists 
should be freely substitutable for the correspond- 
ing member of either of the other two, but in writ- 
ing history this is not so. Each list must retain its 
integrity. On what grounds can a historian choose 
among the three? One might argue that the second 
list is preferable to the first since it explicates the 
rationale upon which the persons in the first list 
were arrayed and that, in point of information 
about the past, the third is best of all, since it both 
names the persons and explicates the rationale of 
their array. Yet a reasonably experienced historian 
committed to communicating what he understands 
about the past actually chose the first option—the 
bare list of names with no indication as to his 
grounds for choosing them or for ordering them 
as he did. His choice is explicable when related 
to an earlier observation about the signal ema- 
nated by the list: “Draw on the reservoir of your 
knowledge of the times in which these men lived 
to give meaning to the list.” The writer assumed 
that most of his readers could and would in fact 
draw from their particular reservoirs the items of 
general information in the second and third lists. 

The effect of spelling out that information, how- 
ever, is to emit another kind of rhetorical signal, 
a stop signal: “Stop drawing on the reservoir of 
your knowledge. I have already told you how I 
want you to think about these men.” And this stop 
signal is just what the writer did not want the list 
to emit. The third version of the list is more exact, 
more overtly informative than the bare names in 
the first list, and just for that reason it is more 
empty, less ample. It dams up the informed 
reader’s imagination instead of letting it flow 
freely, bringing with it the mass of connotation 
and association that those names have for him. 
Therefore, to prevent such a blockage the writer 
chose the first list. In doing so, he made a judg- 
ment. He judged (or gambled) that the connota- 
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tive, evocative list would communicate a fuller 
meaning than the exact one, that it would more 
effectively confront the reader with the reality of 
the Christian revival, and that therefore it was the 
more appropriate device for advancing the reader's 
understanding of it. Whether he was correct in his 
judgment or not is immaterial. In setting forth his 
findings, a scientist never needs to make such a 
judgment at all. Scientific rhetoric is purposefully 
constructed to free him of that need by barring 
connotative terms and evocative devices. To a 
scientist the idea that he had to choose between 
a rhetoric of clarity and precision on the one hand 
and one of evocative force on the other would 
be shocking. The idea that the writer of history 
has to select between mutually exclusive ways of 
setting forth the same data and that the knowl- 
edge of history that he conveys depends in some 
measure on his judgment in selecting among al- 
ternative rhetorical devices is perhaps as disturb- 
ing and perplexing. But one is impelled to the latter 
conclusion by an investigation of the peculiarities 
of the way writers of history use footnotes, quota- 
tions, and word lists. 


Codification of historiographic principles 


The whole preceding article may be regarded 
as a prolegomenon to a codification of principles 
of historiography. Its aim has not been to produce 
such a codification but (1) to indicate that it might 
be possible to produce one and (2) to educe a few 
of the rules that would have place there. It has 
been concerned time and again to mark the irre- 
ducible differences that separate the rhetoric of 
history from that of the natural sciences because, 
given the prestige of those sciences and the strik- 
ing similarity of some of the objectives of history 
writing and science writing, there is a danger that 
an attempt to codify the principles of historiog- 
raphy might take the form of a systematic effort 
to reduce as far as possible those principles to the 
ones current in the natural sciences. This indeed 
has already been the outcome of attempts by an- 
alytic philosophers from Carl Hempel to Morton 
White to codify the rules of historiography. The 
outcome of such an effort would be catastrophic, 
not because it would be an utter failure but be- 
cause it would be a partial success. It would suc- 
ceed in codifying rules about a great deal of what 
historians write in a way that would relate it closely 
to what scientists write. It might then be inferred 
that only the part of the rhetoric of history which 
can be articulated with that of the sciences is 
fit for communicating what in the course of 
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their researches historians learn about the past, 
or that only that part is amenable to codification. 
In the foregoing discussion, however, we have 
already seen that in the writing of history it is often 
necessary to employ language in ways that scien- 
tists quite properly reject for communicating the 
results of their investigations. Therefore, instead 
of extending our knowledge of the past, to limit 
historiography to those statements about the past 
which can be formulated in the rhetoric of the 
sciences would sharply constrict it. 

The rational procedure in attempting to elicit 
general rules of historiography would be, rather, 
to make a series of analyses of the kinds classified 
and discussed in the previous section, taking as 
their subject pieces of historical writing which on 
the basis of a broad consensus of historians have 
been extraordinarily successful in transmitting 
what their writers knew about the past. It is im- 
possible, of course, to predict in substance the out- 
come of such an analytical effort. Because in the 
section on general analysis we were able to elicit 
a few sample rules, it may, however, be possible 
to hazard a conjecture about its form. On the basis 
of that small sample one might conjecture that 
a codification of the rules of historiography would 
resemble a manual of military strategy more than 
a handbook of physics. It would consist of a num- 
ber of maxims generally applicable to the solution 
of recurrent problems in writing history, leaving 
the identification of his particular problems and 
application of the maxims to the experience of the 
trained historian. 

The most important professional use of such 
codification would be precisely in the training of 
historians. Historians do not lack the ability to 
discriminate between historiography which badly 
and inadequately communicates what a historian 
knows and historiography which communicates it 
well. Unfortunately, because that ability is now 
acquired almost wholly through experience rather 
than through a combination of experience and 
systematic knowledge, it is rarely and inefficiently 
transmitted from teacher to pupil. Indeed, the sys- 
tematic training of historians is almost solely given 
over to the transmission of competence in the op- 
erations they must perform before they engage in 
the activity that defines their craft—the writing 
of history. Many historians receive the doctoral 
degree, which is supposed to certify their compe- 
tence in their craft, without ever being compelled 
to rewrite anything they have written after having 
it subjected to rigorous and systematic criticism. 
The chronic ineptitude that hosts of historians dis- 
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play in their attempts to communicate what they 
know is a testimonial to the inadequacy of their 
training in this respect, or to its complete neglect. 
This ineptitude may suggest the desirability of an 
attempt to state coherently at least part of what 
the better historians know experientially about 
writing history and demonstrate visibly in the con- 
sistent appropriateness of their responses to the 
problems of historiographic statement. 


Theoretical implications 


The examination of historiography in this article 
has at various points suggested what becomes quite 
evident in the treatment of narrative explanation 
as against historical storytelling and in the section 
on the analysis of the form of a historical work 
—that the practices of historians in writing history 
may have some peculiar and serious implications 
in that wide area of human concern in which men 
struggle with the difficult problems of the mean- 
ing and nature of knowledge, understanding, and 
truth. The principal relevant points that have 
emerged may be summarized as follows. 

First, historiography is a rule-bound discipline 
by means of which historians seek to communicate 
their knowledge of the past. 

Second, the relation of writing history, of its 
rhetoric, to history itself is quite other than it 
has traditionally been conceived. Rhetoric is ordi- 
narily deemed icing on the cake of history, but 
our investigation indicates that it is mixed right 
into the batter. It affects not merely the outward 
appearance of history, its delightfulness and seem- 
liness, but its inward character, its essential func- 
tion—its capacity to convey knowledge of the past 
as it actually was. And if this is indeed the case, 
historians must subject historiography, the process 
of writing history, to an investigation far broader 
and far more intense than any that they have 
hitherto conducted. 

Third, there is an irreducible divergence be- 
tween the rhetoric of history and the rhetoric of 
science; the vocabulary and syntax that constitute 
the appropriate response of the historian to his 
data are neither identical with nor identifiable 
with the vocabulary and syntax that constitute the 
appropriate response of the scientist to his data, 
But the historian’s goal in his response to the data 
is to render the best account he can of the past 
as it really was. Therefore, by his resort to the 
thetoric of history, regardless of its divergence 
from that of the sciences, the historian affirms 
in practice and action his belief that it is more 
adequate than the latter as a vehicle to convey 
the kind of knowledge, understanding, truth, and 


meaning that historians achieve. Indeed, instances 
were discovered in which, in order to transmit an 
increment of knowledge and meaning, the very 
rules of historiography demand a rhetoric which 
sacrifices generality, precision, control, and exact- 
ness to evocative force and scope—a choice entirely 
out of bounds according to the rules of scientific 
statement. And this implies that in the rhetoric 
of history itself there are embedded assumptions 
about the nature of knowing, understanding, mean- 
ing, and truth and about the means of augmenting 
them that are not completely congruent with the 
corresponding assumptions in the sciences, at least 
insofar as the philosophy of science has succeeded 
in identifying them. 

Historiography has generated a crisis in the cur- 
rently dominant Anglo-American school of phi- 
losophy, the school that has as its main subgroups 
logical positivism, the philosophy of science, and 
language analysis. That it has done so is evident 
from a cursory examination of the index of one 
of the more recent works on history writing by an 
analytical philosopher (Danto 1965). Besides the 
philosophical magnates, living and dead, tangen- 
tially involved in the dialogue—Ayer, Bradley, 
Dewey, Hume, Kant, Lewis, Peirce, Ryle, Russell, 
Wittgenstein—the index mentions Agassi, Danto, 
Donegan, Dray, Gallie, Gardiner, Gellner, Hempel, 
Mandelbaum, Nagel, Passmore, Popper, Scriven, 
Walsh, and Watkins, all of whom since 1940 have 
directly confronted the problems that in their view 
historiography poses for philosophy; and the list 
is by no means complete. The close attention that 
this group of philosophers has directed to history 
writing is especially significant because of their 
central preoccupation with the way in which lan- 
guage communicates knowledge, understanding, 
meaning, and truth. In the broadest sense this 
large collective enterprise has been trying to define 
the relationship between the practices of writers 
of history and the nature of knowledge, under- 
Standing, meaning, and truth, especially as re- 
vealed in the structure of scientific rhetoric. 

The preceding prolegomenon to an inquiry into 
the rules of the rhetoric of history provides a clue 
to the character of the crisis (symptomatically 
marked by the profusion of their output on the 
subject) with which the writing of history has 
confronted the analytical philosophers. History has 
posed for them a very difficult puzzle. Most his- 
torians in theory, all in practice, treat their subject 
as if through their current methods and their 
current rhetoric they were achieving and trans- 
mitting increments of knowledge about it. That 1s 
to say, they declare that if a piece of historical 
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work is well done and properly set down, readers 
will know more about the past after they have read 
it than they did before. And for practical purposes 
very few people have seriously doubted the pro- 
priety of this claim (the few that do, appear to 
have read very little history). And yet historiog- 
raphy—the forms of statement historians adopt, 
their rhetoric—does not seem to fit into the sign 
structure suitable for scientific explanation, the 
classical rhetoric for communicating increments 
of knowledge, and most historians have been either 
indifferent or actively hostile to the notion that 
in the interest of rendering an account of the past 
as it actually was, they ought to elaborate and 
consistently employ such a sign structure. It is 
with this paradox that so many analytical philoso- 
phers have tried to deal systematically since 1940. 

The course of this large collective effort is far 
too complex and has had too many ramifications 
to be dealt with here in detail. Briefly, the initial 
supposition (Hempel 1942), set forth above, was 
(1) that the universal valid model of explanation 
is that of the natural sciences, (2) that this con- 
sists in linking an event to general laws in such 
a way that the event is entailed by the laws through 
strict deduction, (3) that any activity of a historian 
that does not achieve this end does not explain 
anything, (4) that although in some instances 
historians can perform the necessary operations, 
they rarely do so, and therefore (5) that by and 
large in most of their actual operations historians 
explain nothing. Twenty-five years of intensive dis- 
cussion by analytical philosophers has taken off 
from, and resulted in a number of proposals for 
modifying, this rigorist position. It seems likely 
that the dis-ease which some of these modifications 
manifest is in part the consequence of an often 
unarticulated sense on the part of analytical phi- 
losophers who have read history books that the 
actual procedures of historians, for knowing, un- 
derstanding, and giving an account of the past 
as it actually was, do achieve their explicit or 
implicit purpose. In the course of bringing about 
a partial confrontation of the general-law theory 
of explanation with historiography, the analytical 
philosophers discovered a number of facts about 
the latter which because of their apparent devia- 
tion from the general-law or scientific model caused 
them perplexity. Among these were (1) that for 
many purposes of “explanation why,” historians 
do not resort to general laws but to truisms; (2) 
that when historians are confronted with the 
question “Why?” their frequent, indeed normal, 
impulse is not to recite or seek relevant general 
laws, as a scientist would do, but to tell a story, 
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and that such a story often seems to provide a satis- 
fying answer to the question “Why?” while a gen- 
eral law does not; (3) that the questions historians 
are often most heavily engaged in answering are 
not why-questions at all but what-questions (and 
also, one might add parenthetically, who- , when- , 
and where-questions); and (4) that a great deal 
of the activity of historians can be construed as 
having explanation as its aim only by so far ex- 
tending the meanings of explanation current in 
analytical philosophy as to destroy even the ap- 
pearance of synonymy and to impose well-nigh 
unbearable strains even on analogy. 

In the process of coping with these problems, 
the analytical philosophers have produced a series 
of solutions, not always coherent with each other, 
of considerable interest to themselves but appar- 
ently of very little interest to historians as such. 
The character of most of these answers (and 
perhaps the explanation of their lack of interest 
for historians) may properly be described as “as- 
similationist.” The common characteristics of these 
assimilationist answers are first to seek out all traits 
of historiography that can reasonably be identified 
with or assimilated to the model of scientific ex- 
planation by means of general laws; then to make 
epicyclic modifications of the general-law structure 
of explanation to accommodate some of the more 
evident deviations of historiography from its pat- 
tern, always holding such modifications to the 
minimum; and finally wholly to prescind from 
some of the most evident traits of historiography 
on the ground that they are irrelevant to the quest 
for knowledge, understanding, and truth. The last 
procedure, which for present purposes is of the 
most importance, is illustrated by a passage in 
Morton White’s Foundations of Historical Knowl- 
edge, a passage of special interest because among 
the practitioners of analytical philosophy White 
alone is also a practitioner of historiography at a 
very high level of excellence. 


The historical narrative, the extended story, is so large 
and rambling by contrast to the single sentence treated 
by the logician that any effort to treat it as a repeat- 
able and identifiable pattern of language may give an 
impression of remoteness and distortion well beyond 
what might be felt by the historian who finds his 
causal statements cast in a single syntactical mold. On 
the other hand, the very qualities of narrative which 
might lead a historian to think that logical analysis 
distorts it are those that might inhibit a logician from 
trying to discern its structure. The complexity and va- 
riety of narrative, the fact that one story seems so dif- 
ferent in structure from another, may give both the 
romantically minded historian and the classically 
minded logician pause. Yet the vast differences that 
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human beings exhibit do not prevent us from X-raying 
them in an effort to discern the skeletal structure that 
each of them possesses. . . . History is a literary art as 
well as a discipline aimed at discovering and ordering 
truth, and if we neglect some of the narrative’s literary 
qualities in order to clarify certain epistemological 
problems connected with it, our procedure is like that 
of the sane roentgenologist, who searches for the skull 
without denying that the skin exists and without deny- 
ing that the skin may vary enormously in color, tex- 
ture, and beauty. (1965, pp. 220-221) 


The equation here is at once interesting, dubious, 
and exemplary of the assimilationist posture above 
described. Before presenting it, by the addition 
of a couple of rather large epicycles, White had 
already assimilated several common traits of his- 
toriography as closely as possible to the general- 
law model of scientific discourse. The quotation 
announces his intention to do the like with another 
major trait, the storylike character of much history 
writing. But he knows this is going to leave him 
with a very large residue of what historians write 
still on his hands. This residue consists in part of 
matters that the analysis of historical rhetoric in 
this article has called attention to. He disposes of 
this uncomfortable residue by assigning it to his- 
tory as “a literary art” rather than history as “a 
discipline aimed at discovering and ordering truth.” 
In this connection his analogy with roentgenology 
is not wholly fortunate. For it is at least arguable 
that the knowledge that history makes accessible 
is no more fully revealed by the mere skeletal 
structure of its narrative than the knowledge of 
the human head is fully exhausted by what an 
X-ray plate shows about its mere ossature. Just as 
it may be suggested that while the human head 
is partly a bony structure and partly ( sometimes ) 
a thing of beauty, it is also a number of other 
things, too, so it is possible to grant that history 
is a literary art while denying that all those aspects 
of history writing which White consigns to that 
function and that function alone are actually irrele- 
vant to the function of history as a discipline aimed 
at discovering and ordering truth. It has been one 
purpose of the foregoing article to suggest some 
grounds for such a denial. 

The mention of epicycles in the preceding para- 
graphs provides us with a clue to the dis-ease 
of analytical philosophy with historiography, a 
dis-ease so acute that it has become an intellectual 
disease which may be a prelude to a general 
and deep intellectual crisis. Epicycles suggest the 
Ptolemaic system of astronomy, and examination 
of the ultimate crisis of that system may by analogy 
help toward an understanding of the current crisis 
with respect to knowing, understanding, explain- 
ing, and truth that the study of historiography has 


induced in analytical philosophy. The crisis in the 
Ptolemaic system came in the sixteenth century, 
when it was destroyed by the Copernican revo- 
lution. For centuries before, it had been normal 
science. Following the terminology of Thomas 
Kuhn, it was structured about several paradigms, 
among which were (1) the earth is at the center 
of the cosmos; (2) the earth does not move; 
(3) the orbits of all heavenly bodies are circular; 
(4) the circular motion of each heavenly body is 
uniform in rate. These paradigms were invoked 
to support an area of pre-Copernican science far 
more extensive than celestial mechanics; and to 
save this science, the observed deviations of the 
planets from presumed circular orbits and uniform 
speeds were dealt with by an ingenious but exceed- 
ingly intricate system of epicycles, eccentrics, and 
equants. The Copernican revolution was initiated 
by Copernicus’ allegation that this system could 
be greatly simplified by assuming that the earth 
was not at the center of the cosmos and that it 
moved around the sun annually and rotated on its 
own axis daily. 

Certain facts about the Copernican revolution 
are worth noting in the present context. (1) Coper- 
nicus’ own work by no means provided a wholly 
satisfactory solution to the difficulties it sought to 
deal with. (2) Although the third and fourth para- 
digms of Ptolemaic astronomy—those dealing with 
the orbit, shape, and speed of the planets—ulti- 
mately crumbled under the impact of the revolu- 
tion Copernicus started, he had no intention of 
displacing them and in fact held firmly to them. 
(3) Copernicus’ heliocentrism and geomobilism 
implied the destruction not merely of Ptolemaic 
celestial mechanics but of other large tracts of the 
science of his day; what he offered in place of what 
he destroyed was, however, unsatisfactory. In 
some matters he offered nothing in place of it, and 
in others he does not seem to have been aware that 
he had destroyed it, so that overall for a long time 
the old normal science provided better explanations 
of many phenomena than the Copernicans did. 
(4) For all the above reasons the marginal ad- 
vantage of the Copernican over the Ptolemaic 
celestial mechanics was not at all clear, and the 
conservatism of those who continued to adhere to 
the older scientific paradigms for a long time is 
quite intelligible. 

Let us now apply this analogy to the crisis that 
has confronted the analytical philosophers as a 
result of their explorations of historiography. Until 
and except for that confrontation, their para- 
digms—essentially the modes of rhetoric they 
ascribed to the sciences—provided them with 4 
reasonably satisfactory way of understanding and 
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rendering intelligible the syntactical structure and 
vocabulary of a language capable of conveying 
frequent increments of knowledge, meaning, and 
truth—the language of the natural sciences. From 
this fact of experience they assumed that all know- 
ing, meaning, and truth can be incorporated into 
statements in their paradigmatic rhetoric and that 
nothing that cannot be reduced to that rhetoric 
can claim a place in the region of knowing, mean- 
ing, and truth. During their 25-year confrontation 
with historiography they have discovered one 
anomaly after another in the rhetoric of history, 
place after place where it appears to deviate from 
the language of the sciences. Their most general 
response has been to try to save their normal view 
of the nature of knowing, meaning, understanding, 
and truth and of the proper rhetoric for communi- 
cating them by constructing a complex structure 
of the logical equivalents of epicycles, eccentrics, 
and equants in order to assimilate to it as much 
of the rhetoric of history as possible and thereby 
save the paradigms which support that structure. 
This procedure has been less than satisfactory, 
since it requires them quite arbitrarily and without 
evidence to assign to many traits of the rhetoric 
of history an altogether aesthetic rather than a 
noetic function. This has been especially the case 
with respect to those aspects of the work of the 
writer of history which concern themselves with 
the telling of a historical story and with the dis- 
position and arrangement of his evidence and the 
choice among alternatives, all connotative rather 
than purely denotative, for the communication of 
what he knows. The philosophers have proceeded 
as they have for the very good reason that to 
do otherwise would be to raise extremely perplex- 
ing questions about the nature of knowing, under- 
standing, meaning, and truth to which, as of now, 
neither they nor anyone else has any very plausible 
answers. 

One of the aims of this article is to suggest that 
nevertheless a paradigm shift which would raise 
such questions may now be desirable and even 
necessary. The first step would be to assume that 
the rhetoric of history, including much to which 
analytical philosophers assign only aesthetic value, 
constitutes an appropriate response on the part 
of historians to their commitment to advance the 
knowledge and understanding of the past as it 
actually was, There are no better reasons for re- 
jecting this assumption than for making it; logi- 
cally to reject it or to accept it involves decisions 
equally arbitrary. It has, however, a certain prima- 
facie empirical plausibility; it is based on an un- 
invidious view of the consistent refusal of some 
of the very best historians dedicated to communi- 
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cating the truth about the past wholly to adopt 
the rhetoric of the sciences. To start with this fa- 
vorable assumption would provide room among the 
means of knowing for certain rules of historiog- 
raphy concerned with the advancement of knowl- 
edge, for which there seems to be no room within 
the present structure of knowing as the analytical 
philosophers conceive it. 

It would by no means open the way for the 
sort of intellectual slatternliness that analytical 
philosophers rightly object to and oppose. On the 
contrary, it would assist in the introduction of 
some much-needed conscious intellectual rigor 
into regions in which rigor has often been sadly 
lacking or in which its presence has been due 
to the experience and temperament of particular 
historians rather than to thoughtfully codified pro- 
fessional standards of performance. But to do all 
this requires an acknowledgment and acceptance 
(1) that in some areas of human inquiry the pur- 
suit of truth can be effectively carried on only by 
means of a rhetoric which diverges from that of 
the sciences and (2) that this is not wholly due 
to the peculiarities and perversities of those who 
pursue the truth in those areas but in part to the 
very nature of the terrain over which they must 
pursue it. Once analytical philosophers fully recog- 
nize that there may and indeed must be more than 
one style, one rhetoric, for communicating the 
things that are both knowable and communicable, 
and that the problem is not that of reducing all 
styles to one but of carefully investigating what 
style is appropriate to the particular problems of 
communication inherent in a particular kind of 
knowing, it will be possible to bridge the now 
ever-widening gaps that separate analytical phi- 
losophers, historians, and rhetoricians. They might 
then join in trying to discover whether a thorough 
exploration of historiography, the rhetoric of his- 
tory, can teach them anything worth finding out 
about knowing, understanding, meaning, and truth. 


J. H. HEXTER 


[See also SCIENTIFIC EXPLANATION; HISTORY; PERIODI- 
ZATION; SCIENCE.] 
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Ii 
AFRICAN HISTORIOGRAPHY 


A belief in the continuity of life, a life after 
death, and a community of interest between the 
living, the dead, and the generations yet unborn 
is fundamental to all African religious, social, and 
political life. Thus, although the serious writing 
of African history has only just begun, a sense of 
history and tradition has always been part of the 
African way of life. 

The ancient Egyptians were very conscious of 
the continuity of life and death. They were con- 
scious not only of the overriding importance of an 
after-life but also of the continuing relevance of 
the dead to the living. They prepared the burial 
chambers and preserved the bodies of the dead 
with great care. The essence of the Horus myth 
was that the dead, particularly the kings, con- 
tinued to influence the life of the living by af- 
fecting the annual inundations of the Nile and 
the germination of crops. A good deal of Egyptian 
religion revolved around the commemoration of the 
dead. Impressive monuments were erected, and the 
priests who ministered in those shrines became 
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very knowledgeable about the traditions and folk- 
lore of the past. 

This fundamental belief in the continuity of life 
is found among all African peoples. It is an es- 
sential element in traditional African historiog- 
raphy. Throughout sub-Saharan Africa one finds 
the continuing relevance of the dead to the life 
of the present and of future generations. It is ex- 
pressed in the belief that each community was 
founded by an ancestor or a group of ancestors, 
that whatever the status or possessions of the com- 
munity, they were owed to the ancestors, and that 
the ancestors had established for all time the basic 
charter of life, which could be adapted and modi- 
fied but which could not be completely changed. 
The ancestors and the tradition they represented 
were a living reality. Reverence for the ancestors 
sometimes became worship. The fear of “what 
the ancestors would say” was an ever-present con- 
sideration and one of the most powerful sanctions 
in African societies. 


Traditions of origins 


Each community—family, clan, village, town, 
or state—however large or small, had an estab- 
lished tradition concerning its origins. The com- 
munity might split up, migrate, and assimilate new 
elements, or be conquered by others and absorbed 
by new immigrants. At each stage of transforma- 
tion, the tradition was recrystallized to accommo- 
date changed conditions, and a new tradition of 
origin was formulated by the new community. 
These traditions became the core of the commu- 
nity’s view of history. The very process of tradition- 
making and acculturation in the community, and 
of transmitting tradition to succeeding generations, 
developed a consciousness of history that became 
widespread in Africa. 

These origin traditions did not attempt a histori- 
cal explanation in the modern European sense of 
verifiable texts and chronology. They promoted 
understanding of and a respect for the institutions 
and practices of the community. They offered ex- 
planations of the world as the community con- 
ceived of it—the origin of land and sea, man and 
the other species, the origin of the state, the basis 
of different laws and customs, the title of the 
community to the land it held, how and why it 
differed from its neighbors in the gods it wor- 
shipped and the customs it upheld—but the ex- 
planations were not so much historical as philo- 
sophical, literary, and didactic. Accurate chronology 
and causation were of little relevance. To a large 
extent, history was merged with myth and was 
part of the general philosophy of life. In this way 
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traditional African historiography resembled that 
of Europe before the scientific revolution split phi- 
losophy into its component parts. The making and 
transmission of tradition was not the work of 
historians in the modern sense, but of priests and 
diviners, elders, and wise men in general. Tradition 
explained not only the relationship between the 
ancestors of different communities but also the re- 
lationship between the existing community, the 
ancestors, and the gods. It was expressed not only 
in narrative but also in sacred poetry, in ritual re- 
enactments, and other religious manifestations. 
Tradition was part of the philosophy of the com- 
munity, part of its own peculiar way of life. Thus 
there could be no concept of universal history ex- 
tending beyond the life of the particular commu- 
nity. 

The making and the transmission of tradition 
varied from place to place. It depended on the size, 
nature, beliefs, and resources of the particular 
community. In segmented societies where roles were 
often not differentiated, it was part of the functions 
of the clan head as he fulfilled specific political and 
religious roles. In organized states, however, par- 
ticularly those with centralized monarchies, such 
as Benin, Ashanti, or Dahomey, where the political 
and legal implications of tradition were of every- 
day importance, the making and transmission of 
tradition became a controlled and well-regulated 
specialism. 


Oral transmission 

The commonest method of transmitting tradi- 
tion was through stories, fables, and proverbs told 
by the elders as part of the general education of 
the young. In those story-telling sessions, after the 
evening meal in the family compounds or during 
festivals of the full moon when people stayed up 
late, traditions telling of the origin of the whole 
community or of the particular family or clan were 
related. More recent events that had occurred 
within remembered history, particularly those of 
the previous two or three generations, were also 
talked about. 

Traditions were transmitted more formally when 
there were organized educational institutions such 
as those connected with puberty rites, initiation 
into age-grades and secret societies, or during the 
training or apprenticeship of priests and diviners. 
The rites by which the king-elect was initiated into 
the kingship were of particular interest. As the 
successor to and representative of the ancestors, 
the king became the custodian of the traditions of 
the community. One of the most important func- 
tions of the rites preceding his coronation, there- 
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fore, was to initiate him into the secrets of his 
ancestors and the traditional lore of his people. 
More than that, partly as ritual offerings to the 
ancestors, partly as entertainment and education 
for the people at large, the traditions of the people 
were recited publicly; symbolic events from the 
past were dramatized; the names, genealogies, 
cognomens, titles, and praise songs of the ancestors 
were chanted. The new king announced his own 
title. The title frequently was intended to charac- 
terize the expectations of his reign. The praise 
singers responded, and a new chapter in the peo- 
ple’s tradition opened. 

Thus the process of transmitting tradition can- 
not be separated from the process of creating tradi- 
tion. Tradition was made by those who transmitted 
it—the village and clan elders, the singers and 
drummers who assisted at coronations and other 
public festivals, the officials of age-grades and 
secret societies who conducted the puberty rites or 
initiated new members. These persons were ap- 
pointed sometimes from among eligible members 
of a single family, sometimes from a wider circle 
of candidates who had to demonstrate their mas- 
tery of past traditions and folklore and their wit 
at entertaining people and making tradition out of 
contemporary affairs. In the daily life both of the 
community and of individuals, tradition was in- 
voked, challenged, modified, and created in the 
processes of litigation, of settling disputed succes- 
sions to all types of offices, of recording new events 
and new situations that resulted when masterful 
individuals or communities made innovations in 
the guise of merely fulfilling tradition. 


Factual versus literary traditions 


It is necessary to distinguish between different 
types of tradition, or oral tradition, as it is now 
generally described (Vansina 1961). The first dis- 
tinction is between traditions of a factual and his- 
torical type and those of a literary and philosophi- 
cal type. Of the more factual type, it is important 
to distinguish between events of the recent past 
that can be recalled from the personal experience 
of grandfathers and fathers and events of the more 
distant past. The authenticity of these remote tra- 
ditions is in large part dependent on the special 
institutions that may have existed for making and 
transmitting tradition and on how efficiently these 
institutions functioned. Such factual traditions 
include formal lists of kings and other office 
holders, chronicles of each reign, appellations and 
praise verses (which often included either direct 
or implied criticism) of each king or other leading 
chiefs, genealogies, and certain laws or customs. 


But even traditions that appear well preserved and 
factual may turn out on examination to be sym- 
bolic; apparent biographical data may represent 
summaries of the life of the whole community over 
one or more generations. References to years, gen- 
erations, and periods may in fact refer to struc- 
tural and not chronological time. 

The more literary types of tradition include 
proverbs and sayings, songs and lyrics, of which 
some are general and others peculiar to particular 
guilds, age-grades and other associations. The more 
philosophical traditions are enshrined in the sacred 
chants of different religious organizations and 
cults, such as the praise verses of the gods, divina- 
tion poems, funeral dirges, liturgies, and hymns. 
Finally, one should distinguish between traditions 
narrated in the words of the speaker and those with 
set and formal texts. 


Ethiopian influence 


There have been, of course, other historical tra- 
ditions in Africa whose influence on African histo- 
niography is difficult to evaluate at the present state 
of our knowledge. One important example is the 
Ethiopian historical tradition, partly African and 
partly of Judaeo—Christian inspiration. The suprem- 
acy of the Solomonid dynasty, the unity of church 
and state, and the integrity of the monophysite 
church were the dynamic historical forces. As in 
other parts of Africa, in the twelfth century Ethio- 
pia developed a legend that linked the ruling 
dynasty with the Holy Land. But it was a written 
tradition, enshrined in the Book of Kings that be- 
came a major feature of coronation rites. The 
monasteries recorded the annals of each reign and 
preserved important texts and charters. Yet the 
primary interest of Ethiopian intellectual life was 
theological, not historical; there was little attempt 
until recent times to analyze and interpret the 
annals and the chronicles to produce history. Of 
more relevance to African historiography are the 
traditions of the Berbers, Like other Africans, the 
Berbers were very conscious of the continued rele- 
vance of the past. In their reaction both to Roman 
Christianity and Arabian Islam they manifested an 
attitude of mysticism and dissent combined with 
the veneration of ancestors. It may be said that 
this attitude produced hagiography and not critical 
history, but hagiography itself was a method of 
enshrining and immortalizing the idealized social 
and religious virtues of the people. It was, in a 
sense, the literary expression of respect for the 
norms and virtues of ancestors, similar to the 
traditions found in other parts of Africa. The trac- 
ing of real genealogy, always of great importance 
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to the Arabs, has remained a marked feature of 
Muslim Africa, but, in addition, the Berbers intro- 
duced veneration of ancestors in the form of trac- 
ing spiritual genealogies of Muslim leaders. This 
was expressed in the tariqa chains and the handing 
down of the wird among Muslim Brotherhoods. 
Initiation into the tariqa conferred spiritual knowl- 
edge and benefits; this paralleled initiation into 
traditional African associations, guilds, and cults. 


Muslim influence 


Muslim influence was important not only in 
north Africa but also in east Africa, the whole of 
the Sudan, and even in a few places within the 
forest areas. In addition to real and spiritual gene- 
alogies, Muslim writers produced a number of 
tarikhs and chronicles, especially in the eleventh 
to the seventeenth centuries. These embodied eye- 
witness accounts, oral tradition, and evidence from 
earlier accounts by geographers, travelers and 
traders. Muslim writers were particularly interested 
in the spread and influence of Islam and in the 
religious and the economic life of the main centers 
of Islam, These factors were singled out of the 
totality of African life and traditions and given 
undue emphasis. In important centers of Muslim 
learning like Timbuktu, Gao, Djenné, Kano, Kat- 
sina, and Bornu in the west and central Sudan, or 
Kilwa, Malindi, and Mombasa in east Africa, tradi- 
tions of the people were written down, mostly in 
Arabic but sometimes also in the Arabic transcrip- 
tions of the vernacular. The accounts centered on 
leading personalities of the Muslim community 
rather than traditional states or clans. Rulers were 
judged good or bad insofar as they extended or 
hindered the influence of Islam and the privileges 
of Muslim scholars. 

It is now recognized that the Prolegomena of 
Ibn Khaldūn, the famous Tunisian scholar of the 
fourteenth century, is one of the world’s most 
important works on historiography. He emphasized 
the importance of sociology to history. He sought 
to study the past not only in terms of the actions 
of individuals but also by an analysis of the laws, 
customs, and institutions of the different peoples 
as well as the interaction of state and society. This 
approach would have provided a basis for synthe- 
sizing the many traditions in Africa into a history 
of the continent, but until recently, his work had 
little influence. The contemporary medieval world 
was more interested in the study of theology, law, 
and rhetoric. Within Africa itself, Islamic scholar- 
ship declined and was at its lowest ebb in the 
eighteenth century. It was in this period that the 
study of history was revived in Europe, but because 
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of the abundance of written documentary evidence, 
European scholars did not adopt the sociological 
approach of Ibn Khaldin, but concerned them- 
selves mainly with the decrees, wars, and politics 
of kings. In the nineteenth century the legal and 
biographical approach was broadened to take social 
and economic factors into account, but documen- 
tary evidence had become so overwhelmingly im- 
portant for the European scholar that he tended 
to equate written documents with history. The ab- 
sence of documents was thus taken to mean the 
absence of any events worthy of historical study. 


European influence 


In the nineteenth century, when European influ- 
ence intruded into Africa, it did not build on the 
existing historical traditions but challenged and 
supplanted them. The European view of documen- 
tary history supported the propaganda of the 
colonial rulers: Africa had no history worthy of 
record; therefore the history of European traders, 
missionaries, explorers, conquerors, and rulers con- 
stituted the sum total of African history. European 
history and the history of European expansion 
began to displace local history and tradition in the 
education of the African youth, although some 
attention was given to Arabic and other sources. 
European historians of the late nineteenth and 
early twentieth centuries sought to explain the 
Atlantic slave trade, the European technological 
supremacy, and the subjugation of Africa not in 
terms of any historical study of the continent but 
in terms of racial and psychological prejudices 
about the inherent inferiority of people with dark 
skins. Missionary circles even introduced the reli- 
gious explanation that Africans were the children 
of Ham and were under Noah’s curse to be hewers 
of wood and drawers of water for their lighter- 
skinned brethren. African historiography thus be- 
came nothing more than a justification of European 
imperialism. 

Africans who became literate in the European 
languages at first accepted these theories, but be- 
cause the traditional African historical conscious- 
ness remained alive, they soon began to challenge 
the absurdities of the European historians. Some 
of them who did not necessarily question the basic 
European standpoint began, nevertheless, to record 
the laws, customs, proverbs, sayings, and historical 
traditions of their own communities. They recorded 
also the major events of the nineteenth century, 
especially in the period just preceding the establish- 
ment of European rule. Notable among such 
writers may be mentioned James Africanus Horton 
of Sierra Leone, Carl C. Reindorf and John M. Sar- 
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bah of Ghana, Otomba Payne and Samuel Johnson 
of Nigeria, and Apolo Kagwa of Uganda. For a long 
time the work of these men and the material in 
missionary and government records on specifically 
African peoples escaped the attention of historians. 
It was the anthropologists who discovered them, 
but they were then not interested in history. With 
a few notable exceptions like Franz Boas and Emil 
Torday, their primary. concern was to describe the 
quaintness and the peculiarities of “tribes,” to jus- 
tify as well as to facilitate the imposition of colo- 
nial rule. Other literate Africans, more conscious 
and more resentful of the European standpoint, 
began to protest. But since they were restricted by 
the axiom that only documentary history is real 
history, they could only reply with polemics and 
counter-theories. Uncritical use of the few written 
sources available, particularly those of the ancient 
Greek writers on Africa, the Muslim geographers 
or travelers, and the European traders on the west 
and east African coasts, laid the foundation of new 
myths, An example of such a myth is the so-called 
“Hamitic hypothesis,” the view that all light and 
civilization in sub-Saharan Africa had come from 
north Africa and the Middle East and that the 
civilizations of Africa are thus civilizations of the 
Hamites; another myth regards the influence of 
the Atlantic slave trade as so all-pervasive that it 
can explain all major trends in African history 
since the fifteenth century. 

World War 1, the Russian Revolution, the rise of 
the Indian National Congress, the increasing dis- 
satisfaction with the facile theory of European 
superiority—all these encouraged a new appraisal 
of the African past. African works of art, looted 
from Africa and scattered all over Europe and 
North America, inspired new art forms in Europe. 
New masterpieces of sculpture and terracotta were 
discovered, and their African origin became more 
difficult to deny. Anthropologists became less tied to 
the colonial regimes, less ethnocentric, and conse- 
quently more appreciative of African culture and 
historical traditions. The International Institute of 
African Languages and Cultures in London began 
to publish Africa, and the French established the 
Institut Français d'Afrique Noire in west Africa, 
The Journal of Negro History, in the United States, 
primarily concerned with the history of the Negro 
in America, drew attention to the Possibilities of 
African history. Anthropologists like Melville J. 
Herskovits began to take African culture and its 
survivals in the New World seriously and to seek 
understanding of the African past. 

The impulse toward a new African historiog- 


raphy came with the movement toward independ- 
ence, which gathered pace in Africa during and 
immediately after World War 11. This nationalist 
movement firmly rejected the European appraisal 
of the African past and demanded a new orienta- 
tion and improved educational facilities to produce 
this reappraisal. With the establishment of new 
universities in Africa, it was inevitable that the 
teaching of history and the training of African 
historians would receive special attention. The old 
theories were maintained at first: besides European 
history, there were courses only on “European 
Activities in Africa” and some postgraduate re- 
search on British or French policy toward this or 
that territory at such and such a period. By the 
late 1940s, however, African research students 
were insisting that African history must be the 
history of Africans and not of Europeans in Africa, 
that local records and historical traditions must be 
used to supplement European metropolitan archives 
—in short, oral tradition must be accepted as valid 
material for historical research. This new approach 
produced works like Trade and Politics in the Niger 
Delta (Dike 1956) and The Egba and Their Neigh- 
bours (Biobaku 1957). 


Research problems and prospects 


No doubt the validity of nonwritten sources for 
historical research had been pointed out before, 
but it was new for university departments of his- 
tory to accept it. Even then not everyone was 
happy about it. Anthropologists replied cautiously 
that oral tradition, even when seemingly factual, 
was not history and could only be interpreted in 
terms of its function in society and within the 
particular culture. But this did not destroy its valid- 
ity as material for history; it only argued for a 
return to the link between history and sociology 
advocated in the fourteenth century by Ibn Khal- 
din. This interdisciplinary approach has been the 
most fruitful trend in African historiography in the 
last decade. 

There have been three major developments to 
promote this interdisciplinary approach. The first 
has been the creation of special centers or insti- 
tutes of African studies within which historians, 
anthropologists, linguists, and archeologists can 
Cooperate, both in research and in the training of 
future historians. The second consists of specific 
culture-history projects like the Benin and Yoruba 
schemes, in which teams of people from different 
disciplines cooperate under the direction of one 
person to throw light on the culture history of a 
particular culture. The third is the formation of 
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associations and the convening of periodic confer- 
ences or congresses on African history or African 
studies in general, bringing experts together from 
many disciplines to review progress in different 
fields and bring their joint consideration to bear 
on specific problems of Africa history. This cooper- 
ation has extended beyond the social and human- 
istic studies; the experimental sciences are also 
applying the benefit of their techniques to resolving 
problems in African history. 

The interdisciplinary approach has been very 
fruitful in the collection and evaluation of material 
for African history. Since the end of the war, many 
African states have founded national archives. 
This in turn has stimulated the exploitation of 
government, missionary, commercial, and private 
archives in Europe. Materials relevant to African 
history are now being copied and transferred to 
archival centers and places of learning in Africa, 
where they are supplemented by local records in 
European, Arabic, and other African languages. 
Similarly, more care is now taken of ancient mon- 
uments, and museums of art and crafts are being 
established. The recording of oral tradition has 
become a major preoccupation. Evidence from all 
these sources is cross-checked and supplemented 
by the study of linguistics and African languages, 
social anthropology, archeology, and other sciences. 
Gaps in our knowledge are being filled, and a spe- 
cific chronology is beginning to emerge. Historians 
can now attempt a synthesis of the history of the 
whole continent, particular regions, or of the newly 
independent states. While insisting on the validity 
of traditional African historical material, the new 
African historiography has been able to accept the 
European concept of continental and universal 
history. 

African historians have also adopted rigorous 
standards and methodology in the collection of 
data. Indeed, the emphasis in the last decade has 
been placed so much upon this scientific metho- 
dology that it is a question worth raising whether 
the proper interpretation and synthesis of the ma- 
terial has not tended to lag behind. While the inter- 
disciplinary approach has been very successful in 
the collection of data, teams or conferences of ex- 
perts from different disciplines do not write history. 
The interdisciplinary approach has alerted the his- 
torian to the validity of nonwritten sources; it has 
not superseded him or relieved him of the neces- 
sity to sift the diverse evidence, synthesize, and 
write history. Moreover, it would appear that the 
new historiography is still so tied to the universities 
that the proper functions of history in the new 
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Africa, to educate and entertain society as a whole, 
have received little attention. We have still to create 
a new philosophy of African history to replace that 
of the old palace historians and singers. 


K. O. Dike AND J. F. A. AJAYI 


[See also AFRICAN SOCIETY; HISTORIOGRAPHY, article 
on ISLAMIC HISTORIOGRAPHY; and the biographies 
of Herskovits and IBN KHALDON.] 
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Ht 
CHINESE HISTORIOGRAPHY 


China can claim the oldest continuous historical 
tradition on earth. The appointment of court his- 
torians has been attributed to the Yellow Emperor, 
one of the legendary founders of a Chinese order. 
Modern archeology has shown that the court di- 
viners of the Shang kings (1751-1111 B.c., accord- 
ing to Tung Tso-pin) maintained “archives” of 
their divinations, inscribed on bone and shell: here 
at the beginning was an association between 
magico-religious operations and record keeping 
that was to have long-term effects on the historio- 
graphical tradition. 

Fragmentary records from the early part of the 
Chou dynasty (1111-221 B.C. )—principally cer- 
tain chapters of the Shu-ching, or “Classic of His- 
tory’—reflect a continuing interest in royal gene- 
alogy, ritual operations, and political legitimacy. 
With the gradual emergence of semi-independent 
states, from the eighth century B.c. on, traditions 
of recording and compilation developed in several 
courts. The Warring States period (481-221 B.C.) 
was China’s first great age of systematic thought. 
Scholars searched the records of the past for ex- 
amples and precedents to sustain their arguments; 
this habit of appealing to history figures impor- 
tantly in all subsequent Chinese thought. Con- 
fucius (551-479 B.C.) and his followers laid great 
stress on the moral content of history, maintaining 
that the records of the past, if properly studied, 
would reveal the operation of moral norms in the 
affairs of men. The Ch’un-ch’iu chronicle of the 
state of Lu, allegedly edited by Confucius to ex- 
press his judgments, came to be regarded as the 
prototype of moralistic-normative history. 

The unified empire created in 221 B.c. was to 
be the model for subsequent Chinese political de- 
velopment. The imperial court continued and 
elaborated earlier traditions of historian-officials 
who kept records of the actions of the emperor and 
of events in the realm. These historians were deep- 
ly influenced by Confucian orthodoxy of the Han 
period, with its interest in the moral and didactic 


uses of history. In the Former Han dynasty (206 
B.C.—A.D. 8) the Grand Historian Ssu-ma Chien, 
continuing the work of his father, drew on court 
records, on chronicles and inscriptions from earlier 
times, and on oral traditions to produce the Shih- 
chi, the first great history of the Chinese ecumene 
from its shadowy beginnings to about 100 B.c, 
During the Later Han dynasty (A.D. 25-220) an- 
other great court historian, Pan Ku, wrote the first 
of the “dynastic histories” (tuan-tai shih), his His- 
tory of the Former Han Dynasty, These two Han 
works provided models that were returned to again 
and again by later historians. 

The breakup of the Han empire was followed by 
a long period of political disunion, from 220 to 
589. During much of this period north China, the 
heartland of Chinese civilization, was under the 
domination of “barbarian” invaders, while Bud- 
dhism gradually became a pervasive force in Chi- 
nese thought and life. The early part of this period 
was the second great age of creative thought— 
thought that ranged with greater depth and imagi- 
nation than ever before over the problems of man, 
society, and the cosmos. This is reflected in his- 
torical writings, which became more self-conscious 
and more critical. Historiography began to achieve 
its independence. Liu Hsieh (465-522) devoted 
part of his great treatise on literature to problems 
of historiography: the need for general principles, 
criteria for the selection of particulars, standards 
of credibility, and problems of objectivity and bias. 
The autonomy of the Chinese historical tradition 
was strikingly affirmed in this age of Buddhist 
dominance; Buddhism had slight effects on Chi- 
nese historical thinking, and Buddhist historians 
shaped their works after approved secular models. 

The great reunified empire of Tang (618-906) 
was notable for its artistic and literary brilliance. 
History became, for the first time, a standard in- 
gredient of the curriculum for state examinations. 
The noted T'ang official Tu Yu (735-812) sought 
to break free of dynastic chronicles and wrote an 
encyclopedic compilation, the T’wng-tien, that may 
be regarded as the first institutional history of 
China. In the early T'ang the bureaucratic ap- 
paratus for recording events, processing docu- 
ments, maintaining archives, and writing history 
was greatly elaborated. At this time imperial com- 
missions replaced individual authors in the com- 
pilation of dynastic histories. This marked the 
inception of a division between official and un- 
official historiography that persisted until the end 
of the imperial order. A severe critic of this change 
was Liu Chih-chi (661-721), whose “Conspectus 
of History” (Shih-t'ung) is the first major work of 
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historical criticism, signaling the emergence of his- 
tory as a separate and independent branch of 
study. In it Liu dealt with such problems as the 
credibility of evidence, historical style and form, 
the qualities of historians, the effects of bias and 
political pressure, and the moral dilemmas of 
historians. 

Toward the end of the T’ang, Confucianism, 
which had long seemed intellectually anachro- 
nistic, began to be revived and modernized. The 
resulting system of Neo-Confucianism had far- 
reaching effects on the writing of history. The 
rationalism of the new system was reflected in les- 
sened attention to supernatural events and inter- 
pretations. It fostered, in the centuries that fol- 
lowed, a strong tradition of historical empiricism 
and critical scholarship. The historical writings of 
the Neo-Confucianists of the Sung (960-1279) 
show a new meticulousness in historical inquiry, 
a disposition to use unofficial sources, a devotion 
to rational explanation combined with a profound 
belief in moral dynamics. Perhaps the greatest 
historian of this age was Ssu-ma Kuang (1019- 
1086). His Tzu-chih t'ung-chien was a history of 
the Chinese world from 403 B.C. to A.D. 959, ar- 
ranged in annalistic form. The author drew on an 
immense variety of works and appended his study 
of doubtful points (k’ao-i) to the completed his- 
tory. He emphasized in his preface his moral- 
political purpose: “. . . taking in all that a prince 
ought to know—everything pertaining to the rise 
and fall of dynasties and the good and ill fortune 
of the common people, all good and bad examples 
that can furnish models and warnings.” 

The development of printing in the Sung dynasty 
and the spread of literacy among an enlarged elite 
meant a great proliferation of private historiog- 
raphy, an expanded reading public, and vastly 
increased possibilities for preserving historical 
records. These trends and the dominant Neo-Con- 
fucian view of history—most authoritatively ex- 
pressed by Chu Hsi (1130-1200) in his “epitome” 
(kang-mu) of Ssu-ma Kuang’s history—continued 
through the age of Mongol rule (1279-1368) and 
the Chinese dynasty of Ming (1368-1644). These 
centuries are better known for monumental com- 
pilations and exhaustive commentaries than for 
creative historical works. An exception is the sec- 
ond major institutional history of China, the Wen- 
hsien tung-k’ao, by Ma Tuan-lin (c. 1250-1325). 

The shock of the imposition of Manchu rule in 
1644 plus a growing impatience with Neo-Con- 
fucian orthodoxy as it devolved into pedantry on 
the one hand and vapid philosophizing on the other 
provoked a new critical movement of great im- 
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portance. A new rational empiricism produced new 
principles of textual criticism and new work in 
historical geography, epigraphy, archeology, and 
other fields. Ku Yen-wu (1613-1682) was one of 
the pioneers in this movement and began the de- 
velopment of the inductive method of research 
that had far-reaching effects on historical and 
philological studies. Chao I (1727-1814) wrote of 
recurrent patterns and forces in Chinese history 
with unusual acumen. Chang Hsiieh-ch’eng ( 1738- 
1801), ignored in his own time, presented an en- 
larged view of history, a conception of historical 
synthesis, and ideas about the qualities of a his- 
torian that were new to the Chinese tradition. 
These men and the other historians of this autum- 
nal flowering never—for all their new ideas and 
rigorous methods—renounced their allegiance to 
Confucian morality, to the belief that history 
should be written to illuminate moral truths and 
thereby to reform the world. When the imperial 
order and Confucian orthodoxy had collapsed, the 
methods and skeptical spirit of these men figured 
largely in the modernization of Chinese histori- 
ography. 


The Chinese view of history 


The usual Chinese term for history, shih, has 
several meanings: writings about the past, one 
who (usually under official appointment) records 
events, and, in early times, “astrologer” or “astron- 
omer.” The term does not have the second meaning 
of the English word “history,” the past as such. 
The Chinese conception of history was shaped by 
certain elements in the Chinese world view. One 
of these was ethnocentrism, derived from the rela- 
tive isolation of Chinese civilization. History was 
primarily concerned with the “Central Kingdom” 
(chung-kuo), with the world (tien-hsia) in which 
China was “central.” Thus peoples peripheral to 
China were treated as “barbarians,” or peoples to 
be contained, “chastised,” or converted to Chinese 
culture. Since the Chinese had only fragmentary 
knowledge of other great civilizations, there was no 
trace of comparative history, such as one finds in 
Ibn Khaldin. A second element is holism, the view 
that human and natural events are interrelated in 
a coherent whole; symptoms of dislocation in one 
order are interpreted as signs of malfunction in 
the other. Especially in histories written before 
about a.D. 1000, much attention is given to natural 
catastrophes, portents, and the like. Yet the force 
for harmony or dislocation originates in the actions 
of men, and a rational-secular view of historical 
causation gradually becomes dominant. A third 
element is a view of history as a devolution from 
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a golden age: the sage-kings of the distant past 
had presided over an ideal order; in later times 
men moved further and further from that ideal. 
Change was considered desirable if it promised a 
return to ancient ideals. This gives to all Chinese 
histories—even those of periods of innovation—a 
pronouncedly archaistic tone. A fourth element is a 
cyclical conception of political history; polities, like 
men, have sequences of birth, youth, maturity, 
senescence, and death. The habit of holistic think- 
ing led historians to see symptoms of the same 
cyclical stages in all the spheres of culture: litera- 
ture, the arts, the ethos of the villages, the cus- 
toms and habits of the elite. The practice of dating 
all events by dynasty and era name (nien-hao) 
reinforced the conception of political change as the 
dominant force in cyclical change. 

A fifth element is the view that there is a moral 
dynamic in the affairs of men; the Confucian sages 
had specified the principles. History, if properly 
written, would lay bare the working of those prin- 
ciples in a nexus of events or in an individual life. 
This belief tended to give a moralistic coloration 
to all cause-and-effect statements and to reduce 
biography to exemplary or minatory stereotypes. 

Scope and purposes of historiography. From 
the earliest times the keeping of records and the 
writing of history were official functions. Every 
dynasty had its history office. It was staffed by men 
who had been educated in the standard curriculum 
and had passed the state examinations. The com- 
mon experience of all officials included a great 
deal of history: study of the style and content of 
the ancient chronicles, memorization of historical 
sequences, the use of historical allusion and his- 
torical precedent in the most routine official com- 
munications, the marshaling of historical cases in 
policy argument. Thus, history was an integral 
part of training and of official life. This meant that 
a high percentage of officials at some point in 
their careers would be assigned for a period to the 
history office, where they might work at daily 
record keeping or at one of the several types of 
compilation periodically ordered by the throne. 
From the Tang dynasty ( 618-906) on, high- 
ranking officials were appointed to chair or to serve 
on the commissions in charge of major historical 
compilations. All works compiled in the history 
office were presented to the throne for approval. 
Private historians were usually members of the 
official class, were largely dependent for their ma- 
terials on official records, and often felt it politic 
to present their works to the throne for approval. 

Historians of all kinds were thus deeply involved 
in official life and shared the general concerns of 


the official class: the maintenance of stability and 
order through governmental operations and the 
institutions of social control, the preservation of 
Confucian orthodoxy and the maintenance of Con- 
fucian ethical standards in society, the upholding 
of inherited standards of excellence in literature 
and the arts, the defense of the privileged posi- 
tion of the literate gentry against threats from 
autocratic emperors and upstart power groups. 

These interests, working together with the world 
view discussed above, determined the scope of 
Chinese history writing. The overriding concern 
with order produced a concentration of attention 
on political history and the lessons about stability 
and change that could be drawn from it. Similarly, 
the history of institutions was viewed from the 
capital and from the official viewpoint; thus, for 
example, the “economic” monographs of the dynas- 
tic histories are focused on the regulatory function 
of government, and the “geographical” monographs 
are concerned with what we would call adminis- 
trative geography. Biographies deal less with indi- 
vidual character than with the subjects’ official 
posts and social role; they tend to be grouped ac- 
cording to a common social role or according to 
certain moral standards that the subjects exempli- 
fied, for example, “loyal ministers,” “literary men,” 
“virtuous women,” “harsh Officials,” “partisans.” 

Slight attention was given to all the groups that 
challenged the literate gentry’s power: military 
leaders, merchants, eunuchs, court favorites, and 
members of the empresses’ families. (The last 
three were treated when their machinations were 
adduced as prime causes of dynastic weakness or 
collapse.) Heterodox religions tended to be given 
little attention. Although Buddhism dominated 
Chinese culture for five hundred years or more 
(from 350 to 850) and was a major force in life 
and thought in later centuries, references to it in 
official histories are few; only one monograph in 
one dynastic history is devoted to Buddhism and 
religious Taoism. References to these and other 
heterodoxies are usually pejorative and emphasize 
their socially disruptive effects. Unsuccessful rebels 
are given scant attention beyond the account of 
measures taken to suppress them. Common life 
and popular culture are seldom given detailed 
treatment, except when natural calamities created 
problems of relief and rehabilitation or subversive 
movements appeared among the oppressed peas- 
antry. 

The whole picture of life that appears in the 
Standard histories and official compilations tends 
to minimize tensions, cultural differences between 
one region and another, clashes of opinion, and 


| 
| 


HISTORIOGRAPHY: Chinese Historiography 403 


other phenomena that in the modern Western view 
seem most significant in the development of Chi- 
nese civilization. Yet these defects are in part 
counterbalanced by the sheer volume of the record; 
it would require 45 million English words to trans- 
late the 25 dynastic histories. And from the sixth 
or seventh century on the dynastic histories can be 
checked and supplemented by an enormous wealth 
of published official and private sources: diaries, 
memoirs, collected papers, essays, poems, collec- 
tions of inscriptions, stories, travelogues, and mis- 
cellanies. Surviving manuscripts, notably those 
discovered at Tun-huang, provide further raw ma- 
terial and additional checks. The shih-lu, or “veri- 
table records,” of the last two dynasties have been 
preserved and provide a daily record of court and 
government from 1368 to 1912. Thus the modern 
historian can, by using all these varieties of ma- 
terial, reconstruct substantial segments of the 
Chinese past. 

Historical method. The methods used by Chi- 
nese historians fall into two closely related groups. 
One consists of the methods of recording contem- 
porary happenings, the other of the methods of 
compiling a coherent account from such records. 
Court historians were charged with writing up, day 
by day, the events at court: audiences, rituals, 
memorials to the throne, imperial edicts, digests 
of reports received from outside the capital. Pri- 
vate historians would usually record events they 
witnessed, journeys they took, or the lives of their 
relatives and friends. The exercise of the recording 
function, especially at the official level, was a 
solemn obligation, for the historian had the moral 
duty to record accurately, without fear or favor, 
and this usually in a power situation fraught with 
tensions. Heroic examples of historians who died 
rather than distort the record were constantly 
before them. 

The second type of method is compilation. Court 
historians from time to time edited and digested 
the daily records into chronological accounts of a 
dynastic period (kuo-shih) or of a single reign 
(shih-lu), These accounts in turn formed the basis 
for the “annals” (pen-chi) section of the dynastic 
history written by the historians of the succeeding 
dynasty. The bare chronological accounts in the 
dynastic histories were supplemented with biogra- 
phies prepared from both official and nonofficial 
records and with monographs pieced together from 
official papers dealing with particular topics, for 
example, imperial edicts on judicial administration, 
ceremonies, economic and fiscal affairs, etc. The 
procedure of compilation was to select integral pas- 
sages from the records and piece them together 


with words or sentences of transition, usually with- 
in a chronological framework. Thus, official his- 
tories were compiled rather than written afresh, 
and the same may be said of most private his- 
toriography. 

In the whole process of recording and compila- 
tion, the historian was circumscribed by a variety 
of attitudes and conventions, many of which had 
come down from the distant past. The world view 
and interests of the Chinese elite limited the his- 
torian’s horizons, and Confucian morality governed 
his choices. Moreover, the Chinese attitude of re- 
spect for the written word meant that he handled 
documents from the past with care and circum- 
spection. He did not alter them lightly, and when 
two accounts of an event were’ in conflict he gen- 
erally chose the one that showed greater consist- 
ency with his other materials and inserted it 
integrally into his text. The discussion of discrep- 
ancies in evidence appeared rather late in the 
tradition and was then relegated to separate sec- 
tions of a history or to separate works, 

The chronological arrangement of historical par- 
ticulars derived naturally from the system of court 
record keeping. It also had an ancient and authori- 
tative prototype in the “Spring and Autumn Annals” 
(Ch'un-ch’iu), allegedly edited by Confucius. This 
method of compilation (pien-nien) was dominant 
in the tradition, and even the institutional his- 
torians who sought to transcend dynastic chro- 
nology arranged the materials in their topical chap- 
ters chronologically. The devolutionary view of 
history precluded the full development of ideas of 
history-as-process. In general, “befores” and “afters” 
were thought to indicate to the discerning reader 
the elements of moral causation that were believed 
to inhere in any sequence of events. 

But the historian was enjoined by tradition to 
make clear the moral lessons of history. There 
were certain approved means of doing this. Two 
of these were methods attributed to Confucius: 
“appropriate concealment.” which meant omission 
or disguise by euphemism of particulars that might 
impair the image of a worthy individual or group, 
and the selection of terminology in such a way as 
to apportion moral praise or blame, for example, 
“succeed to” or “usurp” the throne. More explicit 
judgments by the historian were to be found in 
brief comments appended to a section or chapter 
of his work; there the moral point was made, the 
lesson drawn. Furthermore. the historian could 
express degrees of approval and disapproval by his 
arrangement of material. For instance, in a collec- 
tion of biographies the lives were grouped accord- 
ing to certain types. The historian’s judgment was 
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reflected in the group to which a given biography 
was assigned and in the position of the group in 
the collection: the most worthy were placed first; 
the least worthy came at the end. 

Both recording and compilation were subject to 
verbal and ritual conventions. Drawing up an im- 
perial edict was a solemn and complex task that 
involved choosing the right references to the clas- 
sics and the appropriate allusions to historical prec- 
edent, the observance of the official taboos in refer- 
ring to members of the ruling house, choosing the 
rhetorical flourishes proper to the matter in hand. 
When the historian used such an edict in compiling 
a history, he had to understand what it meant and 
fit it into his narrative with only those minor verbal 
changes dictated by the lapse of time. There were 
proper and improper ways of recording all events: 
the selection of a crown prince, the death of an 
emperor or empress dowager, the reception of a 
tributary envoy. The recorder used these conven- 
tions, and the subsequent historian reproduced 
them in his account. To all these formulas were 
added standardized literary locutions for describing 
certain types of events: famine in the provinces, 
incursions along the frontiers, and so forth. Much 
of Chinese history, particularly official history, is 
thus cast in standardized ritualistic or literary for- 
mulas. 

Biography was also subject to formulas. This had 
two roots in historiographical tradition. One was the 
dependence of the biographer on funerary writings 
of all kinds, which presented individual lives in the 
conventional formulas of filial piety and the cult of 
ancestors. The other was the conception of the 
individual life as having an exemplary or minatory 
purpose; if history was to teach moral lessons, 
biographies must serve to illustrate virtue and vice 
in individual lives. The historian—-compiler, as we 
have noted, grouped his biographies and thus 
tended to stereotype the lives classified according 
to any one type. Particulars concerning the life of 
a man that were at variance with the type to which 
his biography was assigned were sometimes in- 
cluded in other parts of the same history. 

The weight of convention was heaviest upon the 
compilers of dynastic histories and other official 
works. In general, the historian who recorded 
or compiled accounts of nonofficial personages 
(monks, recluses, certain literary figures) or of 
events of marginal concern to the court (ethnog- 
raphy of certain peoples, popular festivals) en- 
Joyed greater freedom. It was always in the sector 
of private historiography that innovation occurred: 
for example, the invention of a new type of “unof- 
ficial biography” (pieh-chuan) in the Ch’ing dy- 


nasty, the development first of the “life chronicle” 
(nien-p'u) and subsequently of the autobiographi- 
cal life chronicle, the creation of institutional his- 
tory (which later atrophied under official spon- 
sorship), and the development of local histories 
(fang-chih) given a fresh impetus by Chang Hsiieh- 
ch’eng and his successors. But if some of the official 
conventions were less burdensome in such cases as 
these, the method of compilation, governed as it 
was by tradition and the world view of the elite, 
remained very much the same. One of the effects 
of this is that the reader seldom—except in certain 
kinds of prefaces—glimpses the personality or 
point of view of the historian in the objective, fac- 
tual flow of the prose. Another effect—especially 
pronounced after the bureaucratization of history 
—was that nearly all personal and evocative writ- 
ing was relegated to “nonhistorical” categories: 
novels, anecdotes, and the like. This deprived the 
historiographical tradition of the vividness and 
color found in the early chronicles. 


The modernization of historiography 


The breakup of the millennial traditions of Chi- 
nese historiography occurred gradually in phases 
paralleling the dissolution of the imperial order of 
which it had been a part: a period of resistance to 
the intrusive forces from the West, followed, from 
about 1860 to 1905, by a phase of attempted com- 
promise and accommodation, followed in turn by @ 
period of increasing acceptance of Western ideas 
and institutions. The abolition of the examination 
system in 1905, accompanied by sweeping educa- 
tional reforms after the models of the Japanese 
modernizers, ushered in revolutionary changes in 
the organization of learning. The old academies 
were swept away and replaced by state-supported 
schools; the long-gowned man of learning who 
divided his time between official service and schol- 
arly activities gave way—as scholar and teacher— 
to the new generation of Western-trained and 
Japanese-trained professors in the newly estab- 
lished universities and research institutes. History 
was institutionalized in new careers and in new 
institutions, where it struggled to re-establish itself 
as one of the new disciplines among the many 
transplanted from the West. r 

The universities were the first of the new insti- 
tutions for the study of history. They grew from 
small beginnings under the empire to a nationwide 
system of state, provincial, and accredited private 
universities under the Nationalist government 
(1927-1949). Although the scale was small (only 
about 25,000 students attended accredited univer- 
sities in 1930-1931), the universities provided 
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positions for historians engaged in teaching and 
research and the intellectual forums in which they 
developed their ideas. The principal institution for 
advanced research was the Institute of History and 
Philology, one of the branches of the Academia 
Sinica established by the government in 1928, The 
institute supported a variety of research activities 
aimed at the reorganization of the historical heri- 
tage and the application of new methods of an- 
alysis. This included an approach to validating 
historical documents through archeological investi- 
gation. Its excavations of the late Shang capital at 
Anyang revolutionized the study of early Chinese 
history. National libraries in Peking and Nanking 
and libraries attached to universities grew in size 
and were organized along Western lines. After a 
considerable lapse of time, the Nationalist govern- 
ment took over the archives of the defunct dynasty, 
and the collection and organization of archival 
materials were gradually modernized; substantial 
archival collections were published, notably by the 
Palace Museum in Peking. 

The senior historians who worked within this 
new framework of institutions were scholars trained 
under the old order. The younger men—those who 
wrought the modernization of historiography— 
were, typically, deeply influenced by Western ideas 
and in many cases by study in Japan, Europe, or 
the United States. 

The movement for a new history, although it 
had earlier beginnings, may best be seen in the 
context of the May Fourth Movement of 1919, 
which was basically a movement in search of a 
new Chinese culture adapted to the needs of a 
modern society. In the intellectual ferment of Pe- 
king in the 1920s, ideas and programs were vigor- 
ously discussed. Western ideas from Diderot to 
Dewey (and from Plato to Lenin) found their 
champions and interpreters. Historians, like other 
intellectuals, argued the great issues: Why had the 
Chinese order weakened and fallen victim to West- 
ern imperialism? What should be the ingredients 
of a modern Chinese civilization, and how could 
this develop in the midst of political chaos and 
continuing foreign pressures? What elements from 
the Chinese past could be used as explanations of 
China’s present and as guides to its future? And 
which of these elements might be worked into the 
fabric of a new Chinese culture? 

Although there were last-ditch defenders of the 
older scholarly traditions, the net effects of these 
controversies on historiography were revolutionary. 
The classics ceased to be regarded as sacred and 
were rigorously re-examined for authenticity and 
credibility as historical documents; “heterodox” 


works—Taoist, Buddhist, and others—were reap- 
praised for their historical and philosophic content; 
the vast range of popular stories, novels, dramas, 
and other works disdained by the older literati as 
“vulgar” were seriously studied for the light they 
might shed on China's social past; field-work tech- 
niques from the West were put to use in the study 
of village life, popular cults, and folklore, and the 
findings were used to further understanding of tra- 
ditional popular culture. The comparative method 
was introduced, and the age-old habit of regarding 
everything Chinese as sui generis was broken. The 
moral-—political emphasis of traditional history was 
challenged from every side, and materials were 
collected, regrouped, and studied for the light they 
might shed on social and economic history or the 
history of material culture and the arts, Archeol- 
ogy, although underfinanced, produced rich new 
finds and new ways of understanding the life of 
the past from surviving objects; analytical history 
based on stated hypotheses replaced the time- 
honored forms of compilation. Exact citations and 
footnotes began to replace the ancient method of 
piecing together passages from earlier works. 

The publications of the new historians of the 
1920s and early 1930s reflect all these changes. 
Research societies, with their quarterly bulletins and 
monograph series, were a typical medium of com- 
munication, and their publications reflect the effort 
to reorganize China’s past. Thus, for example, 
there were new journals of economic history and 
geography, the journal and the monographs of the 
Society for Research in the History of Chinese Archi- 
tecture, journals of Buddhist studies, and numer- 
ous journals of general historical studies, many of 
them the organs of new university departments 
and institutes. Semipopular magazines and news- 
paper supplements carried the findings of histori- 
ans to a wider reading public. The great publishing 
houses of Shanghai (notably Chung-hwa and the 
Commercial Press) commissioned new works, or- 
ganized series on the national past, and reprinted 
large collections of choice editions of the literary 
and historical works of the past. Punctuated edi- 
tions and vernacular translations of classical works 
made the heritage more accessible to the young. 
Modern reference works, indexes, and concor- 
dances of traditional sources were developed. De- 
spite the social and political instability of these 
years, the vitality of the new historical scholarship 
promised well for the future. 

These prospects were dimmed, first by the Japa- 
nese attack of 1931 and then by the Japanese inva- 
sion of north China in the summer of 1937. 
Centers of learning were destroyed, and faculties 
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and students fled to the southwest, where the uni- 
versities continued valiantly to function, but in 
dire poverty and without libraries or other facilities 
for scholarly research. Despite these hardships, im- 
portant works of scholarship were published in the 
years 1937—1945. The return of the universities to 
north China in 1945 was followed by catastrophic 
inflation and the onset of civil war. The Nationalist 
government was increasingly intolerant of dissent, 
and the atmosphere became steadily more tense 
and oppressive. 

In the deepening crisis of the civil war, histori- 
ans, like other intellectuals, had to make a choice: 
to remain in China under the Communist party or 
to flee with the Nationalists to Taiwan. The great 
majority remained. After 1949 there were two prin- 
cipal centers of historical studies: the People’s Re- 
public of China and the Republic of China on Tai- 
wan. 

Historical scholarship in the People’s Republic is 
institutionalized in academic posts at the universi- 
ties and in several of the institutes of the Academy 
of Sciences (Chung-kuo K’o-hsiieh Yiian). Volumi- 
nous publications of many kinds have appeared 
since 1949. Large documentary collections have 
been published, for instance, on the peasant revolts 
of Chinese history, on China’s wars with the West- 
ern powers, on the beginnings of capitalism, on 
economic history, on reform movements. Excellent 
punctuated editions of traditional works and ver- 
nacular translations have been published on a large 
scale, and many of the important historical studies 
of the period from 1920 to 1949 have been re- 
printed, often with few changes other than a new 
preface. As the ground has been cleared for public 
construction, archeologists have conducted new 
excavations; publication has lagged behind discov- 
ery, but it is clear that the new finds now being 
assembled and classified in national and provincial 
museums will permit the rewriting of much of 
China’s history. 

Like other intellectuals, historians in the Peo- 
ple’s Republic have been subjected to “thought re- 
form,” so that their thinking and writing would 
wholly conform to the dominant orthodoxy. They 
have been constantly exhorted to perfect their the- 
oretical grasp of Marxism, to use Marxist theories 
to bring forth a new history that will serve the new 
order. Each article may bring upon the historian 
the wrath of a party theorist who finds something 
in it that does not conform to the party’s view of 
the past and future. The historian enjoys less pres- 
tige than the scientist and is far more exposed to 
ideological pressures. The older generation that 
came to maturity under the Republic have not 


maintained their earlier creativity, and Peking 
critics have complained that after ten years of the 
“New China” the younger historians are poorly 
trained in language and Chinese and Western his- 
tory and that there are no satisfactory textbooks 
or general histories. 

Since 1949 the problems and results of historical 
study have been laid down by the government and 
the party according to the doctrines of Marxism- 
Leninism-Stalinism and Maoism. The historian 
has been ordered to document and validate, not to 
explore and question. The long period from 770 
B.C., or perhaps from 481 B.c., to 1840 has been 
authoritatively categorized as “feudal”—a neces- 
sary stage in the Marxist evolution of society. The 
only sustained attention given to this long period 
has been in the study of peasant rebellions, each 
of which, as Mao Tse-tung has said, “dealt a blow 
to the existing feudal regime and more or less 
furthered the development of the social productive 
forces.” Mao Tse-tung has also prescribed the study 
of the “sprouts of capitalism,” for it is his thesis 
that China would herself have progressed from 
“feudalism” to her own “capitalism” if it had not 
been for the intrusion of foreign imperialism. The 
discussion of this thesis has produced polemics, but 
it has also led to the publication of important his- 
torical documents and a few usuable monographs. 
Modern history from 1840 to 1919 is characterized 
as semifeudal and semicolonial, and much effort 
has been expended on documenting imperialist ag- 
gressions and on attempting to periodize these 
years in terms of their “contradictions,” changes in 
the “mode of production,” and so on. The concen- 
tration on universal Marxist determinants has, by 
depriving China of a distinctive past, come into 
conflict with the intense nationalism of the Peo- 
ple’s Republic. Recently there have been efforts to 
re-examine the key figures of Chinese history rather 
than to restrict historical emphasis to peasant 
rebels and impersonal social forces, This is at least 
a step toward the re-creation of a distinctive Chi- 
nese history. In general, however, dogmatic con- 
cerns have made sterile the historical studies since 
1949; indeed, one noted Peking historian exclaimed 
in 1957 that they had brought historiography to 
“the brink of death.” 


On Taiwan, the government re-established the “a 
Academia Sinica, which now has its old Institute 


of History and Philology and a new Institute of 
Modern History. National Taiwan University has a 
history faculty, as do the smaller colleges on the 
island. Considerable collections of rare books, 
archeological materials, government archives, and 
works of art were brought from the mainland. 
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There has been a steady output of documentary 
collections, notably on the history of the nineteenth 
and the early twentieth centuries. Some of the 
Academia Sinica’s earlier serials have been re- 
sumed, and the output of historical monographs 
has been considerable. Yet the historical commu- 
nity on Taiwan is small and impoverished; his- 
torians are sensitive to their insularity and the 
garrison-state atmosphere around them. It is not 
surprising that no major works of synthesis and 
interpretation have appeared on Taiwan. 

The present is indeed one of the low points in 
the long history of Chinese historiography. Mean- 
while, scholars in Japan, Europe, and the United 
States—many of them of Chinese birth—are ina 
position to push forward in an atmosphere of free- 
dom toward new understandings of the Chinese 


past. 
ARTHUR F. WRIGHT 


[See also CHINESE SOCIETY and CHINESE POLITICAL 
THOUGHT. ] 
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Iv 
ISLAMIC HISTORIOGRAPHY 


Islamic historiography is the historical literature 
written by adherents of the various branches of 
the religion of Islam. Although much of it is written 
in Arabic, a great deal has appeared in other lan- 
guages employed by Muslims; Persian (beginning 
with the tenth century) and Turkish (from the 
sixteenth century onward) were used intensively 
for the writing of histories. Various minorities 
living under Muslim domination, especially the 


Eastern Christian denominations, produced histori- 
cal works bearing a close resemblance to their 
Islamic counterparts, but these will not be con- 
sidered here. On the other hand, the rapidly grow- 
ing number of historical works written by Muslims 
in our time must not be excluded, even if the old 
forms and techniques have been abandoned, to- 
gether with most of the values that were charac- 
teristic of Islamic historiography from medieval 
times well into the nineteenth century. 

Origins and early beginnings. Pre-Islamic Arabs 
took a keen interest in genealogy and in major 
events affecting tribal politics, but the memory of 
the past was transmitted orally. In the Byzantine 
and Persian territories conquered by the Muslims 
during the first half of the seventh century, his- 
toriographical traditions of long standing were 
alive, if not exactly flourishing. In these areas, per- 
sonal contacts between Muslims and learned non- 
Muslims or converts seem to have set in motion 
a certain minor impetus toward the creation of 
Muslim historical writing. 

The principal motivation behind the subsequent 
tremendous development of this historical writing 
lies in the conception of Islam as a historical re- 
ligion. The Prophet Muhammad (ca. 570-632) 
saw himself as the culmination and fulfillment of 
a historical process that started with the begin- 
ning of the world in time. Through a chain of 
divinely ordained messengers ( principally, the 
great figures of Judaism and Christianity), of 
whom he was the last, this process was leading 
toward a clearly foreseeable end of the world. More- 
over, Muhammad saw himself as a religiosocial re- 
former fulfilling prophecies and giving directions 
for the future. Thus, he provided the outline on a 
vast historical canvas that was left to be filled in 
and interpreted by the historian. There was another 
aspect of the awareness of history fostered by the 
Prophet. Historical precedent, in all its essential 
manifestations, was very important in the develop- 
ment of Islamic civilization. From the outset, and 
increasingly so with the passing of time, political, 
legal, religious, and scholarly institutions, as well 
as moral ideas and values, were considered as de- 
riving their ultimate authority from the events of 
early Islam and the actions and behavior of the 
early Muslims. The historical truth and significance 
of these events and actions required constant Te- 
affirmation, confirmation, and re-evaluation. This 
sharpened historical consciousness and was re- 
sponsible for a great amount of historical research 
and writing. 

Although there is no doubt as to the motivation 
that made Islamic historiography an inevitability, 
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the mechanisms through which this took place are 
much harder to trace. The early authorities re- 
sponsible for the historical information reported 
in later writings are represented as having trans- 
mitted the material orally. This may be true, but 
it is more likely that there existed a method of 
oral transmission that was supplemented by un- 
published written notes that were the reporters’ 
personal aide-mémoires. In fact, there seems to 
have been no regular procedure for the orderly 
publication of works written in Arabic until the 
end of the seventh century. The subsequent intro- 
duction of paper, at about 750, or the beginning of 
the ‘Abbasid dynasty, made possible a literary ac- 
tivity that in the quantity of production was some- 
thing not seen before in the Mediterranean world. 
However, it would seem that relatively few works 
that could be called histories were written, and 
they almost certainly enjoyed no wide circulation. 
Earlier Muslim historical works have largely been 
lost because of the absence of institutions of publi- 
cation and durable writing materials and also, 
perhaps, because of the dynastic change that made 
works written under the Umayyads (660-750) 
largely unacceptable. ‘Urwah b. az-Zubayr (ca. 
650-711), a member of the Muslim elite, is cred- 
ited with the composition of the book Raids of the 
Prophets. In the generation following ‘Urwah, 
al-Zuhrî (ca. 670-740) is said to have written a 
work on “the genealogy of his people.” He also 
wrote, apparently for his personal use, on the 
length of the reigns of the caliphs. From the work 
of a third early authority, Masa b. ‘Uqbah (d. 758/ 
759), a brief fragment, not entirely historical in 
character, has come down to us. The earliest large- 
scale work that is preserved, although only in later 
recensions, is the biography of the Prophet (Sirah, 
“Way of Life”) by Ibn Ishaq (ca. 704-767 ). It deals 
with the history of pre-Islamic Arabia and with all 
the details and events of the Prophet’s life. The 
same author is also credited with a history of the 
caliphs. From the evidence now at our disposal, 
we may conclude that by around 700 historical 
research, focused on the life of Muhammad, began 
to serve the political and social needs of the new 
religion. It also seems certain that most of the 
formal elements of later historical writing were to 
some degree already present at that time. 

Forms and contents of historical works. The 
importance of form in literary presentation is 
particularly noticeable in Islamic historiography, 
which has, for better or for worse, always remained 
in bondage to the forms it first developed. Pre- 
Islamic Arab tradition had already stressed the 
concrete “factual” element in history, isolated from 


its environment and as much as possible unmodi- 
fied by human thought processes. Thus, simple 
Statements, isolated events, superficial, if colorful, 
characterizations, put next to each other and left 
without any explicit elaboration of their inner or 
causal connections, came to determine the basic 
appearance of Muslim history books. Historical 
truth, like religious truth, was considered ensured 
by the unimpeachable character of the succession 
of men through whom a given item of information 
was transmitted, the so-called chain of transmitters 
or isnâd. Even if the historian did not mention 
these chains of authorities for all the individual — 
items he reported (and many historians did not), 
the concept of each historical fact possessing a 
great, even absolute, degree of autonomy was 
strengthened. The effects have been felt through- 
out all Muslim historical writing, which can be 
characterized as fundamentally episodic, no matter 
how long and detailed and skillfully narrated the 
individual episode. 

Annalistie histories. Larger units for organiz- 
ing the historical material were soon needed to 
hold together the constantly growing volume of 
data. They were readily found in the proven 
principles of dynastic arrangement according to 
the reigns of successive rulers and in annalistic 
arrangement according to the years of the era. 
The era of the hijrah was introduced about the 
year 638, and Muslim historians were fortunate 
compared to all their predecessors inasmuch as 
they were able, from the outset, to rely upon a 
continuous, unambiguous, and generally accepted 
chronology. The strictly annalistic arrangement 
was thought particularly suitable to historical 
presentation. It was also easily combined with 
other arrangements, such as that according to 
reigns; and it fitted in well with the fragmented 
episodic approach and helped to perpetuate it. 
Systematically practiced, it was rarely interrupted 
even to the extent of the consecutive reporting of 
events that extended over several years, It was 
especially useful in that it facilitated the continua- 
tion of standard histories in the form of supple- 
ments often called dhayl (“tail”) or some variant. 
Such continuations might be independent works 
containing only new material, or they might repeat 
the earlier material, shortening the narrative for 
the older period and becoming more and more 
detailed as they approach the actual time of 
writing. Authors then would systematically date 
events according to the month and the day and 
list even trivial news. The oldest preserved, at 
least in part, annalistic histories in Arabic were 
written by Khalifah b. Khayyat in the first half 
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of the ninth century and by Ya'qûb b. Sufyan in 
the second half of the same century, The multi- 
volume standard work of Muslim annalistic his- 
toriography, which decisively influenced its future 
course, was written by at-Tabari (d. 923). 
Biographies. Of the areas of study cultivated 
by the Muslim historians, the most important was 
biography. This was due not so much to the often 
expressed fact that history is the record of man 
and his actions, but rather to the fact that Muslim 
historiography, in its early beginnings, was con- 
cerned with the story of the life of a great indi- 
vidual, the Prophet Muhammad, and with the cir- 
cumstances surrounding the activities of the early 
Muslims. It then became necessary to scrutinize 
the lives of all those connected in any way with 
Muslim law and religion and to learn the dates 
of their birth and their death, their local affilia- 
tions, their teachers and disciples, their moral 
character, and their works and activities. Depend- 
ing on the importance attached to a given indi- 
vidual, biographies could at times grow into large 
volumes while still remaining restricted, by and 
large, to these topics. The majority of biographies 
were brief and were entered early, in the form of 
obituaries of notable persons, into annalistic his- 
tories. Biographies dealing with representatives of 
certain scholarly categories were collected in 
special works, and biographies of religious scholars 
constituted the main contents of the large, theo- 
logically oriented segment of local historiography. 
In order to facilitate reference, biographies were 
grouped in “classes” (tabaqah, plural tabaqat) 
comprising those who had died in approximately 
the same period. This somewhat clumsy arrange- 
ment served quite well the need of religious 
scholars to judge the genuineness and reliability 
of a chain of transmitters. However, as the num- 
ber of biographees increased, an alphabetical 
arrangement was instituted. Beginning with the 
tenth century, alphabetization thus became the 
preferred method of arrangement in collections of 
biographies, although the tabaqdat system continued 
to be used. It must be noted that the merchants, 
military men, government officials who were not 
at the same time religious scholars, scholars in the 
nonreligious disciplines, poets, and so on were 
originally not the object of any systematic bio- 
graphical research. Factual information was often 
plentiful but always scattered. Much of it would 
have been lost if some later Muslim authors had 
not put together the scattered information and 
produced excellent biographical collections dealing 
with some of these groups, such as the littérateurs 
treated by Yâqût (d. 1229) in his Irshad al-arib 


ilâ ma'rifat al-adib, the physicians collected in the 
great work on the history of medicine by Ibn Abi 
Usaybi'ah (d. 1270), entitled ‘Uytin al-anbd’ fi 
tabagat al-atibbd’, or the fine choice of biographies 
of illustrious men presented by Ibn Khallikan 
(d. 1282) in his Wafaydt al-a'yan. 

General histories. Political history, restricted 
to the administrative and military exploits of rulers 
and statesmen, was the essential raw material for 
most Muslim historical writing. Monographs on 
particular events or periods were written in great 
profusion; in fact, the early ninth-century his- 
torians, most of whose works are known to us 
only by title, wrote copiously on politically signifi- 
cant individual events. Comprehensive world his- 
tories, usually traced from the creation of the world 
or from the coming of Islam to the times of their 
particular authors, also met with success, They 
were universal only in the Muslim sense, not ad- 
mitting more than a rather limited amount of 
set data from pre-Islamic times and largely in- 
sensitive to non-Muslim history, even where it 
impinged upon Muslim affairs. The assimilation 
of the Hellenistic heritage, and with it the develop- 
ment of a taste for cultural history, broadened the 
horizons of historiography. In the late ninth 
century, political history combined with intellec- 
tual history to touch on all the noteworthy and 
accessible features of the various civilizations then 
known. This trend produced such outstanding 
works as the History by al-Ya'qûbî and the series of 
publications by al-Mas'ůdî (d. 945/946), among 
them the preserved Murůj adh-dhahab. It con- 
tinued to echo weakly in the works of much later 
periods. Interest in the contemporary or near- 
contemporary non-Muslim world remained never- 
theless subdued. While the scholarly curiosity of 
historians welcomed “strange” information, they 
undertook no systematic search for it. Measured 
against the enormous mass of Muslim historical 
writing, references to events that took place out- 
side the Muslim world are few indeed. Al-Mas‘ddi, 
for instance, included a list of the kings of the 
Franks (see Maqbul Ahmad 1960, pp. 7-10). Em- 
bassies from abroad occasionally provided an open- 
ing for referring to events on the international 
scene. Historians of the time of the Crusades were 
well aware of cultural and political differences, but 
in their political and military analysis they did not 
venture beyond the borders of Islam. In central 
Asia, the singular circumstances created by the 
Mongol Empire, with its tentacles stretching far 
outside the old Muslim world, produced the states- 
man and historian Rashid-ad-din Fadlallah (d. 
1318). His General History (Jami at-tawarikh), 
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written in Persian, is probably the first genuine 
universal history of Islam. The local historiography 
of cities and regions was also persistently culti- 
vated. It emphasized political and religious history 
but also included topographical description and 
antiquarian data. As might be expected, data on 
social, economic, and financial history are inciden- 
tal to most historical writing in Islam and, there- 
fore, comparatively scarce. Some annalistic his- 
tories give us a rather intimate view of medieval 
urban life with its crimes, suicides, recurrent infla- 
tion, and other social problems. 

Historians. For a very large and influential part 
of its production, Islamic historiography is indebted 
to scholars trained in the religious studies who 
earned their living by virtue of this training but 
whose literary activities included the writing of his- 
tories. For example, al-Bukhari (d. 870), the author 
of the most authoritative collection of Prophetic 
traditions, wrote several collections of terse biogra- 
phies of religious authorities. He entitled each of 
these works History, thus establishing himself in 
Muslim consciousness as a historian. From the 
eleventh century on, a great many historians were 
religious scholars who held positions in the judici- 
ary, in the civilian branch of the political adminis- 
tration, and in the madrasahs (mosque schools), 
all of them depending on the religious establish- 
ment for their livelihood. 

Court historians. In Islamic countries, wher- 
ever ambitious and powerful rulers were to be 
found, history was the “royal science” par excel- 
lence, which kings, courtiers, and wazirs, as well as 
the tutors of princes, were expected to master. 
Caliphs and sultans often ordered officials to set 
down the history of their reign or dynasty. It may 
have happened even more frequently that historical 
works were written for presentation to the ruler in 
the hope of a reward or preferment. The profes- 
sional court historian became an established fixture 
at certain courts. For example, the courts of the 
later dynasties in Persia and the Ottoman Empire 
provided a particularly favorable atmosphere for 
historical studies. However, it would hardly be 
correct to speak of a clearly defined Muslim court 
historiography. Regardless of the position and the 
motivation of these writers, their works as a rule 
remained individualistic efforts. The personalities 
of their authors can be glimpsed even through the 
curtain of flattery and obsequious verbosity that 
was often extremely dense. 

The term “court historian” would also hardly be 
the appropriate designation for government officials 
who, because of their proximity to the seats of 
power where they were able to witness important 


events in the making, wrote historical works. Their 
number was not inconsiderable, and to them we 
owe some of the best historical works produced in 
Islam. Late tenth-century historians such as Mis- 
kawayh (d. 1030) and Hilal as-Sabi’ (d. 1036) 
were government officials who not only possessed 
an inside knowledge of political affairs (the family 
of as-Sabi’ had served in government for genera- 
tions) but were also accomplished writers and well 
versed in philosophy and the secular sciences. All of 
this is reflected in the quality of their works and in 
their historical insight. The work of ‘Imad-ad-din 
al-Isfahani (d. 1201) is an outstanding example 
of historical memoirs written by a high official who 
consulted documents and diaries. In particular, his 
Barq ash-Sha'mi deserves high praise as represent- 
ing the apogee of diplomatic historiography in 
Islam. 

Amateur historians. ‘Imad al-Isfahani and 
rulers who wrote historical works and memoirs may 
be called amateur historians. Most of the numerous 
genealogical works were written privately by men 
who frequently were members of the only true Mus- 
lim nobility, the descendants of ‘Ali. In general, 
quite a few historical works were probably pro- 
duced as labors of love, out of reverence for the 
importance of history and in recognition of the 
need for preserving an adequate historical record. 

Professional historians. In exceptional cases, 
it appears that an author might be paid for writing 
historical works on the basis of copies sold through 
booksellers; but in any case, we may safely surmise 
that this was not sufficient to live on. The profes- 
sional historian, in the modern sense, could hardly 
have existed in the medieval Muslim environment. 
History was not included in the curriculum of the 
madrasahs, although lectures on historical topics 
were occasionally given by professors employed and 
paid for teaching other subjects. However, there 
were men who spent their lives producing histor- 
ical works and who came to consider themselves, 
and to be considered by Muslim tradition, as his- 
torians, for example, al-Mas'ûdî and, during the 
Mamluk period in Egypt, when interest in history 
ran high, al-Maqrizi (d. 1442) and many others 
like him. 

Purpose and methodology of Muslim historiog- 
raphy. Muslim historians were in the habit of 
introducing their works with statements concern- 
ing the purpose of historical writing (for a number 
of these statements collected by as-Sakhawi, see 
Rosenthal 1952, pp. 219 ff.). The ideas expressed 
in this connection soon became standardized and 
did not possess any individual ring. This fact itself 
is significant in that it shows the general and un- 
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questioned acceptance of their validity. These state- 
ments expressed the belief that history is useful; it 
teaches by both negative and positive examples, 
that is, it teaches everybody how to handle his own 
affairs in this fleeting world and, most importantly, 
it teaches political leaders how to govern properly. 
Furthermore, history is entertaining; it provides 
amusing, yet thoughtful, relaxation from more ex- 
acting intellectual tasks. And history is instructive 
and edifying as a handmaiden of religion, proving 
the truth of Islam and the correctness of the view 
of the world expounded by it. On the surface, this 
last point may seem to have been raised merely in 
order to provide the greatly needed excuse for the 
waste of time assumed to be involved in any occu- 
pation with secular subjects. In reality, it touches 
on the very meaning of historiography as a part of 
Muslim civilization. Only as an integral component 
of the religiously determined structure of the world 
and society could the study of history be meaning- 
ful in Islam. 

The idea of history as a powerful weapon in 
political and ideological struggles does not seem to 
have been entertained openly or commonly by Mus- 
lim historians. On occasion, they were conscious 
of the fact that their work was used to serve the 
purpose of exalting an individual or of fortifying 
the political aspirations of a ruling house. In some 
instances, minutely detailed modern research has 
been able to uncover concealed political tendencies 
or a purposeful manipulation of the evidence in his- 
torical writings. However, Muslim historians con- 
sidered themselves as the custodians and transmit- 
ters of facts that could not be altered and neither 
required nor admitted of interpretation. The his- 
torian’s proper activities were restricted to report- 
ing, shortening, collecting different recensions of, 
and, perhaps, retelling the information provided by 
the available sources. 

This view largely determined the method of his- 
torical research. The historian’s main task was set- 
ting down what had actually happened, and his 
main problem was ascertaining the truth of his in- 
formation, whether it came to him orally or through 
written sources. Truth was understood, above all, 
as verification of the presumption that an item of 
information was derived from someone in a posi- 
tion to know, All history being in a sense contem- 
porary history, personal observation was the real 
basis of historical knowledge and the best assur- 
ance of historical truth. For the rest, the elaborate 
system developed by scholars of the science of 
Prophetic traditions, a system for ascertaining the 
genuineness of these traditions, came to be consid- 
ered as applicable also to historical research. Of 
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necessity, written histories were granted evidential 
authority. Archival research and the study of in- 
scriptions, coins, and similar historical evidence 
were only sporadically practiced. A full discussion 
of the methodology of medieval Muslim historiog- 
raphy is preserved in a work written in Arabic by 
a Persian scholar, Muhammad b. Ibrahim al-Iji, 
who wrote in 1381-1382 and whose work is thus 
the oldest extensive treatise on the subject known 
so far. On a less theoretical level, comprehensive 
works on Islamic historiography, its methods, its 
problems, and its history were composed in Egypt 
in the following century by al-KAfiyaji (d. 1474), 
who wrote in 1463, and, following him, by as- 
Sakhawi (d. 1497), who wrote in 1492. 
Philosophy and sociology of history. The Islamic 
historians’ own views regarding the meaning of 
history are implied in their works and spelled out 
in the methodological writings just mentioned. 
They were convinced that history was the chosen 
instrument of God for the gradual improvement of 
mankind and for man’s preparation for the final 
reckoning at the inevitable end of the world. The 
coming of Muhammad and Islam was viewed as 
the great turning point of world history, at which, 
for the first time, the purpose of history revealed 
itself clearly and history became a comprehensible 
reality. From then on, control of history's progress 
was within the reach of human beings, if—and 
that remained the great question—they followed 
the comprehensive plan for the good life laid down, 
for both the individual and society, in the religious 
injunctions of Islam. While there was no automatic 
sin-and-retribution mechanism operative in history, 
the rulers at least were judged by their compliance 
or noncompliance, according to the historian’s 
sources of information, with Islamic moral norms. 
There were, however, a few exceptional historians 
who, while not denying the validity of Islamic the- 
ology, tried to understand history as a purely hu- 
man social phenomenon. This approach ran coun- 
ter to the major premises of the Muslim world view 
and was, therefore, always slightly suspect. It was 
hinted at by Miskawsyh when, in his large history 
entitled The Experiences of the Nations, he denied 
that events caused by the interference of the super- 
natural in history, such as the events connected 
with the life of the Prophet, could provide a useful 
experience for students of history. Eventually, the 
northwest African Ibn Khaldan (d. 1406), writing 
in 1377, constructed a coherent system of the his- 
torical process in purely human terms and devoted 
to its exposition the famous Muqaddimah (“Intro- 
duction”) of his large universal history, Kitab al- 
‘Ibar. He explained human society as depending on 
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material and psychological forces, which were de- 
scribed by him in detail; and he defined history in 
terms of a cyclic motion (with a slight, but con- 
tinuous, forward movement) of growth and decay 
within the various forms of human associations. 
[See the biography of IBN KHALDUN.] 

Contemporary Muslim historiography. The old 
forms of historical writing have persisted until re- 
cent times, especially in the more shielded regions 
of the Muslim world. At the time of the first true 
clash of Islam with the modern world, precipitated 
by Napoleon’s Egyptian campaign, it was still pos- 
sible for an important work of the annalistic type 
to be produced by a man of genuine historical per- 
ception, the Egyptian al-Jabarti (d. 1826?). Dur- 
ing the nineteenth century, there were a few Arabic 
translations of then popular Western historical 
works. The study of certain aspects of world history 
not directly affecting the Muslim countries began 
to attract some limited interest, especially on the 
educational level. To this day, it can fairly be said 
that the interest in all non-Islamic history has re- 
mained limited, and nothing of first-rate impor- 
tance appears to have been produced in the field. 
On the other hand, from the beginning of the twen- 
tieth century and increasingly accelerated by the 
political developments that took place in the wake 
of World War 11, the meaning of history and the 
practice of historiography as it affects Muslim life 
has become a major concern everywhere in Muslim 
countries. Some have contended, for example, that 
the communal conflict between Muslims and Hin- 
dus preceding the partition of India was largely the 
result of the wrong teaching of history and, there- 
fore, amenable to redirection through the efforts of 
historians (see S. Nadvi, in Philips 1961, p. 493). 
Others have held that Muslim history is unable to 
give any guidance for the solution of present-day 
problems and remains better forgotten and disre- 
garded. 

Popular feeling tends to glorify the study of the 
great Muslim past as an inexhaustible source for 
the building of national morale and the strengthen- 
ing of nationalist aspirations. This use of history 
is fostered by writers devoting their literary talents 
to historical themes, writers such as Muhammad 
Husayn Haykal and Mahmiid ‘Abbas al-‘Aqqad. 
Motion pictures and, to a lesser degree, stage plays 
effectively exploit historical themes (see Landau 
1958, pp. 114 ff., 198 f.), More recently, numerous 
Muslim historians with Western training in schol- 
arship and methodology have begun to publish, in 
their various languages, serious and often impor- 
tant historical works as well as biographical, social, 
and economic studies on past Muslim history. 


Archival studies are getting underway—particu- 
larly in Turkey, where much of the preserved ma- 
terial is concentrated. The publication of medieval 
historical texts is being continued at an accelerated 
pace with the maintenance of, generally speaking, 
satisfactory standards in editing technique. 

The great pre-Islamic past of the Muslim coun- 
tries inspired different political ideologies and 
movements based on historical speculation, These 
had their greatest efflorescence between 1920 and 
1945. At present, the excavation, conservation, and 
study of the archeological remains of the pre-Islam- 
ic and the Islamic past are competently cultivated 
nearly everywhere. 


FRANZ ROSENTHAL 
[See also IsLam.] 
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v 
JAPANESE HISTORIOGRAPHY 

The Japanese historiographical genius has ex- 
tended less to the fashioning of grand systems of 
interpretation than to the assiduous working out 
of domestic history. Of all national histories outside 
the Western world, none has been revealed in such 
precise and abundant detail or in such a variety of 
interpretive forms as that of Japan. The work of 
Japanese historians, particularly in the last cen- 
tury, has been both prolific and comprehensive. 
And while the events of Japanese history have yet 
to find a prominent place in the main body of world 
historiography, the reverse condition, whereby Jap- 
anese history is treated in world context, has been 
carried forward in impressive fashion by the Jap- 
anese. Today every segment of Japanese history is 
served by the professional archivist, the scholarly 
monographer, and the interpretive synthesizer, 
alive to modern currents of historical philosophy 
and methodology. 

History is one of the most popular academic 
fields in Japan today. The standard Japanese Pub- 
lisher’s Annual for 1964 lists 1,249 single titles un- 
der history. Each of Japan’s more than 230 univer- 
sities has its department of history, often in more 
than one faculty. A recent handbook of historical 
research lists 75 libraries and archival repositories 
worthy of general note. The same handbook lists 
56 scholarly serials published by national historical 
associations and 40 by societies of primarily local 
importance. 

While Japan does not have a central national 
archives, the Historiographical Institute of Tokyo 
University serves somewhat in this capacity, and 
in addition, numerous private and government li- 
braries preserve or collect materials on a large 
scale. Printed archival collections are abundantly 
available. The Historiographical Institute alone has 
published over 350 weighty volumes of sources, 
while the standard reprinting of premodern Japa- 
nese histories runs to more than sixty volumes. 
The aspiring Japanese historian has at his com- 
mand the product of four generations of modern 
historical scholarship and a full reference shelf of 
encyclopedias and dictionaries on every major sub- 
ject from bibliography to religion. For stimulus to 
his thinking he may turn to volumes on the mean- 
ing of history by notable Japanese historians or to 
translations of standard works by Croce, Weber, 
Marx, Toynbee, Freud, or Parsons. 
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Japanese use of history 


History is more than a popular field well served 
by its specialists; history is, and has been, impor- 
tant to the Japanese people in their persistent search 
for their own identity and for a sense of order in 
the world about them. Japanese historians have 
been eclectic in the theories to which they have sub- 
scribed but remarkably consistent in the objectives 
that they have pursued. Since the eighth century, 
Japanese historians have looked to history to ex- 
plain the political and moral order. This search has 
been served by a variety of world views and historio- 
graphical traditions just as it has had to respond to 
the constantly shifting conditions of Japan’s domes- 
tic politics and world position. The earliest and 
most original view of history adopted by the Japa- 
nese grew out of an age when Japan existed in com- 
parative isolation from China and the dominant 
culture of the continent. It conceived of a world 
with Japan at the center, the proud inheritor of an 
ideal order built around the imperial house and 
protected by the native (Shinto) deities. This world 
view was kept alive into modern times as the 
Japanese clung to various tokens of their imagined 
cultural or racial superiority. But Japanese ethno- 
centrism was seldom free of competition from 
other, more widely based systems, and by the sixth 
century both Confucianism and Buddhism were 
challenging the claims of Shinto. 

By medieval times, while not openly admitting 
the moral superiority of China or India, Japanese 
historians had taken to explaining causation in 
political affairs by reference to Confucian concep- 
tions of proper or improper conduct or to Buddhist 
interpretations of retribution for good and evil. 
Japan’s confrontation with Western civilization 
was historiographically even more traumatic, for 
it not only presented Japan with a multiplicity of 
theories of historical explanation, but also imposed 
upon her a view of the world order that placed 
the West at the center of the human drama and 
made the Judaeo-Christian tradition the driving 
force in human progress. 

Japanese historical writing, reacting to these 
several pressures, has tended to be assimilative in 
concept and methodology. In strictly technical 
terms it has demonstrated a growing sophistica- 
tion in its capacity to deal with basic facts and 
causal phenomena. And this cumulative quality of 
Japanese historiography—its constant increase in 
historical detachment and critical judgment and 
its steady improvement in basic techniques—is 
one of its most impressive over-all features. On the 


other hand, the search for order and status has 
proved continually illusive. The questions asked by 
every Japanese—who are we? what distinguishes 
us as a people and culture? what is our place in 
the world order?—remain a source of uncertainty 
even today; and it is this uncertainty that has kept 
Japanese historiography oscillating between the 
extremes of nationalism and universalism. 

The Japanese have never been ones to accept a 
secondary position in the world gracefully. His- 
toriography in modern Japan has kept pace with 
the national desire to achieve world prominence. 
History and nation, in fact, have been closely 
linked. For history has been drawn into the service 
of the state, either to provide the people with a 
crucial sense of continuity with their traditional 
values or as a means of justifying the revolutionary 
changes endorsed by a government intent upon 
modern reform. Throughout most of the last cen- 
tury, history has seemed inevitably to teach the 
lesson of Japan’s inferiority to the more enlightened 
peoples of the West. Only once did the Japanese 
people appear to have it in their power to turn his- 
tory in their favor so that their private view of the 
world could be extended into a universal vision. 
During the era of continental expansion, the march 
of Japanese armies seemed on the verge of making 
truth out of the propagandist’s claim that the goal 
of world history was to draw mankind under the 
benevolent rule of the Japanese emperor. The dis- 
crediting of that claim gave back to the Japanese 
historian the task of finding a realistic position 
for Japanese history in an objectively conceived 
world environment. The task has not been easy. 


Early historical writing 

Japan’s earliest extant histories, the Kojiki 
(A.D. 712; “Record of Ancient Matters”) and the 
Nihon shoki (A.D. 720; “Chronicle of Japan”), were 
both products of the desire of the Japanese ruling 
house to “clarify the political order” and to pro- 
duce a national chronicle comparable to those that 
added luster to the dynasties of China. The Kojiki 
is a narrative of the Japanese people from the “age 
of the gods” through the establishment of the Ya- 
mato hegemony to the end of the reign of Empress 
Suiko in a.D. 628. The purpose of the work is to 
display the background of the Yamato house and to 
document the dependent status of various noble 
lines. While the sources of the Kojiki are obscure, 
and hence its historiographical conceptions diffi- 
cult to date, it is safe to assume that it exemplifies 
the earliest recorded views of the Japanese people 
regarding themselves and their past. The work is 
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notable for its matter-of-fact treatment of super- 
natural events, particularly its uncomplicated han- 
dling of the relationship between human society 
and the deities. The Kojiki contains no generalized 
myths of creation and no culture heroes. Nor does 
it depict the Japanese gods as active agents in the 
lives of mortals once the world of man was set 
down upon the Japanese islands. Rather, the story 
of creation begins with the first male and female 
deities, proceeds through generations of their off- 
spring, and then by genealogical descent enters 
the age of man in Japan. In the human political 
world, power and status derived simply from the 
circumstances of lineal descent from particular 
ancestral deities. The imperial family established 
its claim to sovereignty by virtue of direct descent 
from Amaterasu, the Sun Goddess. 

Because of its limited circulation, the Kojiki had 
little immediate influence on Japanese historians. 
Not until the eighteenth century, when national- 
istic scholars found in its archaic contents the 
ingredients of an idealized society based on “pure 
Japanese” values, did it become the revered source 
of the Japanese view of history. For over a mil- 
lennium, therefore, it was the Nihon shoki that 
had the reputation of being Japan’s first history. 
The Nihon shoki had the advantage of being 
written in Chinese and being more closely based 
on Chinese models than was the Kojiki; it sought, 
in other words, to place Japanese history into a 
world that acknowledged the existence of China 
as the source of a Confucian world view, although 
it sought to maintain Japan's independent and 
competitive position in that world. More histori- 
ographically self-conscious than the writers of the 
Kojiki, the Nihon shoki’s authors attempted to 
assign dates to the amorphous events associated 
with the early generations of the imperial house 
and thus establish a chronology comparable to that 
of China. It was from this effort, using the Chinese 
theory of “great cycles,” that the authors arrived at 
the controversial date 660 B.c. for the ascension 
of Jimmu, “the first emperor,” a date that modern 
historians more properly place in the third or fourth 
century A.D. 

The Nihon shoki and five succeeding official 
histories that chronicled the events of the imperial 
court down to 887 are known together as the 
Rikkokushi (“Six National Histories”), Although 
they constitute an attempt of the Japanese to write 
history in the Chinese official manner, they differ 
considerably from their models. The writer of his- 
tory met in Japan conditions very different from 
those in China, for the Japanese dynasty had al- 
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ready established itself “in perpetuity,” and the 
historian could have very little to say about its 
right to rule. As a consequence, the Japanese never 
fully adopted the premises of the Confucian moral 
order which made political power contingent on 
virtue but rather found ways of equating their 
own highly aristocratic social hierarchy to that of 
China by the assumption that the moral right to 
sovereignty followed the line of hereditary succes- 
sion. In fact, as the Japanese imperial line con- 
tinued unbroken for century after century, the 
Japanese began to turn the Confucian order to 
their advantage and to claim superiority in matters 
of government over the Chinese, who were con- 
stantly rebelling against their emperors. 

The writing of official history lost its meaning in 
Japan once the imperial bureaucracy gave way to 
the direct rule of aristocratic houses and the em- 
peror was pushed above politics to serve as a sacred 
legitimizer. By the tenth century the compilation 
of official annals iad given way to the writing of 
private histories. The monogatari (narratives) 
and kagami (mirrors) exemplify a style of his- 
tory that was as distinctive a product of the aristo- 
cratic society of the Heian court as the literary 
masterpiece, the “Tale of Genji.” Written in Japa- 
nese, not Chinese, the new histories combined the 
intimacy of single authorship with the immediacy 
of narrative writing. The most famous of these, 
the Okagami (“Great Mirror”), combined a syn- 
thesis of Japanese history up to the eleventh cen- 
tury with an explanation of the rise of the Fujiwara 
family, which then dominated the court. 


The Middle Ages 

By the twelfth century Japanese historical works 
were showing the influence of Buddhist concepts 
of karma and salvation. The immensely popular 
epic history of the war between the Minamoto and 
Taira, Heike monogatari, was written and re- 
cited chiefly for its didactic message. The writing 
of history generally passed into the hands of 
priests, who relied on Buddhist explanations for the 
rise and fall of family or individual fortunes. Their 
works were characteristically suffused with a sense 
of compassion for human suffering, sadness over 
the transience of life, and concern over the immi- 
nence of the age of decay. Works written from the 
point of view of the Buddhist establishment, there- 
fore, looked at Japan from a viewpoint that played 
down worldly political orders in the face of the 
universality of the human condition. Visible power 
and circumstance were often depicted only to re- 
veal the inevitability of decline. Yet even among the 
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Buddhists, a voice was raised which claimed Japan 
to be the center of the Buddhist world. Nichiren 
(1222-1282) militantly proclaimed Japan to be 
the land destined to bring the Buddhist faith to 
perfection. 

In terms of domestic history, the rise of the mili- 
tary aristocracy and the creation of the shogunate 
(headquarters of the military hegemony) in 1192 
added new dimensions to the historians’ task of 
interpretation. Historians now had to explain not 
only the virtue of the imperial house but also 
the reason for the decline of the civil aristocracy 
and the ascendancy of military houses. Two note- 
worthy products of the Middle Ages in Japan deal 
with these questions at the same time that they 
show distinct advances in historiographical tech- 
nique. Gukanshd (“Miscellany of Ignorant Views”), 
written by the priest Fujiwara Jien (1155-1225), 
explains the balance between civil and military 
ministers as dependent on the quality of service 
they provided and on the requirements of the times. 
Jien, to some extent, therefore, applied Confucian 
concepts of rulership to the behavior of “imperial 
advisors.” But the Gukanshé is particularly noted 
for being the first example of “purposeful” his- 
torical writing in which the author was able not 
merely to record but to survey, interpret, and ex- 
plain while exercising personal detachment. 

Jinné shotoki (“Records of the True Descent 
of the Divine Emperors”) by General Kitabatake 
Chikafusa (1291-1354) was written to inform the 
scion of a displaced branch of the imperial family 
of the true state of political affairs into which he 
was born. Its great importance to later generations 
lay in its emotional revival of the theme of Japan’s 
Superiority as a nation because of the unique vir- 
tue of the unbroken imperial line. 


The Tokugawa period 


Japan’s great age of historical writing prior to 
modern times came in the peaceful years of the 
Tokugawa period (1600-1868). Up to this point 
the writing of history had been the province of 
relatively few courtiers and priests. With the cessa- 
tion of civil war in 1600, the Tokugawa house and 
the territorial lords (the daimyo) gave official en- 
Couragement to the pursuit of learning, and history 
became a major concern of the entire samurai 
class. The multicentered political order, which in- 
cluded a regenerated imperial court, a shogun with 
unmatched powers, and stable daimyo administra- 
tions in the provinces, provided incentive for the 
compilation of numerous official histories, while 
the cultivation of intellectualism led many a private 


scholar into the fields of Japanese history and 
literature. 

At the same time, profound changes were affect- 
ing the Japanese climate of opinion. The rigorous 
pursuit of Confucian philosophy among the samu- 
rai, not simply as an esoteric pastime, but for 
practical use in political affairs and the betterment 
of education, improved the Japanese scholar's 
mastery of historical technique. More important, 
it rejected Buddhist mysticism for a more ration- 
alistic view of history. In addition, a new interest 
in native Japanese studies (e.g., study of the 
Kojiki), which refocused interest upon the imperial 
house, gave the Japanese the ingredients of a na- 
tionalistic revival in scholarship and the confidence 
to reject in time even the Confucian values that 
had so long dominated their thinking. 

Chief among the historical works of this age 
were certain products of official patronage and 
committee authorship. Heading the list of “house 
histories” was the voluminous Tokugawa jikki 
(“Veritable Annals of the Tokugawa House”), the 
first part of which was completed in 516 chapters 
between 1809 and 1849. The primary effort of 
the Tokugawa house to write a national history 
So as to legitimize its rule was the Honcho tsugan 
(“Comprehensive Mirror of Japan”), completed in 
1670 by the Hayashi family of shogunal Confucian 
advisors. The text was in Chinese, and, as the title 
indicates, the form was modeled after Ssu-ma 
Kuang’s famous Tzu-chih tung-chien (1084; “Com- 
prehensive Mirror for Government”). 

Destined to become more influential than the 
Honchõō tsugan was the Dai Nihon shi (“History of 
Japan”), compiled under the auspices of the Mito 
branch of the Tokugawa house. It was the first 
(and only) successful attempt of the Japanese to 
write according to the full specifications of the 
Chinese dynastic history style and required the 
efforts of a large compilation bureau from 1657 
to 1906, when the essays were finally complete. 
But the annals had been made public in 1720, 
and from that time the work had acquired a repu- 
tation for its sentiments of loyalty to the imperial 
house. 

Historiographically more important than the 
works of group authorship, however, were the 
writings of two private historians. Arai Hakuseki 
(1657-1725), a scholar-statesman in the Confu- 
cian sense, was a writer of wide versatility. His 
Dokushi yoron (“Commentaries on History”) con- 
tained an entirely original system of periodization 
based on the shifting locus of political power. The 
Koshi tsū (“Survey of Ancient History”), in which 
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he called for philological studies to penetrate the 
Japanese classics and state that the kami (gods) 
were humans and understandable in rational 
terms, exemplified both the high moral sense of 
the Confucianist and the growing rationalism with 
which many Japanese scholars were able to ap- 
proach historical causation. 

Rai Sanyd (1780-1832), less competent as a 
historian, exemplified the new interest in national 
studies. His Nikon gaishi (“Unofficial History of 
Japan”) continued in the tradition of Kitabatake’s 
emperor-centered historical narrative. More ac- 
cessible than the Mito Dai Nihon shi, it was widely 
read and everywhere stirred up sentiments of 
loyalty toward the emperor and pride in the unique- 
ness of Japan’s national structure. 

Such works of compilation and interpretation 
could hardly have been written had it not been for 
the diligent labor of a large body of archivists and 
antiquarians during the Tokugawa period. This 
labor, in fact, laid the foundations of Japan's 
modern archival repositories. The contemporary 
Cabinet Library, for example, is heavily dependent 
upon collections begun under Tokugawa patron- 
age. The program of the Historiographical Institute 
today is also a continuation of the remarkable 
work of the blind bibliographer Hanawa Hokiichi 
(1746-1821) and his son, who scoured the coun- 
try collecting and collating historical texts, which 
they published according to a subject classification 
under the title Gunsho ruijū (“Classified Texts”). 

These highlights of the historiographical story 
of the Tokugawa period constitute but a small 
fraction of the total activity of the age. By the 
eighteenth century, historical studies were no 
longer limited to a narrow court circle or to 
individuals patronized by the central military au- 
thorities. Nearly all of the more than 250 daimyo 
promoted the writing of house histories or the 
collection of local documents. The spread of learn- 
ing among diverse groups (even outside the ruling 
class) encouraged variety and specialization. An- 
tiquarians wrote on ancient usages or court cere- 
monies, philologists studied the meaning of archaic 
Japanese words, and bibliographers began the task 
of examining critically Japan’s heritage in the field 
of letters. Adding impetus to the growth of scholar- 
ship and the diffusion of ideas was the remarkable 
expansion of the printing industry, which put 
much of the output of Tokugawa writers on the 
market and into the hands of other scholars. The 
growth of private and official academies, particu- 
larly in the great cities of Edo, Osaka, Kyoto, and 
Nagasaki, led to a lively interchange of ideas. 
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By the middle of the nineteenth century, when 
Japan was abruptly thrust into the company of 
Western powers and her intellectuals were brought 
face to face with the latest products of science and 
scholarship from the West, Japan had at her dis- 
posal a mature tradition of historical scholarship, 
based on sound, though antiquated, principles of 
Chinese methodology and suffused with a growing 
national self-consciousness. It required only the 
touch of Western influence to set in motion a com- 
plex historiographical revolution. On the one hand, 
the Japanese were quick to absorb the methods and 
philosophies of history offered by the West. On 
the other hand, they comprehended immediately 
that the West posed a vital threat to their intel- 
lectual security and their historical importance as 
a nation. In the world views of the West, Japan 
had no recognized place other than among the 
“uncivilized” peoples. 


The Meiji period 

The basic revolution in historiography took 
Japan roughly thirty years, the same period of time 
required to lay the foundation for the modern 
Japanese state. Japan's first task was the fairly 
straightforward one of recapturing in modern sci- 
entific form the facts of Japan’s past. For several 
years the government dominated this work of basic 
compilation. Interest in history ranked high among 
the Meiji leaders, for they sought justification for 
the new regime as well as knowledge about past 
laws and administrative practices to serve as a 
basis for new legislation. In 1869 an Office for 
Collection of Historical Materials and Compilation 
of National History was established by government 
order. Its director, Shigeno Yasutsugu, was im- 
mediately caught in the dilemma of objective 
versus propagandistic compilation. The projected 
national history was eventually abandoned after 
attack from Shintoist scholars, who claimed the 
work gave insufficient support to the emperor. The 
collection of materials, however, has been con- 
tinued to the present. Meanwhile most of the new 
departments of government were busy collecting 
records and compiling documentary histories. The 
most ambitious and currently useful of these tran- 
sitional works is Koji ruien (“Encyclopedia of An- 
cient Matters”), a monumental encyclopedia in 
the Chinese manner compiled by the Department 
of Shrines between 1879 and 1913. 

The conflict over interpretation that divided the 
official historiographers of the early Meiji era re- 
flected the deep problems that Japan faced in the 
realm of historical ideology. Ranged on one side 
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were the successors to the Shinto revivalist school 
of the Tokugawa period, who wished to use his- 
tory to stimulate patriotism and to keep alive the 
memory of a noble past with which they could 
proudly confront the rest of the world. For them 
Rai Sany6’s history remained a model, and between 
1876 and 1884 five supplements to this work were 
printed. Yet just as in the political arena, the 
traditionalists were attacked by the advocates of 
“progress” and intellectual freedom. Private his- 
torians with more flexible and cosmopolitan views 
were at work absorbing new Western concepts and 
rethinking Japan’s historical circumstances. Many 
of them, in fact, saw Japan as half-civilized in 
contrast to the “enlightened” peoples of the West. 
Soon translations of such contemporary. laissez- 
faire writers as Mill and Spencer began to circulate 
in Japan, and the broadly interpretative works of 
men like Henry Thomas Buckle and Francois 
Guizot showed the way to the writing of cultural 
history. Fukuzawa Yukichi’s Bummei ron no gai- 
ryaku (1875; “A Short Account of the Theory of 
Civilization”) and Taguchi Ukichi’s Nikon kaika 
shdshi (1877-1882; “Short History of Japanese 
Civilization”), based on these models, opened a 
new era of Japanese historiography that would 
shift the attention of the historian away from the 
purely political to encompass the intellectual, ar- 
tistic, and religious dimensions of Japanese culture 
and would begin the task of finding a place for 
Japan in a world view made larger by the addition 
of the West. 

These two streams of historiography, official— 
nationalistic and private-international, carried on 
into the 1890s, but not until the government had 
abandoned direct interference in the writing of 
history behind an avowed insistence on objectivity 
and the advocates of patriotic history had turned 
their attention to the more impressionable field 
of elementary education. In the realm of scholar- 
ship it was expected that history would become 
an objective science. And for this purpose a Ger- 
man scholar, Ludwig Reiss (1861-1928), had 
been invited to establish a chair of history at Tokyo 
University. Under Reiss the German Geschichts- 
wissenschaft school was accepted as the basis for 
historical training at Tokyo. In 1895 the govern- 
ment attached its Historical Bureau to Tokyo Uni- 
versity with the aim of compiling the Dai Nippon 
shiryO (“Historical Materials of Japan”) along the 
lines of the German Monumenta germaniae_ his- 
torica. Thus the combination of the University of 
Tokyo professorships and the Historiographical In- 
stitute became the hard core of Japan’s modern 


historical profession, from which emanated the 
dominant academic orthodoxy. 


Beginnings of modern historiography 

Japanese historiography came of age as a mod- 
ern discipline during the forty years after 1890. 
This era was marked by conspicuous success along 
four main lines: (1) the perfection of a modern 
historical methodology, (2) the writing of special- 
ized monographic studies on particular institutions 
and aspects of Japanese civilization, (3) the prepa- 
ration of general historical surveys, and (4) the 
publication of reference works and source mate- 
rials. The outstanding historiographic work of the 
period was undoubtedly Kuroita Katsumi’s Kokushi 
no kenkya (1908; “Study of Japanese History”). 
Kuroita, of Tokyo University and schooled in 
the German historical-scientific tradition, suc- 
ceeded in formalizing the periodization of Japanese 
history and in supplying a definitive critique of the 
standard sources in the field of political history. 
Simultaneously, Japanese historians began the 
task of dividing their history both horizontally by 
periods and vertically by topics into numerous 
specialties. Monographic studies of political history, 
foreign relations, legal institutions, economic his- 
tory, and the history of art, literature, and religion 
were produced in abundance. By the 1920s the 
ground had been prepared for the appearance of 
new and more satisfactory historical surveys, such 
as Nihon bunkashi (1922; “Cultural History of 
Japan,” 12 volumes) and Sōgō Nikon shi taikei 
(1926; “Synthetic Survey of Japanese History,” 
20 volumes), both works of multiple authorship. 

The mid-1920s stand out as a quiet and pro- 
ductive interlude in Japanese historiography, when 
few fundamental conflicts of interpretation dis- 
turbed the academic calm. Although “academic” 
and “cultural” historians were separated by obvious 
differences in approach or subject matter, they 
shared basic premises about the function and aim 
of history. Both believed essentially in “scientific” 
methodology, and both were concerned with the 
task of “discovering” Japan’s past in all its political 
subtleties and cultural richness, They saw Japan 
as having successfully joined the ranks of the 
modern powers, so that Japanese history could be 
viewed as simply another tributary flowing into the 
mainstream of modern progress. The Japanese 
were content to study their past descriptively as a 
subject sufficient to itself and worthy of pride. 

But this atmosphere was not to last long. First 
of all, Japanese historians began of their own 
accord to look beyond their own history to dis- 
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cover relationships with other histories and to 
subject their history to new comparative judg- 
ments. Nishida Naojiré’s brilliant analysis of Japa- 
nese culture saw in Japan its “Gothic art,” and its 
“rise of a commercial spirit.” Tsuda Sdkichi broke 
through the taboos that still obscured Japan’s early 
history to reveal Japan’s cultural growth as an 
unfolding of human qualities shared by “all 
people.” Honjo Eijirð, Ono Takeo, and Tsuchiya 
Takao began the economic interpretation of Japa- 
nese history. 

The new currents of historical inquiry were 
sufficiently strong to bring into being a number 
of new societies dedicated to specific types of his- 
tory. Up to then the chief association of Japanese 
historians had been the Shigakkai (Historical So- 
ciety of Japan), founded at Tokyo University in 
1889. Among the new associations, the Keizaishi 
Kenkyükai (Society for the Study of Economic 
History) was organized at Kyoto University in 
1929, and the Shakai Keizaishi Gakkai (Social 
and Economic History Society) was organized in 
Tokyo in 1931. In 1933 a group of young histo- 
rians of the Tokyo area, organizing themselves into 
the Rekishigaku Kenkyūkai (Historical Science 
Society ), began the move toward “progressive” his- 
tory that was to lead increasingly in the Marxian 
direction. 

By the late 1920s history had again become a 
pivotal subject for a people who had begun to 
question the course their country was pursuing 
both at home and abroad. Social and political 
problems that followed World War 1 and were ac- 
centuated by the depression now agitated the aca- 
demic world. Intellectuals saw a growing discrep- 
ancy between “government” and “people,” between 
the way things were and the way they ought to be. 
The fading of the democratic ideal, not only in 
Japan but in much of Europe, left the Japanese 
open to the competitive ideals of socialism and 
national socialism. 

During the 1930s, as Japan began her expansion 
on the continent and drifted toward war in the 
Pacific, her scholarly world was torn increasingly 
between the ideas of left and right. Marxist his- 
torians rewrote Japanese history as a story of na- 
tional development from primitive to socialist 
society. They hotly debated whether or not the 
Meiji Restoration was a bourgeois revolution and 
criticized the government of Japan as capitalist 
and imperialist. Perhaps the most significant con- 
tribution of the Marxist school in these years was 
the series entitled Nihon shihonshugi hattatsu shi 
k6za (1932; “Essays on the History of the Develop- 
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ment of Capitalism in Japan”). Marxists eventually 
came under heavy attack. After the middle 1930s 
open expression of their views declined, although 
a strong underground commitment to them con- 
tinued among Japanese intellectuals and much of 
the Marxist vocabulary passed into common usage. 

By the mid-1930s the tempo of nationalistic 
writing had accelerated. Incited by governfnent 
and public opinion, historians lent their energies 
to propagandistic purposes and the rewriting of 
national history along messianic lines. Although 
the higher levels of scholarship were able to main- 
tain a precarious objectivity, by the time of the 
war Japan’s youth was uniformly being taught a 
brand of history that stressed the old myths of 
uniqueness and invincibility and claimed as the 
goal of history the ultimate conquest of the world 
by the Japanese. 


Postwar historiography 

After disastrous defeat in war, Japan embarked 
upon the slow process of economic and spiritual 
rehabilitation. The very foundations of her history 
had been challenged: much of her heroic past had 
been discredited, and the seemingly successful 
attempt at modernization had ended in failure. 
Where did Japan now stand in the world? And 
what of Japan’s past was now worthy of remem- 
brance? These questions were put to a generation 
of scholars strongly disillusioned by their wartime 
experience. The purging of old-guard scholars 
brought younger men in large numbers into the 
universities, while the freedom of expression that 
the Occupation authorities permitted encouraged a 
vigorous iconoclasm among all intellectuals. Many 
of the new scholars were men of strong Marxist 
conviction whose desire to express themselves had 
been increased by the long years of silence imposed 
upon them during the war. 

Few subjects were as controversial as history 
during the first fifteen years after World War 1. 
For history lay both at the heart of the revised 
system of “education for democracy” and of the 
Japanese attempt to understand a world divided 
between two great power blocs and two ideologies. 
Battles raged over the rewriting of elementary 
school history textbooks, and lines were drawn be- 
tween the Marxist-dominated Rekishigaku Kenkyŭ- 
kai and the still academically oriented Shigakkai. 

In the first years after the war, it seemed as 
though Japanese historians had uniformly and pre- 
cipitously abandoned the extreme of nationalist 
history only to take up another extreme of socialist 
history in which nation and emperor were both 


420 


totally rejected. The theme of the common man 
in his struggle against feudalism, absolutism, fas- 
cism, and capitalism was played with dramatic 
eloquence. 

Yet by the 1950s, as political and economic con- 
ditions began to settle down, historians also began 
to relax their extreme ideological orientation. In 
volurhe of research and publication, historians of 
the 1950s had begun to surpass their prewar out- 
put; new encyclopedias, new survey histories, new 
works of basic research were published, so that 
by the end of the decade the entire literature of 
Japanese history had practically renewed itself. 
New documentary series that penetrated even more 
deeply into the details of government or economic 
activity encouraged research at new levels of re- 
finement and precision. The creation of new uni- 
versities and research centers served to break up 
the prewar factions of historians and increase the 
variety of historical writing. 

In the face of such diversity, any simple charac- 
terization of the Japanese historiographical scene 
today is hardly possible. Most of the vast product 
of the Japanese historian is of a strictly empirical 
nature and is little affected by problems of bias 
or interpretation. But there are underlying issues 
of great consequence. Japanese historians are still 
troubled by questions of fundamental philosophy. 
Marxism continues to provide the most widely ac- 
cepted historical world view. Recent experimenta- 
tion with various social science methods has offered 
certain alternatives to Marxism, although nothing 
like a “complete system.” There are followers of 
Max Weber and of special techniques of statistical 
data gathering or group research, but these are 
chiefly techniques, and the Japanese must still 
grapple with the question of where history is tak- 
ing them and where Japan fits into the ultimate 
scheme of things. Increasingly, however, it does 
appear that the Japanese are gaining the assurance 
to forgo their ideological preconceptions and en- 
gage in a less self-conscious approach to their own 
history. 


Joun WHITNEY HALL 


[Directly related are the entries BuppHIsM and JAPA- 
NESE SOCIETY.] 
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VI 
SOUTH AND SOUTHEAST ASIAN HISTORIOGRAPHY 


Historiography in south and southeast Asia, as 
elsewhere, developed in close relation with the 
sources of literacy. As literacy came from several 
sources and at different times, so was history writ- 
ing similarly varied and differentiated. Most impor- 
tant of such sources before the twentieth century, 
however, were the many religions which, for at 
least six centuries, united some parts of the region 
and divided others. Five of the areas representing 
different religions and literate experiences may be 
distinguished for a survey of historiographical tra- 
ditions. 

There has been little in common between popu- 
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lar Hinduism in India and the Theravada Buddhism 
of Thailand, Cambodia, Burma, and Ceylon. These 
two areas again have little in common with a third 
area, the extensive sphere of Islam, with large cen- 
ters in Pakistan and India and in Muslim societies 
throughout Malaysia, Indonesia, and the southern 
Philippines. The fourth and the fifth areas are 
somewhat peripheral to the rest, that of Vietnam, 
where a variety of Chinese religion and culture still 
survives, and that of the largely Christianized so- 
ciety of the Philippines. In each of these areas a 
different attitude toward the idea of history may be 
discerned. 


Traditional historiography 


South Asia. The earliest literate religion intro- 
duced to the preliterate animistic communities of 
the region was that of the Vedas. The religion was 
restricted primarily to India and produced its ear- 
liest annalistic literature in the Puranas. The pu- 
ranic tradition was further extended and other dy- 
nastic and regional annals were compiled, but they 
remained “marked by obscurity, exaggeration, pau- 
city of authentic data and neglect of topography 
and chronology” (Ghoshal 1961, p. 2) down to the 
Muslim invasions of the twelfth century. The only 
exception to this Hindu heritage was a work com- 
pleted in the middle of the twelfth century, the 
Rajatarangini (“Kashmir Chronicle”) by Kalhana. 
Of Kalhana, Majumdar (1961a, p. 21) says that he 
“held that the first requisite of a true historian was 
to keep a detached mind, free from bias and preju- 
dices (and), like a judge, must discard love and 
hatred while recounting the events of the past . . . B: 
but Basham (1961a, p. 61) maintains that Kalhana 
was more concerned “to teach moral lessons.” 

The great epics, the Mahābhārata and the 
Rāmāyaņa, had considerable influence and were 
used as sources by an independent historical tradi- 
tion. While they did not lead to the growth of his- 
tory writing, they were remarkable repositories of 
stories which were as real and meaningful to most 
of the peoples of south and southeast Asia as the 
Homeric epics were to the peoples of Europe. For 
centuries, they were the nearest thing to history 
and may be said to have performed the role of 
history among the peoples who transmitted the 
stories, 

In addition, the two epics, in conjunction with 
the stories of the Pafcatantra and the Buddhist 
Jataka stories, also provided a strong anecdotal and 
narrative tradition for the Buddhist genealogies 
and chronicles of Ceylon and mainland southeast 
Asia. Such Buddhist stories and chronicles ranged 
from simple eulogies to types of hagiography and 
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were used for moral and spiritual education in the 
monasteries and at the courts. It took several cen- 
turies before the chronicles advanced from recount- 
ing the philosophical progress of Gautama Buddha 
and his disciples to the conscious record of contem- 
porary political and religious events. Eventually, in 
Ceylon, where the vamsa tradition (notably, the 
chronicles known as the Dipavamsa, the Maha- 
vamsa, and the Cūlavamsa ) produced several court- 
sponsored chronicles compiled by learned monks, 
a kind of history writing appeared. The works were 
annalistic and anecdotal and mainly written in 
verse. They were also restricted to court use. There 
was no tradition of individual authorship, for each 
chronicle was largely the continuation of a previous 
one and incorporated materials from earlier chron- 
icles. The success of any set of chronicles was de- 
termined more by its literary quality than by its 
historical accuracy. If a new compilation achieved 
high literary standards, it might supersede earlier 
works altogether (Perera 1961; Godakumbura 
1961). 

Indian historiography was greatly enriched after 
the Muslim conquests of northern India at the end 
of the twelfth century. A well-developed tradition 
of history writing was introduced, and for more 
than six centuries a branch of Muslim historiog- 
raphy dominated the south Asian scene. The main 
features of these Muslim writings are common to 
those of Muslim historiography in Persia, west 
Asia, and north Africa. They remained bound to the 
need for orthodox authority and the desire to serve 
God and the Muslim community. They were also 
largely directed to teaching moral and religious 
lessons through descriptions of prophets, caliphs, 
sultans, and other great men of both religion and 
government. Furthermore, they were limited to the 
accounts of the triumphs and disasters of Muslim 
rulers and kingdoms and barely touched on the 
peoples of other faiths in India. 

Despite these limitations, the Muslim writings 
provided a historical picture of India not available 
among any other community in India until recent 
times. For this reason, at least two of the greatest 
Indo—Muslim historians may be singled out to il- 
lustrate the range and scope of the tradition. The 
first is Ziauddin Barani, whose work, the Tarikh-i 
Feroz-shahi, was completed in 1357. Here, Barani 
achieved a new consciousness of the value of his- 
tory. Not only could history strengthen faith, rea- 
son, and judgment, give comfort, teach patience, 
and distinguish between good and evil, but it was 
also “the necessary foundation of truth” (Hardy 
1960, pp. 22-23). Although his awareness of truth 
was strictly within the framework of Islam, it re- 
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mains a vital ingredient in distinguishing the best 
Muslim histories from the many written. 

This was also true of the Moghul historian Abul 
Fazl (1551-1602), whose history of the reign of 
the emperor Akbar (1556-1605) is regarded as the 
height of Indo—Muslim historiographical achieve- 
ments. This work, the Akbar Namah, is not free 
from conventional eulogy and stylistic flourish, but 
it represents the result of great steps forward in 
archival collection, document examination, and 
close analytical research. To this day the section 
called Ain-i-Akbari is regarded as the classic study 
of the institutions and workings of an empire at 
the height of its extent and power. 

Although all Indo—Muslim writings were deter- 
mined by extraneous non-Indian concepts of his- 
tory, they have become part of the Indian (and 
now also Pakistani) historical heritage and may be 
seen now as an integral part of traditional south 
Asian historiography (Hardy 1961; Rashid 1961; 
Elliot 1867—1877). 

Southeast Asia. The Mon-Khmer and Cham 
peoples of mainland southeast Asia acquired much 
of the art and architecture of India and left many 
great historical monuments which are still intact 
to attest to the richness of the heritage (Coedés 
[1948] 1964, pp. 35-72). But the earlier Hindu- 
Buddhist accretions left little impact on the Mon 
and Cambodian chronicles that came to be written 
later. 

With the spread of Theravada Buddhism after 
the thirteenth century, the Mons in particular com- 
piled chronicles (Rajawan, or genealogies of var- 
ious kinds ) which established the tradition of bring- 
ing together dynastic information, anecdotes about 
the kings, and various myths and legends which 
gave meaning to each reign (Shorto 1961, p. 64). 
This tradition was strengthened by the Burmese, 
who brought a keener sense of chronology to the 
compilations. Although still essentially a derived 
tradition from Buddhist Ceylon, the later Burmese 
Yazawin (“Chronicles”) of the eighteenth and nine- 
teenth centuries were clearly indigenous writings 
influenced by local animism and Burmese concepts 
of kingship and cosmology, As works compiled by 
learned monks, learned brahmans, and learned 
ministers, they provided valuable material for early 
European works on Burmese history (Htoot 1961; 
Ohn 1961; Maha Yaza Win Taw Kyi). 

Similarly, the Thai tradition, also developed by 
learned monks and ministers, was derived from 
Ceylon, probably through the Mon—-Khmer-speaking 
peoples of the Menam valley. Most of the earlier 
versions of the Thai chronicles were destroyed 
when Ayuthia was sacked by the Burmese in 1767. 
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A notable exception was the P’ongsawadan, com- 
piled in 1680, which covered the period 1350-1605. 
This chronicle form was revived during the late 
eighteenth century, and many chronicles were writ- 
ten not only for Thailand but also for Burma, Cam- 
bodia, and the states of southern Thailand, includ- 
ing the Malay states of Songkhla and Saiburi 
(Kedah). 

The situation was different, however, among the 
Javanese and Malay peoples. Hindu—Buddhist mon- 
uments and inscriptions abound, but there also de- 
veloped quite independently a native concern with 
the past and the use of the past as symbol and 
magic to give power and confirm authority. From 
the epic poem Nagara—Kertagama to the Pararaton 
and the Babad Tanah Jawi (fourteenth to seven- 
teenth centuries), the court poets lauded their 
kings, worked out impressive genealogies, and per- 
fected their verse. These were not works of history, 
but they were approaching an autonomous tradi- 
tion in historical awareness. Of particular interest 
is the framework of Hindu gods, Buddhist identifi- 
cations, and indigenous beliefs in the first two 
works and their extension to include Muslim proph- 
ets in the Babad. The sense of continuity is unmis- 
takable. The desire to link legitimacy and sov- 
ereignty to past heroes and their origins is sustained 
through the centuries. Lacking both an accurate 
chronology and a secular concern with kings, min- 
isters, subjects, and enemies, the great lists of 
names of people and places are more like exercises 
in metrics and incantation than history writing. 
But as the tradition developed into the nineteenth 
century, later Babad closer to modern histories were 
produced, notably, the Babad Dipanagara and the 
Sejarah Banten. In addition, there were lesser his- 
torical writings in south Celebes (Sudjatmoko et al. 
1965; Berg 1965; Graaf 1965; Noorduyn 1965; 
Johns 1964), 

Far better developed as histories are the writings 
in Malay, especially the Sejarah Melayu and a num- 
ber of works concerning the Johore and Riau Ling- 
ga empires, as well as “rhymed chronicles” like the 
Shar Perang Mengkasar and similar compositions. 
Malay writing is richer than the Javanese in anec- 
dote and more vivid in the description of men and 
places. It still lacks chronology but is more accurate 
about personal relationships. There is less concern 
with magic than with moral values like loyalty and 
sincerity, and, as a whole, Malay histories enter- 
tain as well as teach. Three outstanding examples 
of a nascent social history, the Misa Melayu, the 
Hikayat Abdullah, and the Tuhfat-ul Nafis, ap- 
peared in the eighteenth and nineteenth centuries. 
It is possible to discover a certain amount of out- 
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side influence in these works, but they clearly show 
their continuity with the Sejarah Melayu tradition 
(Bottoms 1959; Amin 1963, pp. 1-46; Teeuw 1964; 
Josselin de Jong 1964). 

Finally, a brief note on the importation of tradi- 
tional Chinese historiographical forms in the 
writing of Annamese (Vietnamese) history. In rul- 
ing the Tongking region of north Vietnam for a 
thousand years, the Chinese determined the nature 
of its historiography. Such traditional works were 
still being written throughout the nineteenth cen- 
tury and even during the early years of the twen- 
tieth century. But, interestingly, these forms are 
not traceable in those parts of the Indochinese pen- 
insula which fell under the influence of Theravada 
Buddhism. Similarly, with the Spanish invasion of 
the Philippines at the end of the sixteenth century, 
a kind of traditional Roman Catholic historiography 
was introduced which matched the Malay chron- 
icles of the Sulu Archipelago. This limited Euro- 
pean clerical tradition remained dominant in the 
Philippines until the latter half of the nineteenth 
century, and vestiges remain even today. 

Features of traditional historiography. It has 
often been noted that the history of south and 
southeast Asia had little unity of theme until the 
coming of modern industrial civilization during the 
last hundred years. The traditional historiography 
of the region seems to confirm this view. Except for 
the common lack of chronology in the historical 
works and the fact that there are so many gaps to 
be filled in the region’s history, there appear to be 
more major differences than similarities in the 
types of writings mentioned above. Some of the 
traditions may have had common origins, but they 
developed as discrete and autonomous traditions in 
Ceylon, Burma, Thailand, Java, and the Malay 
world. Even the Islamic tradition was not homoge- 
neous, and what emerged in northern India was 
quite different from Muslim historical writings in 
Acheh, Java, Malacca, Johore, south Celebes, and 
Sulu. This suggests that there were strong indige- 
nous features in each of the traditions which were 
peculiar to the different peoples. The subject has 
now aroused considerable interest, and it is to be 
expected that more precise analyses of the indige- 
nous cultures will soon appear to help us under- 
stand the differences better. 

Based on present information, the following list 
indicates the common features and major differ- 
ences. 

Common features are that (a) most of the works 
are strong in genealogy but weak in chronology 
and biographical detail; (b) the emphasis is on lit- 
erary style, anecdotal material, and the use of his- 
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tory as moral and religious teaching; (c) where 
they are primarily secular, there is a common cen- 
tral interest in kingship, and the emphasis is placed 
on orthodox qualities of loyalty and conformity; 
(d) cosmological and astrological considerations 
tend to exclude causal explanations as well as the 
idea of progress. 

Major differences are that (a) religion cut off 
the Indo—Muslim historians from the Hindu socio- 
economic background of Indian history; it cut off 
the Thais and Cambodians from the east Asian his- 
toriographical tradition in its Vietnamese form; and 
it cut off the Malayo—Javanese world from the 
Thais and Burmese on the one hand and the Fil- 
ipinos on the other; (b) national rivalry influenced, 
for example, Burmese and Thai historical writing 
about each other; (c) language differences in India 
before the use of Persian and in mainland south- 
east Asia after the decline of Pali were complex; 
most of the works were not intelligible outside of 
the country’s boundaries; (d) royal policies about 
historical writings varied considerably; Muslim and 
Malay works were in circulation, while Thai, Bur- 
mese, and Vietnamese works were kept mainly for 
official use. 

Two further points about the nature of the early 
traditional chronicles need to be emphasized. First, 
modern scholarship has begun to show appreciation 
of such traditional writings and has applied the 
techniques of Biblical and Homeric criticism to 
their study. It is now realized that these chronicles 
are best understood in the context of the total cul- 
tural system in which they were compiled. Second, 
whether these works may be called “histories” or 
not, their value as historical documents has now 
been proved. What is required are finer and more 
sensitive techniques for obtaining from them the 
data needed to write the history of south and south- 
east Asia. 


Modern historiography 

The growth of modern historiography in Europe 
coincided with the expansion of European activities 
in Asia. But as European activities were peripheral 
to Asian history between the sixteenth and nine- 
teenth centuries and as traditional European atti- 
tudes toward the nature and use of history changed 
very slowly, there was no impact on the historiog- 
raphy of south and southeast Asia during this pe- 
riod. It was only in the latter half of the nineteenth 
century, when Western science and culture were 
consciously taught and learned, first in south Asia 
and then in parts of southeast Asia, that the region 
became affected by Western historical methods. 
What was being widely introduced, however, were 
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the traditional techniques of the late eighteenth 
and early nineteenth centuries, for example, the 
philological methods of William Jones and the his- 
torical writings of James Mill, Mountstuart Elphin- 
stone, and Vincent Smith. There was a time lag in 
methodological change, and these traditional tech- 
niques were still regarded in the early part of the 
twentieth century as advanced and valid for imi- 
tation. The developments toward “scientific” his- 
tory and the growth of social sciences in Europe 
and America were hardly noticed until after the 
end of World War 11. Application of Western his- 
toriographical techniques to the study of India and 
Ceylon came first, but the Dutch eventually applied 
them to Indonesia, and the French, although com- 
ing last, were quick to apply them in their thorough 
study of Indochina. 

South Asia. Serious Western scholarship in 
India dates from William Jones's founding of the 
Asiatic Society at Calcutta in 1784. The activities 
of this and similar kinds of societies for Oriental 
research in Bombay, Madras, Mysore, and Ceylon, 
together with the growth of learned societies in 
France and Germany and the creation of university 
chairs in Europe during the nineteenth century, 
laid the foundations of modern south Asian histo- 
riography. 

The most important contributions were initially 
in the field of Sanskrit philology and in the editing 
of Vedic and Buddhist texts, but the study of Indian 
antiquities eventually provided a more scientific 
basis for dealing with the otherwise intractable ma- 
terials on ancient India. The pre-Muslim period was 
the most challenging because there were none of 
the essentials for the historian—no chronology, no 
reliable genealogies, no clear identifications of 
people and places. It was here, too, that the most 
dramatic discoveries and achievements were re- 
corded. Particularly notable was James Tod’s An- 
nals and Antiquities of Rajasthan, published from 
1829 to 1832, and James Prinsep’s epigraphical 
and numismatic studies published in 1858 under 
the title Essays on Indian Antiquities. Their suc- 
cess led to the setting up of an archeological de- 
partment in 1862 under Alexander Cunningham; 
this became in 1902 the famous Indian Archaeolog- 
ical Survey under John Marshall (Ancient India, 
1953). 

In comparison, the traditional European histo- 
rians were far less successful. Mill, Elphinstone, 
and Smith, as well as French and German historians 
of India, were either too ready to show the superior- 
ity of Western rule or too uncritical of the avail- 
able Muslim and non-Muslim material. Their im- 
pact was initially greater on European audiences 
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than on Indians. It was not until the establishment 
of British-type universities in Calcutta, Bombay, 
Madras (all three in 1857), and elsewhere that 
formal history teaching introduced the European 
writings to young Indian scholars. By that time, 
while historical sciences in Europe had made fur- 
ther progress, Indian scholars like R. G. Bhandarkar 
were learning the techniques and attitudes of the 
earlier period well enough to criticize the works of 
European historians themselves (Philips 1961a; 
Basham 1961b). 

It is only in the twentieth century that south 
Asian historiography began to respond directly and 
strongly to Western methodology. It did so in at 
least two different directions. The first was a more 
intensive appreciation of Western scientific meth- 
ods, especially following the brilliant archeological 
work on the Indus civilization at Mohenjo-daro and 
Harappa. The second was a nationalistic or anti- 
imperialistic approach, which in its extreme forms 
produced obscurantist and revivalist historical writ- 
ings on the one hand and stimulated Marxist and 
other forms of radical historiography on the other 
(Majumdar 1961b). 

Modern historiography may be said to have be- 
gun with the British publication of the Cambridge 
History of India, published in 6 volumes between 
1922 and 1932. This aroused considerable interest 
in India, even among those who resented the domi- 
nance of British contributors. It was recognized, 
however, that in the fields of archeology, epigraphy, 
and numismatics, the great collections produced 
by European scholars were indispensable. So also 
were the collections of documents and the interpre- 
tative works on British activities, the politics of 
the East India Company, and the extension of 
British power in India. But this was not the case 
with works that dealt with Indian religion and cul- 
ture, the Indian response to British rule (for ex- 
ample, the 1857 mutiny and the nationalist move- 
ment), and the social and economic changes in 
India since the beginning of the nineteenth cen- 
tury. It was in these fields that a new generation of 
scholars first challenged the results of European 
scholarship. Notable among this generation were 
R. C. Majumdar, H. C. Raychaudhuri, K. A. A. 
Nilakanta Sastri, and K. M. Pannikar. Most of them 
were products of university departments of history, 
whether in Britain or in India, and many were pro- 
fessional teachers of history. They continued the 
tradition of forming learned societies to publish 
scholarly journals and learned to use and appre- 
ciate the great libraries and archival collections 
organized by the states and the government of 
India (see Nilakanta Sastri 1956; Datta 1957). 
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Since independence, the work of history writing 
has continued. Institutions like the Archaeological 
Survey, the Historical Records Commission, and 
the Indian History Congress are particularly active. 
Among the historical publications of academic 
standing, two are noteworthy: the Indian Historical 
Quarterly and the Journal of Indian History. A ma- 
jor enterprise has been the 11-volume series, His- 
tory and Culture of the Indian People, published by 
the Bharatiya Vidya Bhavan under the general edi- 
torship of R. C. Majumdar. Also significant has 
been the Indian History Congress conference on 
Asian history in 1961 and the meeting at Delhi in 
1965 of the International Congress of Orientalists. 
These developments reflect the new Indian histori- 
cal consciousness, which has revealed a consider- 
able acceptance of modern scientific methods of 
historiography. The most important measure of 
modern Indian achievements is that there are today 

` trained historians to cover almost every period and 
type of problem in Indian history. In addition, 
several of the younger historians have begun to 
apply the tools of social science to the study of 
modern history. In this respect, many have turned 
to new academic disciplines developed in the 
United States for their methodological assistance. 

Less outstanding has been the development of 
historiography in Pakistan after 1947. Separation 
from India deprived the new state of the best de- 
veloped research facilities and the best library and 
archival collections in the region. The scholars, 
therefore, had to work under grave disadvantages. 
They continued to revive the traditions of Islamic 
historiography, but the most important achieve- 
ments are the works of those scholars who first 
made their reputations before separation from In- 
dia, for example, A. Yusof Ali, Shafa’at Ahmad 
Khan, and I. H. Qureshi. Although the majority 
of the scholars feel greater kinship with the his- 
torical traditions of Persia and west Asia, they can- 
not deny their Indian heritage completely. Their 
struggles with modern historiography, therefore, 
are part of the progress of history writing in south 
Asia. 

In Ceylon, modern historical attitudes developed 
later, but standards are high, as may be seen in 
the publications of the Ceylon Branch of the Royal 
Asiatic Society and the recent Ceylon Journal of 
Historical and Social Studies. 

Finally, the remarkable fact must be noted that 
the bulk of historical works of significance in south 
Asia have been written in English. There have been 
important works published in Bengali, Urdu, and 
Hindi from the early years of the twentieth century, 
but so far none are widely known outside of re- 
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gional schools and universities. It is important to 
note they all show the influence of modern his- 
torical works. There is no reason to doubt that in 
time more scholarly studies in the Indian languages 
will reach the high academic standards achieved 
by works in English. 

Southeast Asia. Unlike south Asia, there was 
no center in southeast Asia where modern scholar- 
ship could take root and fan out over the whole 
region. The manner in which different parts of 
the region came under European control and the 
limited resources of the small states prevented a 
systematic development of modern historiography. 
For example, the early Portuguese, Spanish, Dutch, 
and English accounts of the region (sixteenth to 
eighteenth centuries) had no influence on south- 
east Asian writings and were really part of the his- 
tory of European historiography. Even the founding 
of the famous Bataviaasch Genootschap von Kun- 
sten en Wetenschappen (Batavian Society for the 
Study of Arts and Sciences) in Jakarta in 1778 and 
the publication of William Marsden’s History of 
Sumatra in 1783 and T. S. Raffles’ History of Java 
in 1817 gave little impetus to the study of history. 
Only in the latter half of the nineteenth century, 
with the revival of the Bataviaasch Genootschap 
and with the founding of the Straits Branch of the 
Royal Asiatic Society in 1878, did serious scholar- 
ship begin in Indonesia and Malaysia. In fact, Euro- 
pean writings of the nineteenth century were pro- 
duced while the indigenous traditions of Babad and 
Sejarah were still alive, and Western scholars often 
depended on native chronicles for their material 
on earlier history. In any case, such European 
works ran parallel to and were external to the local 
efforts at historical writing and did not affect the 
traditional forms and attitudes. 

Similarly, among the mainland countries of 
southeast Asia, the Burmese and the Thais were 
particularly active at compiling their Yazawin and 
P’ongsawadan, even as European amateur scholars 
like Arthur Phayre (History of Burma, 1883) and 
W. A. R. Wood (A History of Siam, 1926) were 
working on their histories and as research journals 
like the Journal of the Burma Research Society and 
the Journal of the Siam Society were being pub- 
lished, Both British scholars, for example, acknowl- 
edged their great debt to native scholarship. In 
Vietnam, traditional Vietnamese historians helped 
early French scholars attached to the Ecole 
Francaise d'Extréme Orient (founded in 1900), 
whose works were to gain fame for the school’s 
Bulletin. Also, the Imperial Archives at Hué pre- 
served their documents in the traditional fashion 
for some years after the French conquest. 
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The Philippines was a special case where one 
stage of Western historical scholarship displaced 
another. Still, Spanish traditional history continued 
to be written under American rule (after 1898), 
while scholars from the United States studied Phil- 
ippine history from Spanish colonial and mission- 
ary records. The most valuable of its kind was E. H. 
Blair and J. A. Robertson’s 55-volume work The 
Philippine Islands, 1493-1898, published between 
1903 and 1909. 

In the nineteenth and first half of the twentieth 
centuries, there have been three distinct areas of 
modern southeast Asian historiography. First, an- 
cient history, of which the native peoples had 
little or no knowledge, was being recovered by the 
methods of philology, epigraphy, and archeology. 
Second, colonial history, involving European trade, 
wars, treaties, and administration, was the special 
interest of European historians themselves and 
aroused little interest among local scholars. And 
third, the “middle period,” varying from four to ten 
centuries before the nineteenth century, was a pe- 
riod of some indigenous historical writing; modern 
methods can be used to help rearrange, date more 
accurately, and even reinterpret the writings of this 
period (Hall 1961). 

In contrast to what they did in south Asia, the 
British, the Dutch, and the French in southeast 
Asia made no effort to train local historians until 
the years just before World War 11. Except among 
the Dutch in the twentieth century, history writing 
was primarily an amateur affair, a by-product of 
administration and long residence in the country 
concerned. There were fine scholarly works, in 
particular by the professionals, for example, N. J. 
Krom on early Indonesian history and George 
Coedés on southeast Asian epigraphy before the 
coming of Islam. R. O. Winstedt’s textual work for 
Malay history and G. H. Luce’s great collections of 
inscriptions for Burmese history were also out- 
standing. The best work, however, was done in 
early history; no historians of the colonial period 
produced works of comparable stature. Further- 
more, such colonial history as was written could 
better be seen as part of European historiography 
than of southeast Asian historiography. 

There were two countries in the region where 
developments were different—independent Thai- 
land and the Philippines under United States ad- 
ministration. In Thailand, Thai royal patronage 
was given to the Siam Society, which published a 
useful journal; and Chulalongkorn University, 
founded in 1917, taught both traditional and 
modern history. In the Philippines, missionary 
universities like Santo Tomás, founded in 1611, 


taught little secular history but began to introduce 
modern historical methods into the curriculum by 
the end of the nineteenth century. In addition, 
the Americans founded in 1908 a secular uni- 
versity, the University of the Philippines, which 
taught some modern history from the start, al- 
though little work was done on southeast Asian 
history itself. 

Since the end of World War and particularly 
following the independence of the Philippines, 
Burma, Indonesia, and Malaya (now Malaysia), 
there have been new advances in the historiography 
of the region. A major landmark was the publica- 
tion in 1955 of D. G. E. Hall’s A History of South- 
east Asia, which has successfully established that 
the whole span of history for all of southeast Asia 
is an intelligible historical unit. Also, a searching 
debate on the nature of European scholarship on 
southeast Asia has followed the provocative studies 
on early Asian trade by J. C. van Leur. As a result, 
southeast Asia was given a separate place in the 
series of conferences held in London in 1956 on 
historical writings on the peoples of Asia (Hall 
1961). This stimulated another collection of papers 
on Indonesian historiography, first sparked by the 
seminar on national history held in Jogjakarta in 
1957 (Singhal 1960; Smail- 1961; Benda 1962; 
Sudjatmoko et al. 1965; cf. Seminar Sedjarah 
1958). 

The region still faces a long debate about 
whether national history should form the basis 
of the new southeast Asian historiography. The 
struggle for national identity in Burma, Indonesia, 
and Vietnam has reduced historical research to a 
trickle in those countries. Only in the Philippines, 
Malaysia, and Singapore has there been sustained 
interest in modern research, and these countries 
have supported international conferences on his- 
tory to keep this interest alive. A joint effort led to 
the creation of the International Association of 
Historians of Asia, which meets every three or four 
years and shows signs of being an organization 
primarily for southeast Asian historians. 

Features of modern historiography. In the 
two regions of south and southeast Asia, modern 
Indian historiography is clearly the most impres- 
sive both in volume and in quantity. Pakistan and 
Ceylon, however, have much in common with 
Indonesia, Burma, Malaysia, and the Philippines 
in terms of stage of growth, present and future 
resources, and trends of scholarship. There is a 
chance that Ceylon, the Philippines, Malaysia, and 
Singapore may develop differently because their 
traditional historiography does not stand in the 
way of rapid modernization. What is clear is that 
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modern historiography is being confronted by na- 
tionalism and may well be subordinated to national 
needs. The example of India is before the others. 
National history takes precedence over scientific 
history, but as long as the means of teaching and 
using modern academic tools and methods remain, 
this situation may soon change. Thailand remains 
the exception. Active historical research is limited, 
and some work is still being done within the 
framework of traditional values, but there is no 
nationalist flavor in the work. Also, a new genera- 
tion is using modern methods with increasing skill 
and confidence. 


Autonomy of historiographic tradition 


History has had a relatively small place in south 
and southeast Asian traditions. Its main functions 
had been to strengthen the authority of kings, to 
teach moral and religious lessons, and possibly 
also to please and amuse. Exact times and places 
and the lives of great and lesser men had never 
been important in themselves. They had to serve 
the purpose of the historian’s audience, which often 
comprised only the kings, priests, and courtiers 
of the day. Each historiographical tradition had 
evolved according to the needs of different audi- 
ences and had beenveither strong or weak depend- 
ing on whether the institutions which produced 
the audiences were stable or unstable. As long 
as the institutions had been able to survive, the 
historical tradition which supported them also 
survived. 

In south Asia and in the Philippines, the impact 
of colonial rule was felt over a long period, and 
this had marked consequences on the indigenous 
historiographical traditions. Hence, apart from 
some of the traditions of Muslim historiography, 
no other tradition has survived. In other parts of 
southeast Asia, however, Western influence has 
had only a brief history—in most cases, much less 
than a hundred years, during which colonial 
scholars and policy makers did not try to destroy 
the local traditions. Thus, much of the traditions 
still survive within the new historiography itself. 
This is particularly true where nationalist move- 
ments have effectively revived the historical works 
of the past. 

But historiography is no longer a static discipline 
even in the West, and it is undergoing radical 
changes as new social science disciplines are turn- 
ing to the study of the past. What is important 
is not that historiography in south and southeast 
Asia has still not freed itself completely from tra- 
ditional attitudes and their limitations. It is that 
lines of communication have been established be- 
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tween Asian and Western academic institutions, 
important works have become mutually available, 
and new disciplines have been introduced and un- 
derstood, The key concepts have already reached 
south and southeast Asia: that time and place 
must be accurate, that knowledge about man’s past 
must be secular and humanistic, and that historical 
fact and interpretation must always be tested by 
the best scientific methods. 

Wanc GUNGWU 


[See also Asian society, articles on SOUTH ASIA and 
SOUTHEAST ASIA. Other relevant material may be 
found in Buppuism and ISLAM.) 
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HISTORY 


The articles under this heading deal with vary- 
ing conceptions of the nature of history and its 
subject matter, as does the article on THE HISTORY 
OF SCIENCE, listed under SCIENCE. An analysis of 
what historians do when they write history and 
discussions of traditions of history writing in dif- 
ferent parts of the world will be found under His- 
TORIOGRAPHY. Major related topics are ARCHE- 
OLOGY; EVOLUTION; KNOWLEDGE, SOCIOLOGY OF; 
PERIODIZATION; TIME, article on SOCIAL ORGANIZA- 
TION. Other relevant material appears in Eco- 
NOMIC THOUGHT; POLITICAL THEORY; RELIGION; 
SocioLocy, article on THE DEVELOPMENT OF SO- 
CIOLOGICAL THOUGHT. 
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I 
THE PHILOSOPHY OF HISTORY 


The expression “philosophy of history” has come 
to refer to two quite distinct types of inquiry. 

Traditionally, it has been used to refer to at- 
tempts to provide a comprehensive explanation or 
interpretation of the entire historical process. 
“Philosophies of history” in this sense have been 
characteristically concerned with such questions 
as: “What is the meaning (significance, purpose ) 


of history?” or “What fundamental laws govern 
historical development and change?” Among the 
chief exponents of this type of theory may be num- 
bered Vico, Herder, Hegel, Comte, Marx, Buckle, 
Spengler, and—in our own time—Arnold Toynbee 
and Pitirim Sorokin. Men like these have been in- 
spired by the belief that history presents problems 
beyond those that occupy the attention of ordinary 
practicing historians, whose activities, being largely 
confined to the investigation of particular areas or 
sections of the past, fail to satisfy the demand for 
an intellectually or morally acceptable conception 
of the course of history “as a whole.” By offering 
accounts of the human past that exhibit it as con- 
forming to certain principles of universal validity, 
they have sought to meet this demand; at the same 
time they have often (though not always) claimed 
that their interpretations may enable us to make 
predictions or forecasts concerning the future de- 
velopment of society. ; 

The grounds upon which such interpretations 
have been based, ranging from empirical consider- 
ations to notions that are frankly religious or meta- 
physical, have been various. Nor have they always 
taken the same form. Marx, for example, portrayed 
history as following a unilinear pattern in the 
direction of a particular foreseeable “goal”; Speng- 
ler and Toynbee have presented it as conforming 
to certain regular and recurrent cycles of change; 
while others, again, have treated it as somehow 
combining both these features. Common to all, 
however, has been the assumption that the his- 
torical process is more than an agglomeration of 
events “senselessly” succeeding one another in 
time: there is an underlying structure or theme 
waiting to be discovered, in terms of which this 
apparently arbitrary sequence can be seen to be 
ultimately meaningful or intelligible. 

Even in the nineteenth century, when such spec- 
ulation was at its peak, there were philosophers 
and historians—Schopenhauer and Burckhardt, for 
instance—who challenged its pretensions; and in 
the twentieth, it has been exposed to a series of 
logical and methodological criticisms which in their 
cumulative impact have proved extremely damag- 
ing. In any case, projects of this kind must be 
sharply distinguished from the type of inquiry that 
will be chiefly considered here, and which is some- 
times referred to as “formal” or “critical” philos- 
ophy of history. Philosophy of history in this sense 
has developed comparatively recently, its rise 
broadly coinciding with the decline of its specu- 
lative counterpart. It has for its subject matter not 
the course of historical events, but rather the 
nature of history conceived as a specific discipline 
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and branch of knowledge. In other words, it may 
be said to be concerned with such topics as the 
purposes of historical inquiry, the ways in which 
historians describe and classify their material, the 
manner in which they arrive at and substantiate 
explanations and hypotheses, the assumptions and 
principles that underlie their procedures, and the 
relations between history and other forms of in- 
vestigation. Thus, while the problems with which 
it deals are not speculative problems of the sort 
previously mentioned, neither are they problems of 
the type to which practicing historians typically 
address themselves in the course of their work: 
the questions involved arise from reflection upon 
historical thinking and reasoning and are primarily 
of an epistemological or conceptual character. 


The autonemy of history 


Philosophical concern with the nature of his- 
torical understanding originated largely as part of 
a general protest against the tendency (prevalent 
among followers of the Enlightenment) to regard 
the natural sciences as representing the paradigm 
of all true knowledge. Even to some of the earlier 
speculative philosophers of history, the view that 
the categories and modes of interpretation em- 
ployed so successfully in the investigation of physi- 
cal nature could be validly extended to human 
studies appeared far from self-evident; in particu- 
lar, the writings of both Vico and Hegel can be 
said to embody an implicit challenge to this opin- 
ion. Yet the belief that there are no differences in 
principle separating history from other disciplines 
and that the historian should strive as far as pos- 
sible to apply to his own field the methods estab- 
lished in other areas of inductive inquiry was a 
persistent one: in the eighteenth century, empiri- 
cists like David Hume saw no reason to question it, 
and in the nineteenth it was to be constantly re- 
affirmed by a host of positivistically minded the- 
orists. And, insofar as it was maintained, there 
appeared to be no grounds for supposing that the 
study of history presented any special problems 
from a philosophical point of view; logically and 
epistemologically it was on a level with any other 
form of empirical science. 

Dilthey and Croce. The close of the last cen- 
tury, however, witnessed the emergence of a 
number of thinkers to whom this comfortable as- 
sumption no longer seemed acceptable and who 
raised awkward questions: among the more influ- 
ential of the writers involved in the new trend were 
Georg Simmel, Heinrich Rickert, and (above all) 
Wilhelm Dilthey (see Hodges 1944) in Germany, 
and the philosopher and historian Benedetto Croce 
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(1917) in Italy. To summarize what they said 
would be difficult; they were not lucid expositors 
of their ideas, and they incorporated into their 
theories metaphysical conceptions that have lost 
much of their appeal since they wrote. Neverthe- 
less, they succeeded in focusing attention upon 
features of the historian’s activity that had been 
overlooked and ignored by many of their predeces- 
sors. It was pointed out, for instance, that the his- 
torian’s aims are ostensibly very different from 
those characteristic of the natural scientist: his- 
torians are not concerned with the discovery of 
universal laws or theories from which predictions 
can be derived and which can serve as guides to 
action in practical or technical contexts; on the 
contrary, their primary purpose is to determine 
what happened in the past and why. This neces- 
sarily involves a concentration upon the concrete 
particularity of events that are in themselves 
unique and unrepeatable. The abstract categories 
of science (“pseudo-concepts,” as Croce called 
them) are, however, adapted to quite different 
ends; their proper field of application is the sphere 
of the universal and unchanging, and they can 
therefore play no role in history as truly conceived. 
For the historian is not interested in phenomena 
regarded as “specimens” or as instances of general 
truths; as Michael Oakeshott (1933, p. 154) has 
expressed it, “the moment historical facts are re- 
garded as instances of general laws, history is 
dismissed.” 

There is, moreover, a further point of funda- 
mental importance that such critics have wished 
to emphasize. Both Dilthey and Croce underlined 
the distinction, considered by them to be crucial, 
between the respective subject matters of science 
and history. In crude terms, this may be repre- 
sented by the familiar dichotomy of “spirit” and 
“nature”; more specifically, it involves the belief 
that it is impossible to view the activities of his- 
torical agents as mere pieces of observable “be- 
havior,” reducible to (or explicable in terms of) 
purely physical items. It follows that the prin- 
ciples of knowledge and understanding that are 
appropriate here cannot be those presupposed by 
scientific interpretations of the world. For the his- 
torian it is essential that he should be able to 
reconstruct “from within” the reasons, purposes, 
and emotions that motivated the persons with 
whom he is concerned and that found outward 
expression in their deeds. Various notions, such as 
“reliving” and “Einfühlung,” or “empathy,” were 
appealed to in order to characterize this process; 
but, however described, it was posited as a dis- 
tinctive feature of historical thinking, sufficient in 


itself to mark history off from typically scientific 
modes of inquiry. 

Collingwood. The basic contention was per- 
haps most forcefully and clearly formulated by the 
English philosopher R. G. Collingwood (1946), 
who was in his own work deeply influenced by 
Croce. According to Collingwood, the essential task 
of the historian is to “rethink” or “re-enact” in his 
mind the deliberations of historical agents, thereby 
rendering intelligible the events with which he has 
to deal in a way that finds no parallel in the physi- 
cal sciences. This led him to claim, among other 
things, that the term “cause” has its own meaning 
in the context of historical narrative, not to be con- 
fused with any it may bear elsewhere. Thus, to 
show what caused a given occurrence in history 
is not a matter of subsuming it beneath scientific 
laws or empirical generalizations; rather, it is a 
question of eliciting its “inner side”—that is to say, 
the thoughts and reasons that, once uncovered, 
exhibit what happened as the response of a rational 
being confronted by a situation requiring a prac- 
tical solution. 


The rise of analytic philosophy 


Considerations like the above provided the stim- 
ulus to much modern philosophical analysis of 
history. This has turned very largely on the issue 
of whether. and if so in what way, historical 
thought has its own distinctive logic that resists 
interpretation in scientific terms. In general, con- 
troversy has tended to center on two main topics. 
The first concerns the logical character of the 
explanations historians give of particular events 
and developments. The second relates to the episte- 
mological status of historical accounts of the past 
and to the question of whether they possess an 
objective validity comparable to that claimed for 
the results of scientific investigation. 

Historical explanation. A major difficulty that 
tends to beset discussions of historical explanation 
derives from the variety of forms such explanation 
can assume. It is tempting to imagine that there is 
a single model to which all explanations in history 
ultimately conform; to explain an historical occur- 
rence, it may be suggested, is always to exhibit it 
as being in some sense the consequence of certain 
other events or conditions. Yet in practice it is far 
from clear that the narratives historians provide, 
rich as they are in interpretative devices, invariably 
follow this neat pattern. For example, we may be 
told that a particular happening or circumstance 
was of a certain type (e.g., it is described as repre- 
senting a “political revolution” or an “imperialist 
war”); or that it was part of a general trend (“The 


struggle was a phase of evolving nationalism.”); 
or that it was significant as indicating changes 
with wider social implications (“The influence of 
women at court was a symptom of dynastic de- 
cline.”). These may all constitute valid ways of 
increasing or illuminating our understanding of 
what occurred; they do not, however, appear to do 
so by providing anything obviously analogous to a 
causal explanation. 

Nevertheless, whatever supplementary methods 
the historian may use to render the past intelligi- 
ble, it may still be urged that causality remains 
the fundamental category of historical understand- 
ing. The crucial problem, therefore, is one of eluci- 
dating the notion of causal connection in history 
(Gardiner 1952; Dray 1957). Has this notion some 
special application in historical contexts, as Col- 
lingwood claimed? Or is it susceptible to an inter- 
pretation which demonstrates that historical ex- 
planations do not, after all, diverge in any essential 
manner from those characteristically employed in 
the natural sciences? 

The theory of “covering laws.” The view that 
no radical differences divide historical from other 
kinds of explanation has found its chief defenders 
among philosophers whose general conception of 
causality largely derives from Hume. Since there 
are no “necessary connections” between matters of 
fact, any claim to the effect that a causal relation 
holds between certain events must contain a covert 
reference to a natural regularity or law. In other 
words, to explain an occurrence is to show that it 
was bound to occur, given the fulfillment of cer- 
tain antecedent or initial conditions, and given the 
existence of some law or laws correlating such 
conditions with events of the type to which the 
explicandum belongs. According to this account, 
the historian, along with any other inquirer into 
causes, cannot avoid appealing to general state- 
ments asserting empirically verified uniformities; 
it is the latter that afford the essential backing or 
warrant that his explanations require. To point 
out that his direct concern is with the particular, 
not the general, is to say something that, though 
true, does not materially affect the issue. There is 
no incompatibility between the claim that the his- 
torian’s object is to explain particular events and 
the claim that, in doing so, he necessarily commits 
himself to the acceptance of certain general truths. 
To accept the second of these contentions is not 
even to deny that there may be an intelligible sense 
in which each historical event is “unique” (though 
here it is worth emphasizing that, if such events 
were unique in the absolute and unrestricted man- 
ner sometimes suggested, it would be impossible 
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to say anything about them at all). For all that is 
required for explanatory purposes is that the oc- 
currence to be explained should resemble other 
happenings in certain respects or aspects—namely, 
those that permit the application of relevant gen- 
eralizations or laws. From this point of view there 
is no difference in principle between the procedures 
of the historian and the natural scientist; if a 
chemist or an astronomer wishes to explain a par- 
ticular phenomenon falling within his field, a 
similar abstraction is involved. 

On the face of it, this theory seems to have 
much to commend it, agreeably combining concep- 
tual economy with empirical hardheadedness. A 
closer look may, however, reveal difficulties. In the 
first place, the theory appears to assume that all 
causal explanations in history take the form of 
showing that, given certain initial conditions, a 
particular event had to occur. But this is far from 
being universally true: the historian’s object in 
citing causes is frequently the more limited one of 
explaining how a certain historical occurrence was 
possible; not why it was bound to happen; the 
causes referred to represent the necessary rather 
than the sufficient conditions of what happened. 
In itself, this hardly constitutes a conclusive objec- 
tion; it might, for instance, be replied that the de- 
termination of necessary conditions also involves 
an implicit reference to laws, and that a more com- 
plex formulation of the proposed analysis, capable 
of accommodating this kind of case, could easily 
be devised. Where the theory is more clearly vul- 
nerable is in its bland assumption that laws of the 
type it postulates lie ready to hand and that it is 
these to which historians refer when they offer 
their explanations. For when attempts are made to 
specify general statements connecting “causes” 
and “effects” in the required fashion, the proposi- 
tions elicited tend to be so vague and indeterminate 
as to make it hard to see how they could conceiv- 
ably perform the explanatory function attributed 
to them. Nor does historical practice appear to bear 
out the suggested interpretation, Thus it may be 
argued that a historian, when confronted by the 
task of explaining what caused a specific event, 
such as the French Revolution, does not do so by 
attempting to subsume it beneath putative laws 
concerning revolutions in general; on the contrary, 
he proceeds to an analysis of the particular case, 
showing through detailed inquiry how various con- 
nected sequences of factors combined to give rise 
to the complex phenomenon under examination. 

The theory of “continuous series.” Apprecia- 
tion of such points has led some modern writers to 
oppose to the previous “covering law” conception 
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of historical explanation one that has been called 
“the model of the continuous series.” In the latter 
view, the historian traces, step by step, the rela- 
tions between earlier and later phases of historical 
change, thereby building up an intelligible narra- 
tive whose various components can be seen to 
stand in “intrinsic” or “natural” connections with 
one another: it is by such careful and particular- 
ized investigation, and not by applying universal 
laws or generalizations, that explanation in history 
characteristically proceeds. But this account, though 
plausible in many ways, still leaves a question un- 
answered. For it may be inquired how we are to 
understand the individual connections stated to 
obtain between the events of which the series is 
composed. To say that they are “intrinsic” or 
“natural” is surely, if anything, to appeal to the 
notion of what generally happens or can normally 
be expected to occur; but is this not to reintroduce 
the conception of empirical uniformities? It would 
appear, in other words, that the essential differ- 
ence between the two interpretative models con- 
sists not in the fact that one relies upon the notion 
of generalizations whereas the other does not, but 
rather in an (admittedly important) disagreement 
concerning the kinds of generalizations that are 
relevant, and the levels of inquiry at which they are 
employed or presupposed. 


The historical point of view 


It is perhaps hard to see how any theory of his- 
torical explanation could wholly dispense with ref- 
erence to general statements at some stage of its 
analysis. What is less clear is whether such gen- 
eral statements have the status and role in history 
which the use of the term “law,” with its predomi- 
nantly scientific associations, implies. There is 
something eccentric in the idea that the construc- 
tion of an historical narrative involves a continu- 
ous resort to generalizations concerning human 
behavior, if by “generalizations” is meant a set of 
inductively established or experimentally confirmed 
propositions that can be precisely listed and formu- 
lated. It is not merely that words like “insight” and 
“judgment,” together with others that are embed- 
ded in the vocabulary of ordinary historical criti- 
cism, would seem to have little application to his- 
tory conceived along such rigorous and tidy lines; 
the picture further suggests an “external” or “spec- 
tatorial” approach to the material that appears to 
obscure a salient feature of much historical writ- 
ing and understanding. For it is arguable that in 
order truly to comprehend the policies or decisions 
of a particular historical figure or the motives or 
ideals that inspired some large-scale historical 


movement (whether political, intellectual, or ar- 
tistic), it is necessary to be able to share imagina- 
tively the point of view of the participants; and 
this in turn requires a firsthand acquaintance with 
what it is, for example, to appreciate a situation 
and plan accordingly or to entertain certain hopes, 
desires, or fears. To speak of historical events as 
being “naturally” related or as forming an “intel- 
ligible” sequence may well be to imply that what 
happened was such as to fulfill our normal expec- 
tations; but it is important to recognize that, in 
human contexts, what we expect is closely tied to 
what we find understandable in the light of our 
own experience as rational purposive agents. It 
was this consideration, above all others, that earlier 
thinkers like Collingwood wished to stress when 
criticizing positivist attempts to assimilate history 
to natural science. Though often expressed in mis- 
leading or exaggerated terms, it is a point that still 
retains considerable force. 

Is objectivity possible? The claim that the his- 
torian stands in an especially intimate relation to 
his subject matter has sometimes been regarded as 
indicating a further significant difference between 
history and the natural sciences. This is the sug- 
gestion that the very nature of the historian’s task 
and situation precludes him from achieving in his 
descriptions and interpretations the kind of objec- 
tivity that characterizes scientific work. It is not 
merely that, as a matter of fact, historians often 
offer widely dissimilar accounts of the same his- 
torical phenomena, even when basing what they 
say on broadly identical sources; it may be argued 
that such striking divergences are necessary and 
inevitable. Thus, suppose it is held—as it was, for 
instance, by Croce—that historical knowledge es- 
sentially involves the “re-creation” of the past by 
each historian within his own mind; it then be- 
comes difficult to see how any historical account 
can fail to be to some extent colored and shaped 
by the individual interests and personality of its 
author—a conclusion tacitly accepted by Croce 
himself when he spoke of all history as “con- 
temporary.” 

Even if such “idealist” theories of historical 
knowledge are rejected, further independent fac- 
tors may be adduced that point in the same gen- 
eral direction. It has been maintained, for example, 
that the fact that the historian is engaged in dis- 
cussing human beings and their activities, using 
everyday language to do so, commits him to intro- 
ducing considerations which would be manifestly 
out of place in scientific contexts: there can be no 
such thing as a purely objective or “value-free” 
historical account, since language that is adapted 


to the description of what people feel, think, and 
do necessarily reflects the element of evaluation 
and appraisal that pervades the whole texture of 
human life and experience. Again, attention may 
be drawn to the manner in which all history is 
necessarily selective. The historian cannot pack 
into his account everything he knows about the 
subject he is studying, nor would he be considered 
a good historian if he tried to do so; the employ- 
ment of judgments of relevance, of relative im- 
portance or triviality, is fundamental to his under- 
taking. But such judgments are founded upon 
assumptions and preconceptions of diverse kinds 
that are inherently disputable and that vary from 
person to person, society to society, age to age. 
What is of significance to an historian belonging 
to one period or milieu may seem unworthy of 
mention to another whose time and background 
are different; religious opinions, political beliefs, 
moral or social ideals, must all, consciously or 
unconsciously, influence such things as the his- 
torian’s presentation of his material, his decisions 
as to what to include or omit, the weight he assigns 
to particular factors, and even his critical assess- 
ment of evidence and sources. 

Subjectivity and historical evaluation. A con- 
clusion frequently drawn from all this, both by 
philosophers and by practicing historians who have 
reflected on their craft, is that history is infected 
by some kind of radical and irremediable “sub- 
jectivity.” Yet the claim in question, together with 
the arguments used to support it, have tended to 
be framed in highly general and abstract terms; 
in consequence, as a number of recent critics have 
pointed out (see, for instance, Carr 1961), several 
significant distinctions are in danger of being over- 
looked. It is a mistake, for instance, to suppose 
that the selection and presentation of material are 
always determined by “subjective” convictions and 
preconceptions of the kind stressed above; they 
may be, and very often are, dictated in a quite in- 
contestable manner by the particular nature of the 
problem with which the historian is concerned. 
Likewise, it is one thing to say that an historian’s 
choice of problem is due to certain personal inter- 
ests or predilections he may have, and another to 
argue that these will necessarily affect his manner 
of solving it; the two have not, however, always 
been clearly separated. Again, judgments of rela- 
tive importance may sometimes be made in the 
light of what has been called the “causal fertility” 
of events; but the question of whether some spe- 
cific occurrence was productive of more far-reach- 
ing changes than another is an empirical matter, 
decidable by investigation—it has nothing essen- 
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tially to do with subjective values or attitudes 
peculiar to the historian. 

Similar possibilities of confusion may arise with 
regard to the suggestion that the historian’s subject 
matter is such as to render evaluation unavoidable. 
No doubt it is true that the purposes and doings 
of historical agents were to a large extent informed 
by the values and principles (moral or otherwise) 
to which they subscribed; but this by itself in no 
way entails that the historian cannot discuss their 
activities without engaging in such evaluation on 
his own account. If, on the other hand, it is the 
historian’s use of common language that is held 
to preclude the possibility of his providing “neutral” 
descriptions, the precise force of this contention 
(assuming it to be correct) is open to doubt; what, 
for example, is to prevent historians from devising 
a reformed terminology to meet the difficulty? 

Remaining problems. Taken together, the above 
and related points may go some way toward reduc- 
ing the prima facie persuasiveness of the claim 
that anything akin to objectivity in the scientific 
sense is unattainable in history. But the issue re- 
mains a curiously intractable one, involving vari- 
ous puzzles and ambiguities into whose complexi- 
ties it is impossible to enter here. Nor does it stand 
alone in this respect. The rich field of the critical 
philosophy of history contains a host of similarly 
disputed problems, with roots extending into many 
adjoining areas of inquiry (Gardiner 1959; see also 
Stern 1956; Meyerhoff 1959; New York University 
. .. 1963). From this point of view their further in- 
vestigation does not only concern the future devel- 
opment of the historical studies: it also has an 
obvious and important bearing upon some of the 
fundamental methodological questions that, at the 
present time, confront the neighboring social 


sciences. 
PATRICK GARDINER 


[See also MARXIST SOCIOLOGY; POSITIVISM; SOCIOLOGY, 
article on THE DEVELOPMENT OF SOCIOLOGICAL 
THOUGHT; and the biographies of COMTE; CROCE; 
DILTHEY; HEGEL; HUME; MARX; SIMMEL; SOROKIN; 
SPENGLER; VIco; WEBER, MAX.] 


BIBLIOGRAPHY 

Aron, RayMonp (1938) 1961 Introduction to the Phi- 
losophy of History: An Essay on the Limits of His- 
torical Objectivity. Boston: Beacon. > First published 
in French. 

BERLIN, Isaran 1955 Historical Inevitability. London 
and New York: Oxford Univ. Press. 

Carrns, Grace E. 1962 Philosophies of History: Meet- 
ing of East and West in Cycle-pattern Theories of 
History. New York: Philosophical Library. 

Carr, Epwarp H. (1961) 1962 What Is History? New 
York: Knopf. 


434 


CoLLINGwoop, Rosin G. 1946 The Idea of History. Ox- 
ford Univ. Press. + A paperback edition was pub- 
lished in 1956. 

Croce, BENEDETTO (1917) 1960 History: Its Theory 
and Practice. New York: Russell. > First published as 
Teoria e storia della storiografia. 

DANTO, ARTHUR C. 1965 Analytical Philosophy of His- 
tory. Cambridge Univ. Press. 

Dray, WiıLLIAM 1957 Laws and Explanation in History. 
Oxford Univ. Press. 

Gate, W. B. 1964 Philosophy and the Historical Un- 
derstanding. London: Chatto & Windus. 

GARDINER, PaTRIcK (1952) 1958 The Nature of His- 
torical Explanation. Oxford Univ. Press. 

GARDINER, PATRICK (editor) 1959 Theories of History: 
Readings From Classical and Contemporary Sources. 
Glencoe, Ill.: Free Press. 

Hopces, HERBERT A. 1944 Wilhelm Dilthey: An Intro- 
duction. London: Routledge. > Contains extracts 
from some of Dilthey’s principal writings. 

KAHLER, ERICH 1964 Meaning of History. New York: 
Braziller. 

Léwith, Kart (1949) 1950 Meaning in History: The 
Theological Implications of the Philosophy of History. 
Cambridge Univ. Press. > A paperback edition was 
published in 1957 by Phoenix. 

MANDELBAUM, Maurice H. 1938 The Problem of His- 
torical Knowledge: An Answer to Relativism. New 
York: Liveright. 

MEYERHOFF, Hans (editor) 1959 The Philosophy of 
History in Our Time: An Anthology. Garden City, 
N.Y.: Doubleday. 

New York UNIVERSITY, INSTITUTE OF PHILOSOPHY, 5th, 
1962 1963 Philosophy and History: A Symposium. 
Edited by Sidney Hook. New York Univ. Press. 

OaKEsHOTT, MICHAEL 1933 Experience and Its Modes. 
Cambridge Univ. Press. 

Popper, KARL R. 1957 The Poverty of Historicism. Bos- 
ton: Beacon. 

STERN, Fritz R. (editor) 1956 The Varieties of History: 
From Voltaire to the Present. New York: Meridian. 

Wats, Wittiam H. (1951) 1958 An Introduction to 
Philosophy of History. London: Hutchinson. > A pa- 
perback edition was published in 1960 by Harper as 
The Philosophy of History. 

WHITE, Morton G. 1965 Foundations of Historical 
Knowledge. New York: Harper. 

Wincu, PETER 1958 The Idea of a Social Science and 
Its Relation to Philosophy. London: Routledge; New 
York: Humanities, 


Il 
HISTORY AND THE SOCIAL SCIENCES 


To social scientists, all history is social history, 
whether historians classify it as social history, 
political history, economic history, religious his- 
tory, or history of some other kind. It cannot be 
said, therefore, that there is a distinct category of 
historical study which is devoted specifically to the 
past as the social scientist would deal with it. 
Rather, a new method of studying history of all 
kinds is emerging which is intended to satisfy the 
criteria of the social sciences and which provides 
or will provide evidence to illuminate the task of 
the sociologist, the anthropologist, the social psy- 
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chologist, and so on. The historian working in this 
way makes use of the theories, categories, and 
techniques of the social scientist whose work he is 
trying to parallel. The social scientist, when he 
turns to accounts of the past for evidence, attempts 
to master the outlook and methods of the histo- 
rian, The purpose of the present article is to ex- 
amine in a summary way the principles which are 
beginning to govern these activities of historians 
and social scientists. It must not be supposed, how- 
ever, that all history is now to be written or ever 
will be written with the scientific study of society 
as the end in view. The writing of history is a much 
more general activity than the systematic study of 
social relationships. Accounts of the past seem to 
have been composed in some form or other in every 
society. In literate societies with a high degree of 
cultivation, these accounts are rewritten every cen- 
tury or every generation, in some cases every few 
years, in conflicting versions. 

The writing of history is undertaken for many 
purposes, which themselves would be a legitimate 
object of a social scientific investigation. These 
purposes can only be hinted at here. Reconstruc- 
tions are worked out and interpretations are built 
up in order to reconcile a national society (or a 
group of any kind) with its past and with the way 
in which its present differs from that past. They 
are needed to make intelligible to every new gen- 
eration its ordained place in time. They are needed 
to justify religious beliefs and practices, to provide 
political rationalizations, to enrich aesthetic and in- 
tellectual experience, and merely to satisfy curios- 
ity. Even the simple keeper of the annals of his 
people or his church does something in all these 
directions and also does something to ensure, as 
d'Alembert said in the great Encyclopédie, that the 
achievements of the past shall not be lost to the 
men of the future. 

The successive authors of the Anglo-Saxon 
Chronicle could not possibly have conceived of a 
science of society or imagined that the work which 
they produced could give rise to such an enterprise, 
let alone that it might provide an alternative to it. 
But since their time and down to our day, historians 
have advanced all these propositions about the re- 
lationship between history and the social sciences. 
It has been claimed that there is a distinctive his- 
torical method which provides its own account of 
how society works or, perhaps, its own unique at- 
tempt to do all that can be done, in view of the fact 
that the workings of society can never be more 
than partially established. According to this view, 
narration and description are the proper methods 
to be used (for an argument of this kind, see Col- 
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lingwood 1946). Since no social situation, no past 
event, can ever be described in full in all its chang- 
ing aspects, selection of the typical, after as ex- 
haustive a study as possible, has to govern the 
undertaking. Some claim that the principles of 
such selection can be scientific principles, but 
others deny this, on the grounds that the selection 
can only be intuitive. Thus, the study of history has 
been seen as the point of origin of the social sci- 
ences; or as the rationale of an alternative type of 
special explanation; or as all that can be advanced, 
since social science is a chimera. A very different 
claim has also been put forward. The domain of 
the historian has been defined as all the social 
scientific evidence coming from the past. In this 
way, the phrases “historical science” and “the his- 
torical sciences” have come into being. 

All the various claims about the relationship be- 
tween history and the social sciences raise logical, 
conceptual, and philosophical issues, some of them 
of great intricacy [see HISTORY, article on THE PHI- 
LOSOPHY OF HISTORY]. In the present article re- 
course will be had to an archaic or even obsolete use 
of the word “history,” as an aid to understanding his- 
tory’s present relationship with the social sciences. 
History will be conceived of here as bearing the 
very wide meaning it had in the phrase “natural 
history.” Some biological scientists still call them- 
selves naturalists. 

Nowadays “natural history” is a way of referring 
to biology, botany, zoology, and geology, with a dis- 
tinct implication of their being pursued rather un- 
systematically by amateurs, as a diversion. Before 
and during the scientific revolution, however, “nat- 
ural history” meant all that could be known about 
nature simply by description, as contrasted with 
“natural philosophy,” which meant that part of 
nature which could be understood on principle, 
scientifically, and which was acquired by the sys- 
tematic use of certain techniques of observation. 
If we substitute the word “societal” for “natural,” 
the new phrase “societal history” can be used in 
contrast with the phrase “social science,” in rather 
the same way “natural history” was used in con- 
trast to “natural philosophy” or “natural science.” 
Such a usage recognizes that history is legitimately 
pursued for many purposes other than the comple- 
menting of the social sciences. It avoids the diffi- 
culties, already described, which attach to the 
phrase “social history.” It emphasizes that social 
information which does not yet belong to the ana- 
lytically formulated and technically cultivated social 
sciences (and which may never belong there) can 
nevertheless be apprehended, if not understood, in 
the historical, narrative, descriptive, intuitive mode. 


Societal history, or the “history” of ordinary usage 
taken in its very widest sense, stands to the social 
sciences as natural history stands or once stood to 
the natural sciences. Once this roundabout defini- 
tion is laid down, it becomes clear that although 
every form of historical study necessarily belongs 
to societal history whether or not it would usually 
be described as social, no form of historical study 
necessarily belongs to the social sciences. Particu- 
lar types of historical inquiry may be said to be 
part of the social sciences, nevertheless, if certain 
conditions, discussed later in this article, are sat- 
isfied. We shall, in fact, distinguish those historical 
studies intended to advance the social sciences as 
a particular area of societal history and give them 
a collective name, “deliberative societal history.” 

The limitations usually placed upon the simple 
word “history” will be disregarded in a further way. 
Archeological evidence is sometimes excluded from 
history, but it will be included here as part of socie- 
tal history, as will the material gained by anthro- 
pologists and sociologists in direct observation and 
oral communication. But although societal history 
is a descriptive undertaking, dealing with a very 
wide range of sources and with indefinitely exten- 
sible information, it has to satisfy the strictly chron- 
ological criterion rather more exactly than does 
natural history. It can deal only with those facts 
which belong to past time and which are to be 
understood in one-way temporal succession. Even 
this limitation becomes tenuous in the case of evi- 
dence assembled for the current situation by social 
scientists, which must all belong to past time, even 
if it is only the very recent past. But in practice 
the very near past, as represented by the most 
recently available social survey data, and so on, is 
excluded from societal history. 

Since their subjects are related in this way, the 
social scientist and the societal historian are by no 
means always distinct individuals, any more than 
are the natural historian and the natural scientist; 
each individual is more the one than the other— 
more the sociologist, for example, than the social- 
structure historian. Some important sociologists 
(e.g., Max Weber and T. H. Marshall) wrote spe- 
cifically historical works, and most technical works 
of the social sciences contain some discussion of a 
historical character. This is true even of economic 
theory (see, for example, John Maynard Keynes 
1930), while the studies of anthropologists and so- 
ciologists sometimes have to present a considerable 
amount of descriptive and narrative history of the 
conventional kind. Only theoretical statistical works 
are likely to be entirely wanting in historical con- 
tent. There are, moreover, as is well known, books 
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written as history which are rightly regarded as 
classics of the social sciences (Tocqueville’s works 
are good examples). 


Types of societal history 


It is possible to delineate five types of historical 
undertakings, together with their particular func- 
tions in the study of society. They are listed here in 
descending order of significance to the social 
sciences, and, of course, they overlap to a certain 
extent. 

Social science works. The first type of historical 
writing of significance to the social scientist belongs 
to the literature of the social sciences themselves, 
since it consists of parts of works written by social 
scientists. Every such work, as has been said, con- 
tains narrative—descriptive components, and these 
belong to societal history. They are of varying 
length and importance. There are the short histori- 
cal descriptions and arguments which are found 
in such books as Gunnar Myrdal’s An American 
Dilemma (1944) and in the very long historical 
citations and discussions which go to make up most 
of the text of Wittfogel’s Oriental Despotism 
(1957). Neither this latter book nor any other 
book of the same type belongs wholly with socie- 
tal history, since its object is to illuminate a partic- 
ular institution in its significance for all societies 
at all times. 

Social structural studies. The second type of 
historical writing of significance to the social 
scientist is social structural history. It consists 
of complete works deliberately undertaken by 
scholars calling themselves historians, rather than 
social scientists, that provide comparative historical 
examples which can be used alongside the compar- 
ative geographical examples of anthropologists. 
Such studies may take various forms: they may be 
comprehensive surveys of particular societies at 
certain points in the past, or they may contain 
records of social change over a particular period. 
But they will always tend to embrace whole national 
societies or whole cultural areas, rather than deal- 
ing in specified institutions alone. Although they 
belong with the established tradition of social his- 
tory, they are undertaken, as far as possible, in con- 
formity with two principles not usually made ex- 
plicit in traditional works. One is that the evidence 
shall be assembled and analyzed in accordance 
with the methods and techniques of all the social 
sciences. The other is that conclusions shall be 
presented in a form which can be used in social 
analysis generally. 

An example of an experimental work of this 
kind, using the method of comparison rather than 


of narrative and dealing with the whole social 
structure rather than with specific institutions, is 
Laslett’s The World We Have Lost (1965), which 
covers English society before and after industrial- 
ization and attempts to satisfy the principles just 
cited. If it is compared with Trevelyan’s English 
Social History (1942), the difference between 
social structural history and traditional social his- 
tory is brought out. A summary of the principles of 
this emerging form of historical writing is at- 
tempted below. 

Areas of social activity. The third type of his- 
torical writing of significance to the social scientist 
consists of studies of the past of some isolable area 
of social activity. The distinctions between these 
special historical areas consist, to a large extent, 
simply of their varying subject matter. But a more 
interesting and important principle of difference is 
beginning to appear. Part of the definition of a rec- 
ognized social science is that it should possess its 
own particular body of theory and technique, al- 
though there are differences between various social 
sciences in this respect. Historical writing within 
the area of each social science must attempt to 
make use of its theory and its technique, and in one 
conspicuous case, that of economic history, it has 
certainly begun to do so. Only one other isolable 
historical area shows signs of a similar evolution. 
and that is demographic history, using the theory 
and technique of demography. These are the two 
social sciences which lend themselves most easily 
to quantification and mathematical analysis; but 
it need not be supposed that the less effective tools 
and devices at the disposal of other social scientists 
are quite without their effect on historical studies. 
The history of religion and the history of educa- 
tion, for example, can in principle make use of 
religious and educational psychology and sociology. 

The theory and technique of sociology and psy- 
chology can in principle be adapted so as to apply 
to the history of literature and the fine arts, to the 
history of social and political thinking, and to the 
history of mathematics and science. All these his- 
torical studies may also come to be illuminated by 
such techniques as content analysis and the other 
expedients used by sociologists and psychologists 
to reach an understanding of beliefs, attitudes, 
opinions, and ideologies in the contemporary 
world. (For an example of these expedients, see 
Lane 1962; and for a highly speculative attempt 
at psychological and intellectual analysis in past 
time, see Erikson 1958.) A further historical area 
may soon show signs of independence, making use, 
where it can, of the theory and techniques of po- 
litical sociology. This new subject might tentatively 
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be called the history of political systems, commu- 
nication, and participation. Although such a subject 
will inevitably have to grow out of voting studies 
undertaken historically, there are already signs that 
even in societies without democratic procedures, 
past and present political behavior can be fruitfully 
studied (see, e.g., Vincent 1966). The distinc- 
tion between this nascent subject and political his- 
tory of the conventional kind will concern us below. 

Nevertheless, these studies are difficult to classi- 
fy, as can be seen from the example of the history 
of technology, which certainly belongs as much to 
economic history as it does to the history of mathe- 
matics and science and which may well have its 
part to play in some other areas as well. These 
studies, by treating their data and presenting their 
conclusions in a manner appropriate to their sub- 
ject, will be of direct use to the social scientist, 
providing him with comparative historical examples 
in specialized spheres. Social structural history 
would ideally represent an amalgamation of all 
possible studies of this kind, as well as a general 
framework within which each of them might be 
pursued. 

It is unlikely that many books of this type would 
actually deal with any one specialized area as a 
whole, although it is possible that works, especially 
multivolume collaborative studies, will continue to 
appear with titles such as “A History of Social and 
Political Thought.” Most of the narrower studies 
can have two distinct objectives. The first objective 
is the social analysis of certain features of histor- 
ically distant situations, with the issues of the 
social sciences generally in view. An example of 
this kind of study is Smelser’s Social Change in 
the Industrial Revolution (1959), Although a his- 
torical monograph, applying theory to a particular 
topic, the Lancashire cotton factories of the early 
nineteenth century, it is also intended as a contri- 
bution to general social theory. The second objec- 
tive is to illuminate past events and to revise inter- 
pretations of them, with only incidental reference 
to social analysis of present institutions and atti- 
tudes. It is noticeable that the studies carried out 
so far by the econometric economic historians tend 
to belong to this second type. 

Documentary and preparatory works. The fourth 
category of historical writing of significance to the 
social sciences consists of documentary and pre- 
paratory works. Studies of this kind have consider- 
able value, especially in the present, developing 
stage of many of the subjects already described. 
For example, the discovery and preparation of data 
for the description of social structure from listings 
of inhabitants, including familial structure and 


kinship systems, has been of the first importance 
for social structural history. The editing and print- 
ing of records such as licenses to marry issued by 
episcopal courts, regular series of documents drawn 
up in connection with poor relief, and parochial 
registers containing detailed recordings of baptisms, 
marriages, and burials all come under this head- 
ing and are of great value to demographic as well 
as social structural history (Wrigley 1966). 

Traditional historical studies. The fifth cate- 
gory of historical writing of significance to the so- 
cial scientist is the residual one and consists of all 
other historical works, of whatever kind. 

Works of traditional history are less likely to be 
of direct importance to the social scientist than any 
of those listed in the first four categories. But this 
does not mean that works which have been or are 
now being produced in accordance with traditional 
historical conventions are irrelevant. According to 
the definition adopted here, they belong to societal 
history and could have been mentioned as contain- 
ing documentary and preparatory material. Many 
of them are, in fact, of great value in the hands 
of percipient and conscientious social inquirers. 
Moreover, the realistic and critical historian is often 
in a position to illuminate the use made of histori- 
cal evidence by social scientists in a peculiarly ef- 
fective way. He may even, on entirely historical 
criteria, demonstrate the inefficacy of explanations 
used both by his fellow historians and by social 
scientists. An example of this is the devastating 
work recently done by Hexter (1961) on the-rise- 
of-a-class hypotheses. Even if a historical work is 
composed without any intention of recording or 
illuminating social change, it may nevertheless do 
so or it may be shown to have done so by a later 
critic. 

Deliberative societal history 

There clearly is a distinction between historical 
works intended by their authors as contributions 
to social analysis and those written in indifference 
to such an aim. Works of the first type constitute 
the category of deliberative societal history, in con- 
trast with traditional historical studies. Although 
some of the characteristics of deliberative societal 
history have been sketched above, the criteria 
which mark historical work of this kind are not 
yet clear, distinct, or universally recognized. Never- 
theless, it is already obvious that most traditional 
historical studies could not conform to the required 
conditions. 

In the first place, many historians would be 
unwilling to recognize deliberative societal history 
as a description of their work. Some would reject 
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the whole project of social and political science. 
Their methods belong to the category of tacit 
knowledge, not only in the understanding of past 
events but also in the selection of what will 
interest, inform, or even elevate their readers. 
Analysis of society and of social situations is by no 
means absent, even from writing of this kind, but 
the emphasis is upon description and narration, 
and the task is regarded as being entirely literary, 
with, perhaps, philosophical overtones. History, 
after all, as well as being the companion to the 
social sciences, is one of the traditional arts, with 
its own individual muse. Historians who insist that 
any human experience is unique are understand- 
ably skeptical of attempts at formulating general 
tules for the study of society and social change. 

But even if and when it is not done in conscious 
divergence from the social sciences, most tradi- 
tional historical study is still accomplished in 
ignorance of them. It is undertaken without knowl- 
edge of the relevant theories, concepts, and tech- 
niques. This is the second reason so much writing 
of this character is of problematic value to the 
social scientist. 

The interests of the general public, rather than 
of the academic world or of the social scientists. 
give rise to the third reason that the relationship 
between traditional historical studies and the social 
sciences is so indirect and arbitrary. The demand 
that history shall be “interesting,” that it shall tell 
a relevant, informative story with a moral or a 
message, has effects on the academic historian as 
well as on the writers of textbooks, the biographers, 
and the journalists. Tradition has tended to estab- 
lish in historical studies specific requirements 
which have been even more limiting. It is expected 
that a national society will ordinarily be the unit 
of historical investigation and of historical narra- 
tive. The chronological divisions usually have to be 
those of the conventional political landmarks, and 
the events, sentiments, and attitudes to be ex- 
plained have for the most part to be political, 
chosen for the importance they have for the 
reader’s sense of citizenship. 

These influences continue even when the sub- 
ject matter is no longer of the traditional politi- 
cal character. Hence, many specialized studies still 
bear titles such as “The History of American 
Science in the Colonial Period” or “The History of 
Japanese Education Under the Tokugawa.” It is 
true that the growing importance of the social 
sciences in recent years is beginning to remove 
some of these limitations and that the domination 
of politics, the state, and the values of citizenship 
is much less than it used to be. It is also true that 


economic history shows signs of transcending these 
limitations altogether, although it still seems to 
select topics as much for their polemical, and 
often political, interest as for their economic sig- 
nificance. Quite apart from traditional historical 
studies, then, none of the subjects we have classed 
as deliberative societal history can as yet be said 
to be under the controlling influence of the social 
sciences. 

Quantitative history. Of all recent develop- 
ments, it is clear that econometric economic his- 
tory stands in sharpest contrast to traditional 
historical pursuits. Its rapid development in recent 
years makes it necessary to consider the question 
of the use of quantification in historical studies 
and its effect on their relationship with the social 
sciences. Douglass C. North sets out the charac- 
teristics of econometric economic history, but even 
among the economic historians there are some who 
do not consider it history at all or, at most, call it, 
in Fritz Redlich’s term, quasi history. [See HISTORY, 
article on ECONOMIC HISTORY. ] 

The insistence on giving a numerical value to 
everything in a historical situation which is rele- 
vant to the problem in hand is one source of the 
criticism. This has given the title “cleometrics” to 
the new pursuit and brings it directly into line with 
trends in the social sciences. Although this pursuit 
has been severely criticized, I do not believe that 
its numerative characteristic makes the subject any 
the less historical. 

Even if the numerical equivalents seem quite 
unreal to the common-sense observer and intro- 
duce a host of uncertainties into the issues which 
have to be judged, it has to be remembered that 
all historical judgments are beset with uncertain- 
ties of the same logical kind. Attempting to reckon 
the percentage of the gross national product of 
the United States made up by the railroads in 1850 
is the same kind of undertaking as estimating the 
amount of influence which the growth of Christi- 
anity had on the decline of imperial Rome. Nor 
does it appear that the counterfactual type of argu- 
ment used in cleometrics is necessarily a contra- 
diction of the historical outlook and method. 
Estimating what might have happened if what did 
happen had not happened is characteristic of much 
historical argument of a conventional kind, and 
all that cleometrics does is to argue confessedly 
in this way, openly estimating the risk of error. 

The truly significant point about cleometrics, 
for traditional historical studies, is that it attempts 
to do economic analysis on noncontemporaneous 
subjects. That is why this type of economic history 
belongs to the social sciences to an extent that no 
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other historical study yet does. It has ceased to be 
mere societal history and has become social 
science. Only the recent advances in the theory of 
economic growth have made such work possible, 
and it is clear that the more fragmentary the quan- 
tifiable evidence, the more sophisticated the neces- 
sary theory is likely to be. For most of the other 
social sciences having equivalents among types of 
deliberative societal history, no such advanced 
theory exists or seems likely to be developed, except 
perhaps in demography. It must be remembered, 
also, that vast areas of information from the eco- 
nomic past cannot be dealt with by cleometrics 
and will never become susceptible to such methods. 

It seems unlikely, therefore, that the rise of 
cleometrics is a portent of the future course of the 
relationship between traditional historical studies 
and the social sciences. In the foreseeable future 
only very few historians will be able to call them- 
selves social scientists. Not many more social scien- 
tists will, perhaps, be able to apply their theories 
and practice their techniques on chronologically 
distant materials. The most important change will 
come at a more modest level: much more historical 
writing will surely come under the heading of “de- 
liberative societal history.” The elaboration of so- 
cial structural history will be an important means 
by which this change will take place. 


Social structural history 


Two of the characteristics of social structural 
history have also been made plain: first, it should 
assemble its evidence and carry out its analysis in 
accordance with the methods and techniques of 
the social sciences, and second, it should present 
its conclusions in a form which can be used in 
social analysis generally. It has also been suggested 
that social structural history may dispense with the 
narrative method, which has in the past been an 
almost universal feature of historical writing. 

Since social description is to be done in wholes, 
narrative would in any case be a peculiarly difficult 
form for history written in this way. But although 
works of this kind will probably tend to consist of 
contrasts between a “before” and an “after,” the 
dates selected will represent the median years of 
particular generations, yather than exact moments. 
The choice of the generations to be described will 
itself be a matter of importance and, like every- 
thing else, will have to be made with the interests 
of the social sciences generally in mind, as well as 
being made in accordance with historical criteria. 

The social structural historian should begin his 
descriptions where the anthropologists and the so- 
ciologists do, that is, with the size, structure, and 


functions of the family in the society in question. 
Then will come the kinship system, then the other 
relationships, the geographical, the economic, the 
religious, the intellectual, that go to make up the 
community. “Community” here is understood in a 
plural sense, for the local, the tribal, and the re- 
gional associations and finally the national com- 
munity, if indeed all these existed, are all to be 
included. Only then, and this is in sharp contrast 
with the practice of the traditional historian, will 
he concern himself with political institutions and 
the state itself. 

It is clear that historical writing of this charac- 
ter will be faced with a particularly acute form of 
the problem which affects all historiography, that 
of summarizing and abbreviating sufficiently to 
make its account intelligible. Traditional history 
has a traditional expedient for this purpose: 
the choosing of significant instances, which are 
presented as typical. Since the social structural 
historian is still writing societal history, rather than 
social science, he will of course be at liberty to use 
the same method. But he must be expected to have 
a much clearer notion of the social theory or the- 
ories which make his chosen instances significant, 
and he will recognize that total description is a 
chimera, that all he can ever put on paper is a 
model of the society he is dealing with. The func- 
tion of theories and the usefulness of models in 
pursuits of this kind is to order the data, to select 
and insist upon the regularities, to simplify dras- 
tically by approximation. 

In the course of time, then, the social structural 
historian may find himself having to handle whole 
sets of theories and several different types of 
models. By this time the theories and models used 
by economists and demographers should be famil- 
iar to him. But his first concern will be to con- 
struct a model of his society in the much simpler 
sense just referred to; perhaps “miniature” would 
be a better term. He must, at all points in his de- 
scriptions, analyses, and comparisons, have the 
whole of the society in mind as well as a simplified 
notion of its over-all workings. He may find it use- 
ful to think of two separate models, or miniatures, 
one static and one dynamic, fitting into each other 
in the way any theory appropriate to a system in 
the process of change fits into a theory of that sys- 
tem as it is when at rest. He should at all times be 
aware of the ways in which his chosen models 
misrepresent the realities he is striving to deal with, 
and his duty will be to try to improve them. He 
should be conscious of the haphazard nature of the 
theories he has to use and aware of the areas 
in which, like his companions and predecessors 
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among the traditional historians, he has to rely on 
intuition and guesswork. He may or may not find 
general theories of social action, like those of Tal- 
cott Parsons, useful to him in his difficult and chal- 
lenging task. The essential point is that he should 
recognize that his is a theoretical, as well as an 
empirically descriptive, undertaking. 


Social structural history and the other forms of 
deliberative societal history which have been de- 
fined here are not propounded as alternatives to 
traditional historical writing which should, and 
ultimately will, replace it. The writing of history 
has many functions other than providing social 
scientists with comparative instances and with a 
continuum in which they can do their work. His- 
tory will continue to be written for all the many 
purposes for which it has always been written, re- 
gardless of what the social sciences do and how 
they may develop. All that has been attempted here 
is a sketch of the somewhat complex relationship 
between history as traditionally written and the 
social sciences as they are now pursued. Let it be 
stressed again that the historian and the social 
scientist have often been the same person, and this 
will probably increasingly be the case in the future. 
But in the writing of social structural history, the 
differences between the two roles become especially 
clear. Complicated and difficult as composing his- 
tory of this kind may be, it presents a challenge 
which neither the historian nor the social scientist 
can any longer afford to ignore. 

PETER LASLETT 
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Til 
ETHNOHISTORY 


Although it has appeared sporadically since the 
early twentieth century, the term “ethnohistory” 
was first used Systematically in the 1940s by some 
North American cultural anthropologists, arche- 
ologists, and historians, to describe their writings 
and research on the history of the aborigines of 
the New World. In more recent years “ethnohis- 
tory” has come to mean the historical study of any 
non-European peoples. Utilizing documentary, oral, 
and archeological sources and the conceptual 
framework and insights of cultural and social 
anthropology, these studies attempt to reconstruct 
the history of indigenous peoples before and after 
European contact. 

Ethnohistorians combine their “historical” sources 
with ethnographic field work among the present- 
day members of the societies whose past they aim 
to reconstruct. Their goal is to present “rounded” 
history, which will take into account the social and 
cultural systems of indigenous peoples; thus, ethno- 
historians of North America have paid particular 
attention to the location and migration of Indian 
tribes, changing cultural adaptions to environment, 
demographic history, the exact nature of the rela- 
tions of particular tribes with Europeans, and the 
effects which activities such as the fur trade and 
warfare have had on American Indians (“Sym- 
posium on the Concept of Ethnohistory” 1961). 

Ethnohistory has led mainly to studies of par- 
ticular cultural units, equivalent to the field anthro- 


pologists’ ethnographic accounts. There has been 
little effort to build a body of generalizations, either 
through comparison or through the development 
of concepts or categories of sequences which would 
make interregional comparison possible. The char- 
acteristic approaches and problems of ethnohistory 
derive from the nature of the indigenous societies 
being studied, the period, type, and duration of 
European domination, the kinds of documentation 
available, and the theoretical orientation of the 
anthropologists who have studied the region. 

Ethnohistory differs from the work of conven- 
tional colonial historians in several respects. The 
ethnohistorian usually has firsthand field experi- 
ence in the area; this experience increases his 
knowledge of the indigenous society and how it 
actually functions or functioned. Thus, his interpre- 
tation of documentary evidence is deepened. The 
ethnohistorian tends to think in systemic, func- 
tional terms rather than in terms only of accident 
and particulars. He tries to use his general knowl- 
edge of social and cultural organization and con- 
structs his units in terms of such concepts as 
“segmentary lineage-based societies,” “peasant so- 
cieties,” and “patrimonial societies.” The ethnohis- 
torian tries to perceive historical events from the 
position of the aborigine rather than that of the 
European administrator, even when he is using 
the administrator’s documents. He is more inter- 
ested in the impact of colonial policy and practice 
than in the genesis of these policies in the metro- 
politan society. 


History of the approach 


One of the major sources of the field of anthro- 
pology was a concern with the history of man in 
general, the comparative study of societies and 
institutions, and the reconstructions of the history 
of particular societies, Voltaire, Gustav Klemm, Sir 
Henry Maine, J. F. McLennan, J. J. Bachofen, 
N. D. Fustel de Coulanges, L. H. Morgan, and E. B. 
Tylor drew heavily on historical materials to estab- 
lish a comparative science of society and culture. 
These early anthropologists utilized information 
about classical civilizations, Hindu India, Euro- 
pean barbarians, medieval European institutions, 
and missionary and traveler accounts of primitive 
societies. In their broad-ranging and speculative 
reconstructions of the history of man, they dis- 
covered and labeled some of the basic features of 
primitive and peasant societies. [See the biographies 
of BACHOFEN; FUSTEL DE COULANGES; McLENNAN; 
Maine; Morean, L. H.; TYLOR; VOLTAIRE.| 

Subsequently, the broad schemes of “evolution- 
ary history” put forward by these early anthropolo- 
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gists were rejected; however, they did illustrate how 
documentary material, illuminated by comparative 
theory, may be used to understand particular se- 
quences of social and cultural change. 

At the beginning of the twentieth century the 
diffusionists, e.g., Ratzel and Graebner, and then 
the distributionists, e.g., Wissler, Kroeber, and 
Lowie, denied the possibility of the use of direct 
historical methods to reconstruct the history of 
aboriginal societies. Kroeber believed that for the 
study of “poor dateless primitives . . . we do not 
possess even one document written before our day” 
({1901-1951] 1952, p. 65). Lowie, in his attack 
on Swanton and Dixon’s use of oral traditions and 
travelers’ accounts in their history of North Ameri- 
can Indian migrations (Swanton & Dixon 1914), 
utterly denied “that primitive man is endowed with 
historical sense or perspective” (Lowie [1917] 1960, 
p. 206). Lowie believed that the anthropologist’s 
“historical problems can be solved only by the ob- 
jective methods of comparative ethnology, arche- 
ology, linguistics and physical anthropology” (ibid., 
p. 210). 

The American distributionist, or “historical,” 
school was based on the attempt to discover items 
of culture and society from the “memory culture” 
of surviving elderly members of American Indian 
tribes. These social and cultural items or traits— 
items of material culture and linguistic data—were 
plotted geographically, in attempts to infer histori- 
cal or chronological relationships between tribes. 
The distributionists were not concerned in any de- 
tailed fashion with the history of particular tribes. 
Typical of this approach, Sapir’s “Time Perspective 
in Aboriginal American Culture: A Study in Meth- 
od,” published in 1916, devoted only two pages out 
of 87 to the use of documents and indigenous oral 
traditions. Dependence on distribution studies of 
individual traits or complexes (e.g., the Sun Dance, 
particular tales and myths) and lack of systematic 
use of documents and oral histories weakened the 
work of the American historical school, and they 
tended to produce timeless descriptions of phenom- 
ena on an areal basis or descriptive synchronic 
accounts of particular memory cultures. 

In England in the 1920s Malinowski and Rad- 
cliffe-Brown also rejected historically oriented re- 
search in anthropology. Both argued that docu- 
ments for the study of primitive society were 
unavailable. Radcliffe-Brown contended that the 
nature of social anthropology and history were anti- 
thetical: social anthropologists, as distinguished 
from ethnologists, were to be concerned with the de- 
velopment of generalizations about the structure of 
society derived from the comparative study of prim- 
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itive societies, without reference to their history. 
Synchronic, or cross-sectional, studies of societies 
were carefully distinguished from diachronic stud- 
ies, or studies of societies as they changed through 
time; the latter could lead only to explanations of 
uniqueness. Until the 1950s most British social 
anthropologists kept to Radcliffe-Brown’s strictures 
and avoided diachronic studies. [See the biographies 
of MALINOWSKI; RADCLIFFE-BROWN; SAPIR.] 

British and American anthropologists continued 
to study social and cultural change without refer- 
ence to historical materials, even when, as in the 
case of Lucy Mair’s study of the Baganda (1934) 
or Monica Hunter Wilson's of the Pondo (1936), 
documentary sources were readily available, Gluck- 
man’s study of the Zulu political system (1940) 
and Nadel’s Black Byzantium (1942) did use his- 
torical materials to develop a model of political 
structures before European incursions. However, 
these studies are not histories but analytical ab- 
stractions from historical sources to illuminate 
structural principles. The one outstanding excep- 
tion during this period is E. E. Evans-Pritchard’s 
study of the bedouin of Cyrenaica (1949). In this 
study Evans-Pritchard analyzed the process by 
which a lineage-based society developed centralized 
political roles and institutions, The Sanusi, an order 
of Muslim religious leaders, had moved into Cyre- 
naica in the early nineteenth century and provided 
needed religious and trade functions in the society. 
Geographically and structurally they located their 
religious centers at boundaries of existing lineage 
and tribal territories. Largely through the pressure 
of Turkish and, subsequently, Italian administra- 
tors who tried to rule the bedouin, the heads of the 
religious order, as the only visible leaders, were 
impelled into society-wide political roles. Evans- 
Pritchard used available colonial records and re- 
ports, published narratives, oral traditions, and the 
memories of participants in the events which make 
up the historical narrative, The Sanusi of Cyrenaica 
is based on Evans-Pritchard’s understanding of the 
operation of an acephalous lineage-based political 
system, and it is this which gives him the structural 
principles on which he organized his historical 
narrative and provides a model for the study of the 
process of internal structural change in such a 
society under the impact of foreign rule, 

In the United States in the period from 1910 to 
1930 a few anthropologists, notably John R. Swan- 
ton, in his studies of the Indians of the American 
southeast (1922; 1946), and Frank G. Speck, 
working on the tribes of the northeastern United 
States (1928), used direct historical methods to 
reconstruct the tribal pasts. For this task they drew 


on their field work among the remnants of the 
tribes of the areas and made intensive use of a 
wide range of documentary materials. [See the biog- 
raphy of SwANTON.] 

Fittingly, the clearest early examples of system- 
atic ethnohistorical work are found in a volume of 
studies dedicated to Swanton and published by the 
Smithsonian Institution in 1940, William Fenton 
used seventeenth-century and eighteenth-century 
documents to trace location and movement of Iro- 
quois bands (1940); William Duncan Strong dem- 
onstrated that documentary materials could be 
used with archeological data to provide a contin- 
uous record from present into the past of particular 
sites (1940); Julian Steward’s study of Great Basin 
societies combines ecology, history, archeology, and 
ethnography and yielded insight into structural and 
cultural processes (1940). These three studies indi- 
cated the ethnohistoric approach that was to be- 
come formalized in the 1950s. [See the biography 
of STRONG; Ecotoey, especially the article on CUL- 
TURAL ECOLOGY.] 

The accumulation of ethnographic data made it 
clear that early assumptions about the stability of 
cultures and societies before European contact were 
false. Anthropologists began to recognize that in- 
stead of a precontact situation of stagnation in 
aboriginal societies, changes of three types had 
occurred. First, there were small-scale cyclical 
changes, exemplified by the growth and fission of 
extended families and lineages, There were also 
larger cycles of political and cultural expansion as 
lineages within tribes came to dominate similar 
units; however, many societies could not develop 
institutions to contain reassertion of independence 
of such units, so that large-scale tribal organiza- 
tions would develop for a time under one or another 
section of a tribe, only to break apart into smaller 
units again. The third type of change involved 
large-scale tribal migrations, leading to greatly 
changed political, social, and ceremonial orders. 

In addition to these internal processes of change, 
ethnohistorians have demonstrated the indirect 
effects of outsiders—Europeans and Arabs, for 
example—on indigenous societies and cultures even 
before the period of European domination. The 
slave trade in both east and west Africa, the trans- 
Sahara trade in west Africa, and the ivory trade 
in east and central Africa led to major political 
changes in African societies. The fur trade in North 
America led to major intertribal warfare, the devel- 
opment of ideas of property, and the emergence of 
a stratified social system based on differential pos- 
session of or access to furs, The introduction of 
the horse to the Great Plains of North America 


changed the way of life of many tribes who then 
bordered the region. In each case the culture and 
society that anthropologists assumed were static 
and stable and from which one could measure or 
describe change were in themselves changing be- 
cause of outside influences (Ewers 1955; Leacock 
1954; Jones 1963; Dike 1956). 

The passage of the Indian Claims Act in 1946 by 
Congress led to a marked rise in ethnohistorical re- 
search in the United States. Under the provisions 
of this act Indian tribes could sue the federal gov- 
ernment for recompense for lands taken from them 
after the Indians had signed treaties protecting 
their rights. Anthropologists were employed as ex- 
perts by both Indian tribes and the government, 
to establish location, extent, and nature of aborigi- 
nal control over various territories and the exact 
nature of treaty obligations. This drew the attention 
of many ethnographers, who previously had paid 
little heed to the extensive archival resources of 
the federal government and the various states in 
their study of the American Indian. Ethnohistory, 
the principal journal in the field, was founded in 
1954 partly to provide an outlet for materials and 
interest developed by the Indian claims cases. 

The expansion of field-work opportunities in 
Latin America and Asia and the emergence of 
many states from colonial rule has been a tremen- 
dous stimulus to ethnohistorical work since the end 
of World War u. In many of these areas there are 
long literary traditions and a wealth of documen- 
tary material. In Latin America, for example, cer- 
tain areas have been covered in historical sources 
for a four-hundred-year period (for a brief review 
of the literature, see Adams 1962; Armillas 1960; 
Gibson 1955), In east and southeast Asia there 
have been important ethnohistorical studies of 
kinship and clan structure (Freedman 1958; Ray: 
Smith 1962), land tenure (T. C. Smith 1959), the 
recruitment and training of indigenous bureaucra- 
cies (Ho 1962; Marsh 1961; Silberman 1964), 
urban social history and mobility (R. J. Smith 
1963), immigrant communities (Skinner 1957), 
and indigenous political systems (Gullick 1958). 
Ethnohistorical studies of south Asia and the Mid- 
dle East are beginning to appear (Cohn 1962a; 
1962b; Polk 1963). 

In European studies there has been a long tradi- 
tion in the study of classical, medieval, and early 
modern society, enlightened by sociological and 
anthropological method and concepts. Most of this 
work has been carried out by social, economic, and 
legal historians rather than by anthropologists 
themselves, The ethnohistorical study of classical 
society has attracted considerable attention (Kluck- 
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hohn 1961). M. I. Finley, on the basis of the 
Odyssey, has written an essay on the culture and 
social structure of the Greeks of the heroic era; in 
this he consciously used the ideas of Malinowski, 
Mauss, and Radcliffe-Brown (Finley 1954). E. R. 
Dodds, in his analysis of Greek literature, has 
drawn on some of the concepts of psychoanalyti- 
cally oriented anthropology (1951), Mare Bloch’s 
great works on feudal society (1939-1940) and the 
rural structure of medieval France (1931) illus- 
trate the possibilities of an ethnohistory of medie- 
val Europe. 

The writing of British social history from the 
time of Maitland (1897) and Vinogradoff (1905) 
has been marked by the conscious and unconscious 
use of social anthropology. Modern subjects that 
have received sophisticated ethnohistorical treat- 
ment include the blood feud of the Franks (Wal- 
lace-Hadrill 1959), Anglo-Saxon kinship (Lancas- 
ter 1958), and marriage systems of the early 
modern period (Stone 1965, pp. 589-671; Habak- 
kuk 1950). Although social anthropologists have 
done considerable field work in European peasant 
societies, few examples of systematic and careful 
ethnohistorical work have appeared. An exception 
is the work of Lawrence Wylie, a student of French 
literature and civilization, who on the basis of field 
work among French peasants has been able to 
show the usefulness of oral traditions and docu- 
ments for studying the changing value systems ofa 
peasant village (Wylie 1965). 

In areas without long written traditions, careful 
and important ethnohistorical work has begun. The 
Journal of African History, founded in 1960, dem- 
onstrates the utilization of official records, recorded 
African traditions, and Arabic and Coptic mate- 
rials. The institutional history of the Maori from 
the eighteenth century is being written (Vayda 
1961; Biggs 1960). The Journal of Pacific History 
was recently established as an outlet for the grow- 
ing ethnohistorical research on the Pacific area. 


Sources and methods 

Written documents. In his use of written docu- 
ments, the ethnohistorian initially has the same 
problem and uses the same techniques as conven- 
tional historians, The ethnohistorian who has been 
trained as an anthropologist and has carried out 
field work is often highly frustrated when he has 
to depend on documents. The research problems of 
the ethnohistorian usually pertain to local history 
or “subhistorical” problems. He is not concerned 
with major, well-documented events, which a polit- 
ical historian deals with; very often he wants to 
know the minutiae of the past, e.g, the kinship 
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connections of shadowy historical figures in an in- 
digenous society, the movement and location of 
particular lineages at particular times, the sym- 
bolic meanings of a coronation ceremony in an 
African kingdom, the population of an American 
Indian group in the seventeenth century. 

Often it is difficult to identify adequately the in- 
dividuals and groups the ethnohistorian is con- 
cerned with. As an anthropologist, he expects to 
build inductively, from disparate bits of informa- 
tion, a picture of a functioning system; but he can- 
not generate his own data by asking people ques- 
tions and observing their behavior in the context 
of living experiences, The documents he deals with 
are rarely written by the people whose social struc- 
ture or culture he wants to study but are accounts 
by observers, naive and biased, who often only half 
understood what they were recording. If he uses 
administrative records, not only must he know, as 
a good historian, who wrote the minutes or the 
statements of decisions taken and why they wrote 
them; he must place the data in a broader context 
of administrative policy, Certain official records, 
such as tax records, land surveys, and documents 
from actual legal proceedings, as distinguished 
from decisions and policies, frequently yield the 
best data. These materials are less finely filtered 
through the cultural screen of the administrators. 
The ethnohistorian must constantly try to under- 
stand the categories of the administrator and out- 
side observer, as well as the indigenous classifi- 
cation systems. The interpretation of official or 
nonofficial documents, of policy statements and 
other primary data, requires an understanding of 
the culture and society of colonial administrators. 
This in itself is difficult, for the gross features of 
the metropolitan society may be misleading. The 
ethnohistorian has to know what particular group 
in the society the administrator came from, whether 
his values, education, and social and political phi- 
losophy differed from the rest of the society, and 
if so, how. The ethnohistorian must understand 
the structure of the colonial administration and 
know the partisanship of the men responsible for the 
documents he studies. He must understand the re- 
lationships between the decision makers in the 
metropolitan center, the administrators in the colo- 
nial center, and the men in the field. He has to 
know how administrators collected data and infor- 
mation and whom they dealt with and employed 
from the indigenous society. He must perceive 
which notions developed about the indigenous so- 
ciety were mistaken, how they affected decisions 
and observations, and how decisions which may 
have been based on such misinformation affected 


the indigenous society. The ethnohistorian’s task 
is to use conventional historical methods but to ask 
different questions and to keep in mind his concern 
with the indigenous society (Curtin 1964). 

There are, for almost every region, extensive 
published collections of source material; for exam- 
ple, Thwaites’s 73-volume series (Jesuits Letters 
From Missions 1896-1901) for North America, the 
collections of Theal (1883) and Brasio (1952- 
1960) for southeast Africa, and parliamentary 
papers of Great Britain for India and Africa, The 
main resources, however, are to be found in the 
national and regional archives and local adminis- 
trative and record offices of the area being studied. 

In areas such as Uganda, the emirates of north- 
ern Nigeria, and the Malay States, where Euro- 
peans ruled indirectly and tried to maintain the 
indigenous political system, documents were pro- 
duced by members of the society themselves. Polit- 
ical and social development can be traced through 
the eyes of some of the indigenous peoples. 

Oral traditions. In recent years, particularly in 
the study of the history of African societies, the 
ethnohistorian and the anthropologist with a his- 
torical interest, have demonstrated convincingly 
how oral tradition can be recorded, collated, 
checked, and utilized for historical purposes ( Abra- 
ham 1961; Vansina 1961; M. G. Smith 1961), Oral 
traditions cover a wide variety of subject matter 
and can be found in a variety of forms. Societies 
with centralized political institutions and conquest 
states have often produced well-developed oral his- 
tories, and frequently there are specialists whose 
concern it is to memorize and transmit these tradi- 
tions. In the use of this form of oral tradition, great 
caution is obviously necessary, since the history re- 
flects as much about present social and political 
structure as it does about the past and is constantly 
being changed to account for changing situations 
(Barnes 1951; Cunnison 1951). 

Oral history reflects the social units within the 
society; villages and lineages will have accounts of 
their past, which perform the specific function of 
relating groups to each other and which validate 
or correct local claims and support relationships. 
The ethnohistorian is often confronted with an 
extraordinary multiplicity of conflicting accounts 
of the past, even from the same village (Cohn 
1961). Tribal segments, royal lineages, and courts 
may have well-preserved histories, which function 
as charters to justify contemporary social structure. 

As Vansina (1961) demonstrates, historical nar- 
ratives are not the only aspect of oral tradition that 
can be recorded, collated, and utilized; sacred 
formulas, names, poetry, genealogies, folk tales, 


myths, and legal precedents are useful to the eth- 
nohistorian. In the interpretation of oral tradition, 
stress must first be put on the cultural context in 
which the tradition is found. Vansina defines oral 
tradition as “testimonies of the past which are 
deliberately transmitted from mouth to mouth.” As 
he does in the case of written documents, the re- 
searcher must always ask what function the tradi- 
tion performs in the contemporary society, Even 
material which is demonstrably false can be of 
great value, as it might incidentally contain histori- 
cal facts. 

Where outsiders have been recording indigenous 
oral traditions for a long time (as, for example, 
among the Maori), the relationship between the 
oral tradition and contemporary political structure 
can be used to understand not only the past re- 
ferred to in the tradition but the actual political 
situation existing at the time the tradition was re- 
corded. 

Field work. Field work is essential to the work 
of the ethnohistorian. The basic anthropological 
orientation, which differentiates the ethnohistorian 
from the conventional colonial historian, is devel- 
oped through the experience of systematic field ob- 
servation and the collection of data from living 
people; the aim of field work is to present a descrip- 
tion and analysis of a functioning social system. 

Field work, then, is a major part of the ethnohis- 
torian’s training; through field work he develops a 
sensitivity to the structure of a society that is diffi- 
cult to achieve from study of documentary evidence 
alone. Ideas relating to historical relationships and 
processes may actually be tested in the field, where 
aspects of the society and culture are still in opera- 
tion, [See FIELD WoRK.] 


Ethnohistory and anthropology 


Thus far diachronic studies have not yielded 
theoretical formulations, While synchronic studies 
are useful in enabling the ethnohistorian to infer 
social processes from documentary evidence, the 
contribution that diachronic studies will make to 
theory building or even to the development of 
descriptive generalizations regarding society and 
culture is harder to demonstrate. Even in the most 
rigid synchronic ethnographic study, the ethnog- 
rapher must deal with the dimension of time. At 
the very least, he is dealing with three generations 
and with individuals whose lives have covered a 
sixty-year period. Invariably the field ethnographer 
asks questions about the past; he must confront the 
question of norms and changing norms, accidental 
social arrangements, and enduring aspects of the 
Social structure. 
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Through historical study the anthropologist may 
identify changes within the system which are the 
result of flux, accident, or cyclical sequences and 
those which are due to structural realignments. 
Nadel and others have argued that to know the 
direction of social structural change, one needs 
time depth (Nadel 1957, chapter 6; Lévi-Strauss 
1949). Thus, for example, careful statistical study 
frequently shows that there are in some, if not all, 
societies degrees of latitude or freedom in an indi- 
vidual’s choice of residence, whether it be virilocal 
or uxorilocal, and these choices may be related to 
other variables. Synchronic studies may account 
for these relationships, but if we want to account 
for change, then historical methods for studying 
a society—primitive, peasant, or industrial—are 
the prerequisite for the development of adequate 
theories (Evans-Pritchard 1961; M. G. Smith 1962; 
Thomas 1963). 


Historians and anthropologists 


The development of nineteenth-century “scien- 
tific history’—the study of the past divorced from 
the values and passions of the historian’s times, 
the idea that historical facts could be determined 
and, if chronologically ordered, would speak for 
themselves—has led, with some notable excep- 
tions. to the historian’s consciously eschewing 
concepts and generalizations that might guide and 
illuminate his description and analysis of the past. 
In the twentieth century, though, historians have 
become increasingly aware that they do use and 
have to use generalizations if they are to do more 
than edit texts. H. Stuart Hughes (1960, pp. 25- 
26) has pointed to at least four levels at which 
historians generalize. First, they generalize seman- 
tically; by using words, such as “nation,” “revolu- 
tion,” “development,” “trend,” and “social class,” 
historians implicitly abstract, generalize, and com- 
pare. Second, “conclusions” in the form of a group- 
ing of statements about a man, a period, or a move- 
ment are generalizations. Third, schematizations, 
inherent in such ideas as “urbanization” and “indus- 
trialization,” by which bits and pieces of historical 
study are organized in terms of process or structure, 
are generalizations and are close to the kinds of 
generalizations social scientists make. Finally, there 
are the broad, all-inclusive systematizations of his- 
tory or metahistory, associated with the work of 
men like Spengler and Toynbee. It is at this fourth 
level, that is, in the conscious use of concepts re- 
garding process and structure in society and cul- 
ture, that the social scientist and the historian can 
best interrelate their study. If the characteristic 
activity of the historian is the study of the past and 


446 HISTORY: Ethnohistory 


if his organizing principle is a time sequence, then 
he must borrow organizing principles from other 
disciplines, both the humanities and the social sci- 
ences. In major subfields of history, this process of 
borrowing is explicit, for example, in economic 
history, where the concepts and methods of eco- 
nomics are consciously used to provide the concep- 
tual framework. Intellectual and social history have 
borrowed from psychology, sociology, and anthro- 


pology. 


In the last thirty years there have been several 
efforts to utilize the anthropologist’s approach in 
the study of history; the anthropology which has 
proven most congenial to historians is cultural 
anthropology. The concept of culture as an all- 
embracing idea covering behavior and values of a 
particular people at a particular time fits well with 
historians’ predilections. Hughes puts it well when 
he says: “. . . the approach of the cultural anthro- 
pologist so closely resembles that of the historian 
as frequently to seem identical with it, Like the 
historical scholar, the student of exotic cultures 
adopts a highly permissive attitude towards his 
data; he is perfectly happy in the realm of impre- 
cision and of ‘intuitive’ procedures; and he tries to 
grapple with what he regards as the central prob- 
lems of the societies with which he is concerned” 
(Hughes 1960, p. 34; see also Ware 1940; Social Sci- 
ence Research Council 1954; Gottschalk 1963). 
Books such as Ruth Benedict's Patterns of Culture 
(1934) and attempts on the part of anthropologists 
to do national character studies are taken by his- 
torians as models (for example, see Potter 1954) be- 
cause the approach, rather than the techniques, 
methods, and concepts, of the cultural anthropolo- 
gist is useful to historians. With notable exceptions, 
such as Marc Bloch, historians have not been eager 
to combine field work with historical research to find 
still extant in societies traces of previous industrial 
and agricultural techniques or surviving forms of 
social organization [see the biography of BLocy]. 

It is, however, in the study of the preindustrial 
and modernizing societies of today and of the his- 
torical societies that characterized the whole world 
before the beginning of the nineteenth century that 
the anthropologist and the historian would appear 
to need each other. 


BERNARD S, COHN 


[See also ARCHEOLOGY; ETHNOLOGY; HISTORIOGRAPHY. | 
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Iv 
CULTURE HISTORY 


Culture history is the subdivision of general 
history that is concerned with the historical de- 
velopment of nonliterate peoples, present and past. 
It is almost always practiced by cultural anthro- 
pologists, if we include under this designation such 
specialists as archeologists and anthropological lin- 
guists. This definition implies that there is no real 
difference in principle between the history of the 
professional historian and the culture historian. 
Sometimes an attempt is made to distinguish be- 
tween the two by contrasting the use of written 
documentary sources as the chief or only kind of 
evidence admitted by the historian proper with 
the variety of other, more conjectural methods 
used by the student of nonliterate cultures. This 
point of view has occasionally been taken either 
by historians who wished to resist extension of 
the field of history through these methods or by 
“schools” of anthropologists, such as the earlier 
social functionalists, who admitted the value of 
“genuine” history based on documentary evidence, 
while rejecting the “conjectural” culture history of 
historically minded anthropologists. 

It is clear, however, that such a distinction can- 
not be maintained on principle. As noted in stand- 
ard handbooks of historiography, the task of his- 
torical investigation involves the use of all types 
of evidence regarding the past. Thus E. Bernheim 
([1905] 1926, p. 62) states that all peoples can 
be subject to historical investigation and that the 
principles of historical investigation are every- 
where the same but that differences in the nature 
of the evidence require specialized knowledge and 
training. Thus, “. . . it is in the interest of a scien- 
tific division of labor to assign the history of primi- 
tives and prehistoric peoples to the ethnologist and 
archeologist.” Besides, even where documentary 
evidence exists in abundance, as it does from the 
ancient classical world, nondocumentary tech- 
niques, such as archeology and comparative lin- 
guistics, have made essential contributions and are 
used by professional historians. On the other hand, 
anthropologists have come to realize that they are 
by no means limited to nondocumentary methods. 
For many parts of the world there are frequently 
historical documents that may shed valuable light 
on cultural changes undergone during the centuries 
between first contact with the West and profes- 
sional anthropological field study. In some in- 
stances, there may even be indigenous historical 
records neglected by the historians because the 
area is outside of the normal purview of their 


interests. This was the case, for example, with the 
Islamized peoples of the Sudan. 

The aim, then, of culture history, is in no essen- 
tial respect different from that of conventional his- 
tory, particularly when the latter is viewed in its 
most general aspect as not merely political history 
but as history of all aspects of culture. It may be 
added that for his primary goal, the understanding 
of cultural development, the culture historian will 
need certain noncultural data, such as environ- 
mental changes, human racial differentiation as 
the result of isolating mechanisms paralleling eth- 
nic differentiation, and inferences concerning 
ancient demographic factors. The differences be- 
tween culture history and conventional history is 
then one of degree rather than kind. Since he 
necessarily relies to a greater extent on nondocu- 
mentary sources, the culture historian will be con- 
cerned with groups and not individual actors, and 
the time scale will often be relative rather than 
absolute. However, with the development of radio- 
active and other methods of absolute dating, even 
this latter difference tends to be effaced. 


General methodology 


All historical investigations proceed by infer- 
ences, often very complex, from evidence existing 
in the present. The relation between the evidence 
itself and the fact of which it is a trace is of two 
main logical types: cause and effect, as when an 
artifact is taken as evidence for the human activity 
that produces it, or symbol and referent, as in 
verbal accounts (whether written or oral) in which 
the evidence is a description of the fact. Traces 
differ, as will be seen later, in still other respects. 
Particularly in culture history, where documentary 
evidence is usually minimal or lacking, the gen- 
eral strategy of the historical enterprise is based 
on the circumstance that the same event may leave 
multiple traces, each of which provides independ- 
ent evidence for the fact. 

For example, if at some time in the past one 
people has borrowed the cultivation of a food plant 
from another, it will have taken over the genetic 
varieties utilized by the donors and the same 
or similar methods of cultivation. They may have 
borrowed the word for the plant itself or other 
terminology connected with its cultivation. These 
aspects are independent, in that some might have 
been present in the original event without the 
others and since their present outcomes are dis- 
tinct, e.g., the genetic plant varieties NOW utilized, 
the observable agricultural methods, and one or 
more words in the present language. Each of these 
evidences may be said to belong to a different 
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system, because for its interpretation we must put 
it into a context of different facts. Thus the data 
concerning a particular genetic variety of plant 
are significant in the light of the totality of vari- 
eties, their geographical distribution, and the re- 
constructed genetic history of the species itself in 
relation to the wild ancestral form. The agri- 
cultural methods are part of an ethnologic dis- 
tribution. The linguistic terminology is part of a 
language and must be evaluated in terms of ap- 
pearance or nonappearance in related languages 
deriving from a linguistic classification itself based 
on linguistic evidence. 

Each of these traces, then, is interpreted in 
terms of the system to which it belongs. Although 
the details of method differ for each system, they 
all have in common the important characteristics 
of being comparative and involving assumptions 
regarding diachronic processes. 

It has sometimes been felt that certain types of 
historical inferences involve a comparative method, 
for example, those based on language or ethno- 
logical trait distributions, while archeological arti- 
facts or documents give direct testimony concern- 
ing the facts for which they are evidence. There is 
at best, again, merely a difference of degree of 
complexity and not of kind. An archeological im- 
plement must be compared with other implements 
with regard to form, function, place, and time be- 
fore it can be assigned any historical meaning. 
This is true for written documents, as we are 
reminded emphatically by the historians Langlois 
and Seignobos, who state that a document “in 
respect of which we necessarily are in total igno- 
rance of the author, the place, and the date is good 
for nothing” ([1898] 1925, p. 87). 

Another fundamental set of considerations in- 
volved in the construction of such interpretive 
systems refers to process. By a process is meant 
a class of similar changes. To draw an example 
from textual analysis, if the same word appears 
twice within a few lines of a manuscript that is 
being copied, a scribe, in looking back through 
the manuscript, will sometimes mistake the second 
occurrence for the first and so eliminate the inter- 
vening material. Such an error is called haplology 
and may be called a process. Since manuscripts of 
all periods, places, and languages are subject to 
haplological change, like other such processes it 
may be considered as a class in abstraction from 
its specific temporal and spatial loci. If two manu- 
scripts share the same haplologies along with other 
specific changes, one may conclude that they have 
both been copied from the same version and thus 
do not furnish independent evidence regarding the 
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original text. By such reasoning, manuscripts may 
be arranged in a genealogy and their comparative 
study can lead to the reconstruction of the lost 
original. In reasoning by means of process, such 
factors as the frequency with which instances are 
likely to occur, whether two identical instances will 
tend to occur independently, and the length of time 
required for their occurrence are all among the 
fundamental considerations. For example, another 
process in manuscript transmission is the inter- 
polation of marginal explanatory glosses into the 
body of the text. Obviously, it is more likely that 
a particular haplological error will occur twice 
independently than that an interpolation involving 
precisely the same words will occur at all. 

Human activities are not the only processes 
relevant to cultural-historical reconstruction. For 
example, the patination due to the weathering of 
artifacts is a process of change and allows us to 
draw very approximate conclusions regarding age. 
The point here is that the historical conclusions 
to which we are led by particular existing evidence 
is dependent on our assumptions about the proces- 
ses of change it has undergone since the time it 
came into existence. 


Specific methods 


A number of the independent methods men- 
tioned earlier may now be considered in greater 
detail. 

Verbal evidence. All verbal evidence has as its 
source linguistically formulated descriptions by ob- 
servers of the original event or events. Whether 
this primary source is oral or written in its first 
form makes little or no difference; it is subject to 
the same possibilities of error through observer 
bias, inaccuracy, or prevarication. The differences 
between written and oral sources stem from the 
mode of subsequent transmission. The advantage 
of writing is that, because of its semipermanence, 
it will go through fewer reproductions and will be 
less changed in the course of such reproductions. 
Since it will thus be closer to the original report, 
it will be easier to reconstruct the exact verbal 
form of the report. The form in which the his- 
torian encounters the report is not in itself deci- 
sive. Thus, literary sources often contain accounts 
written down at some time from oral tradition, 
so that the report has been transmitted orally dur- 
ing the earlier part of its career and in writing 
later on. The opposite also occurs when literary 
formulations become the subject of folkloristic 
transmission. 

The critical use of written documents, the chief 
source of the historian’s history, falls under the 


methodology of history proper and is thus only 
briefly discussed in the present connection. It is 
relevant, however, to point out that the culture 
historian’s written documentation is most frequent- 
ly that of the outside observer, such as the explorer 
or missionary, rather than the participant. It is 
therefore subject to errors based rather on the 
outsider’s inability to comprehend the cultural 
frame of reference of the actors than on bias. 
Therefore, the inaccuracies are characteristically 
of a different kind from those of the internally 
placed participant. Thus, contrary to the, latter, 
the outside observer will not tend to conceal mili- 
tary defeats or the historical illegitimacy of the 
power exercised by a ruling dynasty. 

Anthropologists take as their point of departure 
the notion of primitive peoples as peoples without 
written history; but beginning about 1950 it be- 
came apparent that the extent and the value of 
, both external and, in certain cases (e.g., west 
Africa), internal documentary sources had been 
seriously underestimated. The Indian land claims 
cases in the United States also led to much docu- 
mentary research into land occupation and use 
patterns of the aboriginal period. Such interests 
led to the development of ethnohistory as a sub- 
discipline of anthropology [see History, article on 
ETHNOHISTORY]. 

The other chief source of verbal reports is oral 
tradition, which includes not only orally transmit- 
ted narrative history but other kinds of spoken 
material containing historical information, e.g. 
proverbs and epic poetry. This source is perhaps 
the most controversial. Thus, G. P. Murdock (1959) 
discounts it as altogether unreliable, while J. Van- 
sina (1961) makes it the very keystone for his 
reconstruction of the history of a number of Afri- 
can peoples. As Vansina has pointed out, oral 
tradition must be used critically, and, indeed, it 
requires a methodology very similar in principle 
to that required for the study of documentary 
sources. More perhaps than any other source, it 
has been employed uncritically in the past. Oral 
traditions have been published without indication 
of the individual, place, or date from which they 
derive, of facts, if any, regarding the manner of 
their earlier transmission, and without variant ver- 
sions. As for manuscripts, it is possible to develop 
a genealogy of lines of independent transmission 
and reconstruct the archetype or original version, 
a method similar in basic respects to that devel- 
oped by the Finnish school of folklorists for oral 
literature in general. The time depth and chrono- 
logical precision of oral traditions are necessarily 
limited, but within these limitations they can give 


important and reliable information when treated 
critically. 

Archeology. Among the remaining research 
methods, which have in common the reasoning 
from trace as effect to historical cause, archeology 
is to be distinguished from the rest in that it deals 
with material objects as evidences of cultural ac- 
tivities of the period in which they were produced 
rather than with existing cultural phenomena 
viewed as developments from, and hence evidences 
for, earlier cultural traits. Thus, subsequent modi- 
fications of form, if any, are normally the result 
of natural forces independent of man. The strength 
of archeology is the reliability and concreteness 
of its evidence and the definiteness of its spatial 
attribution. Its necessary limitations stem from the 
fact that it is confined to material culture and de- 
ductions that can be made therefrom. The set of 
artifacts found at a particular site and stratigraphic 
level, sometimes called a component, is taken as 
the material expression of the life of a local com- 
munity. Often very similar assemblages are found 
over a continuous area with indications that they 
all date from roughly the same chronological pe- 
riod. Such a unit, often called a phase, may be 
conjectured to represent some sociocultural unit, 
such as a tribe. The interpretation of archeological 
evidences regarding a phase has both cultural and 
social aspects. From settlement patterns, density 
of remains, the functions of the artifacts them- 
selves, and evidences regarding contemporary cli- 
mate, fauna, and flora, the attempt is made to 
reconstruct the basic technologic and demographic 
patterns with whatever further, usually less cer- 
tain, inference can be made regarding other as- 
pects of culture, such as social structure or religion. 
There has also been an attempt to identify and 
determine the geographic boundaries of ethnic 
groups. Contemporary evidence shows that such 
conclusions are subject to a considerable margin 
of error, since, on occasion, ethnic groups with 
highly similar material cultures may differ funda- 
mentally in language and other cultural aspects 
and constitute politically independent groupings. 

The second fundamental aim of archeology is to 
reconstruct the time-space relationships of the 
sociocultural entities inferred from material re- 
mains. The basic problem is, of course, chrono- 
logical rather than spatial. Relative dating methods 
include the stratificational (when in the same site 
more recent material is superimposed on more 
ancient), estimates of length of occupation from 
the nature of the deposits, inferences regarding 
the rough contemporaneity of sites with similar 
material, cross-dating from traded objects whose 
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date and provenience is known from documentary 
sources, and geochronology. Where other methods 
fail, the evolutionary assumption that simpler types 
precede more developed has been utilized. Such 
conclusions are most plausible where we are deal- 
ing with mechanical inventions that presuppose 
other less complex devices that enter into them 
or where a more efficient device requires the de- 
velopment of some specific and recondite skill, 
e.g., smelting metal as compared with the utiliza- 
tion of stone. In recent years the development of 
methods of radioactive dating, such as carbon-14, 
has revolutionized archeology by providing absolute 
dates. 

Beyond the placing of archeological units in 
space or time, there are inferences regarding the 
historical relationship of particular cultural traits, 
complexes, or cultures as a whole. With the pre- 
requisite of space-time continuity established or 
reasonably to be conjectured, cultural similarities 
are interpreted as resulting from such historical 
processes as geographic migration of a people or 
by diffusion, in which the traits are borrowed 
through contact with a neighboring people. Such 
integral spread or adoption of cultural features is 
often called genetic. Sometimes however the con- 
nection is not genetic, although historical. For ex- 
ample, trade objects may be distributed along 
recognizable trade routes, which indicate cultural 
connections, although the objects themselves are 
not actually produced in all of the sites in which 
they are found. There are of course cultural re- 
semblances that are nonhistorical in origin and 
are the result of independent parallel develop- 
ments. What is to be assumed a similarity will, 
of course, depend on the definitional criteria 
adopted. Under more general criteria things will 
be considered similar that are rejected under nar- 
rower criteria. The disputed cases are character- 
istically those that combine generality of criteria 
with absence of well-proved space-time continuity, 
e.g., Egyptian and Mayan pyramids. 

A further major contribution of archeology has 
been to furnish materials from the distant past 
that complement the documentary history of more 
recent periods and permit speculation about the 
long-term “evolutionary” trends of cultural develop- 
ment. Thus, archeology provides support for the- 
ories regarding the evolution of technology and 
systems of economic subsistence. [See DoMESTICA- 
TION; HUNTING AND GATHERING, article on NEW 
WORLD PREHISTORIC SOCIETIES; URBAN REVOLU- 
TION. | 

Trait distribution. Another basic method for 
reconstructing history that employs cultural mate- 
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rials is the study of the geographical distribution 
of cultural traits, which reads historical depth into 
spatially arranged data. In the broadest sense, 
comparative linguistics is but one example of this 
approach, but since it is the least controversial, 
has the most explicitly developed method, and con- 
tributes most largely and reliably of all cultural 
distributional methods, it will be discussed first. 

Comparative linguistics. We may consider that 
every language is a cultural subsystem, that such 
subsystems are distributed over geographical space, 
and that each meaningful item in a language is 
a cultural trait that involves form (“sound”) and 
function (“meaning”). The first step in compara- 
tive linguistics is the classification of language into 
mutually exclusive families, each consisting of 
related languages. A family of languages is a set 
of distinct languages presumed to have arisen from 
a single earlier language (the so-called protolan- 
guage) through a course of differential changes. 
In the initial period of such changes, when the 
differences are still small and mutual intelligibility 
still obtains, localized variants are called dialects. 
Dialects, as they diverge more and more in the 
course of time, cease to be mutually intelligible 
and rank as separate languages. The languages 
resulting from such an earlier process are said to 
have a common origin and form a family of re- 
lated languages. This process may occur a number 
of times successively and still give recoverable 
results. Thus, Proto-Indo-European developed dia- 
lects that became the ancestral languages of the 
various branches of Indo-European, e.g., Celtic, 
Germanic, Slavic, Indo-Aryan, Italic. Italic, like 
the others, in turn split into separate languages, 
e.g., Latin, Oscan, Umbrian, Venetic. Of those, 
only Latin survived, and it in turn has developed 
into the modern Romance languages. 

The comparative method reconstructs this course 
of events by classifications, such as the one just 
briefly sketched. Through the observation and eval- 
uation of resemblances involving sound and mean- 
ing and, further, through the regularities inherent 
in processes of linguistic change, most conspicu- 
ously phonetic change, the further step is taken 
of reconstructing as far as possible, and often in 
considerable detail, the phonetic system, grammar, 
and vocabulary of the ancestral language. Only 
exceptionally, as in the case of Latin as ancestral 
to the Romance languages, is there independent 
written evidence regarding this language. 

Thus every family of languages at whatever level 
of classification implies an ancestral language that 
is capable of at least partial reconstruction. Such 
an ancestral language implies a community of peo- 


ple as its users, a degree of cultural homogeneity, 
such as is normally found among speakers of the 
same language at the present time and for past 
documented history, and a placement within geo- 
graphical and chronological limits. It is clear that 
the determination of spatial-temporal location of 
a sociocultural unit speaking a language whose 
features have been largely reconstructed and his- 
torically related to later or contemporary speech 
communities is in itself an important cultural-his- 
torical datum. 

Nonlinguistic inference. The reconstructed lin- 
guistic facts are themselves cultural-historical facts, 
but what is of wider interest to the culture histo- 
rian are the nonlinguistic cultural inferences that 
flow from such linguistic facts, as, for example, 
words that show the probable acquaintance of the 
speakers with certain technological items or reli- 
gious concepts. Such items of protovocabulary are 
reconstructed word-forms, continued in a sufficient 
number of later instances to allow us to infer their 
approximate phonetic shape and meaning and to 
assign them to the ancestral language. It is a fur- 
ther advantage of the comparative linguistic method 
that it almost always allows us to distinguish be- 
tween resemblances among languages that result 
from continuation of an actual item in modified 
form (cognates) and resemblances among lan- 
guages, whether related or not, that result from 
the borrowing of words from one language by 
another where the speakers have been in contact. 
It is also often possible through purely linguistic 
methods to arrive at conclusions regarding the 
direction of borrowing. 

Accordingly, there are three chief types of in- 
ference regarding nonlinguistic cultural phenom- 
ena that can be derived from the comparative 
study of language: those drawn from facts con- 
cerning the classification and distribution of lan- 
guages, those based on protovocabulary, and those 
based on interlinguistic contacts. From the detailed 
classification and subclassification of the members 
of a linguistic family, combined with their present 
geographical distribution or, where available, from 
the evidence of documentary history, their past 
distribution, it is possible to draw probabilistic 
conclusions regarding the area occupied by the 
ancestral speech community. From this will also 
follow certain hypotheses regarding subsequent mi- 
grational spread resulting in the distributions 
found in later historical periods. The fundamental 
assumption made is that every genetic branch of 
a linguistic stock, regardless of its present popula- 
tion size or geographical extent, provides, by its 
location, equal and independent evidence regarding 


the original center of linguistic distribution. The 
procedure implicit in this assumption may be called 
the “center of gravity” method. The best possible 
guess is the average of positions of each genetic 
branch. The center of each of these branches that 
enters into such a calculation may itself require 
calculation in terms of its subbranches, if any. 
Thus, if we had no written records to show whether 
English had originated in the British Isles, North 
America, South Africa, or Australia, the classifica- 
tion of English as a Germanic language within 
Indo-European, the fact that its closest relative 
within Germanic is Frisian, spoken by a small 
fishing population on the Dutch and German 
coasts, and the distribution of other subbranches 
of Germanic in Germany and Scandinavia would 
point to England as the immediate point of disper- 
sal and to the continent of Europe as the location 
of ultimate origin. In fact, considering the level 
of classification represented by dialect variation, 
since the deepest and most fundamental dialect 
divisions exist in the British Isles, one can assume 
that this is the center of dispersal and that the 
rest of English distribution results from the 
relatively recent spread of certain older dialects 
from this center. An important independent check 
involves an application of the protoword method. 
Part of the reconstructed vocabulary of the proto- 
language may reflect the geographical environment 
of the original area of settlement but must be 
interpreted in the light of paleoclimatic and pale- 
ontological knowledge. This method requires con- 
siderable caution in its application because of the 
possibility of parallel semantic changes and be- 
cause it is often necessary to argue from the 
negative standpoint of the absence of a given ter- 
minology. Both points can be illustrated for a 
hypothetical example of a language family in which 
it is impossible to reconstruct an original word for 
“ocean,” thus leading to the conclusion that the 
protocommunity lived inland. It may either be the 
case that there was such a word but it has inde- 
pendently been replaced by different terms in each 
linguistic branch or that it survived but transferred 
its meaning to “lake” several times independently 
through movements inland, so that the meaning 
of the term has been incorrectly reconstructed. The 
possibility of the reconstruction of a whole set of 
semantically related terms obviously strengthens 
such a case greatly. 

For reconstructing the time as against the place 
of the ancestral speech community, the only 
method of absolute chronology that has been pro- 
posed is that of glottochronology (see Hymes 
1960). The method is based on the assumption 
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that every language has a basic vocabulary that 
is composed of certain elements, such as pronouns, 
low numerals, and parts of the body, and that this 
basic vocabulary has a relatively low and constant 
rate of replacement by new forms, whether by 
internal changes or by borrowing. The rate of re- 
placement is estimated from test cases involving 
historical documentation with a known chronology, 
e.g., Latin to French. If we assume random and 
independent loss for related languages at the same 
rate as for the test cases, then from the proportion 
of cognates in the list for any pair of languages 
the date at which the ancestral language began to 
diverge can be estimated. The estimate in current 
use is that in one millennium 14 per cent of the 
100-word list is lost. This method has been widely 
applied but has also suffered severe criticism, both 
regarding the empirical results obtained in the test 
cases and the mathematical assumptions. It is, 
however, quite possible that when subjected to 
necessary revisions, it will give useful results. 

The protoword method also permits inference 
from reconstructed vocabulary regarding the cul- 
ture of the ancestral speech community. Thus the 
essentially village, neolithic nature of the Proto- 
Indo-European culture is shown by the existence 
of reconstructed terms for a number of domesti- 
cated plants and animals, the words for “plow,” 
“village,” etc. Other reconstructible parts of the 
ancestral vocabulary of Indo-European include the 
kinship terminology and the names of certain 
divinities. 

Culture contact. The remaining major source 
of cultural-historical information based on linguistic 
data is the study of linguistic-contact phenomena. 
The most important data are furnished by loan 
words because they frequently have specific cul- 
tural content and because the direction of borrow- 
ing can be determined in favorable cases. One type 
of linguistic-contact study is that which concen- 
trates on the contact of one language with another 
over an extended period. Such an investigation 
may be considered the linguistic analogue of ac- 
culturation studies. It is often possible to distinguish 
different periods of contact based on the “stratig- 
raphy” of the changes undergone in the borrowing 
language. An over-all study of this kind will also 
show the specific aspects of culture in which bor- 
rowings are most numerous and fundamental and 
thus provide important evidence concerning the 
nature of the culture contact. 

Instead of considering the language communi- 
ties and the nature of their linguistic contacts as 
the primary interest, we may focus our attention 
on a specific cultural item. For example, we may 
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examine the linguistic evidence in its bearing on 
the details of the spread of tobacco. Since a cul- 
tural item may be borrowed without the word being 
borrowed and because the direction of borrowing 
cannot in every case be discovered, linguistic evi- 
dence will not usually provide a complete history 
of diffusion, but it will furnish many important 
detailed hypotheses. 

The detection of borrowed words may sometimes 
show that speech communities not now in contact 
must have been so in the past. Sometimes the 
contact must have been with an earlier protocom- 
munity. Thus, Finnish has a number of words bor- 
rowed from very early Germanic that approximate 
reconstructed Proto-Germanic forms. 

In addition to borrowings, where languages have 
been in intimate contact with a large bilingual 
population over a considerable period, there will 
be a tendency to convergence in the sound system 
and grammatical structure. Thus, a number of 
Balkan languages of diverse branches of Indo- 
European share such features as a postposed 
definite article (Rumanian, Albanian, Bulgarian), 
a future formed from an auxiliary “to wish” 
(Rumanian, Albanian, Serbo-Croatian, Bulgarian, 
Greek), and other details. These are not borrow- 
ings, since, for example, the verb “to wish” is in 
each case the indigenous word. Thus, areas of 
mutual linguistic influences can be determined that 
parallel the notion of culture area in cultural an- 
thropology. 

The independent nonlinguistic methods involve 
the mapping of the distribution of cultural traits. 
The main conclusions drawn are that highly de- 
tailed traits, e.g., specified art motifs, if found 
in a restricted geographical area, have a common 
historic origin. The place of origin and process of 
spread are difficult to recover on purely distribu- 
tional evidence. One widely accepted principle of 
inference is that a trait is older in an area in which 
it is more elaborated and more integrated in the 
cultures in which it is found or exists in a greater 
variety of forms, since such developments require 
time. Another is known as the age-area hypoth- 
esis: other things being equal, a more widely dis- 
tributed trait is older, since such spread requires 
time. 

Reconstructing social systems. One class of 
methods using nonlinguistic cultural data involves 
an extension of the comparative linguistic method. 
The attempt can be made to reconstruct aspects of 
the culture of the ancestral speech community by 
a comparison of nonlinguistic traits of the speak- 
ers of the languages. This method has met with 


limited success in the case of comparative Indo- 
European mythology. Just as the names of divini- 
ties may be reconstructed by linguistic compari- 
sons, so the plots of myths involving the divinities 
may be compared in order to reconstitute their 
original forms. Such attempts encounter the dif- 
ficulty that for nonlinguistic aspects of culture, 
there is no systematic way to differentiate between 
resemblances resulting from diffusion and those 
stemming from common origin. The method de- 
veloped by Murdock (1949) belongs here. Since 
only certain changes of type are regarded as pos- 
sible and since social structure is presumed to be, 
like language, relatively impervious to external dif- 
fusional influences, the comparison of social struc- 
tures of linguistically related peoples leads to the 
reconstruction of the type of social structure of the 
ancestral population and its subsequent changes. 
Unlike language, where there are thousands of 
independent vocabulary elements, there are rela- 
tively few types of social structure; therefore, the 
same type of social structure is not probative of 
historical connection between two peoples. Lin- 
guistic comparison is thus a method for recon- 
structing the social structure of peoples known to 
be related on other grounds and not primarily a 
method of discovering historical relationships not 
otherwise known. 

Biological history. The study of certain non- 
cultural phenomena may be coordinated with that 
of culture history. Thus the genetic history of hu- 
man populations is clearly relevant to culture his- 
tory. The isolating mechanisms that produce partly 
or fully discrete breeding populations are in gen- 
eral congruent with those producing cultural and 
linguistic isolation. For example, the linguistic dis- 
tinction of Eskimo—Aleut from the remaining 
language groups of the indigenous Americas paral- 
lels a physical distinction and is the common result 
of the same isolating factors. There is thus the 
possibility of mutual corroboration for historical 
inferences in both areas. For example, the genetic 
distinctness of the African and Oceanic Negro, 
which now seems assured on genetic grounds, is in 
agreement with the linguistic evidence, which is 
also negative on the same point. 

Domestication. A further important noncultural 
source of cultural-historical conclusions is the study 
of domesticated plants and animals. Given a ge- 
netic classification of species and varieties or races 
and their relationship to wild forms, the basic prin- 
ciple is one parallel to the center-of-gravity method 
discussed above in relation to linguistic classifica- 
tion. Thus, the center of origin should be in the 


same area as the wild forms, and the earlier and 
more basic genetic differentiations of the domesti- 
cated forms should have taken place at earlier 
centers of cultivation. Here again, the history of 
plant and animal domestication is in itself impor- 
tant as culture history and provides further inde- 
pendent evidence regarding contacts of people. 


The potentialities of the methods outlined here 
have been only very partially realized. The reasons 
for this are both theoretical and technical. Since 
they were applied on a grand scale but based on 
limited range of evidence and an unsophisticated 
methodology by the cultural-historical schools of 
the early twentieth century, in the reaction that 
followed, interest was focused on structural—func- 
tional problems to the relative neglect of culture 
history. Moreover, practical difficulties are raised 
by the wide variety of methods required that can- 
not easily, if at all, be controlled by a single spe- 
cialist. The most noteworthy attempt thus far is 
that of Murdock’s study of Africa (1959), which 
utilizes evidence from archeology, linguistic classi- 
fication, social structure, and plant genetics. But 
even this study does not take into account many 
further lines of evidence, such as loan words and 
the distribution of art styles. 


JoserH H. GREENBERG 


[Directly related are the entries ARCHEOLOGY; DIFFU- 
SION; ETHNOLOGY; FOLKLORE; LINGUISTICS, articles 
On HISTORICAL LINGUISTICS and THE SPEECH COM- 
MUNITY. Other relevant material may be found in 
HISTORIOGRAPHY, article om AFRICAN HISTORIOG- 
RAPHY; and in the biographies of GRAEBNER; KOP- 
PERS; NORDENSKIÖLD; RATZEL; SCHMIDT.] 


BIBLIOGRAPHY 

BERNHEIM, ERNEST (1905) 1926 Einleitung in die Ge- 
schichtswissenschaft. 4th ed. Berlin: Gruyter. > Trans- 
lation in text provided by Joseph Greenberg. 

GREENBERG, JosEPH H. 1953 Historical Linguistics and 
Unwritten Languages. Pages 265-286 in A. L. Kroeber 
(editor), Anthropology Today. Univ. of Chicago Press. 

Hymes, DELL H. 1960 Lexicostatistics So Far. Current 
Anthropology 1:3-44. 

LANGLOIS, CHARLES V.; and SEIGNOBOS, CHARLES (1898) 
1925 Introduction to the Study of History. London: 
Duckworth. > First published in French. 

Murpocx, GrorcE P. 1949 Social Structure. New York: 
Macmillan. -> A paperback edition was published in 
1965 by the Free Press. 

Murpock, Grorce P. 1959 Africa: Its Peoples and Their 
Culture History. New York: McGraw-Hill. 

Sapir, Epwarp (1916) 1949 Time Perspective in Ab- 
original American Culture: A Study in Method. Pages 
389-462 in Edward Sapir, Selected Writings . - - in 
Language, Culture and Personality. Edited by David 
Mandelbaum, Berkeley: Univ. of California Press. 


HISTORY: Social History 455 


Vansina, Jan (1961) 1964 The Oral Tradition: A Study 
im Historical Methodology. Chicago: Aldine. -> First 
published in French 

Wirey, Gorpon R.; and Puurs, Puri» 1958 Method 
and Theory in American Archaeology. Univ. of Chi- 
cago Press. 


v 
SOCIAL HISTORY 


Ideally considered, social history is the study of 
the structure and process of human action and 
interaction as they have occurred in sociocultural 
contexts in the recorded past. In practice, however, 
it has seldom been conceived in such analytical 
terms and has by no means always been envisaged 
in such comprehensive ones. Its investigators, in 
fact, have been content for the most part to chron- 
icle, recount, and describe. They have been sharply 
divided, moreover, as to the proper scope of social 
history. Some have held that it embraces the whole 
range of life and culture in societies that have 
existed in historical time. Others have insisted that 
its concern is most properly confined to a residuum 
left by the abstraction of the polity, the economy, 
and large areas of culture, such as religious beliefs 
and technology. Still others have restricted it more 
narrowly to a heterogeneous lot of domestic and 
communal institutions, customs, attitudes, and arti- 
facts. Numerous minor variations of these basic 
definitions also have been propounded. This diver- 
sity of views as to the scope of social history has 
engendered much confusion concerning the nature 
of the discipline. 

The sources of social history are virtually omni- 
farious. They include such diverse materials as 
official reports, legal documents, newspapers, pam- 
phlets, art objects, graffiti, literary works, and arti- 
facts. One important category of materials is per- 
sonal papers, such as letters, diaries, and journals, 
which reveal in depth and detail highly intimate 
areas of human experience. But there are no mate- 
rials that social history can claim as peculiarly its 
own—another cause of confusion regarding its 
nature. 

Origins 

Ever since Herodotus reported the folkways of 
the Scythians and Tacitus described the institutions 
of the Germanic tribes, historians have written 
accounts that are identifiable as social history of 
one variety or another. Until the eighteenth century 
such accounts invariably appeared as insignificant 
fragments embedded in general works. Then, how- 
ever, as acute concern with the institutions of the 
past was created by the growing desire to place the 
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study of man and society on a solid empirical basis, 
social history emerged as a distinctive genre. 

Although Justus Möser is traditionally credited 
with having been the progenitor of the genre and 
his Osnabriickische Geschichte (1768) is often ac- 
claimed as its first intensive treatment of the com- 
monality of a region, Voltaire was really the pri- 
mary agent of its emergence. In The Age of Louis 
XIV (1751) he treats French society as a totality. 
Essaying to present a comprehensive view, he ex- 
amines numerous facets of its life and culture, 
such as its wars, finances, administration, science, 
literature, art, customs, and religion. Moreover, he 
attempts to identify the ethos that animated the 
whole. Essentially the same approach is employed 
in his Essai sur l'histoire générale et sur les moeurs 
et esprit (1756), a series of disparate and uncon- 
nected discourses in which he surveys “the genius, 
manners, and customs of the principal nations” 
that flourished between the time of Charlemagne 
and the era of Louis xiv. Implicit in both works is 
the assumption that since mankind effected the 
transition from the “barbarism” of the Middle Ages 
to the “enlightenment” of the eighteenth century, 
the historical process advances by stages. 


“Kulturgeschichte” as social history 


The pioneer works by Voltaire inaugurated the 
development of Kulturgeschichte, which, although 
nominally preoccupied with the description and 
dissection of cultural patterns, is vitally concerned 
with social types and institutions. Basic among the 
postulates of Kulturgeschichte are the notions that 
each society, although characterized by multiform- 
ity of life and thought, possesses an essential unity, 
that it is pervaded by a peculiar ethos, and that it 
inevitably passes, like an organism, through a series 
of developmental phases. These postulates were 
strongly supported in the eighteenth century by 
major ideological currents. One was that congeries 
of ideas conventionally denominated romanticism, 
which, as in Johann Gottfried von Herder’s Outlines 
of a Philosophy of the History of Man (1784-1791), 
attributed to each people certain basic psychologi- 
cal peculiarities that, through the operation of a 
characteristic spirit, produce a unique set of social 
and cultural forms. Another was the concept of 
progress, which, as in A. R. J. Turgot’s On the 
Progress of the Human Mind (1750) and Condor- 
cet’s Sketch for a Historical Picture of the Progress 
of the Human Mind (1795), implied that history 
is a cumulative process and that each of its stages 
is a necessary antecedent of the next, These cur- 
rents of thought were fused in the idealism of 
Fichte, Schelling, and Hegel, which during the 


early nineteenth century provided similar support. 
As conceived by idealism, history is a plan whose 
inner logic obliges it to unfold step by step, each 
Step representing a progression, each being an 
epoch endowed with a distinctive character that 
penetrates every detail of life. 

Buttressed by the influence of this concept, Kul- 
turgeschichte developed rapidly. Sometimes it was 
less dynamic and comprehensive in practice than 
in theory, epochal and thematic works being no 
rarity among those that bore its stamp. As a rule, 
however, it traced societal development over a pro- 
tracted temporal span. An early, embryonic ex- 
ample of the genre is Henry Hallam’s View of the 
State of Europe During the Middle Ages (1818), 
which includes surveys of social institutions, lit- 
erature, education, and commerce. More fully en- 
dowed with its essential attributes are Francois 
Guizot’s history of European civilization (1828) 
and Jules Michelet’s universal history (1831), 
which dilate on ideology and values. With the pub- 
lication of Jacob Burckhardt’s The Age of Con- 
stantine the Great (1853), a study of the decline 
of the Roman Empire, the genre reached maturity; 
with the appearance of his Civilization of the Ren- 
aissance in Italy (1860), a study of the birth of 
modernity, it achieved its full fruition. 

Burckhardt’s treatment of the Italian Renais- 
sance is in many respects an excellent example of 
the kulturgeschichtliche method. Concerned with 
the development of the cities of northern Italy be- 
tween the fourteenth and sixteenth centuries, he 
centers his attention on the cultural configuration 
that evolved there, tracing the various mutations 
that it underwent as medieval elements were grad- 
ually replaced by modern. Its genesis he attributes 
to the prevailing illegitimacy of political power. 
He identifies its ethos as an ineluctable individual- 
ism that manifested itself in distinctive patterns of 
vision, thought, and norms. Closely interwoven with 
his delineation of these cultural patterns is a de- 
tailed survey of the concomitant social order. He 
explains how the dominant patriciate arose from 
a fusion of nobility and burghers; he describes the 
origins, growth, and characteristics of other new 
groups, such as humanists and artists. His exami- 
nation of behavioral patterns illuminates a wide 
variety of interrelated matters, like the frequency 
of violence, the prevalence of corruption, the liberty 
accorded women, and the intensity of competition. 
Similarly enlightening is his depiction of signifi- 
cant social types, such as the despot, the polyhe- 
drous man, and the perfect courtier. In sum, as 
he charts the phases of a changing society, he 
effectively portrays its full complexity. 


There are other works that, although of lesser 
stature, are equally representative of Kulturge- 
schichte in its heyday. One of them is Gustav Frey- 
tag’s enormously popular Bilder aus der deutschen 
Vergangenheit (1859-1867), a detailed and inti- 
mate survey of the social and cultural life of the 
German people from the earliest times to the mid- 
nineteenth century. Another is Wilhelm Riehl’s 
well-received Culturstudien aus drei Jahrhunderten 
(1859), a miscellany of essays on sociocultural 
themes. A later example of distinction is Karl 
Lamprecht’s Deutsche Geschichte (1891-1909), 
which traces the course of social and cultural 
change through a sequence of six periods of Ger- 
man history. 

Lamprecht represented a positivistic variety of 
Kulturgeschichte, which, inspired by the progress 
of natural science, especially the elaboration of the 
evolutionary hypothesis and the application of the 
concept of natural selection, rested on the premise 
that a veritable historical science could be con- 
structed by diligent scholarship, i.e., that the laws 
governing the historical process could be ascer- 
tained. The means of realizing this possibility, it 
assumed, was the application of the methodology 
of the social sciences to the whole course of human 
history. It thus bore a strong affinity to historical 
sociology, which from Auguste Comte through Her- 
bert Spencer to Franklin Giddings sought to dis- 
criminate the principles and successive phases of 
sociocultural evolution. One of the earliest expo- 
nents of this position was Henry T. Buckle, whose 
History of Civilization in England (1857-1861) 
exerted considerable influence, especially in the 
United States and Russia. The most important of 
its later advocates included Lamprecht himself and 
his disciples Kurt Breysig, who, in Der Stufen-Bau 
und die Gesetze der Welt-Geschichte (1905), pre- 
sumed to formulate 24 laws of history, and James 
Harvey Robinson, whose New History (1912) 
pointed the way to the contrivance of similar fabri- 
cations. These were the last paladins of Kultur- 
geschichte; the vitality of the genre did not long 
survive them. 


Unschematic social history 

Simultaneously with the development of Kultur- 
geschichte, which had begun with Voltaire, there 
also evolved an unschematic species of social his- 
tory. Characterized by a virtual absence of concern 
with sociocultural morphology and dynamics, its 
objective is essentially the depiction of life in so- 
ciety. It varies widely as to the range of life that 
it considers, sometimes presenting a societal con- 
spectus, as in Sir Albert Richardson’s survey of 
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England under the Georges (1931) and Sir Arthur 
Bryant's essays on England during the reign of 
Charles m (1935), and sometimes confining its 
inquiry to a narrow sector, as in Warwick Wroth’s 
London Pleasure Gardens of the Eighteenth Cen- 
tury (1896) and Robert J. Allen’s Clubs of Augus- 
tan London (1933). Social history likewise varies 
greatly in seriousness of purpose and therefore in 
scholarly level, as is evident in the considerable 
qualitative difference between such works as F. 
Karl Biedermann’s sober and solid study of Ger- 
many in the eighteenth century (1854-1880) and 
Max von Boehn’s light and airy sketch of England 
during the same period (1920). Moreover, it varies 
markedly in degree and character of interpretation: 
some studies, like Arthur W. Calhoun’s history of 
the family in the United States (1917-1919), are 
almost wholly devoid of tendentiousness, others, 
like Ulrich B. Phillips’ classic work on slavery in 
the American South (1918), argue, or at least 
suggest, a cogent thesis. Diversity, in short, is one 
of the principal attributes of social history. 

Typical of the genre in essentials is Ludwig 
Friedlander’s Roman Life and Manners Under the 
Early Empire (1862-1871), which treats Roman 
society from the reign of Augustus to the reign of 
Commodus. Beginning with a panoramic view of 
the imperial city, its sights and sounds, Fried- 
lander proceeds to survey the court, the various 
social classes, the diurnal routine, the position of 
women, the experiences of the traveler, the means 
of communication, the spectacles, and the prolifer- 
ation of luxury. Although he also considers art, re- 
ligion, and philosophy, his primary objective is the 
recreation of past life, its vivid and realistic por- 
trayal. Those of its aspects that constitute the 
cardinal features of his tableau are a rather arbi- 
trary selection. They are dealt with in a succession 
of virtually independent essays whose order follows 
no particular principle. The essays are almost en- 
tirely descriptive; when here and there Friedlander 
interjects interpretive comments, they are frag- 
mentary, belletristic, and simple: homiletic judg- 
ments, comparisons between Roman and later 
times, general observations on human progress. 
Another essential characteristic of the essays is 
their complete disregard of social change and de- 
velopment; they treat two centuries of history as 
a static unit. 

A number of influences contributed to the rise 
of this unschematic type of social history. Despite 
a long historiographical ancestry composed of rudi- 
mentary prototypes, such as the account of the 
plague at Athens given by Thucydides, it received 
its first really significant impetus when the example 
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of Voltaire sanctioned the historical investigation 
of every aspect of society. Something of the early 
effect of that example is apparent in the latitude 
of Arnold Heeren’s Ideen iiber die Politik, den Ver- 
kehr, und den Handel der alten Welt (1793-1812), 
which examines agriculture, trade, finances, and 
manufactures as well as law, constitutional sys- 
tems, and politics. This new breadth of scope was 
soon further extended by the inclusion of the ordi- 
nary people of the past among the subjects of his- 
torical inquiry. Long before the influence of Marx 
produced a school that made those strata its pri- 
mary focus, they were extensively treated in studies 
like John Wade's History of the Middle and Work- 
ing Classes (1833) and James A. St. John’s History 
of the Manners and Customs of Ancient Greece 
(1842). Their inclusion in the historian’s purview 
had a twofold origin, deriving in part from romanti- 
cism, which placed strong emphasis on the folk in 
all its aspects, in part from the social problems 
created by industrialization and from the resultant 
quest for panaceas. 

No less important than this expansion of scope 
were the effects of the influence subsequently 
exerted by other disciplines. As legal and economic 
history developed into distinctive fields of inquiry, 
scholars like Sir Henry Maine, Sir Paul Vinogradoff, 
Sir Frederic Seebohm, and Gustav Schmoller 
adopted a new approach. Thoroughly convinced of 
the sterility of studying legal and economic institu- 
tions in vacuo, they investigated them in relation 
to the rest of the social structure. The adoption of 
this practice gradually produced an intensified 
awareness of the social aspect of all institutions. 
Moreover, it created a strong tendency to consider 
social and economic history as closely conjoined 
and complementary fields, as in such works as 
Henri Pirenne’s Medieval Cities: Their Origins and 
the Revival of Trade (1925) and Henri Sée’s Es- 
quisse d'une histoire économique et sociale de la 
France (1929). 

But the two disciplines were not conceived to be 
of equal consequence. As a result of the protracted 
and pervasive vogue enjoyed by the concept of eco- 
nomic determinism, which found its fullest expres- 
sion in Marxian historiography, social history was 
regarded as ancillary to economic history. Only 
toward the middle of the present century did the 
conjunction of social and economic history tend to 
dissolve, as advances in theory and technique gave 
the latter a new preoccupation with problems of its 


own. 
The Bloch-Febyre movement 


While the conjunction of social and economic 
history was at its closest, a movement was launched 


that sought to disrupt it. Led by Marc Bloch and 
Lucien Febvre, the movement represented a con- 
vergence of the traditions of Kulturgeschichte and 
unschematic social history. Much of the inspiration 
that actuated it derived from Emile Durkheim; 
some may also have come from Max Weber and 
Ernst Troeltsch, whose practice belied their belief 
that history and sociology are immiscible (see 
Troeltsch 1922; McGrew 1958). 

The aim of the movement was ambitious. Im- 
bued with the conviction that the comprehension of 
sociocultural contexts demands they be studied as 
totalities, it aspired to convert social into societal 
history. It envisioned such history as a reconstruc- 
tion of past epochs that would include their entire 
physical, ideational, and normative milieus and 
that would be at once more “scientific” and more 
“human” than the treatment they have usually 
received, an ideal that bears a strong resemblance 
to the objective of much recent cultural anthropol- 
ogy. Illustrative of the success with which that 
ideal could be translated into reality is Marc Bloch’s 
own masterpiece, Feudal Society (1939-1940). The 
principal means that the movement prescribed for 
the achievement of such success was the creation 
of a coherent synthesis out of data drawn from 
sociology, psychology, economics, and geography. 
But it also ordained that the data to be accorded 
the most serious consideration were survivals from 
an earlier time, a variety of evidence whose value 
was first fully appreciated by Giovanni Battista 
Vico (1725) and a century later first fully used by 
Wilhelm Riehl. Assuming that such survivals, 
whether archeological, cartographical, linguistic, or 
folkloric, were much more reliable than documen- 
tary material, it held that they could provide the 
basal insight necessary to recreate the past. To give 
expression to this approach, Bloch and Febvre 
founded in 1929 the Annales d'histoire économique 
et sociale, which has both perpetuated and diffused 
the influence of the movement. That influence, 
which remains strongest in France, where it is at 
present represented by such scholars as Charles 
Morazé and Robert Mandrou, has contributed heav- 
ily over the decades to the weakening of the tradi- 
tional position enjoyed by political history. 


The dominance of political history 


Almost from the inception of both Kulturge- 
schichte and unschematic social history, their prac- 
titioners were obliged to strive against the domi- 
nance of political history. The completeness of that 
dominance in the nineteenth century is strikingly 
reflected in Edward A. Freeman’s terse definition 
of history in general as “past politics” (1886). Its 
vestiges in the mid-twentieth century may be seen 


in Geoffrey R. Elton’s insistence (1956) that polit- 
ical history provides the best possible framework 
for the marshaling of historical data. 

As long as this dominance subsisted unimpaired, 
one of its major manifestations was a strong re- 
sistance to any broadening of the scope of historical 
studies and to any deviation from Leopold von 
Ranke’s prescription of their proper concerns, which 
he held to be factual accretion, not generalization; 
narration, not analysis; individuals, not groups; 
notables, not nonentities. This resistance affected 
the development of social history in significant 
ways. Beginning with August Béckh’s seminal Die 
Staatshaushaltung der Athener (1817), which treats 
both state and society with about equal thorough- 
ness, it was not unusual for research undertaken 
in that discipline to appear as a volume, either 
monographic or comprehensive. But the amount of 
space allocated to the fruits of such research in 
works of general narrative history was extremely 
small. An outstanding case in point is the cele- 
brated third chapter of Thomas B. Macaulay's 
History of England (1849-1861), which serves as 
a diminutive mise-en-scène for a massive political 
survey. Similarly, only five of the 21 chapters that 
comprise William E. H. Lecky’s great work on 
eighteenth-century England (1878-1890) are con- 
cerned with nonpolitical institutions. An even 
smaller proportion, 7 out of 55 chapters, exists in 
G. M. Trevelyan’s history of England during the 
reign of Queen Anne (1930-1934), although the 
author is known primarily as a social historian. 
This engrossment of general narrative history by 
politics constituted a major impediment that the 
advocates of social history had to overcome in order 
to advance their discipline. Not until the early 
twentieth century did they achieve a substantial 
measure of success. The essence of that success 
was a marked expansion of the scope of general 
history. Very frequently, however, the new breadth 
amounted to no more than a compartmentalized 
presentation of diverse aspects of society, with 
political matters still receiving the largest share of 
attention. Even much later, when the space allotted 
to such matters had finally been greatly reduced, 
there were relatively few attempts to achieve the 
sort of highly fused synthesis advocated by scholars 
like Marc Bloch and Henri Berr, whose views may 
be gleaned from Berr’s prefatory article to his 
Revue de synthése historique (“Sur notre pro- 
gramme” 1900). 

Underlying this disparity in the attention ac- 
corded political and social history was the differ- 
ence in the prestige enjoyed by the two disciplines. 
Political history was highly esteemed. Regarded as 
edifying as well as informative, it was taken seri- 
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ously; viewed as a custodian of national tradition 
as well as an inculcator of patriotism, it was con- 
sidered valuable. In contrast, social history com- 
manded much less respect. Whether Kulturge- 
schichte or the unschematic variety, it was thought 
of primarily as entertainment that might invoke 
nostalgia, gratify curiosity, and generate fantasies. 
As the new disciplines of sociology and anthropol- 
ogy developed, their insistence that society was 
more important, more fundamental than the state 
contributed somewhat to the reduction of this dif- 
ferential in prestige. But it survived for a pro- 
tracted period. Moreover, it created an antagonism 
between the two disciplines that erupted most 
forcibly in the years 1888 to 1891, when Eberhard 
Gothein engaged in heated controversy with Die- 
trich Schafer over whether society or the state ought 
to be the subject of historical research (Gothein 
1889; Goldfriedrich 1902). The antagonism has 
long been extinct; the difference in prestige, al- 
though much diminished, continues. 


Current trends 


Dissatisfaction. Although at present there is 
little concern regarding the relative prestige of 
social history, there is much disquiet at its unsatis- 
factory state. This disquiet is occasioned by its 
amorphous, invertebrate character, which derives 
very largely from the absence of a corpus of theory 
capable of providing concepts and hypotheses; its 
indeterminate confines, whose nature arises from 
the persistent disagreement over scope and from 
the lack of a peculiar type of source material; 
its insufficiently rigorous discipline, which, tolerant 
of impressionistic portrayal, imprecise assessment, 
and ill-supported assertion, stems from its strong 
humanistic heritage; and its penchant for descrip- 
tion and eschewal of analysis, which proceed from 
the Rankean prescript. 

The hostility to sociology. These defects could 
be remedied to a large extent through an extensive 
and systematic application of sociological concepts 
and techniques. But for the most part historians 
are unwilling to adopt this course, which would 
effectively transmute social into sociological his- 
tory. Their unwillingness originates in a deeply 
rooted hostility to sociology that has existed for a 
century, except among such deviates as the ex- 
ponents of positivistic Kulturgeschichte and the 
epigoni of Marc Bloch and Lucien Febvre. That 
hostility arose out of profound apprehension and 
prejudice. Historians feared sociology’s synoptic 
pretensions and its putative design to reduce their 
discipline to the secondary role of fact collection. 
Moreover, they abhorred its techniques and aspira- 
tions as pseudoscientific. Finally, they disdained 
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its achievement as negligible and its subordination 
of empirical research to the construction of grandi- 
ose abstractions as only the philosophy of history 
in a new guise. The transformation that sociology 
has undergone since about 1940, when it entered 
its modern age, has almost entirely dispelled their 
apprehension and extinguished their disdain; but 
the strong humanistic bias that inspired their aver- 
sion for its concerns, techniques, and objectives 
continues to foster hostility. 

The hostility of sociologists. This hostility has 
had its counterpart among sociologists. Resentful 
of the ancient heritage, academic respectability, 
superior prestige, and large-scale pretensions of 
history, they long entertained a strong animus 
against it. Two pretensions in particular contri- 
buted to this animus, because they seemed to be 
in serious conflict with the claims of sociology. One 
was history's pre-emption of all recorded behavior 
as its proper sphere; the other was the pedagogic 
role that it assumed as grand interpreter of human 
experience. But if history was disliked as a rival 
discipline, it was also looked upon as an inferior 
one. Sociologists regarded its data as of dubious 
validity; they considered its concern with discrete 
facts as obsessive and its refusal to seek uniformi- 
ties as unscientific. As sociology’s claims to intel- 
lectual and institutional legitimacy gained greater 
recognition, all these attitudes tended to soften: 
but the residues are substantial. 

To explain the origins and persistence of such 
hostility on both sides, historians and sociologists 
alike have emphasized that a fundamental antith- 
esis exists between their disciplines. Drawing their 
principal argument from the neoidealism formu- 
Jated by Wilhelm Dilthey and Heinrich Rickert 
toward the end of the nineteenth century, both 
sides have maintained that history is idiographic, 
hence concerned with the unique, the singular, the 
individual, which makes descriptive treatment in- 
evitable; whereas sociology is nomothetic, hence 
concerned with the recurrent, the repetitive, the 
regular, which makes generalization and abstrac- 
tion possible. They have likewise contended that 
history's perspective is diachronic, since it con- 
siders data in temporal sequence, while sociology 
is synchronic, since it considers data without refer- 
ence to time. Then, too, they have argued that the 
techniques of historical research are particularly 
suitable for investigation of the past, whereas the 
techniques peculiar to sociology are applicable only 
to contemporary phenomena. 


The rapprochement of history and sociology 


On both sides, however, there has long been a 
spirited minority, which, denying these contentions 


and repudiating the hostility that underlies them, 
has resolutely advocated close communion between 
history and sociology. From time to time during the 
past half century, its efforts to promote such com- 
munion have produced intensive discussion, punc- 
tuated by exhortatory and programmatic pronun- 
ciamentos. The most recent efforts have received 
powerful support from the growing sentiment in 
favor of interdisciplinary exchange in general, 
which has exalted the mutual advantages to be 
derived from close cooperation between history 
and all the social sciences. They have likewise re- 
ceived considerable support from sociologists who 
have lately become uneasy over the ahistorical 
orientation of their discipline. That orientation, 
which was deliberately adopted when sociology was 
striving for intellectual autonomy and creating a 
unified system of theory, has been emphatically 
rejected by the empiricist C. Wright Mills and his 
followers, who have held a knowledge of the past 
to be valuable, if not essential, for an understand- 
ing of contemporary society and its problems. Much 
more important is the increasing number of pro- 
ponents of systematic theory who, captained by 
scholars like Robert K. Merton and Bernard Barber, 
have not only strongly recommended the data of 
history and the course of social change as worthy 
of investigation but have also energetically en- 
couraged a rapprochement between historians and 
sociologists. 

Although such a rapprochement remains no more 
than a possibility, some progress has been made in 
that direction. As certain relatively simple socio- 
logical concepts have gained currency among schol- 
ars in general, they have been employed half con- 
sciously by historians. Then, too, some scholars 
in areas like political, intellectual, and religious 
history have toyed with such concepts, permitting 
them to serve as stimuli to the imaginative faculty 
in research as well as in synthesization. Again, still 
others have adopted them deliberately and applied 
them directly, albeit in an irregular, immethodical 
manner. As a result of these practices, recent his- 
toriography includes a substantial segment of par- 
tially sociologized work. Representative of such 
work are the writings of Georges Lefebvre, Albert 
Soboul, and Pierre Goubert. 

Sociological history. Few social historians have 
gone further than this piecemeal, ad hoc use of 
sociological theory. Those few, however, who have 
employed theory both extensively and systemati- 
cally have preduced genuine sociological history. 
Their work is well exemplified by Elinor G. Barber's 
The Bourgeoisie in Eighteenth Century France 
(1955), which forcefully demonstrates the full po- 
tentialities of the approach. Utilizing concepts 
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drawn from structural-functional theory, this pio- 
neer study skillfully analyzes the changing position 
of the middle strata of French society during the 
decades prior to the revolution of 1789. At the 
outset, it identifies the prevailing stratificational 
system as a composite in which caste elements 
were predominant and open class elements were 
secondary; accordingly, social mobility was given 
only limited approval. Then, having carefully ex- 
amined the composition and internal differentiation 
of the bourgeoisie, it demonstrates the harassing 
strains experienced by the socially mobile class, 
which sought to reconcile a partial abandonment 
of traditional Catholic values with a partial adop- 
tion of modern secular ones, and an acceptance of 
a hierarchical class structure with a determination 
to rise. These conflicts, it likewise shows, produced 
in many of the bourgeoisie a strong ambivalence 
concerning the choice of an appropriate style of 
life—guilt, uneasiness, and apprehension attaching 
them to the traditional pattern of their class, ambi- 
tion driving them to adopt the pattern of the nobil- 
ity, whose ranks they sought to enter. Finally, after 
examining the channels whereby they might ascend 
to noble status, the study analyzes their plight 
when such mobility became increasingly difficult 
to achieve, showing how their frustration intensi- 
fied the strains created by their conflicting values 
and how in consequence they were impelled to re- 
ject the whole class structure. Thus Elinor Barber, 
by considering familiar data within a new frame- 
work, achieves a superbly articulated interpretation 
of unusual depth and subtlety and provides an en- 
tirely fresh perspective. 

Some sociologists, animated by their recently 
found interest in historical data, have likewise 
undertaken studies of this type. The possibility of 
their eventually pre-empting the area of social his- 
tory is raised anew by such works as Robert N. 
Bellah’s Tokugawa Religion: The Values of Pre- 
industrial Japan (1957), Neil J. Smelser’s Social 
Change in the Industrial Revolution (1959), Sey- 
mour M. Lipset’s The First New Nation: The United 
States in Historical and Comparative Perspective 
(1963), and Charles Tilly's The Vendée (1964). 
It is this challenge from the outside no less than 
the current defects within that makes ig imperative 
for social historians to put themselves under the 
tutelage of sociologists in order to transform their 
discipline. 

J. JEAN HECHT 


[Other relevant material may be found in the biog- 
raphies of BLocH; CONDORCET; FEBVRE; LAMPRECHT; 
MACAULAY; MILLS; PIRENNE; ROBINSON; SÉE; TUR- 
GOT; VOLTAIRE.] 
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VI 
INTELLECTUAL HISTORY 


The term “intellectual history” is fairly well 
established in the United States, though the Ameri- 
can Historical Association's Guide to Historical 
Literature (1961) uses it sparingly, preferring 
such rubrics as “cultural history” or “social ideas.” 
There are in common use in the Western world, 
however, many other terms: history of ideas, 
Geistesgeschichte, Ideengeschichte, histoire de la 
pensée, and various others. In its widest sense, 
intellectual history may be said to have as its sub- 
ject matter whatever record is left of the activities 
of the human mind. Its most important and most 
available materials are the products of philoso- 
phers, artists, writers, scientists, recorded in their 
works and in the special histories of specific disci- 
plines—philosophy, literature, religion, the sci- 


ences, the arts. But intellectual history is not 
merely a summary or even a synthesis of such 
materials; it commonly also attempts to trace and 
understand the dissemination of the work of cul- 
tural leaders—their “ideas"—in a given society; 
and it also seeks to understand the relation be- 
tween such ideas on one hand and, on the other, 
“drives,” “interests,” and nonintellectual factors 
generally, in individual and in social psychology. 
At its narrowest, intellectual history attempts to 
tell who produced what intellectual or cultural 
achievement when and how; at its broadest, it can 
come close to being a kind of retrospective soci- 
ology of knowledge, even a retrospective general 
sociology. 

Yet intellectual history is not to be understood 
as a kind of master history, It takes the products 
of the human intellect as its source materials; it 
does not in itself exhaust the possible play of the 
historian’s own intellect on all the diverse materials 
left by the past. All historical writing, of course, 
requires from the historian at least minimal atten- 
tion to the record of man thinking. Especially in 
modern works in such fields as social and economic 
history, awareness of the role of ideas is increas- 
ing. Usually, however, intellectual history can, if 
only roughly, be delimited by its major concern 
with the written or spoken word, and even, to use 
a term still somewhat suspect among historians, 
with “theory.” There remains then the difficulty of 
clearly distinguishing between intellectual history 
and, for example, the history of philosophy, the 
history of literature, the history of science, and of 
other branches of culture. It is not quite enough to 
say that intellectual history is the all-inclusive his- 
tory of all these, Sometimes an intellectual his- 
torian, like Preserved Smith in the two volumes of 
his History of Modern Culture (vol. 1, Origins of 
Modern Culture, 1543-1687; vol. 2, The Enlighten- 
ment, 1687-1776 1930-1934), left unfinished at 
his death, does attempt such a difficult all-inclusive 
task, and some of Smith’s topical headings—“The 
Propaganda of the Enlightenment,” “Persecution 
and Tolerance,” “Humanitarianism,” “Morals and 
Manners”—show that he did not limit himself to 
the discussion of clusters of ideas and their affilia- 
tion. But generally the historian of philosophy, for 
instance, is primarily if not exclusively concerned 
with explaining to philosophers or to students of 
philosophy the ideas of other philosophers. He may 
indeed criticize these ideas, that is, evaluate, praise, 
blame; he may, though he need not, attempt to 
find some explanation of these ideas in a given 
philosopher's personal history and in his total en- 
vironment; but he may also treat ideas as breeding 
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ideas in a vacuum—or in a “mind.” The intellectual 
historian, concerned, as he very often is, with the 
same set of al ideas, must also do some 
of what the historian of philosophy does; but his 
main concern must be with what happens to these 
ideas among ordinary educated people and even 
among ordinary uneducated people. 

Perhaps the point can be made more clearly from 
the history of science. It is possible to write a history 
of science, and a very scholarly one, in which the 
aim of the historian is to record discoveries, in- 
ventions, theories; place them in chronological 
sequence; and even explain their dissemination 
among scientists. Such, in fact, was the whole 
work of the distinguished historian of science 
George Sarton, An intellectual historian concerned 
at all with natural science would certainly have to 
master much of the foregoing: but he would also 
have to ask himself what happened to these scien- 
tific theories when they passed into circulation 
among the many. You could write a good history 
of what the work of Darwin has meant to the sci- 
ence of biology in its present state without a word 
about what is commonly called “social Darwinism”; 
but you could not write a good intellectual history 
of the nineteenth century without very serious 
attention to social Darwinism. The difference be- 
tween what the work of Freud means to practicing 
psychoanalysts, and indeed to psychiatrists who 
are not orthodox Freudians, and what Freud's 
work has meant to novelists, playwrights, painters, 
essayists, and the general public is very great in- 
deed. The intellectual historian will have to deal 
with all these last, even though for the professional 
psychoanalyst this Vulgar-Freudismus is a shock- 
ing perversion of the master's true meaning. Much 
of this is well treated in Erik Erikson’s Insight and 
Responsibility (1964). 

This difference, then, between concentrating on, 
placing emphasis on what ideas mean to the expert, 
the professional in a given field, and what they 
mean to the many to whom they somehow do filter 
down is the basic distinction between the historian 
of a special intellectual discipline and the historian 
of ideas. 

Some component of intellectual history is to be 
found in historical writing as far back as the 
Greeks. Herodotus, when he discussed the religious 
beliefs of the Egyptians, and Thucydides, when he 
contrasted the national character of the Athenians 
and the Spartans, were both writing intellectual 
history. The vein of philosophical history that 
attempts to discern what Alfred Kroeber called 
“configurations of culture growth” was never quite 
pinched out even in the Middle Ages, as witness 


464 HISTORY: Intellectual History 


the sequence Augustine—Orosius—Otto of Freising, 
and has widened greatly in our own time. Machia- 
velli's Discorsi has as its major theme an attempt 
to explain the influence of the religious beliefs of 
the Romans on their political achievements. With 
the Enlightenment of the eighteenth century, in- 
tellectual history, still unnamed, assumed a promi- 
nent place in historiography, if only as “philosophy 
teaching by example.” 

The actual designation “intellectual history,” or 
history of ideas or of thought, and its general ac- 
ceptance as a form of historical writing date from 
the late nineteenth century and the organization 
of professional academic history. In the United 
States the term was made popular by James Harvey 
Robinson, whose Mind in the Making (1921) was 
a best-selling sketch of Western intellectual his- 
tory based largely on his famous Columbia Uni- 
versity course on “the history of the intellectual 
classes.” In Germany Dilthey was in many ways a 
precursor of modern intellectual historians, and 
Max Weber, although formally listed as a sociolo- 
gist, set the mold for much work in the field. In- 
deed, his Protestant Ethic and the Spirit of Capi- 
talism may be taken as a most representative 
piece of intellectual history. Among the works of 
professional historians, Friedrich Meinecke’s Die 
Entstehung des Historismus and the Austrian 
Friedrich Heer’s monumental Europäische Geistes- 
geschichte are evidence that the field is solidly 
established in Germany and Austria. 

Professional historians in both France and Great 
Britain have been more reluctant to write intel- 
lectual history, at least under that name. In France 
historians interested in synthesis, such as Henri 
Berr and Lucien Febvre, have certainly made con- 
tributions to the field; and literary scholars like 
Paul Hazard, whose European Mind, 1680-1715 is 
now standard, have written in the mainstream of 
intellectual history. British historians like J. B. 
Bury, R. H Tawney, and Christopher Dawson have 
paid full attention to the intellectual element in 
history, but intellectual history has been written 
primarily by literary men like Leslie Stephen, 
whose History of English Thought in the Eight- 
eenth Century is a classic. The philosophy of his- 
tory is represented in Great Britain by H. T. Buckle 
(History of Civilization in England), Charles 
Collingwood (The Idea of History), and Arnold J. 
Toynbee, whose 12-volume Study of History, as 
summarized in two volumes by D. C. Somervell, 
has had a very wide audience, especially in the 
United States. Croce set the pattern for a whole 
generation of historians in Italy. Croce must be 


listed first as a philosopher, but almost all his 
writing as a historian shows the hand of the in- 
tellectual historian. Among his many writings in 
the field, we may note La Spagna nella vita italiana 
durante la Rinascenza and Storia della età barocca 
in Italia. 

Among the first few generations of professional 
historians throughout the Western world, there 
was considerable resistance to the formal field of 
intellectual history as not “scientific” enough. As 
compared with the concreteness of the materials 
of institutional, economic, and conventional social 
history, the materials of intellectual history seemed 
vague and difficult to pin down as part of “real” 
life. Moreover, what the intellectual historian prob- 
ably has to call—for want of a better term—the 
“spirit of the age” or the “climate of opinion” of the 
late nineteenth century, at least among scholars, 
tended to minimize, if not to deny, the driving 
force of “ideas,” particularly philosophical ideas or 
ideals, in human life. 

Although traces of this resistance to intellectual 
history still exist, the subject has now attained 
academic respectability; indeed, in the United 
States it has become fashionable. It is proving to 
be one of the most effective bridges between his- 
torians and the practitioners of the social sciences, 
groups still rather definitely separated by mutual 
distrust in most Western countries. The problems 
that the intellectual historian must face, while 
often essentially philosophical, are increasingly 
like those confronting the social scientist. The in- 
tellectual historian is bound to try to be a thinker 
rather than a storyteller. Indeed, he hardly has a 
Story to tell. Passage from general sociology and 
the sociology of knowledge to intellectual history 
has become both easy and frequent; it is equally 
easy—some would say too easy—to go from depth 
psychology to historical writing. It seems likely that 
many historians for a long time to come will 
proudly and a bit defensively call themselves hu- 
manists and scorn the social sciences; the intel- 
lectual historian, however, cannot really practice 
his craft if he shares this scorn. 


Types of intellectual history 


No rigorous classification of the kinds of intel- 
lectual history is possible. We shall here attempt a 
rough classification into three types, with the neces- 
sary warning that any given work may display 
some touches of all three. 

First, there is intellectual history that tries to 
establish the “facts” about who wrote what when, 
in what form it was published, as well as similar 


facts about what was produced in cultural media 
other than words, particularly if these other media 
served for “propaganda.” A good example of this 
kind of intellectual history is afforded by the work 
of Charles H. Haskins, notably the articles col- 
lected as The Renaissance of the Twelfth Century 
(1927) and Studies in Medieval Culture (1929). 
Haskins was an impeccably trained medievalist of 
the old school who did much work in institutional 
history. What interested him in intellectual history 
was chiefly the ways in which Greco-Roman works 
—the actual manuscripts—survived, were copied, 
indeed often came back into Western culture via 
translations into Arabic and thence into medieval 
Latin. He was certainly not indifferent to the con- 
tent of the manuscripts he so carefully studied— 
in fact his very use of the term “Renaissance” in 
his title involves another level of historical gen- 
eralization, another kind of intellectual history. 
Still, his main task was to establish by research in 
the original sources a straightforward account in 
the Rankean tradition of wie es eigentlich gewesen. 

Spadework of a similar kind is, of course, always 
essential. For the modern intellectual historian 
there are even many problems of “fact” that must 
be cleared up before he can go on to other prob- 
lems—problems such as who wrote certain pseu- 
donymous or anonymous works, problems of 
clandestine publication, authenticity of memoirs, 
and the like. In this classification too belongs the 
effort to establish the facts (sometimes capable of 
being put statistically) of the dissemination in 
specific circumstances of certain works and even 
of the ideas contained in them. Ancillary to this 
effort is work close to demography, such as the 
study of the degree of literacy in a given popula- 
tion. In actual practice many of these investiga- 
tions can hardly be separated from studies of 
problems in the sociology of knowledge, from 
simple literary ones of “influence of A on B” to 
more sophisticated attempts to: analyze the rela- 
tions between words and deeds. But the establish- 
ment of the facts is essential spadework in intel- 
lectual as in any other kind of historical writing. 

Second, there is the more difficult kind of intel- 
lectual history, also concerned rather more with 
establishing than with evaluating or synthesizing 
facts, to which American usage in particular tends 
to apply the term “history of ideas.” We are here 
concerned with what can be called the cartography 
of ideas or (perhaps) semantics. The school formed 
around Arthur O. Lovejoy at Johns Hopkins affords 
a good example of this approach. Lovejoy identified 
as a “cluster of ideas” such complex and usually 
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very common terms as “nature,” “reason,” “ro- 
mantic.” His major task was that of analyzing the 
constituent elements of these clusters of ideas. 
His Great Chain of Being (1936), for example, 
traces in Western culture the history of one such 
cluster of ideas, that of a hierarchy of interrelated 
living beings from barely sentient ones to the high- 
est and best developed. Although other historians 
have not always matched the scholarly subtlety of 
Lovejoy and his colleagues—who have found some 
sixty shades of meaning for the word “nature”— 
this sort of analysis is an essential part of intel- 
lectual history. It can be applied to the work of a 
given thinker: what did Hobbes, Locke, Rousseau 
understand, “really mean,” by “social contract,” or 
just what did Nietzsche intend the “superman” to 
be and do? It can also be applied to distinguish 
between different uses, different emotional effects, 
of certain words or phrases in given times and 
places. For this a very neat example is the differ- 
ence between the effect of the word “federalist” 
when used in France and in the United States in 
1793—or, indeed, when used in New England and 
in Virginia at the same time. 

Both these first two kinds of intellectual history 
are essential to the third, which is the central con- 
cern of intellectual history in our time. Its task 
may be put with undue simplicity as the study of 
the relation between what men say and what they 
do. “Do” has its obvious complexities; but “say” 
too is mere shorthand for all that goes on in the 
cerebral cortex, and unless modern psychology is 
wholly on the wrong track, in less noble parts of 
the human central nervous system. A very good if 
rather worn example of this kind of intellectual 
history is afforded by the old debate over the influ- 
ence of the Enlightenment on the French Revolu- 
tion (see Church 1964). One extreme position in 
this debate, taken for instance by Felix Rocquain 
in L'esprit révolutionnaire avant la révolution, is 
that hard, specific grievances were all-important 
in producing the revolution and that the work of 
the philosophes was of little, if any, importance. 
Rocquain’s position has significance for later in- 
tellectual historians as a probable reflection of 
Marxist ideas and is certainly a reflection of the 
distrust felt by French political radicals in the 
1870s for any form of “idealism.” The opposite ex- 
treme position is commonly taken by conservatives 
who dislike the French Revolution and who sub- 
scribe to one form or another of the conspiracy 
theory of history. Freemasons, philosophes, Illumi- 
nati, Jacobins, are variously singled out as the 
fanatics of the Enlightenment responsible for 


466 HISTORY: Intellectual History 


everything that happened during the revolution. 
Taine’s famous metaphor in his Origines de la 
France contemporaine is typical: if you see a 
healthy man take up a full glass, drain it, and then 
fall down foaming at the mouth in convulsions, 
you know there was a poison in the glass; the man 
was the Jacobin, the glass contained the ideas of 
the philosophes. In between these extremes the 
debate, which is by no means ended, shows many 
variants of interpretation of the nature and extent 
of the effect of the Enlightenment on what really 
happened. 

The intellectual historian who attempts to judge 
the nature of the effects of an idea or cluster of 
ideas on human events is confronted with the old 
problem of value judgments. It is all very well to 
use, as we have used above, the metaphor of intel- 
lectual history as the “cartography of ideas.” But 
the map maker does not judge, evaluate, criticize, 
or even comment on the actual piece of the terres- 
trial globe he is mapping; only the traveler using 
the map as guide may indeed feel and say “This 
is lovely” or “This is ugly” as he looks at the land- 
scape. The historian, strive though he may to be 
like the map maker, can hardly avoid being like 
the traveler. 

To use once more the field of the Enlightenment 
as an example, intellectual historians have not 
only varied greatly in their interpretations of what 
certain eighteenth-century political thinkers meant, 
but in whether what they meant was productive of 
good or bad. Rousseau’s Social Contract is a good 
focusing point for this problem. One school, repre- 
sented by Jacob T. Talmon, in his Rise of Totali- 
tarian Democracy, finds that Rousseau himself 
meant the work to set up a sovereign power whose 
will was absolute and that the effect of the work 
on political activists like Robespierre and Babeuf 
was certainly to justify their arbitrary “democratic 
totalitarianism.” An opposite point of view is rep- 
resented by the distinguished German philosopher 
Ernst Cassirer, whose Question of Jean-Jacques 
Rousseau, translated with an illuminating intro- 
duction by Peter Gay, holds that Rousseau meant 
the general will to represent a kind of idealized 
moral imperative and that its real influence has 
indeed been to promote the democratic and indi- 
vidualistic freedoms. 


Types of studies 


It must be admitted that into whatever sub- 
classifications intellectual history is broken down, 
it is a sprawling field, with a very great range of 
subject matter and treatment. It can be concerned 


with tracing over a long period a recurring theme 
of man’s thinking. Such “thread” accounts are 
Lovejoy’s above-mentioned Great Chain of Being 
or J. B. Bury’s Idea of Progress. It can adopt the 
approach of comparative history, which tries to dis- 
cern common as well as unique elements in ideas 
and attitudes expressed at different times and in 
different places. An interesting and highly contro- 
versial instance of this approach is the work in 
American intellectual history of Richard Hof- 
stadter, who finds common elements, such as belief 
in the “conspiracy theory” of politics, in groups 
commonly held to be quite unlike—Populists, Pro- 
gressives, and McCarthyites. At least as contro- 
versial, if not of such timely interest, is Carl 
Beckers Heavenly City of the Eighteenth-Century 
Philosophers, in which, as the title implies, the 
author finds significant intellectual attitudes com- 
mon to medieval Augustinians and the philosophes 
of the Enlightenment. 

Histories of utopias—both the writings usually 
so classified in histories of literature and the actual 
experiments with group living in communities 
under the influence of utopian thinking—are ob- 
viously subjects for the intellectual historian. Al- 
though there is an enormous body of historical and 
literary writing about utopias, the subject has not 
had a first-rate general treatment in all its phases, 
a difficult and challenging task. Many wide topics 
in intellectual history, however, have had such 
treatment. This is especially true of the history of 
Christianity and, indeed, of the history of religion 
generally. While the historian of formal philosophy 
can always limit himself to the analysis of ideas 
in and for themselves, a process which is not intel- 
lectual history, the historian of religion can hardly 
avoid taking up the relation between religious be- 
liefs and the behavior of human beings, as well as 
the history of institutions founded on these beliefs. 
Much in this field that can be classified as intel- 
lectual history has been the work of sociologists, 
particularly in Germany. Although Harnack is com- 
monly listed as a historian, Troeltsch and Weber 
are listed as sociologists. 

An important problem in intellectual history, 
that of the cultural generation, has been neglected 
by historians; it has had its best specific treatment 
in a short article by Karl Mannheim, “The Problem 
of Generations” (1928). There are interesting re- 
flections on the problem in Ortega y Gasset, Man 
and Crisis (1933, pp. 30-85 in 1959 edition), a 
translation of his En torno a Galileo. 

It would be hard to deny an element of intel- 
lectual history to all the interpretative “leads,” the 


suggestive “ideas” in the folk sense of that word, 
that have enriched historical writing. There are re- 
cent examples, such as the Pirenne thesis that the 
real break between late Roman culture and that 
of the Middle Ages was not caused by the Germanic 
invasions of the fifth and sixth centuries but by 
the Arab invasions several centuries later, followed 
by the Viking incursions; Marc Bloch’s interpreta- 
tion of French agrarian history in Les caractéres 
originaux de histoire rurale française; the con- 
trasting conceptions advanced by Tawney and 
Trevor-Roper on the composition and the role of 
the “gentry” in early modern England (Tawney’s 
Religion and the Rise of Capitalism is an English 
variation of the work of Max Weber); Braudel’s 
use/of leads from human geography in his study 
of the Mediterranean; Meinecke’s clear definition 
of one kind of historicism; the very familiar Turner 
thesis of the role of the frontier in American his- 
tory; and so on down to theses involving apparently 
very minor concrete details, such as that brought 
out by Lynn White on the wide-reaching effects of 
the early medieval invention of the horse collar or 
that advanced by Walter Webb on the changes in 
the American great plains made possible by the 
availability of inexpensive barbed wire. 


The task of the intellectual historian is difficult. 
He must try to get source materials for the opinions 
and attitudes of at least a sampling of those strata 
in a given society that he thinks are touched by 
the ideas he discusses. With the invention of print- 
ing and especially with the development of the 
mass media and, in our own day, opinion polling, 
he gets almost too much material. For earlier 
periods he has to scrape together what he can, 
much helped by what several generations of 
workers have accumulated under such headings as 
social history, history of morals, Sittengeschichte, 
and the like. The very great body of what is com- 
monly called “literature” from the Egyptians, 
Greeks, ancient Chinese, and others down to the 
present is, of course, a mine of information on the 
opinions and attitudes of fictional men and women 
who were not “intellectuals” and who may often 
have been “typical” of their culture. 

Finally, intellectual history has necessarily close 
relations with some of the social sciences, notably 
sociology and what is usually called cultural an- 
thropology. The practitioners of cultural anthro- 
pology are now venturing increasingly into the 
study of developed societies with abundant re- 
corded history. For these, and for some of the other 
social sciences, intellectual history—and, indeed, 
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other kinds of history—can supplement observa- 
tion and experimentation by providing materials 
essential for the understanding of development 
through time, materials comparable in no mere 
figure of speech to those that paleontology and his- 
torical geology contribute to the earth sciences. 


CRANE BRINTON 


[See also Science, article on THE HISTORY OF SCIENCE; 
and the biographies of Croce; DILTHEY; FEBVRE; 
MANNHEIM; MARX; MEINECKE; ROBINSON; SARTON.] 
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vil 
ECONOMIC HISTORY 

Economic history is broadly concerned with the 
performance of economies in the past. The issues 
that are relevant to an economic historian range as 
widely as an interest in the growth, stagnation, or 
decline of economies; the well-being of individual 
groups in the economy during the course of eco- 
nomic change; and the interrelationship between 
economic organization and performance. This last 
issue necessarily focuses on the institutional struc- 
ture of the society. As a result, economic history 
frequently spills over into the allied fields of social 
and political history. However, the major issues of 
economic history fall into two rather broad cate- 
gories—(1) the over-all growth of the economy 
over time and the determinants of that growth (or 
stagnation or decline) and (2) the distribution of 
income within that economy in the course of its 
growth or decline. The latter concern covers the 
whole range of issues of the well-being of diverse 
groups in the society during the course of economic 
change in the past. 

The distinguishing feature of economic history 
as compared with the discipline of economics itself 
is its paramount concern with problems of the past 
rather than of the present. It is distinguishable 
from general historical inquiry not only by its spe- 
cialized concern with economic aspects of past so- 
cieties but also by its appeal to a systematic body 
of theory as a source of generalization and by the 
equally systematic use of quantitative methods of 
organizing evidence. 

Recent changes in the discipline. The use of the 
above characteristics to separate economic history 
from general historical inquiry reflects a distinct 
change in the discipline in recent times. In the 
Encyclopaedia of the Social Sciences, John Clap- 
ham could describe the discipline as “a branch of 
general institutional history, a study of the eco- 
nomic aspects of the social institutions of the past” 
(1931, p. 327). In his article, Clapham made clear 
that economic theory had a minor role in economic 
history and “the relationship of economic history 
to social history is much closer” (1931, p. 329). 
Essentially, the field was a branch of historical 
inquiry employing the methods of the historian. 

In the intervening years, the discipline has grad- 


ually adopted more of the methodology of the social 
sciences. While this revolution in economic history 
is far from over, and while today much if not most 
economic history is still written by scholars trained 
as historians, the direction of change is unmistak- 
able. The pioneering studies in this transformation 
include Clapham’s emphasis on measurement 
(1926-1938 ); Eli Heckscher’s plea for an increas- 
ing use of economic theory in economic history 
(Lane & Riemersma 1953); and the studies of the 
International Committee on Price History (Cole & 
Crandall 1964), including William Beveridge's 
work on England (1939), Earl Hamilton's on 
Spain (1934), N. W. Posthumus’ on Holland 
(1943), and Arthur Cole’s on the United States 
(1938). Occasional efforts to systematically apply 
principles of economic analysis to problems in eco- 
nomic history were highlighted by T. S. Ashton’s 
work on the English industrial revolution (1948) 
and Walt W. Rostow’s effort to analyze the British 
economy of the nineteenth century (1948). 

In the period since World War 1n, and particu- 
larly since 1950, the professionally trained econo- 
mist has led the new development of the field. In 
some instances, such as the work of the Entre- 
preneurial Center at Harvard, research was inspired 
by Joseph Schumpeter’s emphasis on the creative 
role of the entrepreneur in economic development 
(1939), The direction of research under Arthur 
Cole’s leadership of the center was an early (and 
perhaps premature) attempt to synthesize the 
social sciences in order to establish a more com- 
prehensive theoretical framework for economic 
historians (Harvard University . . . 1949). [See 
History, article on BUSINESS HISTORY. ] 

Three developments have furnished the major 
stimuli to the redirection of economic history. First 
there has been the growing interest of economists 
in the study of economic growth. Since World War 11 
economists have devoted a major share of their 
attention to attempting to understand the sources 
of economic development and to account for the 
widely divergent patterns of economic growth be- 
tween the high-income countries of the Western 
world and the low-income countries of the under- 
developed parts of the world. The study of eco- 
nomic development has led economists to isolate 
important elements and determinants of economic 
development, even though they do not fit as yet 
into an over-all general theory. These elements in- 
clude the problems of adapting and modifying 
technology from one economy to another, having 
different factor endowments; the importance of in- 
vestment in human capital; and the study of the 
development of efficient factor and product mar- 


kets. The research of the economist on the sources 
of increased productivity, which underlies eco- 
nomic development, has revolutionary implications 
for re-examining the way in which economic his- 
torians have accounted for development in the 
past. [See AGRICULTURE, article on PRODUCTIVITY 
AND TECHNOLOGY; ECONOMIC GROWTH; PRODUC- 
TIVITY.] 

The second source of change has been the grow- 
ing interest of economists in the more precise 
testing of their hypotheses. The development of op- 
erational propositions which can be tested has be- 
come a major concern of economists and is being 
extended to problems in economic history. This 
involves the sophisticated use of statistical tech- 
niques and methods as well, of course, as the care- 
ful use of economic theory. [See EcCONOMETRICS.] 

The third source of change has been the growing 
volume of quantitative information about the past. 
This concern with measurement of the perform- 
ance of economies in the past has been largely the 
work of economists—or, in recent years, of eco- 
nomic historians trained in economics. The de- 
velopment of national income accounting has 
contributed significantly to the measurement of the 
performance of economies in the past. Simon Kuz- 
nets has played a pioneering role in both the devel- 
opment of national income accounts and their 
systematic application to the measurement of the 
past performance of economies (1956-1963). The 
pioneering studies in price history have been sup- 
plemented in many Western countries. The quanti- 
tative studies of the National Bureau of Economic 
Research in the United States have been paralleled 
in many other countries. The result is an enormous 
increase in quantitative information, which pro- 
vides the economic historian with empirical data 
that he may put to systematic use in testing his 
propositions. 

Taken together, these three developments have 
led to a growing reorientation of economic history 
toward the employment of scientific methodology 
and the systematic use of quantitative measure- 
ment. With these essential tools, the economic 
historian may be able to provide far greater under- 
standing of the past than he has heretofore done. 
Therefore this article essentially offers a methodo- 
logical prescription for the present and the future 
instead of surveying the past literature of the field. 
While the illustrations are drawn from American 
economic history, the underlying principles they 
illustrate are universally applicable. 

Explanation in economic history. The primary 
objective of the economic historian is explanation. 
He seeks to understand the way economies have 
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operated or the way the welfare of people in the 
society has been affected by economic phenomena. 
In this respect, explanation in economic history 
does not differ significantly from scientific expla- 
nation in the natural and physical sciences. It not 
only involves the careful unearthing of facts and 
evidence about the phenomenon to be explained 
but also requires the application of generalizations 
to reduce the shapeless mass of evidential infor- 
mation to an orderly explanation. Therefore, ex- 
planation in economic history, as in the sciences, 
involves the statement of the essential background 
conditions—that is, singular statements of facts 
which provide the setting for the particular pattern 
of events to be explained—followed by the applica- 
tion of general principles which will provide the 
explanation. 

The economic historian, then, is concerned with 
determining the extent to which his explanation 
fits the empirical evidence he is able to obtain. 
Ideally, this empirical evidence is quantitative in 
nature; it may be, however, only a number of quali- 
tative statements to which he assigns particular 
weights. To the extent that the empirical evidence 
runs contrary to his generalizations, the economic 
historian should re-explore the background condi- 
tions which he has assumed or modify and develop 
new generalizations which will be more consistent 
with the available empirical evidence. This process 
of give-and-take between the development of gen- 
eralizations, the specification of background condi- 
tions, and the testing of the generalizations against 
systematic empirical evidence is the way by which 
the economic historian attempts to provide the ex- 
planation of historical phenomena. 

The body of theory that the economic historian 
employs is that of economics. This theory rests 
upon a number of basic axioms and postulates from 
which are derived subsidiary propositions that ex- 
press the general form of the functions used in 
constructing models. These models represent broad 
generalizations of economic behavior. [See Eco- 
Nomics.] This body of economic theory has emerged 
in the past two centuries in the course of a con- 
tinual interchange between the development of 
generalizations and their testing against empirical 
evidence. While economic theory provides certain 
basic models of economic behavior, application of 
these models to given historical situations requires 
the specification of the particular functional forms, 
parameters, or changes in parameters which may 
not be known to the economic historian. Therefore, 
the model that he constructs is one in which these 
forms and shifts of functions must be discovered 
and specified. The empirical verification of histori- 
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cal models requires the testing of the functional 
relationships implicitly or explicitly embodied in 
the explanation in order to see whether the param- 
eters of these equations are consistent with avail- 
able data. 

For example, hypotheses that attribute the fail- 
ure of the southern United States to industrialize 
before the Civil War to the small size of south- 
ern markets for manufactured goods rest upon 
assumed shapes of the supply functions of manu- 
facturing industries at that time and can be tested, 
at least in part, by standard statistical procedures. 
Likewise, the contention that the discontent of 
farmers in western Massachusetts in the post-Revo- 
lutionary War period was due to the severe burden 
imposed upon them by a whiskey tax can be tested 
by measuring the elasticity of the demand for 
whiskey at that time. This information is needed to 
determine the incidence of the tax. 

Causal explanations implicitly involve counter- 
factual propositions. That is, they imply that “had 
conditions been different” the causal sequence in- 
ferred in the proposition would not have taken 
place. A statement that the industrial revolution in 
Britain was induced by the expansion of population 
implies the counterfactual proposition that in the 
absence of this population increase the industrial 
revolution would not have occurred. A statement 
that the monopoly practices of the “robber barons” 
at the end of the nineteenth century significantly 
lowered the income of workers and farmers im- 
plies the counterfactual statement that in the ab- 
sence of monopoly profits the incomes of farmers 
and workers would have been significantly higher. 

The testing of hypotheses. The testing of ex- 
planations in economic history can take several 
forms, These include examination of (1) the em- 
pirical validity of the background conditions; (2) 
the logical consistency of the model; (3) the em- 
pirical validity of functions that relate the back- 
ground conditions: to the conclusions; (4) the 
empirical validity of the conclusions. The most 
appropriate point at which to test a given explana- 
tion depends on the issues under consideration and 
the availability of data. Thus, an explanation of the 
causes of farmer discontent in the late nineteenth 
century in the United States which asserts that the 
source of this discontent was the more rapid fall 
of farm prices than of prices of other goods can be 
refuted by empirical data showing that farm prices 
fell no more rapidly than other prices (North 
1966). A hypothesis which maintains that slavery 
would have fallen under its own weight without a 
civil war rests upon the economic viability of slav- 
ery as an institution. It can be refuted by a sub- 


sidiary hypothesis if, under testing, the latter shows 
that slavery was a profitable institution (Meyer & 
Conrad 1958). It should be noted that a confirma- 
tion of the profitability of slavery will refute the 
earlier hypothesis; but if slavery should prove to 
have been unprofitable, it still would not prove the 
institution was nonviable, since Southerners may 
have wanted to buy and hold slaves for noneco- 
nomic reasons. 

The economic historian may also be able to test 
a given counterfactual proposition. Thus, an argu- 
ment that the railroad was indispensable for U.S. 
economic development in the nineteenth century 
could be refuted by testing the counterfactual prop- 
osition on which it rests—that is, by showing that 
the cost of moving goods by the best alternative 
form of transportation would not have been sub- 
stantially higher than the cost of moving goods by 
railroad. Such a test involves the determination of 
the supply function of rail and nonrail transport 
services (Fogel 1964). Similarly, the statement 
that the robber barons significantly lowered the 
income of farmers and workers by monopoly prac- 
tices could be refuted if it were shown that the 
total amount of monopoly profit at that time, if 
redistributed among farmers and workers, would 
not have significantly raised their incomes (North 
1966 )—and under the further assumption that the 
misallocation of resources under monopoly condi- 
tions would not have significantly affected this. 

Methodological techniques illustrated. An ex- 
tended illustration can illuminate the whole proc- 
ess of research and testing and the problems in- 
volved. The hypothesis that British imperial policy 
significantly lowered the income of the American 
colonists in the period 1763-1775 will serve as the 
example. This hypothesis implies as a counterfac- 
tual statement that in the absence of specific Brit- 
ish policies the income of the colonies would have 
been significantly higher. The information needed 
to obtain a precise answer to this question is the 
actual income of the colonists between 1763 and 
1775 as against the hypothetical income the colo- 
nists would have received during this period in the 
absence of the British policies (i.e., as an independ- 
ent country outside British regulation and protec- 
tion). Since the actual income of the colonists is 
not known, the problem cannot be attacked directly, 
but it might be resolved indirectly by measuring 
the net difference in income that would have oc- 
curred had the specific policies been eliminated. 

The researcher requires first an intimate knowl- 
edge of the structure and characteristics of the 
colonial economy and the specific aspects of the 
Navigation Acts and other British imperial policies 


that impinged upon the colonial economy. It should 
be noted that even the process of selection of facts 
as a part of the background conditions involves 
theorizing, since it is impossible to separate out 
relevant and important facts from irrelevant ones 
without a theory concerning the way an economy 
operates. Thus, when the economic historian dis- 
cards as relatively unimportant the British restric- 
tions on colonial manufacturing, he does so first 
because economic theory tells him that any econ- 
omy characterized by a scarcity of labor and capital 
relative to land—the situation of the American 
colonies—will not typically engage in manufactur- 
ing because its costs will be higher than those of 
competing regions. He is furthermore supported in 
his initial assumption through examination of data 
on the economy of the United States after inde- 
pendence, which shows that manufacturing did not 
loom large even when the ex-colonists were free to 
engage in it. On the other hand, he will be im- 
pressed by the fact that tobacco and rice were enu- 
merated (i.e., had to be shipped to England), and 
the requirement that imports move through Britain 
probably had a significant effect upon colonial in- 
come. 

While the general shape of the relevant demand 
curves would stem from basic economic axioms 
(i.e., that the demand curves would be negatively 
sloped), it now becomes necessary to obtain a spe- 
cific measure of the elasticity of demand and sup- 
ply for tobacco in order to assess exactly the ex- 
tent of the burden involved. By getting data on 
the spread between Virginia and Amsterdam prices 
of tobacco before and again after the revolution, 
the economic historian is able to show how the 
price spread narrowed. However, in order to be able 
to find out how much more tobacco would have 
been bought at the lower price that would have 
prevailed without British restriction, he must know 
the elasticity of demand. Since the data needed to 
compute that elasticity are not available, the his- 
torian may have recourse to other studies of com- 
modities which appear to have similar characteris- 
tics and which give him the assumed elasticity of 
demand. He must also know how much more to- 
bacco would have been supplied had a higher price 
prevailed (i.e., he must know the elasticity of sup- 
ply). Here he may get a proxy by seeing how much 
the tobacco supply expanded after the revolution 
in response to a higher price. Alternatively, he 
might look at the conditions of supply in Virginia 
and Maryland to see to what extent the supply of 
inputs of land, labor, and capital was capable of 
providing more tobacco and whether at the same 
cost or at rising costs. 
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The economic historian must examine all other 
aspects of the colonial economy on which British 
policy impinged. The intimate knowledge of those 
facts of the period which theory attests to be rele- 
vant will lead him not only to assess individual 
burdens but also to recognize that there were bene- 
fits to being a part of the British imperial system 
which will have to be calculated in similar fashion. 
Thus, he will have to measure the extent to which 
income from shipping was increased by inclusion 
within the British imperial system. Similarly, he 
will have to measure the extent to which the colo- 
nists would have had to underwrite their own de- 
fense in the absence of British protection. He may 
find that a proxy for these counterfactual condi- 
tions can be found in the period from 1785 to 1793, 
after the colonies became independent. That is, he 
is assuming that the period 1785-1793 approxi- 
mates in significant aspects the way the economy 
would have operated had it been free and inde- 
pendent in the years 1763-1775. 

This illustration not only provides a capsule in- 
dication of the necessary and essential methods 
by which the economic historian may do meaning- 
ful research, but it also provides some indication 
of the problems and difficulties involved in his 
task. Has he really specified and taken into account 
all of the indispensable conditions? Was he correct 
in ignoring the Stamp Act as not imposing signifi- 
cant burdens upon the colonists? Are his assumed 
elasticities of supply and demand the correct ones, 
or can additional evidence be garnered which 
would indicate that they were different from those 
he specified? Were the conditions between 1785 
and 1793 really a proxy for those from 1763 to 
1775, thereby enabling him properly to employ this 
period as a proxy to the counterfactual situation? 
In fact, what he is doing is comparing a condition 
that existed—that is, income of the colonists be- 
tween 1763 and 1775—with the hypothetical model 
of what would have occurred in the absence of 
British policies. This hypothetical model of a sit- 
uation that did not in fact exist is essentially a 
general equilibrium model, and therefore it is essen- 
tial to his argument and his conclusions that the 
repercussion effects of those things which he does 
not take into account or which he argues are of 
small magnitude are in fact immaterial and would 
not significantly alter the conclusions he reaches. 

Advances in economic theory will lead to reap- 
praisals of the economic past. Like the traditional 
historian, the present-day economic historian will 
frequently be guided by his ideological preconcep- 
tions in making a choice of issues to be examined; 
but his testing of the resultant hypotheses should 
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be neutral with respect to current ideological biases 
and should result in a continuous narrowing of the 
range of disagreement and an increasing under- 
standing of the past. 

It can be seen from the above illustration that 
the limitations of inquiry in economic history are 
those imposed by the limitations of existing theory 
and of available evidence. 

The uses and limitations of theory. While eco- 
nomic theory provides generalizations that can be 
applied to a broad range of issues in economic his- 
tory, particularly to those dealing with the welfare 
of groups at particular times in the past, there is 
no general theory of economic growth to which 
the economic historian can turn in exploring this 
major aspect of economic history. While research 
in the study of economic growth of the past twenty 
or thirty years casts doubt upon many of the im- 
plicit or explicit hypotheses of economic historians, 
there is still no over-all framework which the eco- 
nomic historian can neatly apply. So, in this field, 
as in so many other aspects of economic history, 
the scholar must essentially develop his own theo- 
retical framework, Similarly, where the economic 
historian wishes to explore the theoretical border- 
lines between economic and social history, he must 
rely on the other social science disciplines or de- 
velop a framework of his own to explore these rela- 
tionships. There is no reason, of course, why the 
economic historian should be limited to received 
theory in economics. He is free to develop and apply 
theory of his own. However, caution in such an 
endeavor is obviously essential. The likelihood that 
the economic historian who is untrained in the 
principles of economics can derive theoretical prop- 
ositions of any significance is very slim indeed. 
There is as wide a gap between common-sense ob- 
servations in economics and economic generaliza- 
tions as there is between common-sense observa- 
tions in the physical sciences and the general laws 
of those physical sciences. We would not expect a 
layman to be able to derive from simple observa- 
tion of physical properties the general laws of 
physics; nor can observations of economic phe- 
nomena lead an untrained economic historian to 
develop valid generalizations with respect to eco- 
nomic theory, Economic theory has evolved in the 
give-and-take between the development of general- 
izations and their testing over a long enough pe- 
riod of time, and it cannot and should not be 
ignored in the course of analysis. The economic 
historian trained in economic theory will be well 
aware of the pitfalls inherent in economic analysis. 
Therefore, if he wishes to develop his own theoret- 
ical framework, he will take careful account of 


the work that has gone on before and the degree 
to which previous generalizations are supported by 
available evidence. 

The limitations of empirical evidence. Limita- 
tions of empirical evidence pose equally serious 
problems for the economic historian. He is faced 
with discrete, nonrepetitive, past performance: the 
artifacts and evidence that remain: are his mate- 
rial. Therefore, it is essential that he attempt sys- 
tematically to develop evidence about the past from 
this fragmentary information. As indicated above, 
that which comes closest to providing him with 
accurate tests is quantitative evidence which pre- 
cisely defines and isolates the particular phenomena 
that he wants to measure. Quantitative information 
in such ideal form seldom exists, and the economic 
historian is forced to make use of the more frag- 
mentary evidence which has typically survived 
from the past. Making the most of the evidence 
requires a knowledge of statistical theory so that 
he can use to the best effect whatever data are 
available. 

The illustration used above indicates some of 
the ways the economic historian can employ bits 
and pieces of quantitative information when the 
ideal data are not available. In the above case, 
neither the actual national income of the colonists 
between 1763 and 1775 nor the hypothetical in- 
come in the absence of British policies is known. 
But the net difference in income can be derived 
even without the absolute figures. A complete series 
of prices and quantities for tobacco, rice, and other 
affected commodities is also not available, but 
proxies for these can be obtained that provide rea- 
sonably good measures for the unknowns with re- 
spect to the shape of the relevant demand and sup- 
ply curves. 

The further back in time the economic historian 
explores, the more inadequate the data are likely 
to be. However, the quantitative information may 
be far more abundant than economic historians 
have heretofore believed, since hypotheses dictate 
the search for data, and it is only in recent years 
that self-conscious employment of theory has char- 
acterized research in the field. The quantitative in- 
formation available about the past has usually not 
been mined because its relevance has not been ap- 
preciated. In the absence of quantitative data, the 
economic historian is forced to fall back on the use 
of qualitative description embodied in other kinds 
of information; but it should be noted here that he 
does not thereby escape the essential rules of sta- 
tistical inference. That is, it remains imperative 
that he demand that the qualitative information 
meet the same rules of statistical sampling and 


representativeness required in the use of quanti- 
tative knowledge. In this respect, Clapham’s warn- 
ing in the article on economic history in the 
Encyclopaedia of the Social Sciences is still timely. 


Every economic historian should, however, have ac- 
quired what might be called the statistical sense, the 
habit of asking in relation to any institution, policy, 
group or movement the question: how large? how 
long? how often? how representative? The require- 
ment seems obvious; but a good deal of the older 
politico-institutional economic history was less useful 
than it might have been through neglect of it. (1931, 
p. 328) 


The writing of economic history. While ex- 
planation of the economic past is the ultimate ob- 
jective of the economic historian, and an aware- 
ness of the principles of scientific method is an 
essential requirement in the pursuit of this objec- 
tive, characterization of the discipline solely in 
these terms would give a distorted picture of the 
field. The traditional craft of the historian—the 
careful unearthing of evidence and the assessment 
of its reliability—is fully as important in economic 
history as it is in general history. The present-day 
scholar has inherited a rich store of descriptive 
material and data about the economic past which 
has been mined and assayed largely by historians. 
The present-day analytically oriented economic 
historian who ignores this treasure-trove from the 
past does so at the risk that he will be unaware of 
essential background conditions when he constructs 
his model. It is incumbent upon him not only to be 
thoroughly versed in the traditional literature in 
the field but also to be possessed of that fine sense 
of the detective, which has always been the trade- 
mark of the good historian. 

Finally, the economic historian is attempting to 
provide a systematic and integrated explanation of 
the economic past, and this inevitably involves 
something more than the development and testing 
of a single hypothesis. It is a relatively easy task 
for any well-trained economic historian to test (and 
typically in recent research to destroy) a specific 
explanation about the past, but it is well for him 
to remember that the ultimate objective he seeks 
is far more difficult—to construct a unified expla- 
nation of the economic past. This involves the de- 
velopment of a set of consistent hypotheses together 
with the essential background conditions, both 
woven together in the fabric of a narrative. Such 
a story not only possesses the characteristics of 
good narrative but also makes clear the essential 
background conditions and states clearly the hy- 
potheses involved, so that it is consistent with the 
principles of scientific explanation and so that its 
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several parts can be tested by other economic his- 
torians. 
DoucLass C. NORTH 


[Directly relevant are the biographies of CLAPHAM; 
HECKSCHER; SCHUMPETER. For discussion of the 
approaches of some other economic historians, see 
the biographies of ASHLEY; BÜCHER; CUNNINGHAM; 
EHRENBERG; HAMMOND, J. L. AND BARBARA; LEVAS- 
SEUR; PIRENNE; POLANYI; ROGERS; SÉE; TAWNEY; 
Unwin; UsHER; WEBB, SIDNEY AND BEATRICE; 
WEBER, MAXx.] 
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VIII 
BUSINESS HISTORY 


Although business history in the broadest sense 
encompasses all the activities of businessmen in 
the past, the academic discipline, as developed by 
historians, has certain distinguishing characteris- 
tics. To date it has been primarily concerned with 


the written record of decision making by individ- 
uals seeking private profit through production of 
goods and services. 

Business history in practice rests basically on 
one assumption and one derivative from it. The 
key assumption is that man enjoys a measure of 
freedom of will and, accordingly, that his individ- 
ual decisions affect the course of historical events. 
From that is derived the conviction that human 
decisions, made with an eye to producing profit, 
have significantly shaped the stream of economic 
and social change over a long period of time and 
in many parts of the world. 

In business history, change is regarded as con- 
tinuous, interacting in character, variable in rate, 
and open-ended—but always initiated by man. 
Through a complex of interrelated decision-making 
processes, businessmen are seen to have contrib- 
uted, together with other individuals and groups in 
society, to the generation of changes in their en- 
vironment, both internal and external to their own 
institutions. 

Certain methodological characteristics of busi- 
ness history follow from the foregoing assumptions 
and ideas. It emphasizes microeconomic elements 
in the past more than its parent discipline, eco- 
nomic history, and concentrates more on the proc- 
ess of change and the generation of change. To 
date, business historians have employed impersonal 
analysis of economic performance in the past less 
than those who have utilized the techniques of 
economists to evaluate trends in quantitative terms 
or to “fill in” gaps in historical knowledge. In fact, 
the tools of sociology, anthropology, and psychol- 
ogy are frequently as relevant to the questions busi- 
ness historians ask of data as is economic analysis, 
particularly with reference to motivation of men 
and their relation to the society in which they live. 
Concentration on businessmen as decision makers 
and as builders or destroyers of institutions, as well 
as on the ideas and accumulated knowledge affect- 
ing the place, timing, and conduct of business ac- 
tivities, has, of course, also differentiated business 
history from economic history. 

The tools employed by the professional business 
historian depend on his objective and on the ap- 
proach to the businessman that he selects. Each 
major approach involves a different context for 
viewing businessmen and business of the past. 
Concern with the businessman as an individual in 
society is the approach of the biographer of a busi- 
nessman. Viewed in the framework of one or more 
organizations for the production of profit, with all 
that this implies for policy formulation and imple- 
mentation, the businessman is the province of com- 


pany histories and industry studies. Some students 
have chosen the functional areas of business devel- 
opment, such as finance, production, and market- 
ing, as the subjects of historical research. Others 
have emphasized the implements and institutions 
of business. Still other scholars have concentrated 
their research on the interaction between business- 
men and their environment in terms of its influ- 
ence on developments both inside and outside the 
world of business. 

These varied approaches indicate that business 
history embraces many diverse areas of study, ac- 
commodates many interests, and calls on numerous 
disciplines. It therefore attracts researchers from 
many areas of history, economics, and other schol- 
arly disciplines, as well as those whose qualifica- 
fications are based on other credentials. In the 
latter category are amateur historians with busi- 
ness experience and publicists employed to write 
business history for nonacademic purposes. Al- 
though a number of these contributions meet good 
scholarly standards and many contribute useful in- 
formation, most significant works in business his- 
tory are produced by scholars trained in one or 
more of the traditional disciplines. By no means 
all of this latter group, however, would claim to be 
professional business historians. In many instances 
their forays into the field are one-time expeditions, 
using the subject matter of business history to ex- 
plore an area related to their primary interests. 

Professional historians both benefit and suffer 
from the amorphous nature of their field. They 
have benefited most from outsiders’ contributions 
to such areas as the theory of the firm and have 
suffered most from the popular identification of 
their field with histories of firms written by ama- 
teurs without scholarly standards. In part, this has 
been the price of progress, for business history has 
matured slowly as an academic discipline and un- 
til recently has concentrated on the study of busi- 
ness administration through the medium of com- 
pany histories. 

Since professional business historians have been 
located chiefly in schools of business administra- 
tion, it is not surprising that the variety of their 
approaches to research and teaching has been 
governed in large measure by the applications that 
could be made of their work in such institutions. 
As the curricula of schools of business have broad- 
ened from functional specialties to such larger pre- 
occupations as the responsibilities of businessmen 
in society and the challenge of undeveloped and 
underdeveloped areas to private enterprise, new 
applications of business history have been found. 
Some of these coincide with objectives of historians 
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concerned with all phases of social and economic 
change. In many business history courses, histories 
of firms have been supplemented by industry his- 
tories, by studies of the leadership styles and per- 
sonal qualities of businessmen in the past, by anal- 
yses of government-—business relations viewed in 
broad perspective, and by increasing concern with 
the historical problems and lessons of economic 
growth. Thus the field of business history has been 
in the process of change since it was first recog- 
nized as an academic discipline. 


Initial development by Gras at Harvard 


Business history began as an area of academic 
research and teaching at the Graduate School of 
Business Administration of Harvard University. 
Dean Wallace B. Donham believed that scholarly 
histories of “specific situations as they came to bus- 
inessmen in their communities in the past” were 
essential in order that those situations might be 
compared “understandably” with “current condi- 
tions” (Redlich 1962, pp. 61-62). Donham had in 
mind the use of business history for training pros- 
pective business administrators, the utilization of 
the case method, and the comparison of past tech- 
niques, decisions, and their implementation with 
those of the present. 

To initiate such teaching, research, and writing, 
in 1927 Donham chose N. S. B. Gras, a scholar 
already manifesting an interest in the role of bus- 
iness and businessmen in history. In his classes at 
the University of Minnesota he had encouraged 
students to do research in this field and had em- 
barked on assessing business activities in several 
areas. He was trained in economic theory as well 
as economic history, and he had read widely in 
the literature of sociology and social theory. 

Given his training and experience, Gras at first 
visualized business history quite broadly. From his 
reading in social, political, and economic history 
he gathered data on the environments in which 
businessmen had operated, as well as information 
on the creation and evolution of business instru- 
ments and institutions. Gras accepted the concept 
of capitalism and made the study of the evolution 
of capitalism one of his major concerns. But neither 
in the publications of economic theorists and eco- 
nomic historians nor in the works of Marx, Engels, 
Sombart, and others did he find a satisfactory ex- 
planation of the changing character of capitalism 
over time. While recognizing that environmental 
factors influenced businessmen to some degree, he 
rejected economic determinism; he remained con- 
vinced that men have enjoyed some freedom of 
will and that, by choosing courses of action from 
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a range of alternatives, they have changed the 
course of history. 

Accordingly, to understand the process of change 
in the business sector of society, Gras believed 
scholars must study and analyze the decisions of 
men reflected in the policies and practices of firms, 
the basic units of business systems. In the studies 
of Richard Ehrenberg (1902-1905) he had ex- 
amples of how meaningful histories of firms could 
be written. From such biographies of firms, written 
and to be written, Gras hoped to be able to learn 
how capitalism had evolved as a system. 

With these ideas in mind, Gras embarked on an 
ambitious program for the development of business 
history at the Harvard Business School. With the 
aid of associates, notably Henrietta M. Larson, he 
started developing cases for the course he taught. 
That offering embraced discussion and analysis of 
activities of businessmen from the European Mid- 
dle Ages to the twentieth-century United States; 
lectures on background materials alternated with 
class discussion of both specific and general situa- 
tions. At the same time, to encourage publication 
of the current results of research, Edwin F. Gay, 
then a professor at Harvard, as editor, and Gras as 
managing editor, began issuing the Journal of Eco- 
nomic and Business History (JEBH) in November 
1928. Some members of the group also undertook 
book-length biographies of businessmen and firms. 

Soon after the beginning of this program, eco- 
nomic and other factors induced Gras to modify 
his approach to the new area of study. The finan- 
cial crisis of 1929 and ensuing depression brought 
a drastic diminution in supporting funds. The 
JEBH ceased publication in 1932, a year after Gay 
and Gras disagreed on editorial policies and the 
latter became sole editor; not until 1938 did the 
business history group at the Harvard Business 
School assume responsibility for issuing the Bulletin 
of the Business Historical Society, known as Busi- 
ness History Review (BHR), a quarterly since 1954, 
and it was a much less ambitious periodical than 
the earlier one. Meanwhile, money to underwrite 
research in depth could be found only in limited 
amounts, often restricted to study of the families 
and firms providing the financial support. Simul- 
taneously, the use of the case method in the course, 
as well as the lack of published information on the 
decision-making process and the policies of busi- 
nessmen, fortified Gras’s tendency to concentrate 
on biographies of businessmen and of firms. 

In spite of serious difficulties during the 1930s, 
Gras and his group established the contours of the 
new field. They, as well as others, published their 
research findings in both the JEBH and the Bulle- 


tin. In the Harvard Studies in Business History, 
under the editorship of Gras, appeared books deal- 
ing with merchants, an investment banking firm, 
a commercial bank, and an advertising agency. In 
1939 Gras and Larson published the teaching ma- 
terials assembled for the course—Casebook in 
American Business History—and Gras issued his 
preliminary synthesis of business history in Busi- 
ness and Capitalism. 

In little more than ten years Gras had modified 
his concepts and had put a particular stamp on 
business history. Instead of realizing his early ex- 
pectation of studying and writing the history of 
business within a broad political, social, and eco- 
nomic framework, he gradually came to visualize 
the field more narrowly. To him, business history 
became “primarily the study of the administration 
of business units in the past,” administration being 
in two parts—policy formulation and management 
or execution of policy (Gras & Larson 1939, p. 3). 
From such statements, as well as the subject mat- 
ter of the books published, many observers adopted 
the idea, unfortunately not yet fully abandoned, 
that to Gras business history was “company his- 
tory” and nothing more. 

Actually, Business and Capitalism indicated that 
Gras was seeking, through business history, an 
explanation of the changes in the character of pri- 
vate capitalism over a thousand years. Influenced 
by Karl Biicher and others, Gras related the evolu- 
tion of stages in capitalism to changes in business 
systems, identifying each stage with dominant busi- 
ness types and groups—petty, mercantile, indus- 
trial (specializing), financial, and national. When 
he started writing the book, he did not think all the 
stages mentioned actually fitted the history of pri- 
vate enterprise in every national economy, but after 
its publication the idea became more than a tenta- 
tive hypothesis in his mind. To supplement Busi- 
ness and Capitalism, he began writing a multivol- 
ume history of industries in the United States, a 
task never completed. However, Larson published 
her Guide to Business History in 1948, and the 
Business History Foundation, Inc., an organization 
chartered in New York by Gras and Larson in 1947 
to forward research and writing on the history of 
business (chiefly on that of large-scale enterprise), 
is still active. 

Subsequent broadening of approaches 

Starting with the Casebook and Business and 
Capitalism as the bases of courses in other insti- 
tutions, historians soon began to utilize a variety 


of approaches to the history of business. Some ac- 
cepted Gras’s ideas generally, but increasingly used 


his stage theory for comparative purposes and not 
as a rigid framework for analyzing the history 
of businessmen and business institutions. Other 
scholars reacted more critically, explicitly and im- 
plicitly, to Grasian thought. Some regarded his 
stage theory as outmoded; they considered it too 
rigid and not sufficiently effective as an analytical 
tool. Others were convinced that his definition of 
the subject was too narrow; they thought the his- 
tory of business and businessmen was more than 
the history of business administration and that 
Gras gave too little attention to the motivation and 
environment of businessmen. Many disagreed with 
the broad generalizations, especially those on the 
period since 1870, which Gras made on the basis of 
research by his group and by predecessors among 
economic and social historians. Still others sought 
a less positivistic and more theoretical base for 
analyzing the behavior and achievements of busi- 
nessmen in history. Almost all were more interested 
in analyzing the activities of businessmen or in 
seeking explanations of changes in business than 
in the history of capitalism as such. 
Entrepreneurial studies. One group of scholars, 
the most influential, sought understanding of 
change in the history of business through the con- 
cept of entrepreneurship. Arthur H. Cole reviewed 
the historical changes in that concept in his pres- 
idential address to the Economic History Associa- 
tion (1946). A year later, in a paper presented to 
the same body, Joseph A. Schumpeter built onto 
the idea of innovation that of the creative entre- 
preneur as the main force in generating change 
in business (1947). Jointly the two initiated the 
Research Center in Entrepreneurial History at 
Harvard in 1948. Led by Cole, and ably supported 
by Thomas C. Cochran, Leland H. Jenks, Fritz Red- 
lich, and several others, the group brought together 
numerous scholars to enunciate their ideas and to 
discuss historical tools and techniques as well as 
concepts. Over a ten-year period the center helped 
to train a number of young historians and pub- 
lished many of the products of discussion and re- 
search in Explorations in Entrepreneurial History. 
Cole has provided his personal interpretation of 
the significant findings of the center (1959). First, 
he assessed the relationship of the entrepreneur to 
his organization, to the process of social condition- 
ing, and to elements important “for the proper 
functioning of an entrepreneurial flow.” Then he 
turned to analysis of five categories of “entrepre- 
neurial realities” drawn from historical data: the 
social order, underdeveloped areas, technological 
change, business organization, and the state. Each 
of these had three subsegments of illustrative mate- 
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rial which not only summarized existing substan- 
tive knowledge about types of entrepreneurial 
activity but also presented other historians of busi- 
ness with examples which could be utilized in class, 
tested by research, and added to as the study of 
businessmen and their institutions continued. 

Even before Cole’s synthesis appeared, both older 
scholars and a new generation of historians of busi- 
ness began to fuse the center's products with those 
of the Grasian group. Some welcomed the center's 
analysis of business history in a broad sociological 
as well as economic context. Many noted with ap- 
proval that Cole defined entrepreneurship as “the 
purposeful activity (including an integrated se- 
quence of decisions) of an individual or group of 
associated individuals, undertaken to initiate, main- 
tain or aggrandize a profit-oriented business unit 
for the production or distribution of goods and 
services” (1959, p. 7). That definition, business 
historians generally thought, embraced both policy 
formulation, which they now visualized as closely 
related to Schumpeterian creative entrepreneurship, 
and management (implementation of policy), 
which was seemingly included in an “integrated 
sequence of decisions.” 

In addition to concepts, ideas, and information 
adduced by the Gras and Cole groups, business 
historians have turned to other disciplines for tools 
and techniques. From writers on economic growth 
(Hirschman 1958) and on the theory of the firm 
(Boulding & Spivey 1960), as well as from anthro- 
pologists, sociologists, and social psychologists 
(Hagen 1962), scholars interested in the history 
of business have borrowed and tested concepts, 
theories, and research methods. They have been 
particularly interested in any study dealing with 
motivation of the businessman and his “need of 
achievement” (McClelland 1961). 

In the 1950s and 1960s research continues to 
reflect the mixture of approaches noted early in this 
article. Some scholars focus on a businessman as 
an individual in society (see, for example, Nevins 
1953). There have been numerous semipopular 
histories of firms (see, for example, Lief 1958; 
Blochman 1958). In some instances histories of 
firms written by insiders have been distinguished 
by thorough research and a comprehensive analysis 
(see, for example, Beaton 1957). 

Studies of firms and industries. Since 1950 
academicians have added significantly to the list 
of firms that have been studied in detail. Several 
large firms have submitted to detailed scholarly 
portraits. In large measure these works exhibit the 
increased sophistication arising from the fusion of 
ideas noted earlier. The most distinguished study 
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of the much-examined late medieval period deals 
with the rise and decline of the Medici Bank (de 
Roover 1963). In their histories of large firms some 
students have focused their attention on the re- 
sponse and adjustment of businessmen to changing 
environment (Cochran 1948). One outstanding 
study is devoted to grand strategy on a global scale 
(Wilson 1954). Noteworthy have been the analyses 
of policy and its implementation on the part of 
firms in the petroleum industry, in particular those 
sponsored by the Business History Foundation 
(History of Standard Oil..., 1955-1956: Larson 
& Porter 1959). Recent histories of American rail- 
road companies tend to present them broadly as 
common carriers operating for a profit within an 
environment experiencing rapid change (Overton 
1965). Among studies of financial institutions, sev- 
eral life insurance companies have received detailed 
attention (Williamson & Smalley 1957). A mid- 
western historian has provided the best portrayal 
of a public utility (Miller 1957). 

The history of small business has attracted fewer 
scholars than has that of large-scale enterprise. 
Only rarely has a small firm been appraised in de- 
tail (for one example, see Marburg 1956). More 
attention has been given to generalizations based 
on a study of a significant sample of firms within 
an industry, such as that of metal fabricators in 
New England (Soltow 1965). 

Few historians have attempted histories of en- 
tire industries in recent years. Among the few ex- 
amples are those on rubber (Woodruff 1958) and 
brewing (Mathias 1959), both within the British 
economy, and, outstanding in scope and economic 
analysis, that on the American petroleum industry 
—the authors having at hand at the inception of 
the research numerous histories of firms as well as 
special studies (Williamson & Daum 1959-1963). 

Recent trends. A growing number of business 
historians have centered their interest on what has 
been labeled “business in history.” They seek to un- 
derstand the interaction between businessmen and 
business institutions on the one hand and pertinent, 
influential segments of the environment on the 
other, They are concerned with analyzing both the 
process and the results of the process. Most prom- 
inent in this area have been studies of government— 
business relationships, well exemplified by many 
articles found in the BHR (and elsewhere ). For ex- 
ample, one author has focused on the roles of busi- 
ness institutions and values in relation to evolving 
political institutions in Africa, Asia, and Latin 
America (Robinson 1962). Another has evaluated 
the relationship of business to the emergence of 


the Nazi dictatorship (Schweitzer 1964). A third 
has assessed the connection between a defensive 
national economic policy, coupled with a desire for 
political unity, and Canadian regulation of business 
(Aitken 1964). A similar but more specific ap- 
proach to the history of government—business rela- 
tions is exemplified in a study of the evolution of 
American petroleum pipelines and related public 
policy (Johnson 1956), 

Comparative studies in business history have 
become more common in recent years, providing 
new and meaningful interpretations for business 
historians. Most noteworthy among studies dealing 
with early industrialization is the pioneer mono- 
graph on managerial techniques of British entre- 
preneurs (Pollard 1965). One author has analyzed 
causation and cycles in centralization and decen- 
tralization in administration of large-scale enter- 
prise (Chandler 1956; 1959; 1962: 1965). Another 
has studied evolution of systematic methods for 
coordinating production flows (Litterer 1961a; 
1961b; 1963). Still another can be cited for his 
evaluation of evolving financial reporting by Amer- 
ican corporations within a changing environment 
(Hawkins 1963). One of the most significant books 
analyzes the attitudes and ideas of a broad group 
of railroad executives (Cochran 1953). 

Most of these recent articles and books manifest 
more refined analysis than characterized the writ- 
ten history of business in earlier years, but concep- 
tualization of thought in the field has grown slowly 
and theoretical works have been few. Fritz Redlich 
has written more in this area than has anyone else, 
and his essays dealing with the entrepreneur have 
now been collected (1964). His contributions range 
from an analysis of the “daimonic” in business his- 
tory to categorizations of entrepreneurial types. To 
date, only one author has attempted to advance a 
theory of the growth of the firm (Penrose 1959). 

No widely accepted synthesis of business history 
has yet been achieved, even for the United States, 
the country in which literature on the subject is 
most voluminous. To be sure, some narrations of 
national business achievement have been produced 
(Chamberlain 1963; Walker 1949), and some pe- 
tiods in the history of business have been objec- 
tively appraised (Cochran 1957). 

In spite of vastly increased study of the history 
of business and of business in history, scholars still 
evince a marked lack of interest in numerous im- 
portant topics. Business failures, labor-manage- 
ment relations, small business, and other likely 
subjects are receiving little, if any, attention. More- 
over, although the preliminary moves mentioned 


above have been made, attempts to conceptualize 
and to theorize on the history of business have been 
few. 

Formal courses in business history have been de- 
veloped slowly but steadily. By the mid-1960s more 
than three-score universities, colleges, and schools 
of business in the United States listed courses in 
the field, but the focus on topics involving business 
history was their single common denominator; 
content and emphasis varied widely. Meanwhile, 
academicians in other countries, notably in the 
United Kingdom, Holland, West Germany, Aus- 
tralia, and Japan, have inaugurated research and/ 
or teaching programs in business history, bringing 
the world total of institutions sponsoring such ef- 
forts to more than one hundred. Participants in 
such activities find continuous additions to their 
body of information in the Business History Review, 
Tradition (published in West Germany since 1956), 
Business Archives and History (published in Aus- 
tralia since 1956), Business History (published in 
the United Kingdom since 1958), and pertinent 
articles in many other periodicals. 

The broadening of horizons noted in this article 
has destroyed the exclusiveness of the original 
small pioneering band of professional historians 
and to a degree has outmoded the original frame- 
work that once gave their study of the subject great 
unity. The results of this change are still far from 
clear, and the need for a new synthesis has become 
increasingly apparent. Nevertheless, the governing 
assumptions about the nature of economic change 
and the significance of the businessman in it still 
hold. The increasingly varied approaches and ap- 
plications of business history are in the tradition 
of the evolutionary development that has charac- 
terized the field since it was first recognized as an 


academic discipline. 
Rapu W. Hipy 


[See also ENTREPRENEURSHIP; and the biographies of 
Gras; SCHUMPETER. ] 
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Harald Hjärne (1848-1922), Swedish historian 
and political writer, came from a family of the 
nobility. He studied at Uppsala University, where he 
became assistant professor of history in 1872 and 
professor of history in 1889. His historical works 
fall into three groups. Those written between 1872 
and 1876 deal mainly with old Germanic and medi- 
eval law and social conditions; those of the 1880s 
deal with the history of Russia and connections 
between Russia and Sweden; and those from 1890 
on deal with the Reformation and the seventeenth 
century in Sweden. His numerous political essays 
and articles exerted an important influence on con- 
servative and right-wing liberal thinking, especially 
during the 1890s. Hjärne was a member of the 
second chamber of the Riksdag from 1903 to 1908 
and a member of the first chamber from 1912 
to 1918. 

When Hjärne first emerged as a political writer, 
around 1875, Swedish politics was characterized 
by economic controversies about defense and taxa- 
tion on the one hand and by a blurring of ideologi- 
cal positions on the other. The old conservative 
view of a society based on the estates had dis- 
appeared since the abolition of the four-estates 
Riksdag in 1865. The conservatives then tended to 
merge with the successful right-wing liberals, who 
in turn were becoming more conservative, intent 
on maintaining their conquests. This synthesis of 
conservatism and right-wing liberalism charac- 
terized most political argument and also colored 
the philosophical doctrine prevalent at the uni- 
versities—the political idealism of Christoph Jacob 
Boström. 

Hjärne cast off Boström’s philosophy, with its 
idealization of the four-estate Riksdag and its 
liberal tendency to assert the rights of the indi- 
vidual and private interests, He also moved away 
from the national liberalism of his family, replac- 
ing this by a conservative philosophy of his own. 
According to this political philosophy, the func- 
tions of the state are purely juridical; on this 
point Hjärne agreed with classical liberalism and 
Boström’s ideas, but private interests have no place 
in his system. The only object of the rights granted 
by the king, as representative of the state, to his 


subjects is to enable them to fulfill their obliga- 
tions toward the state. The greater the personal 
contribution of the individual, the greater the 
rights to which he is entitled. Representation 
should be based on these principles and thus be 
regarded as a means of public administration, not 
as a tool for private interests. Hjärne himself main- 
tained that his philosophy stemmed entirely from 
ancient Swedish law and the Swedish constitution 
under the Vasa kings (1875; 1876). Actually, 
however, all the essentials came from the German 
historian and right-wing liberal politician Rudolf 
von Gneist, whose works on English constitutional 
history Hjärne encountered early in the 1870s 
(Gneist 1857-1860). Gneist saw local self-govern- 
ment as the main way of fulfilling personal obli- 
gations toward the state, but Hjärne considered 
military service to be the most important form of 
such obligations. The great stress on duty in his 
philosophy reflects his contempt for the kind of 
haggling about defense and taxation carried on by 
the economic group interests in the Riksdag. 

In the 1880s there arose a partly new, more 
extreme form of conservatism. Strongly national- 
istic, protectionist, and decidedly antidemocratic 
in outlook, its adherents opposed the growing de- 
mands by Norway for equality and greater inde- 
pendence within the Union. At first, Hjarne’s work 
reflected this outlook, but he rejected it after 1890, 
when its supporters threatened to become too 
powerful. It was then that he launched his own 
conservative reform program, “Defense and Re- 
forms.” In accordance with his early ideas about 
the connection between duties and rights he de- 
manded the franchise for all citizens who had 
completed a year’s military service, regardless of 
their income and property. He repudiated the Con- 
servative party and its aggressive Union policy, 
calling himself a right-wing liberal. He retained 
his basic conservative antiparliamentary philoso- 
phy, however. 

Hjarne’s historical works helped to foster the 
patriotic trend in the cultural life of the 1890s, 
and at times he himself succumbed to the nation- 
alist ethos. At the same time he sharply criticized 
the principles of nationalism and race prejudice 
and supported the stand of England and the United 
States against Germany (1903; 1908; 1932-1940). 

Troubled by the growing threat of war and pre- 
sumably also by his own lack of success in the 
Riksdag, around 1908 Hjarne began to develop a 
strongly antidemocratic outlook, maintaining that 
defense and foreign policy should take precedence 
over domestic reforms. As in the 1880s, he again 
considered Germany to be Sweden’s potential ally 
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against the hereditary enemy, Russia. In this belief 
he anticipated the reactionary trend that once more 
dominated conservative thought after the liberals 
came to power in 1911. 

Unlike some other conservative thinkers, Hjarne 
avoided identifying himself with dogmatic poli- 
cies and fatalistic conceptions about inevitable 
historical laws, and he was able to accept with 
equanimity the defeat of Germany and the victory 
of democracy. Primarily through his program “De- 
fense and Reforms” he was a precursor of the 
moderate line, which was to inspire most conserva- 
tive political activity during the following decades. 
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Thomas Hobbes (1588-1679), English philos- 
opher and political theorist, was one of the orig- 
inators of the new mathematico—mechanical view 
of the world established in the seventeenth century 
through the interaction of philosophic reflection 
and natural science. While Descartes was the most 
influential thinker in the movement which gave 
rise to modern philosophy, Hobbes was equally rep- 
resentative of it and scarcely less important. Dur- 
ing the 1630s Hobbes arrived, independently of 
Descartes, at the formulation of the mechanical 
conception of nature; he had also reached, inde- 
pendently not only of Descartes but also of Galileo, 
the notion of the subjectivity of sensible qualities 
such as light and color (Brandt 1921). Like 
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Descartes, Hobbes rejected scholasticism as a phi- 
losophy of sterile disputation and held that mathe- 
matical reasoning must provide the model of 
philosophic method. His thought differed from 
Cartesianism, however, in its thoroughgoing mate- 
rialism. In Hobbes’s view, bodies are the sole exist- 
ents, and the sole cause of phenomena lies in the 
diversity of corporeal motions. Even more than cor- 
poreality, motion is the governing idea in Hobbes’s 
materialist metaphysics, and he employed it with 
impressive ingenuity as the universal explanatory 
principle in his mechanistic account of the world. 

Hobbes also made an important contribution to 
the establishment of psychology as a field of em- 
pirical inquiry. His interest in psychology derived 
from his preoccupation with sensation and imagery. 
Realizing that mental occurrences—i.e., the phe- 
nomena of experience itself—require explanation, 
he provided one which treated them as material 
movements in the brain. Thus, sense, according to 
him, “is but an apparition unto us of that motion, 
agitation, or alteration, which the object worketh 
in the brain, or spirits, or some internal substance 
in the head” ([1640] 1928, p. 4). He left unclear 
whether imaging is identical with certain minute 
motions in the brain or a concomitant of them. His 
analysis of mental phenomena has a reductionist 
tendency, in that it loses sight of what is dis- 
tinctively psychological in its description of phys- 
ical or physiological causes. His account did, 
nevertheless, adumbrate the possibility of a sci- 
entific psychology in which subjects such as sensa- 
tion, dreaming, and imagery could be investigated 
by the same methods used in the sciences of nature. 

Hobbes’s most enduring intellectual achievement 
is his theory of politics, The first expression of 
his political interests, however, was historical and 
classical, rather than philosophical. The son of a 
Wiltshire clergyman, he began his classical educa- 
tion as a child and was sent to Oxford at 14. Fol- 
lowing his departure from the university with the 
B.A, degree in 1608, he became tutor to the son of 
William Cavendish, Lord Hardwick, who was later 
created earl of Devonshire. This was the commence- 
ment of a lifelong connection with the Cavendish 
family, to whose friendship and patronage he owed 
the support that permitted him to pursue his ca- 
reer as philosopher. In 1629, after many years 
spent mainly, it would seem, in the study of classi- 
cal literature, he published his first book, a transla- 
tion of Thucydides’ History. Hobbes conceived 
history's principal business to be the instruction of 
a governing class in political prudence. For this 
purpose he thought Thucydides’ masterpiece un- 
excelled. It is highly Suggestive that of all the an- 


cient historians he should have preferred the writer 
who is the most naturalistic in his inquiry into the 
causes of events as well as the most profound 
analyst of the derangement of political life, and 
even of the meaning of words, which results from 
civil strife. 

When the Thucydides translation appeared, 
Hobbes had already turned his attention perma- 
nently to philosophy. Until the age of 40 he knew 
nothing of geometry; but about 1629, according to 
the curious story related by Aubrey, his contempo- 
rary biographer, he came by chance on a copy of 
Euclid’s Elements. From the study of Euclid he 
perceived the demonstrative certainty attainable 
through geometrical reasoning (Aubrey 1898). He 
held thenceforth that truth is a matter of the laying 
down of clear definitions and the correct deduction 
of all their consequences. 

In the 1630s Hobbes visited the Continent sev- 
eral times, where he formed some important in- 
tellectual friendships and became au courant with 
the most recent philosophical and scientific de- 
velopments. His earliest surviving philosophical 
composition (A Short Tract on First Principles ), 
a work framed on Strictly deductive lines, dates 
from this period. 

This was the time also when the revolution 
against the government of Charles 1 was brewing. 
The beginning of rebellion in Scotland and Eng- 
land at the close of the 1630s evidently stimulated 
Hobbes to develop his political doctrines. There- 
after, wrote Aubrey, “for ten yeares together his 
thoughts were much, or almost altogether, un- 
hinged from the Mathematiques” and intent chiefly 
on the philosophy of politics (Aubrey [1898] 1957, 
P. 151). The result was the composition of three 
works that contain the substance of his political 
theory: The Elements of Law (1640), De cive 
(1642a), and Leviathan (1651). In addition, 
Hobbes included observations of great importance 
in philosophical method and the nature of political 
philosophy in Elementorum philosophiae sectio 
prima: De corpore (1655). He also wrote two 
minor works expressive of his political views, both 
published posthumously: Behemoth (1679), an 
account of the English Civil War, and A Dialogue 

- of the Common Laws of England (16814). 

Hobbes’s approach to political theory reflects the 
spirit of the contemporary scientific movement. In 
his view, the state forms one department in a tri- 
partite division of philosophy, the whole of which 
centers on body. According to this classification, 
philosophy deals first with body under its simplest 
and most general aspects; next, with man as a 
natural body of a particular kind; finally, with the 


commonwealth as a type of artificial body contrived 
by reason. 

Hobbes claimed to be the founder of politics as 
a science and boldly compared himself with such 
great inaugurators as Copernicus, Galileo, and 
Harvey. The meaning of this self-confident asser- 
tion is connected with his conception of method. 
The mode of proceeding in philosophy, he held, is 
exclusively by way of cause and effect; hence, to 
reason philosophically means either to demon- 
strate the effects of a phenomenon from its known 
causes or to demonstrate its causes from its known 
effects. He conceived of cause not as a necessary 
connection between occurrences verified experi- 
mentally but, in a purely intellectualistic—-deductive 
sense, as a hypothetical explanation attained by 
correct reasoning and consistent with ordinary ex- 
perience. In the study of natural phenomena, he 
believed man can attain only to a knowledge of 
possible causes; by contrast, in the study of the 
commonwealth, an artificial body contrived by hu- 
man reason, man can establish causes with cer- 
tainty. Indeed, he went so far as to compare civil 
philosophy with geometry in this respect. Just as 
geometry is demonstrable because men themselves 
define its figures, so “civil philosophy is demon- 
strable, because we make the commonwealth our- 
selves” ([1656] 1962, p. 184). More specifically, 
the definitions of just and unjust, law, covenant, 
etc., on which the political order rests, derive from 
human invention and agreement. To found politics 
as a science would therefore mean to propound 
definitions from which the generation of the com- 
monwealth and the rules necessary to its being 
are strictly deducible. This is what Hobbes thought 
he had accomplished. 

Hobbes began his political theory with an anal- 
ysis of human nature. He portrayed man as a 
creature of incessant activity who can find no rest 
in any final end. Within this stream of activity man 
pursues specific ends. What man desires, he calls 
good; what causes him fear or aversion, he calls 
evil. Thus, good and evil possess diverse meanings 
according to men’s purposes. In so describing how 
men form their notions of good and evil, Hobbes 
was not endorsing a relativistic conception of moral 
judgment; indeed, it was a cornerstone of his po- 
litical theory that men must concur in certain 
common definitions if they are to achieve what 
they all evidently want, namely, self-preservation, 
the condition of any activity whatsoever. Nor did 
Hobbes blame man for or accuse him of being the 
self-centered creature he is. He held that political 
philosophy must take human nature as its datum if 
it is to show how peace and community can be- 
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come possible to creatures who necessarily refer 
all things to their own single selves. Hobbes’s poli- 
tics are therefore linked to his psychology. The 
task he assigned reason is not to conquer or ex- 
tinguish passion—an impossibility anyway—but to 
instruct it. Reason will teach passion what it must 
refrain from and what rules it must accept in order 
to attain its ends of self-preservation and, beyond 
that, of well-being and the commodities of civi- 
lized life. 

It was when Hobbes shifted from man in general 
and in the abstract to the state of nature, where 
allowance must be made for the coexistence of 
many men, that he confronted the political prob- 
lem proper, namely, how to secure peace and order 
among a multitude. Whether a state of nature ever 
existed historically was of little importance to 
Hobbes. For him it represented the hypothetical 
alternative to the commonwealth and sovereignty 
and was therefore a condition in which men in 
the pursuit of their diverse ends are subject to no 
power other than that which they can casually 
impose on one another. Hobbes intended to think 
away civil society in order to picture life in the 
absence of a coercive political order. What he dis- 
played was a state of endless and oppressive inse- 
curity, a war of all against all, where nothing is 
anyone’s with certainty, in which the notions of 
just and unjust can have no place, and where each 
literally has a right to everything. 

Amid this very anarchy Hobbes discerned the 
basis of natural right in man’s desire to live. As 
all men seek their own good, they also naturally 
shun death, the chief of evils. Reason accordingly 
dictates, said Hobbes, that men should attempt 
by all means to preserve themselves. Whatever is 
done according to reason is “done justly, and with 
right” ([1642a] 1949, pp. 8-9). Natural right was 
considered by Hobbes as antecedent to the political 
order: it is a moral claim logically derived from 
the premise of all human activity and passion, 
the wish to live. Thus, when reason teaches that 
to secure themselves men must renounce the 
liberty of the state of nature, the route from 
anarchy to the commonwealth and civilization has 
been pointed out. Moreover, only reason can 
demonstrate which means do in fact conduce to 
that which all men want, and such means alone 
will be real goods. They will be the norms that 
men as rational beings must maintain to make 
their right effective. 

The precepts of reason that lead to self-preserva- 
tion Hobbes called the law of nature. He listed a 
number of these precepts, of which the most im- 
portant is that men should seek peace, and as a 
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further deduction from this, that they should re- 
linquish their right to all things, each being con- 
tent “. . . with so much liberty against other men, 
as he would allow other men against himselfe” 
([1651] 1958, p. 100). 

Yet in what sense are these precepts laws? 
Hobbes answered that they are such only if con- 
sidered as the commands of God. Otherwise they 
are not properly laws, since they are merely ad- 
visive and not externally obligatory. A correct 
definition would be “Conclusions or Theoremes 
concerning what conduceth to... conservation . . .” 
(ibid., pp. 122-123). Hence, to Hobbes the law 
of nature appears to signify little more than the 
means or conditions which the calculating intel- 
lect demonstrates to be necessary to preservation. 
This constituted a radical departure from the tradi- 
tional view. In traditional political philosophy, the 
law of nature was held to be genuine law and to 
prescribe ethical rules originating in the divine 
order and binding upon earthly communities. 
Hobbes, however, was strongly skeptical of this 
conception and took a different line. He refused 
to derive the dicta of natural law from the super- 
natural realm, on the ground that neither God nor 
revelation can be a subject of knowledge and both 
are thus beyond the scope of philosophy. He held 
also that the dicta of natural law are vacuous until 
positive law defines their meaning. Accordingly, 
while natural law forbids theft, murder, and adul- 
tery, it remains for positive law to determine what 
actions are to be called by these names. Hobbes 
strove to remove the ambiguities in the concept 
of natural law in two ways: by denying that 
natural law is properly law at all and by deriving 
its precepts autonomously from the rationally in- 
structed striving of men for self-preservation. In 
effect, therefore, he rejected the divine order as 
the basis of moral valuation. For him, values are 
neither divinely guaranteed nor cosmically under- 
written; they are made by man. 

The fundamental dictate of the law of nature, 
then, is that men should seek peace. This entails 
their renunciation of the right to all things, which 
is theirs in the state of nature. Yet while reason 
always obliges men to do this, the conditions have 
to be established in which they can do so with 
security, Agreement among themselves to lay down 
their right is insufficient unless there is also a 
power to ensure that they adhere to this rational 
resolution. Thence arises, as a further inference, 
the commonwealth in which a single will repre- 
sents the wills of all and possesses the right to 
coerce those who violate the agreement to which 
reason compels them. Therefore the ultimate cause 


of the commonwealth, according to Hobbes, is the 
foresight of men who perceive that civil society is 
the sole means of self-preservation and a contented 
life. Proceeding from this view, Hobbes rejected 
the Aristotelian doctrine that man is a political 
animal. He did not deny that man is naturally 
sociable; but commonwealths, he insisted, are not 
mere meetings, but true unions to whose con- 
trivance the making of compacts is necessary. 

Hobbes explained with elaborate care how the 
consent of men to government is embodied in con- 
tracts or covenants. Like the state of nature, cove- 
nant for Hobbes is a necessary hypothesis: it is 
required to show that the commonwealth is un- 
thinkable except as something to which men 
consent for the sake of life and civility. It also has 
the further function of justifying the coercion and 
punishment imposed by the state. If these are to 
be more than mere acts of power, they must be 
traced to a covenant whereby man himself, as a 
rational being, becomes the author of the punish- 
ment his transgressions incur in civil society. 

In every commonwealth, Hobbes said, there 
must be a sovereign power to enforce the covenant 
to peace that men have made: “Covenants, with- 
out the Sword, are but Words, and of no strength 
to secure a man at all” ([1651] 1958, p. 128). So 
sovereignty is necessitated by the same sequence 
of deductions that accounts for civil society, The 
attributes of the sovereign are the same in any 
form of government—democracy, aristocracy, Or 
monarchy—and are very comprehensive. The sov- 
ereign power is not removable or punishable by its 
subjects; it is the only judge of what conduces to 
peace and therefore of what doctrines and opinions 
may be taught; its will alone makes law and de- 
termines the rules of property, of good and evil, 
of lawful and unlawful actions; it rewards and 
punishes, commands the armed forces, decides on 
war and peace. (Hobbes, of course, recognized that 
in actual fact sovereigns neither exercise all these 
powers nor do subjects allow that they have the 
right to do so. This to him, however, was mere 
illogic: if the sovereign is to keep peace and order, 
subjects ought to acknowledge all its attributes as 
inescapable. ) 

With this rigorous reasoning Hobbes brought 
the modern doctrine of sovereignty into the world. 
That doctrine already had a long history when 
Hobbes wrote: an approach to it appeared both in 
the work of the sixteenth-century French thinker 
Jean Bodin and, much earlier, in the writings of 
the medieval canonists and publicists who upheld 
the most extreme claims of the papal monarchy. 
But Hobbes’s conception was novel in that he al- 


lowed no legal limitation on the sovereign power. 
This was the consequence of his view of law, which 
was defined for him not by any moral content but 
solely by its character as the sovereign’s command. 
The sovereign cannot be limited by positive law, 
because the origin of positive law is the sovereign’s 
will; nor can it be limited by the law of nature, 
because the law of nature is, properly speaking, 
not a law. 

Yet, since the right of nature—man’s claim to 
self-preservation—is the root from which Hobbes’s 
political doctrine grew, this right results in sub- 
stantial qualifications upon the absolutist character 
of his thought. Although, for instance, obedience 
is a duty in civil society, Hobbes pointed out that 
the subject retains his liberty to do whatever he 
cannot be conceived to have renounced or trans- 
ferred by any covenant. This liberty is, in fact, con- 
siderable. The subject is not obliged to obey a 
command to kill or wound himself and may also 
refuse to endanger his life by service in war; again, 
while rebellion is not justifiable, subjects have the 
right, once it is begun, to continue in their resist- 
ance in order to preserve themselves. Most impor- 
tant of all, Hobbes exempted the subject from the 
obligation to obey a command that “. . . frustrates 
the End for which the Soveraignty [sic] was or- 
dained . . .” ({1651] 1958, p. 167). He thus left 
to subjects both a vital right of private judgment 
and a moral vantage point vis-a-vis the sovereign. 

Hobbes also stressed that the sovereign has 
duties toward its subjects. It is obliged, he stated, 
to make the safety and well-being of the people 
the rule of its actions. While this is a moral rather 
than a legal obligation, it is nonetheless real, 
founded as it is on men’s basic purpose in consent- 
ing to submit to the commonwealth and sover- 
eignty. A sovereign which acts otherwise “will act 
against the reasons of peace, that is to say, against 
the law of nature” ([1642a] 1962, p. 167). And 
safety, Hobbes added, means not only bare preser- 
vation but happiness and living delightfully, so far 
as these are possible. His advice to sovereigns re- 
specting taxation, equal justice, and other matters 
of government is unexceptionable in its care for 
the subject’s interests. Nor did he fail to point out 
that the sovereign’s disregard of its duty will lead 
to rebellion as a natural consequence. x 

It is evident, then, that alongside the absolutist 
element in Hobbes’s thought is a strong tendency 
toward liberalism. This appeared not only in his 
account of the subject’s rights and the sovereign’s 
duties but also in his progressive theory of punish- 
ment, which anticipated Beccaria and Bentham, 
and in his dislike of religious intolerance and cleri- 
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cal pretensions. Hobbes’s liberalism is intrinsic 
to his conception of natural right, the dominant 
theme of his politics. Hence there results the para- 
doxical fact that he established his absolutism on 
liberal presuppositions. For him the state is not 
an object of awe and reverence. No sacred mys- 
tique veils Leviathan, the “mortal God,” created by 
human association, to which men owe their peace 
and defense. The commonwealth is the work of 
men; its utility is its sole justification. It makes 
men moral and educates them for civility, but does 
so by their own consent and to advance their own 
purposes. If Hobbes defended absolutism, it was 
because he assumed absolutism to be in the gen- 
eral interest. Without this premise, there would 
be no great difficulty in constructing a liberal sys- 
tem out of the materials provided by his own po- 
litical philosophy. 


Hobbes was one of the most famous thinkers 
of his time. His writings were widely read, and 
although he formed no school, he probably exer- 
cised a greater and more varied influence upon 
English political theory than did any contem- 
porary. Of Continental philosophers, Pufendorf 
and Spinoza, to mention only the foremost, were 
strongly affected by his ideas. A royalist in sym- 
pathy, he emigrated in 1640 and spent the next 
11 years in France. He returned to England after 
the defeat and execution of Charles 1, justifying 
his return with the argument that a subject's obli- 
gation ceases when the sovereign can no longer 
protect him ([1651] 1958, see “Review and Con- 
clusion”). Despite this conduct, Hobbes retained 
the favor and friendship of Charles m after the 
Stuart restoration of 1660. 

Hobbes’s materialism, determinism, and skepti- 
cal temper brought upon him a host of attackers. 
Clerical opponents accused him of heresy and 
atheism, and in 1683 the University of Oxford con- 
demned a number of his works to the flames. Some 
of the controversies he waged, such as that with 
Bishop Bramhall over free will, belong to the great 
intellectual debates of the age. Even upon his 
critics Hobbes exerted a powerful influence. He 
obliged them to lay aside theological and moral 
conceptions, to meet him on his own ground of 
strict and severe reasoning, and to deal with issues 
on their intellectual merits. His remarkable prose 
style, perfectly expressive of the hard, confident, 
and probing character of his mind, also contributed 
not a little to the effect he had upon friend and 
foe alike. By virtue both of his positive doctrines 
and of the scope and rigor of his philosophical 
inquiries, Hobbes was one of the foremost agents 
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in the dissemination of the rationalism that altered 
the moral and mental climate of Europe in the 
course of the seventeenth century. 

PEREZ ZAGORIN 


[See also CONSENSUS; CONSTITUTIONS AND CONSTITU- 
TIONALISM; NATURAL LAW; POWER; SOCIAL CON- 
TRACT; SOVEREIGNTY; UTILITARIANISM; and the 
biographies of Austin; Bacon; BODIN; DESCARTES; 
DURKHEIM; HARRINGTON; HEGEL; MACHIAVELLI; 
MANDEVILLE; VIco.] 
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HOBHOUSE, L. T. 


Leonard Trelawny Hobhouse (1864-1929), Brit- 
ish sociologist and philosopher, was educated at 
Oxford. He was appointed fellow of Merton College 
in 1887 and fellow of Corpus Christi in 1894. Early 
in his career he showed an active interest in social 
and political movements: he was secretary of the 
Free Trade Union from 1903 to 1905 and published 
several political works, notably The Labour Move- 
ment (1893), Democracy and Reaction (1904), 
and Liberalism (1911a). From 1897 to 1902 he 
served on the staff of the Manchester Guardian, 
and from 1905 to 1907 he was political editor of 
the Tribune in London. 

The main lines of the social philosophy that he 
was to develop in his later studies can be clearly 
discerned in these early political writings. This 
philosophy is based on principles differing funda- 
mentally both from laissez-faire liberalism and 
from the bureaucratic, or as Hobhouse called it, 
“official” socialism of the Fabians. It may best be 
described as social liberalism, and in working it out 
he presented a cogent and still vitally important 
analysis of the relations between individual free- 
dom and responsibility and state control. 

In 1907 Hobhouse was appointed the first Martin 
White professor in sociology at the University of 
London and thereafter devoted himself mainly to 
his scientific and philosophical pursuits, but he 
never abandoned his earlier passionate interest in 
the deeper issues of politics, internal and foreign, 
as can be seen in his important contributions to 
the Manchester Guardian during World War 1 and 
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in his work as chairman of trade boards and as 
arbitrator in various disputes. 

The key to Hobhouse’s work is to be found in his 
conception of development. He was profoundly dis- 
satisfied both with Spencer's account of evolution 
as dependent on the struggle for existence and the 
survival of the fittest and with the idealist philos- 
ophers’ metaphysical theory of an absolute mind 
reaching self-consciousness in the historical proc- 
ess. In his view, a theory of development has to be 
solidly based on empirical facts. In his early work 
on logic and epistemology, The Theory of Knowl- 
edge (1896), he had subjected the dominant ideal- 
ist view of the nature of thought to a detailed 
critical analysis and worked out a realistic theory 
based on a synthesis of empiricism and rationalism. 
Subsequently he turned to the study of the growth 
of the mind in the animal world and of the transi- 
tion to the mind of man. The results were embodied 
in Mind in Evolution (1901). 

The publication of Morals in Evolution (1906) 
marked a new epoch in the development of soci- 
ology. Hobhouse revealed a grasp of anthropological 
data, of the history of law, morals, and religion, 
and of scientific and philosophical thought and laid 
the foundations of an impressive sociological sys- 
tem. A later work incorporated research on pre- 
literate societies carried out in collaboration with 
G. C. Wheeler and the present writer and was 
published in 1915 under the title The Material Cul- 
ture and Social Institutions of the Simpler Peoples. 

From 1918 to 1922 Hobhouse published three 
works devoted to political and ethical theory, and 
in 1924 there appeared his Social Development, in 
which he presented a restatement of the whole of 
his sociological work. In this book he gave a fresh 
analysis of the nature of social development and 
of the factors making for progress, arrest, or decay. 
His treatment avoids both the mistakes inherent in 
the kind of crude evolutionism that has brought the 
theory of development into disrepute and those that 
result from the one-sided emphasis on economic 
factors implicit in many forms of historical mate- 
rialism. Only the general lines of this methodology 
are indicated here: 

(1) He worked out a social morphology, that is, 
a classification of types of society and of the forms 
of social institutions, based on a wide survey of the 
data of anthropology and history. He was well 
aware of the difficulties inherent in the comparative 
method but sought to meet them by penetrating 
beneath outward resemblances to basic functions. 

(2) He attempted to correlate the various as- 
pects of social change with their contribution to 
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the general advance of the community, as estimated 
by certain criteria deduced from the general nature 
of development. They are extension in the scale of 
organization, growing efficiency in control and di- 
rection, and increasing freedom and cooperation in 
the satisfaction of mutual needs. It should be noted 
that these criteria are, at this stage of the inquiry, 
to be taken as ethically neutral: scale and efficiency 
are clearly so, and even though freedom and mutu- 
ality might appear to be ethically tinged, they are 
defined by Hobhouse in a manner that does not 
involve any particular ethical theory, Thus, a soci- 
ety is said to be free internally if its component 
parts can operate with the least loss of energy, 
without interfering with or obstructing each other. 
Mutuality depends on the extent to which the parts 
not only do not obstruct each other but work to- 
gether so as to maintain the system as a whole. 

(3) He put forward a hypothesis that there is a 
broad correlation between mental development and 
social development as estimated in the light of the 
above criteria. To establish this correlation he 
undertook an elaborate analysis of mental develop- 
ment in (a) the growth of scientific thought, (b) 
the control of the forces of nature, including hu- 
man nature, (c) the ethicoreligious sphere, and 
(d) more briefly, the history of art. It is not sug- 
gested that the movements in these various spheres 
are parallel or that their rate of advance is similar 
but that, making due allowance for conditions in 
different periods of time and different parts of the 
world, they may be taken as a general indication of 
the growth of the human mind. 

(4) Hobhouse then tried to show that there is 
indeed a broad correlation between mental advance 
and the development of societies. Thus, in the 
earlier phases of thought and belief, societies grow 
in scale and efficiency and in control and direction, 
although at the cost of mutuality and freedom. Gov- 
ernments tend to be authoritarian in character, and 
social differentiation rests on subordination. In rel- 
atively higher phases this growth in scale and effi- 
ciency continues to increase as does subordination. 
But with the beginning of critical and systematic 
thought free forms of government arise, and, in 
theory at any rate, there emerge ideals of a unitary 
spiritual order. In the ancient world the embodi- 
ment of these ideals is still very restricted. In more 
recent times, in the stage of “experiential recon- 
struction,” when conceptual analysis is combined 
with empirical verification, there is not only an 
increase in scale and efficiency but also the begin- 
ning of a concrete embodiment of the elements of 
mutuality and freedom on a world scale; and at- 
tempts are made at a genuine synthesis of personal 


and political freedom with moral universalism. A 
survey of history thus suggests that there has been, 
on the whole, correlated growth of mind and civili- 
zation. 

(5) Hobhouse next argued that the process of 
development, so far examined by him independ- 
ently of any ethical commitment, may be shown to 
be generally in the direction required by his theory 
of the rational good. This good consists in the har- 
monious fulfillment of human potentialities, and it 
is clear that it can be achieved only by the willing 
cooperation of all mankind. In theory, therefore, 
ethical and social development have a common 
end. But although they tend to converge, they do 
not in fact coincide. For this there are many rea- 
sons. Each different society develops, if not wholly 
independently, yet in its own way and in accord 
with its own peculiarities and distinguishable in- 
ternal sources of change. Furthermore, they ad- 
vance at different rates. As a result, inequalities in 
economic and political power arise that offer oppor- 
tunities for exploitation and domination and have 
been among the root causes of war. Again, develop- 
ment in one direction may bring retrogression in 
another. Thus, the extension of the area of organ- 
ization may reduce the chances of conflict but may 
make it more destructive when it does occur. The 
parts may develop at the expense of the whole or 
the changes that they undergo may not be duly 
balanced, with resulting structural strains and pos- 
sible collapse. These causes of arrest or decay can 
easily be illustrated from the history of communi- 
ties and are readily discerned in the contemporary 
world. It is in this connection that Hobhouse’s 
analysis may be fruitfully extended in dealing with 
the problems of our own day. Progress is, in any 
case, not automatic or unilinear but depends on 
human thought and will: on the rationality of the 
mind and its capacity for forming an intelligible 
conception of a good in which all men can share 
and for securing an effective will directed to this 
good. Humanity has not yet reached the stage of 
self-direction; but reviewing the state of the world 
in the 1920s, Hobhouse felt justified in concluding, 
despite serious misgivings, that it contained many 
essentials of such self-direction and that these were 
sufficient to indicate the direction of future de- 
velopment. 

Hobhouse further strengthened his argument by 
an examination of the conditions affecting social 
change: environmental, biological, psychological, 
and distinctively sociological. The great value of his 
contributions to sociology does not depend solely on 
the strength of his main hypothesis; it also lies in 
the light he threw on various phases in the history 


of knowledge, religion, and morals and their rela- 
tion to social change, in the careful way in which 
he distinguished judgments of fact and of value, 
and in the skill with which he eventually brought 
facts and values together in a comprehensive syn- 
thesis. 

Hobhouse will be remembered not only as a con- 
tributor to sociology and to comparative and social 
psychology but also as a philosopher of great dis- 
tinction. The fullest exposition of his philosophical 
theories is to be found in his Development and 
Purpose (1913), which he himself regarded as 
completing a scheme that had occupied him for 26 
years and that had been carried through successive 
stages in his previous major works. In this book he 
showed that development proceeds by the liberation 
of elements originally indifferent to or in conflict 
with one another and by the building up of struc- 
tures of varying degrees of plasticity and coherence. 
The power behind development is mind, essentially 
a correlating activity, manifested in all orderly 
structures but most clearly in living organisms, in 
which mechanical and teleological factors are inter- 
woven. Mind is, in this view, not coextensive with 
reality but is the principle of orderly growth within 
it. It is limited by the material it works upon, and 
its purposes themselves undergo development. His 
fundamental principle, which he entitled “condi- 
tional teleology,” is examined both from the point 
of view of the logical requirements of explanation 
in dealing with systems undergoing change and 
from the point of view of its value as an instrument 
of investigation, especially in the fields of biology 


and sociology. 
Morris GINSBERG 


[Directly related are the entries EVOLUTION, article on 
CULTURAL EVOLUTION; LIBERALISM; SOCIAL DARWIN- 
IsM; SOCIAL INSTITUTIONS. Other relevant material 
may be found in CONSTITUTIONAL LAW, article on 
CIVIL LIBERTIES; TRIBAL SOCIETY; WAR, article on 
PRIMITIVE WARFARE; and in the biographies of 
BoSANQUET; OGBURN; TOYNBEE.] 
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John Atkinson Hobson (1858-1940) was a 
British economist. In his appealing autobiography, 
Confessions of an Economic Heretic (1938, p. 15), 
he described himself as “born and bred in the 
middle stratum of the middle class of a middle- 
sized industrial town of the Midlands.” The town 
in question was Derby, and the family derived its 
income from the Derbyshire Advertiser, a Liberal 
newspaper. The family’s resources sufficed to as- 
sure Hobson an adequate private income all his 
life and the chance to pursue a heterodox career 
free of financial strain. At Oxford he specialized 
in classics. He also made a close study of Marx, 
but Capital repelled him and he dismissed the 
dialectical method as frivolous. As a child he 
broke with formal religion because it failed “to 
satisfy my elementary sense of reason and of 
justice in the doctrines of the atonement and of 
everlasting punishment for unrepentant sinners” 
(1938, p. 20). 

The intellectual influences that impinged on 
Hobson included Mill’s On Liberty and Utilitarian- 
ism and Herbert Spencer's Study of Sociology. John 
Ruskin’s message impressed him while he was 
teaching classics in two successive public schools. 
Even though Hobson recognized the speculative 
cast of much of Ruskin’s thought, he was attracted 
by Ruskin’s criticism of the conventional econo- 
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mists’ tendency to identify economic value with the 
monetary costs of producing marketable commodi- 
ties. Ruskin’s slogan “there is no wealth but life” 
animated all of Hobson’s writings, especially the 
two volumes that best express his humanistic 
conception of economics, The Industrial System 
(1909) and Work and Wealth (1914). Thorstein 
Veblen’s criticisms of capitalist waste and capitalist 
standards of consumption also attracted Hobson. 
With American institutional economists of the 
period he shared a basic distrust of marginal eco- 
nomics and a disposition to interpret all aspects of 
economic activity within the broader framework 
of ethics and sociology. 

Hobson’s two enduring contributions to eco- 
nomic thought are connected with the doctrine of 
oversaving and the theory of imperialism. The 
possibility, even the probability, of oversaving is 
the central lesson of his first book, The Physiology 
of Industry (Mummery & Hobson 1889), which 
he wrote in collaboration with the businessman 
and mountain climber A. F. Mummery. Hobson 
directed this first attack at a central proposition 
of conventional economics—that a society can 
never save too large a proportion of its income— 
asserting that excess saving brings about the 
construction of unneeded capital equipment whose 
products lack consumer markets. In later works 
Hobson strengthened his argument with explicit 
emphasis upon the maldistribution of income, 
which causes the wealthy to save too large a share 
of their income and thus generates unemployment 
and economic depression. Hobson's conception of 
oversaving differs from contemporary theories in 
its assumption that saving is actually and neces- 
sarily converted into fixed investment. The Keynes- 
ian version maintains that an effort to save too 
much diminishes total spending rather than fixing 
itself in surplus machines. 

Hobson's second major contribution is contained 
in Imperialism (1902). This was an extension of 
his oversaving theory, but its impetus and a great 
deal of its historical detail derived from Hobson’s 
experience in South Africa as a correspondent of 
the Manchester Guardian just prior to the Boer 
War. He explained the revived drive for empire as 
the result of a search by English capitalists for 
Savings outlets. Lenin borrowed freely of Hobson’s 
doctrine. 

Although trenchant as a critic, Hobson was less 
successful as an inventor of concepts and pro- 
grams. He never put into acceptable theoretical 
style his human conception of value or his per- 
ception that the money costs of production inade- 
quately measure the human pains of economic 
output. Nor did he find a completely satisfactory 


political vehicle for his ideas. He began as a Liberal 
and then became a somewhat reluctant member 
of the Labour party. Never accepting class war as 
the proper route to a better society, always seeking 
to enlist portions of the middle class under his 
banner, Hobson was also skeptical of nationaliza- 
tion as the single remedy for a defective society. 
Although he was perfectly willing to nationalize 
industries which suffered from monopoly, ineffi- 
ciency, or excessive routinization of their opera- 
tions, he attached at least equal importance to less 
concentration within private industry, progressive 
taxation of income and wealth, and the social 
policies now embodied in the welfare state. Like 
R. H. Tawney and G. D. H. Cole, he emphasized 
the quality of consumption and the satisfactions 
of the worker as elements of a humane society. 
He took pride in what he called “‘the jumble’ of 
politics, ethics, art, religion, which constitutes the 
setting of my thought” (1938, p. 164). 

Orthodox economists admired Hobson little more 
than he admired them. F. Y. Edgeworth, in an 
influential review of The Physiology of Industry, 
denied Hobson and Mummery the “undoubted spec- 
ulative genius” or the “extraordinarily wide learn- 
ing” that alone might justify such paradoxical con- 
clusions. As a direct consequence of the book, 
Hobson lost two extension lectureships and won a 
reputation for heresy that placed him on the fringes 
of the academic world for the rest of his career. As 
late as 1954, Schumpeter summed him up in this 
way: “In economics he was self-taught in a wilful 
way that made him both able to see aspects that 
trained economists refused to see and unable to see 
others that trained economists took for granted” 
(1954, p. 832, footnote). 

Hobson felt himself vindicated, however, when 
Keynes in The General Theory of Employment, 
Interest and Money (1936) termed the publication 
of The Physiology of Industry “an epoch in eco- 
nomic thought” and credited Hobson with antici- 
pating a good half of Keynes’s own theory of under- 
employment equilibrium. As an anticipator of both 
Lenin and Keynes, Hobson seems to have a secure 
place in the history of economic doctrine. 


ROBERT LEKACHMAN 


[For the historical context of Hobson's work, see the 
biographies of MILL; SPENCER; VEBLEN; for discus- 
sion of the subsequent development of Hobson's 
ideas, see CONSUMPTION FUNCTION; IMPERIALISM; 
and the biographies of KEYNES, JoHN MAYNARD; 
LENIN, ] 


WORKS BY HOBSON 
(1889) 1956 Mummery, ALBERT F.; and HOBSON, 
Joun A. The Physiology of Industry: Being an Ex- 


posure of Certain Fallacies in Existing Theories of 
Economics. New York: Kelley. 

(1894) 1949 The Evolution of Modern Capitalism: A 
Study of Machine Production. Rev. ed. New York: 
Macmillan. 

(1902) 1948 Imperialism: A Study. 3d ed. London: Allen 
& Unwin. 

(1909) 1910 The Industrial System: An Inquiry Into 
Earned and Unearned Income. New ed., rev. London 
and New York: Longmans. 

(1914) 1933 Work and Wealth: A Human Valuation. 
Rev. ed. London: Allen & Unwin. 

1938 Confessions of an Economic Heretic. New York: 
Macmillan. 


SUPPLEMENTARY BIBLIOGRAPHY 


BraitsForp, Henry N. 1948 The Life-work of J. A. 
Hobson. London: Oxford Univ. Press. 

Homan, PAULT, 1928 Contemporary Economic Thought. 
New York: Harper. > See especially pages 281-374 
on “John A. Hobson.” 

Hurtcuison, TERENCE W. (1953) 1962 A Review of 
Economic Doctrines: 1870-1929. 2d ed. Oxford: Clar- 
endon. > See especially pages 118-129 on “J. A. 
Hobson.” 

KEYNES, Joun Maynarp 1936 The General Theory of 
Employment, Interest and Money. London: Macmillan. 
-> A paperback edition was published in 1965 by 
Harcourt. 

SCHUMPETER, JosEPH A. (1954) 1960 History of Eco- 
nomic Analysis. Edited by E. B. Schumpeter. New 
York: Oxford Univ. Press. 


HOLMES, OLIVER WENDELL 


The career of Oliver Wendell Holmes, Jr., as a 
judge spanned half a century. Yet quite apart from 
this long and distinguished service on the highest 
courts of his state and nation, his pervasive influ- 
ence as historian and philosopher of the law is 
bound to assure him a permanent place in Ameri- 
can jurisprudence. It is his 30 notable years as an 
associate justice of the United States Supreme 
Court, however, that no doubt constitute his special 
claim to importance and fame as a jurist. In so 
many ways a true product of New England’s intel- 
lectual aristocracy, in time he came to capture the 
popular imagination of the whole country. 

His father was Dr. Oliver Wendell Holmes, 
whose work as poet, essayist, and physician was 
in itself a significant chapter in the flowering of 
America’s cultural and scientific maturity. The 
future judge was born in Boston on March 8, 1841, 
and died in Washington, D.C., on March 6, 1935. 
In July 1861, shortly after graduating from Har- 
vard College, he enlisted with the Twentieth Massa- 
chusetts Volunteers and was mustered out three 
years later with the rank of captain. Holmes was 
wounded three times—at Bull’s Bluff, at Antietam, 
and at Marye’s Hill, Fredericksburg. Both his phi- 
losophy and his rhetoric were destined to reflect his 
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Civil War experience. That experience may help 
explain “the inner resolution he felt bound to 
achieve . . . between skepticism and faith,” as 
Paul A. Freund has phrased it (see the foreword 
in Frankfurter 1938). 

Once out of uniform, Holmes resolved his doubts 
concerning the choice of a life’s calling—philos- 
ophy versus law—by enrolling at the Harvard Law 
School, where he studied between 1864 and 1866. 
“A man may live greatly in the law as well as else- 
where,” he was to say many years later in “The 
Profession of the Law” (1962, p. 29). 

It is significant that almost from the start of his 
law practice, scholarly interests claimed Holmes’s 
chief attention. As Mark DeWolfe Howe has noted, 
“the young lawyer's affiliations of mind and sym- 
pathy had early shown themselves to be more with 
the Brahmins than with the merchants of New 
England” (Howe 1957-1963, vol. 2, p. 2). He 
served as editor of the American Law Review from 
1870 to 1873 and edited the twelfth edition of 
James Kents Commentaries on American Law. But 
it was the publication of his book The Common 
Law in 1881 that doubtless established his reputa- 
tion as a legal scholar, leading in the following year 
to his appointment as Feld professor of law at Har- 
vard and shortly thereafter to his selection as an 
associate justice of the Massachusetts Supreme 
Judicial Court. The book is permeated by all of the 
major strains of thought that have come to be iden- 
tified with Holmes’s outlook and method both as 
scholar and judge: his deep sense of history, his 
rejection of the rigidities of legal logic, his aversion 
to the confusion between law and morals, and his 
awareness of the psychological roots of judicial de- 
cisions. Some of the sentences of the opening para- 
graph have become famous in themselves: 


The life of the law has not been logic: it has been ex- 
perience. The felt necessities of the time, the prevalent 
moral and political theories, intuitions of public policy, 
avowed or unconscious, even the prejudices which 
judges share with their fellow-men, have had a good 
deal more to do than the syllogism in determining the 
rules by which men should be governed. The law em- 
bodies the story of a nation’s development through 
many centuries, and it cannot be dealt with as if it 
contained only the axioms and corollaries of a book 
of mathematics. ([1881] 1963, p. 7) 


Holmes served on the Massachusetts bench for 
20 years, 1882 to 1902, the last three as chief 
justice. Most of his opinions dealt with the tradi- 
tional concerns of common law litigation—torts, 
contracts, and crimes—but there were cases in 
which he had the opportunity to discuss broader 
issues of public policy and constitutional law. When 
that happened he usually gave expression to ideas 


492 HOLMES, OLIVER WENDELL 


that can also be found in his legal essays; they 
foreshadowed the dominant themes of his philos- 
ophy as a Supreme Court justice. He insisted that 
the judiciary has but a limited role to play in the 
process of government and that the judge must 
allow ample scope to the provisions of constitu- 
tions and the discretion of legislators. 

It was Holmes’s dissenting opinions in labor 
cases, especially those in which he vindicated the 
rights to peaceful picketing and the closed shop, 
that made some circles think of him as a radical. 
Yet these seemingly pro-labor opinions were the 
result of his basically conservative slant—what he 
called “the battle of trade” or the “free struggle for 
life.” If the pursuit of economic advantage was 
leading businessmen to combine, he saw no reason 
for keeping workers from cooperating with each 
other in promoting their interests. 

Judging from the correspondence between Henry 
Cabot Lodge and Theodore Roosevelt, it may have 
been Holmes’s views on the labor problem that in- 
duced Roosevelt to name him to the Supreme Court 
in 1902. Ironically, when Holmes, much to Roose- 
velts displeasure, dissented only a year later from 
the Court's opinion upholding the dissolution of 
the Northern Securities Company, he invoked the 
same general economic beliefs implicit in his Mas- 
sachusetts labor opinions. 

Although the number of occasions on which 
Holmes differed from Supreme Court majorities 
was comparatively small, he came to be regarded 
as the “great dissenter” on the Court. It was largely 
these dissenting opinions that made the public 
think of him as a liberal judge, not realizing that 
in private he was opposed to many of the reforms 
that, as a judge, he was seeking to save (Biddle 
1961, p. 68). 

Holmes’s reputation as a liberal no doubt origi- 
nated with his celebrated 1905 dissent in Locher v. 
New York—an opinion which Roscoe Pound char- 
acterized as the “best exposition” of sociological 
jurisprudence. In eloquent and increasingly sharp 
language, the justice continued to protest against 
the tendency of many of his colleagues to annul 
labor and social welfare laws because they disap- 
proved of the policies these laws embodied. He 
furnished one of the best clues to the reason why 
he often dissented on such questions in a case in 
which a majority of the Supreme Court had struck 
down an Arizona law forbidding the use of injunc- 
tions in labor disputes, “There is nothing I more 
deprecate,” Holmes confessed in 1921, “than the 
use of the Fourteenth Amendment beyond the ab- 
solute compulsion of its words to prevent the mak- 
ing of social experiments that an important part 


of the community desires, in the insulated cham- 
bers afforded by the several States, even though the 
experiments may seem futile or even noxious to 
me and to those whose judgment I most respect” 
(Truax v. Corrigan, 257 U.S. 312, 344). 

There is still another reason for the popular 
image of Holmes as a liberal dissenter, In a num- 
ber of free speech cases he voted against the gov- 
ernment and spoke out in defense of the individual’s 
right of free expression. But he had also delivered 
the Court's opinions in several cases upholdin g con- 
victions under the Espionage Act of 1917, includ- 
ing the conviction of Eugene V. Debs, the militant 
head of the Socialist party. It was in the first of 
these wartime cases—Schenck v. United States— 
that he set forth his celebrated formula of “clear 
and present danger.” “The question in every case,” 
he wrote, “is whether the words used are used in 
such circumstances and are of such a nature as to 
create a clear and present danger that they will 
bring about the substantive evils that Congress has 
a right to prevent” (249 U.S. 47, 52, 1919). 

When Holmes became convinced—possibly 
through the influence of his frequent companion 
in dissent, Justice Louis D. Brandeis—that the ob- 
jectionable speech or publication was not likely to 
bring about “a clear and present danger,” he did 
not hesitate to register his disagreement. His 
opinions in Abrams v. United States and Gitlow v. 
New York (268 U.S. 652, 1925) are perhaps most 
notable. In the Abrams dissent, Holmes said: 


But when men have realized that time has upset many 
fighting faiths, they may come to believe even more 
than they believe the very foundations of their own 
conduct that the ultimate good desired is better reached 
by free trade in ideas—that the best test of truth is the 
power of the thought to get itself accepted in the com- 
petition of the market, and that truth is the only 
ground upon which their wishes safely can be carried 


out. (Abrams v. United States, 250 U.S. 616, 630, 
1919) 


What Justice Frankfurter once said about these 
dissents is a good measure of Holmes’s lasting con- 
tribution as a constitutional judge: “. . . some of 
his weightiest utterances are dissenting opinions 
—but they are dissents that record prophecy and 
shape history” (Mr. Justice Holmes [1916-1930] 
1931, p. 116). 

SAMUEL J. KONEFSKY 


[For the historical context of Holmes’s work, see the 
biographies of BRANDEIS; CARDOZO; Pounp.] 
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HOLT, EDWIN B. 


Edwin Bissell Holt (1873-1946) was an Ameri- 
can psychologist-philosopher, an erudite scholar, 
a brilliant and graceful writer, a man of strong 
likes and dislikes in his opinions and social rela- 
tions, and a thinker who had more influence than 
prestige, being more often followed than quoted. 

Philosophically Holt was a radical empiricist, 
building enthusiastically on the foundation laid by 
his admired teacher William James. By 1908, when 
Holt had completed his Concept of Consciousness 
(although he did not publish it until 1914), he had 
accepted James's belief that the unit of nature is 
not an atom or a sensation, but a relation, and 
that consciousness is all those relations in which 
the nervous system makes possible responses of 
the living organism to particular objects in its envi- 
ronment. To such general views James gave the 
name “radical empiricism,” and in 1910 Holt and 
five other philosophers laid down a program for 
what they called the New Realism; this became the 
title of a book that they published together in 1912. 
In it Holt’s essay on illusion and error made his 
views clear. In 1915 he published The Freudian 
Wish (1915a), which became his most influential 
book, in part because it continued the fight against 
consciousness as Cartesian unextended substance, 
but chiefly because it fitted into the positivistic and 
physicalistic trend in psychology, which led from 
earlier materialistic and objective psychology to 
the behaviorism that was founded by John B. 
Watson in 1913. Holt gave behaviorism a philo- 
sophical sophistication which Watson was incapa- 
ble of providing. 

Holt’s insight into the nature of consciousness 
connected him not only with behaviorism but also 
with the emergence of dynamic psychology. Holt 
took from Freud the word wish and employed it 
as the unit of his realistic monistic psychology, the 
relation of response that explains mind. To express 
this dynamic relation Holt in 1931 adopted the 
term drive. Dynamic psychology has now become 
the psychology of human nature, and its achieve- 
ments, which are many, represent the success of 
the protest against the dualism of German intro- 
spective psychology, which initiated the experi- 
mental movement. Holt played a significant part 
in this development. 

About 1920 Holt began to write Animal Drive 
and the Learning Process, intending it as a revision 
of William James’s classic Principles of Psychology, 
but that task proved impossible even for Holt’s 
brilliance. Instead, more than a decade later Ani- 
mal Drive came out as a psychophysiological text 
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that explicated the importance of radical empiri- 
cism for psychology. 

Holt was born in Winchester, Massachusetts, 
went to school in Winchester, entered Amherst 
College for a year in 1892, and then transferred to 
Harvard, where he received the A.B. magna cum 
laude in 1896. After a year at Harvard graduate 
school, he volunteered for the army in the Spanish- 
American War, coming back to academic life with 
a virile and picturesque mode of speech that char- 
acterized him ever after. There followed a year 
abroad at the University of Freiburg, an a.m. from 
Columbia University, and then a PH.D. from Har- 
vard in 1901. At Harvard, William James and 
Hugo Miinsterberg stimulated him most; the latter 
made Holt first an instructor and then assistant 
professor, a position Holt held until his resignation 
in 1918. While Miinsterberg lived—he died in 
1916—Holt was his adjutant in the laboratory, 
helping the graduate students with the research 
that Miinsterberg put them on, maintaining the 
apparatus in apple-pie order, and leading an exper- 
imentalist’s life, not a philosopher's. However, he 
did not publish much experimental work of his 
own. On coming into a considerable inheritance 
in 1918 he decided to resign from Harvard, stick- 
ing to his decision in spite of the offer of a full 
professorship and the directorship of the psycho- 
logical laboratory. 

For the next eight years Holt lived in many 
places, reading, discovering that it was too late to 
revise James's Principles, starting to write the book 
that eventually became Animal Drive, and finally 
publishing it in 1931. Meanwhile, he had gone to 
Princeton in 1926 to spend ten years as visiting 
professor, where he taught a much-liked semester- 
course on social psychology and worked on a sec- 
ond volume of Animal Drive, He retired, however, 
without finishing the work. 


Epwin G. BORING 


[For the historical context of Holt’s work, see the biog- 
raphies of JAMES and MÜNSTERBERG. For discussion 
of the subsequent development of his ideas, see 
Drives and Motivation. ] 
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HOMELESSNESS 


Homelessness is a condition of detachment from 
society characterized by the absence or attenuation 
of the affiliative bonds that link settled persons to 
a network of interconnected social structures. 

Homelessness takes many forms, depending on 
the type of detachment involved and the local cir- 
cumstances. Homeless families and homeless men 
appear, so far as can be determined, in all large- 
scale societies. Homeless women and children are 
relatively rare. Their appearance denotes wide- 
spread disorder and instability, such as follow 
famines and civil wars. 

Degrees of homelessness. Homeless families fall 
into three general types: permanent wanderers, 
such as gypsies and carnival performers; wander- 
ers with a fixed base, especially migratory farm 
workers; and refugees, for whom homelessness is 
an accident and not a way of life. Homeless men 
fall into a variety of categories: single men in itin- 
erant occupations, such as peddlers, tinkers, and 
sailors; migratory laborers; vagrants and beggars; 
religious mendicants; outlaws and other fugitives; 
and hoboes and derelicts. 

These types suggest that homelessness is a mat- 
ter of degree, ranging from temporary to perma- 
nent and from the loss of a single affiliation to the 
absence of all affiliations. In general, homelessness 
is less acute for families than for individuals. The 
homeless family may have no place in any com- 
munity, but its members carry a web of roles and 
obligations with them wherever they go. Some, like 
the gypsies, develop something like a portable com- 
munity independent of the surrounding social 
structure. Others, like the millions of families dis- 
placed toward the end of World War Il, are ren- 
dered homeless without their consent and resume 
normal affiliations at the first Opportunity. 


There is even more variation in degree of home- 
lessness among men without families. Some, like 
the itinerant worker, may preserve multiple affilia- 
tions: occupational, religious, political, and so on. 
They are only as homeless as their mobility forces 
them to be. Others, like religious mendicants, elect 
one overriding identification that supplants all of 
their former roles. 

At the extreme point of the scale, the modern 
skid row man demonstrates the possibility of nearly 
total detachment from society. The present discus- 
sion is focused on this end of the homelessness 
scale and omits any consideration of the minor 
degrees of homelessness represented by Bohemians, 
“beatniks,” troubadours, residents in alien societies, 
and other adventurers whose partial detachment 
from role obligations is arranged deliberately and 
with substitute goals in view. 

Homelessness as a syndrome. The common char- 
acteristics of homeless populations display a dis- 
tinct and unmistakable pattern that is roughly ap- 
plicable to any homeless population though fully 
descriptive of none. Homeless persons are poor, 
anomic, inert, and nonresponsible. They command 
no resources, enjoy no esteem, and assume no bur- 
den of reciprocal obligations. Social action, in the 
usual sense, is almost impossible for them. Lacking 
organizational statuses and roles, their sphere of 
activity extends no further than the provision of 
personal necessities on a meager scale. Their de- 
cisions have no implications for others. Only the 
simplest forms of concerted action are open to 
them, since homelessness is incompatible with sus- 
tained involvement in a complex division of labor. 

The observable behavior of the homeless person 
consists largely of activities that furnish subsist- 
ence or enjoyment without incurring responsibili- 
ties: mendicancy, petty crime, scavenging, casual 
conversation, and an incurious attention at spec- 
tacles. This quality of social inertness renders him 
both innocuous and helpless. He is unlikely to en- 
gage in major crime or political movements or to 
protest his own condition forcefully. A certain 
apathy regarding self-preservation often develops 
in addition to the collective helplessness. The home- 
less in great cities and in refugee camps stand and 
watch their companions assaulted by strangers 
without offering to interfere and without taking any 
measures to protect themselves. The combined 
effect of poor nutrition, exposure of all kinds, neg- 
lect of injuries and illness, and insensitivity to 
emergencies leads to very high rates of morbidity 
and mortality among homeless persons as com- 
pared to the settled population around them. 

The homeless versus the settled.. For reasons 
not entirely understood, the presence of homeless 
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persons often arouses a degree of hostility in the 
settled population that seems entirely dispropor- 
tionate. Ancient ballads preserve the image of the 
gypsy as rapist and kidnapper. Being homeless or 
vagrant became a felony in England in the four- 
teenth century and a capital crime under the 
Tudors; it is still treated as a criminal offense in 
most American and European cities. These external 
attitudes and the punitive measures to which they 
lead further separate the homeless man from his 
settled neighbor. Homeless populations often oc- 
cupy an underworld with a special argot, secret 
signs, and a conventional refusal to communicate 
with outsiders. Such manifestations may lead ro- 
mantic observers to imagine an entire clandestine 
social system, such as the “kingdom of beggars” or 
the “hobo republic,” but on closer view, these fan- 
tasies disappear and reveal the homeless as an 
aggregate of unrelated individuals who share a 
common situation and some elements of a common 
culture. 

A theory of homelessness. The Germanic term 
“home,” with its special connotations of warmth, 
safety, and emotional dependence, has no exact 
equivalent in other linguistic systems. Aside from 
its familiar overtones, it expresses the idea of a 
fixed place of residence shared with a limited num- 
ber of other persons. In current usage, home does 
not imply a family (unrelated persons can “make 
a home” together), a household (a rest home may 
be much larger), more than moderate fixity (in any 
given year, more than one in five U.S. families 
moves to a new home), or exclusivity (a special 
body of tax law covers taxpayers with two or more 
homes). Living outside a family, with no perma- 
nent address, does not make a priest or a soldier 
homeless, but the man who occupies the same 
lodging on skid row for forty uninterrupted years 
is properly considered homeless. The essence of 
the concept goes beyond residential arrangements. 
Homelessness is best visualized as a relationship 
to society at large and best understood by examin- 
ing the difference between the settled and the 
homeless. 

A settled, active adult in any society participates 
simultaneously in several major types of organiza- 
tion. This participation anchors him in the social 
structure by linkages with near and distant kin, 
neighbors, compatriots, superiors and subordinates, 
co-workers, departed ancestors, sacred and super- 
natural personages, persons with the same special 
interests, and even those who like to play the same 
games. Each of these memberships involves duties 
and privileges, restraints and rights, status and 
roles, Each requires sustained interaction with par- 
ticular other persons who enter into the subject's 
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life history and become part of the audience for 
his socially meaningful acts. 

The pattern of these affiliations varies in detail 
from one time and place to another, but the main 
outlines remain unchanged. In modernized coun- 
tries of the twentieth century, the individual's at- 
tachment to society seems to be mediated through 
eight major types of organization: family, school, 
community, state, production group, occupational 
union, church, and recreational associations. Taken 
together, these memberships define who the settled 
person is, how he schedules his days and his years, 
and the benefits he may claim from, and the duties 
he owes to, the social order. Above all, they identify 
the others who are influenced by him and define 
the extent of that influence. 

The fully homeless man is unaffiliated in all 
sectors. He is not an active member of any organ- 
ization and therefore has no enforceable responsi- 
bilities toward fellow members, no audience of 
persons influenced by his actions, no claim on 
social rewards beyond whatever minimum has been 
set to avoid the scandal of his starvation, and no 
duties to fulfill except those imposed as a condition 
of remaining in the territory. Clearly, this condition 
will be uncomfortable and sometimes dangerous. 
On the other hand, it may be attractive to those 
who reject the cultural goals together with the 
means of achieving them. 

Homelessness as an ideal. The age of modern- 
ization is singularly unsympathetic to the positive 
values of homelessness. However, in other times 
and places it has been regarded as a desirable, or 
at least admirable, way of life. 

The exaltation of homelessness as an ideal is a 
central, recurrent theme of Christianity, reflecting 
such scriptural teachings as “lay not up for your- 
self treasures on earth.” Poverty, detachment from 
previous affiliations, and the abandonment of 
worldly aspirations are the first principles of mo- 
nasticism. The Christian ethos of vagrancy was 
most fully developed by St. Francis of Assisi and 
the several orders of wandering, mendicant friars 
who followed the Franciscan model. 

A parallel theme runs through Muslim history. 
The “St. Francis of Islam” is Mulai ‘Abd al-Qadir 
al-Jilani (1078-1166), the patron of the needy and 
suffering. From the Muslim viewpoint, the beggar 
performs a kind of social service by enabling others 
to attain merit by almsgiving: “As long as Islam 
flourishes,” says one writer, “there will be beggars, 
for there will always be devout people who will 
give away all they have, and so become beggars 
themselves” (Edwards 1912, p. 931). 

In addition to friars, hermits, and holy beggars, 


Christianity and Islam recognize and honor the 
pilgrim, who becomes a homeless wanderer tem- 
porarily, partly for the spiritual benefit of visiting 
the sacred shrine and partly to withdraw from set- 
tled society. In the Middle Ages pilgrims played a 
vital part in integrating both the Christian and 
Muslim worlds. The pilgrim routes to Mecca and 
Medina, to Rome and Santiago de Compostela, are 
as crowded today as ever, but the modern pilgrim 
is more often a religious tourist than a holy vagrant. 

The ethos of vagrancy is even more conspicuous 
in Buddhism. Buddha and his disciples were mem- 
bers of an already ancient order of wandering alms- 
men, and Buddha charged his disciples to “go forth 
and wander about for the good of the Many.” The 
bhikshu-sangha was (and is) a community of 
homeless men entered by “going forth from a house 
to a houseless state.” A bhikshu is an almsman. 
“He is differentiated from an ordinary beggar by 
the sacramental character of his begging. His beg- 
gary is not just a means of subsistence, but an out- 
ward token that he has renounced the world and 
all its goods and thrown himself for bare living on 
the chances of public charity” (Dutt 1962, p. 36). 

Similar themes exalting homelessness and at- 
taching a positive value to mendicancy may be dis- 
cerned in Hinduism, the religion of Zoroaster, and 
a number of Hellenistic cults. Indeed, approval and 
abhorrence of religious mendicancy may occur to- 
gether. One appeal of the world religions may lie 
precisely in their ability to combine doctrinal ele- 
ments supporting the performance of status obli- 
gations with others that allow the believer to with- 
draw from the world without penalty. 


Modern forms of homelessness 


Although the traditional types of homelessness 
may still be observed on the contemporary scene, 
they are overshadowed by new forms that arise 
directly from the dominant processes of the modern 
era: industrialization, urbanization, decolonization, 
and the redistribution of political power. The three 
types to be considered here are refugees, migratory 
workers, and skid row men. 

Refugees. Involuntary migration has occurred 
throughout historic times, but the displacements 
of the twentieth century have been unprecedented 
in scale, scope, and duration, Refugees are persons 
who migrate to escape actual or threatened perse- 
cution because of their race, religion, political con- 
victions, or for any reasons related to war (Proud- 
foot 1957, pp. 53, 446), A broader definition of 
refugees includes any persons compelled to aban- 
don their homes because of events for which they 
cannot be held responsible (Vernant 1953, p. 3). 


The involuntary migration of refugees contrasts 
sharply with voluntary migration in search of eco- 
nomic opportunities. Voluntary migrants are most- 
ly young adults usually assured of a welcome in 
the host country and of continued ties with the 
country of origin. Involuntary migration often 
affects entire communities, including people of all 
ages and conditions, uprooted without their con- 
sent and set down in a place selected at random, 
where their numbers increase their helplessness. 

The wars and revolutions of the twentieth cen- 
tury have created both temporary and permanent 
refugees. The temporary refugee may flee from his 
home in the face of an invading army to return at 
the first opportunity. The permanent refugee leaves 
a country where his enemies have come to power 
with no real hope of eventual return. Sometimes 
he is resettled after an interval of hardship in a 
new community among people of his own per- 
suasion. But if no such place is available, he may 
linger indefinitely in refugee camps, living on a 
dole. One observer speaks of Heimatlosenausldander 
as “ultimate refugees” and describes their loss of 
affiliations: 


Here you meet men whose environment has ceased 
to succour them. In their former normal life they had 
the solace of their homes in times of weariness or per- 
plexity, perhaps even in times of unemployment. They 
had the consolation of their church in times of spiritual 
need and of family sadness. They had their trade union 
and political association in times when work and poli- 
tics preoccupied them. They had their village inn and 
folk festivals on high days and holidays. All these props 
and cushions in life’s pilgrimage are taken from 
them.... (Rees 1960, p. 37) 


The large-scale refugee movements of the twen- 
tieth century began with the Balkan wars of 1912- 
1914, World War 1 displaced several million civil- 
ians, most of whom were subsequently relocated. 
Almost every major disturbance of the interwar 
period generated a refugee problem, and World 
War 1 displaced between 20 and 50 million per- 
sons; the exact number is unknown and cannot 
be accurately estimated. In the postwar period, the 
displacement of populations continued unabated in 
Europe, Asia, Africa, the Near East, the islands of 
the Pacific, the Mediterranean, and the Caribbean. 
Not without reason has this been called the “cen- 
tury of the homeless man.” Most refugees are 
ultimately resettled, but not without great cost in 
suffering and dislocation. Meanwhile, the making 
of refugees continues at a rate that reflects the 
peculiar and fanatical character of twentieth- 
century nationalism. 
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Migratory farm laborers. Large-scale agricul- 
ture often requires large amounts of hand labor for 
brief seasonal periods. This situation frequently 
occurs during the transition from a plantation or 
family farm system with a fixed labor force to a 
fully mechanized system of cultivation, The transi- 
tion may extend over several decades, during which 
the local labor force is incapable of planting and 
harvesting a crop without outside help. Arranging 
the appearance of the required number of workers 
at the right time and place is a perennial problem 
for the planters. Protecting the workers against ex- 
ploitation and the natural hazards of migration is 
a major problem for whatever authorities assume 
the responsibility. 

Seasonal migratory labor is required for such 
diverse products as coffee, cocoa, rubber, cotton, 
citrus fruits, wheat, apples, potatoes, and wild rice. 
The seasonal movement of migratory workers is 
significant throughout the world, often transmit- 
ting a modernizing influence from advanced to 
backward areas. The preliterate tribesmen brought 
to the coastal banana plantations of Central Ameri- 
ca, the Italian peasants cultivating sugar beets in 
France, the tribal Africans who migrate to cocoa 
plantations of Ghana, and the Mexican braceros 
picking peaches in Oregon all acquire new ideas 
and attitudes to carry back with them. 

Movements of this kind are often internal as 
well as international. In the United States in the 
middle 1960s, for example, Mexicans were admitted 
annually on a temporary basis under a farm labor 
program jointly administered by the two countries, 
and sizable numbers were drawn under other pro- 
grams from Canada, the British West Indies, Japan, 
and Puerto Rico. But the stream of migrants is also 
fed by many internal groups: Indians, mountain 
whites, Spanish Americans, and southern Negroes. 
The annual cycle begins in the South and moves 
steadily northward with the advancing season; 
some workers follow the wheat harvest all the way 
from southern Texas to Manitoba. 

Migratory laborers vary in degree of homeless- 
ness. John Steinbeck’s novel, The Grapes of Wrath, 
painted an unforgettable picture of migrant fami- 
lies displaced from their ancestral farms in Okla- 
homa during the depression and cut loose to drift 
helplessly from one temporary job to another. Even 
in times of prosperity, the United States has an 
estimated one million transients who have no fixed 
base (Shotwell 1961, p. 34); and there are many 
others who have home bases to which they return 
between seasons or when forced back by un- 
employment. 

A migrant work force may be composed of single 
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men, entire families, or both. Their living and work- 
ing conditions range from adequate to wretched. 
Like other homeless persons, migratory workers are 
prevented from forming or retaining normal affilia- 
tions and are subject to disproportionate hostility 
from their settled neighbors. 


Migrancy engenders community resentment, puts in 
peril such practical aspects of normal family living as 
regular schooling for children, housing that is sanitary 
and convenient and conducive to whclesome family 
relationships, voting privileges, stable income, health 
and welfare services available to residents. Migrancy 
reduces to zero the chance to develop the feeling of 
belonging to a stable community. (Shotwell 1961, 
pp. 36-37) 


The farm laborer occupies a place at the very 
bottom of the socioeconomic ladder, doing the 
most work for the lowest wages with less security 
and opportunity than any other segment of the 
population. 

Solidary organizations among migrant laborers 
are rare. Attempts to unionize them, when not vio- 
lently suppressed, have usually failed. However, a 
loose form of crew organization often develops, 
whereby a group of individuals or families is repre- 
sented by a leader who negotiates with employers 
and arranges transportation and lodging. 

The impact of this form of homelessness on in- 
dividuals is mitigated by its temporary character. 
The available evidence suggests that most migra- 
tory workers return to a settled life at the first 
opportunity, often in an area they have discovered 
as migrants. 

The skid row man. In proportion to their num- 
bers, skid row men in the United States and 
comparable homeless men in western Europe have 
received disproportionate attention from both scien- 
tific and literary observers. In a study using 1950 
census data, Bogue (1963) identified skid row 
areas in 41 American cities and estimated their 
combined homeless population at under 100,000. 
The individual settlements are correspondingly 
small. Careful enumerations in the New York 
Bowery, one of the most conspicuous skid row 
areas in the world, disclosed less than seven thou- 
sand homeless men in 1964 and 1965 (Baker 
1965, p. 7). 

The name “skid row” (or “skid road”) appar- 
ently derives from the log skidways on which felled 
timber was transported in the forests of the Pacific 
northwest. It came to be applied to the Seattle area 
where lumbermen wintered and then to similar 
enclaves in other cities where single homeless men 
were housed in dormitories and lodging houses, sur- 
rounded by facilities that served their special needs. 


Most of these enclaves, in their present locations, 
can be traced back about a century. In recent dec- 
ades, their populations have been declining, al- 
though irregularly; and the migratory workers 
identified as the principal inhabitants of “hobo- 
hemia” after World War 1 (Anderson 1923) have 
given way to the practically immobile pensioners, 
alcoholics, and disadvantaged workers who make 
up the bulk of the population today. 

The interest aroused by skid row may be ex- 
plained partly by its unique institutions, partly by 
its high visibility, partly by the resistance of the 
skid row alcoholic to ameliorative programs, and 
partly by a life style that deviates dramatically 
from the success-oriented, family-centered ethos of 
the surrounding society. The visible facilities are 
cubicle lodging houses, cheap restaurants and bars, 
missions, barber colleges, secondhand stores and 
pawnshops, newspaper reading rooms, sidewalk 
gatherings, all-night movies and burlesque shows, 
tattoo parlors, “jungles” near railroad yards, and 
the “revolving door” of police court and workhouse. 
In several major cities the original area has been 
cleared in the course of urban redevelopment, but 
substitute skid rows have invariably appeared. 

Ethnic and occupational characteristics vary 
somewhat from one skid row to another, but the 
population is almost exclusively male. Their average 
age is much higher than that of the general popu- 
lation; a large proportion are past the usual age of 
retirement (Bogue 1963, pp. 8-10). About a third 
of the men are heavy drinkers, frequently arrested 
for drunkenness and apparently addicted to alco- 
hol, although there is some doubt about their being 
true alcoholics. The remainder are divided between 
social drinkers, whose need for liquor is apparently 
controllable, and moderate drinkers or abstainers 
(Bogue 1963, pp. 90-93). Most report themselves 
unmarried and without active family ties. Their 
histories show less than average involvement in 
family relationships from early childhood on (Pitt- 
man & Gordon 1958, pp. 10, 125-138). Their or- 
ganizational affiliations have often been slight 
throughout adult life, reflecting a permanent state 
of social detachment. The typical skid row man has 
a background of intermittent unskilled or semi- 
skilled employment, with minimal involvement in 
political, religious, and recreational organizations 
(Bahr & Langfur 1966). 

Morbidity and mortality rates are startlingly 
high. The risks of contagious disease, infection, 
exposure, injury, and malnutrition are greater on 
skid row than in any other urban environment 
(Bogue 1963, pp. 199-230). Most men live close 
to the margin of bare subsistence on sporadic earn- 


ings, pensions, relief payments, begging, or some 
combination of these. A few depend on savings or 
assistance from relatives; a few enjoy regular in- 
comes and remain on skid row by preference 
(Bogue 1963, pp. 13-17; 46-77). 

In addition to the core population, the skid row 
area provides temporary accommodation for migra- 
tory workers and work seekers, eccentrics, and 
fugitives of various kinds. Some of these men are 
not strictly homeless, and some homeless men 
have no contact with skid row. However, the pre- 
dominant type is unmistakable. 

The principal current trends in the status of 
skid row inhabitants are decreased mobility and a 
decline of relative income. There is, however, a 
rising provision of public assistance for them, and 
experimental programs are being undertaken in the 
rehabilitation of alcoholics. 


THEODORE CAPLOw, Howarp M. BAHR, 
AND DAVID STERNBERG 


[Directly related is the entry REFUGEES. Other relevant 
material may be found in DRINKING AND ALCOHOL- 
ISM, article on SOCIAL ASPECTS; MIGRATION.] 
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HOMEOSTASIS 


The concept of homeostasis is widely used, in 
physiology and psychology, to identify what seems 
to be a general attribute of living organisms: the 
tendency to maintain and restore certain steady 
states or conditions of the organism. An obvious 
example is that of body temperature, which in the 
human tends to fluctuate only in a narrow range 
about the value 98.6° F. When the temperature 
rises above the normal range, corrective reflexes 
(perspiration, reduced metabolism, etc.) go into 
action to restore the steady state. Persistent devia- 
tion may initiate other actions (moving into the 
shade, plunging into water, etc. ). If body tempera- 
ture drops, other corrective actions are observed. 

Many bodily steady states follow this pattern. 
Blood glucose level, blood pH, and osmotic pres- 
sure are examples. The key concepts are: an 
observable steady state that persists over time with 
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minor changes; thresholds above and below this 
normal range; a sensory input that reports changes 
in the steady state; and effector mechanisms for 
restoring the steady state. 

When a deviation goes beyond either the upper 
or the lower threshold, energy is mobilized to re- 
store the steady state to its optimal value. Physiol- 
ogists have been concerned mainly with the 
reflexes triggered by such deviations, but psychol- 
ogists have emphasized those homeostatic actions 
that are seen in learned behavior. Man will exert 
considerable energy to protect optimal states. He 
may take restorative action (building a fire when 
cold) or forestalling action (moving south before 
winter arrives). The simple reflex level and the 
complex learned response to homeostatic disturb- 
ance are often labeled differently: Stagner and 
Karwoski (1952) called the former “static homeo- 
stasis” and the latter “dynamic homeostasis”; Cofer 
and Appley (1964) used the terms “physiological 
homeostasis” and “behavioral homeostasis.” 

The biochemical and reflex defenses function 
adequately to protect some constancies; if blood 
osmotic pressure drops too low or vitamin concen- 
tration is too high, kidney mechanisms correct the 
Situation. In other cases, the reflex machinery may 
fail, and learned behavior is activated. There are 
probably two thresholds in the system, one for 
reflexive response and another (further from op- 
timum ) that initiates voluntary action. 

Damaging the biochemical mechanism forces 
increased reliance on the learned systems of de- 
fense. Thyroidectomized rats build nests to pro- 
tect against heat loss; parathyroidectomy or adren- 
alectomy leads to increased drinking of solutions 
containing calcium or sodium. It seems reasonable, 
therefore, to consider the cerebral cortex as the 
highest level of a homeostatic protective mecha- 
nism. 

Heterostasis. The concept of heterostasis must 
be introduced into any systematic discussion of 
homeostasis. Action to restore one steady state may 
disrupt another steady state (perspiration to cool 
the body upsets the osmotic equilibrium in the 
blood, triggering thirst), Heterostasis points to a 
hierarchy of steady states; deviation of an “impor- 
tant” value will initiate action that disrupts one 
lower in the scale, but the reverse is not true. 
Thirst dominates over hunger, and the need for 
oxygen is prepotent over either, Deviations from 
optimum dominate behavior as a function of their 
relevance to organismic survival. Oxygen lack can 
be tolerated for only a few minutes, water lack for 
a day or two, food lack for many days. 


Sensory controls. Homeostasis depends upon a 
negative feedback loop; disturbance of a steady 
State initiates sensory cues that trigger responses 
tending to restore the previous state; and as the 
sensory input indicates that the optimal value is 
being approached, the intensity of corrective action 
diminishes. Each steady state must have built into 
its control system, in neural or biochemical form, 
some representation of the tolerable range of val- 
ues and upper and lower thresholds. However, all 
such monitoring systems probably report to a cen- 
tral mechanism, giving rise to something like Mor- 
gan’s (1957) “central motive state,” a state of 
tension that elicits dynamic homeostatic responses. 
Tension is thus an important intervening variable 
in homeostatic theory. 

Some types of stimuli have inherent equilibrium- 
disturbing or equilibrium-restoring properties. 
Sweet tastes not only signal reduction of hunger 
for infants; they also seem to have general tension- 
reducing value. Pain disturbs equilibrium and 
raises tension. Soft, warm contact may be a uni- 
versal tension reducer for mammals (cf. Harlow 
1958). This need not imply a one-to-one corre- 
spondence between pleasure or “subjective utility” 
and homeostatic value in a goal object. For exam- 
ple, the organism may make mistakes (saccharin 
is sought by the organism because it is misper- 
ceived as equilibrium restoring); and learning may 
cause the organism to value objects that are purely 
symbolic and not in themselves effective either to 
disturb or to restore steady states [see LEARNING, 
article on REINFORCEMENT]. 

Imprinting. Imprinting may be a determiner 
of cue value. When a gosling first hatches, it seems 
to fixate on the first moving object and endow 
this with positive valence; the gosling will follow 
this object and behave as if a steady state were 
disturbed when the object disappears [see IM- 
PRINTING]. 

Adaptation level. Adaptation level modifies the 
sensory controls in homeostatic cycles. As Cannon 
(1932) pointed out, the “baseline” (herein referred 
to as optimal value) may shift as a result of ex- 
perience. Infants learn to adapt to a two-hour, 
three-hour, or four-hour feeding schedule and be- 
come “hungry” at the proper time. If subjected: to 
radio and television noises, they show distress at 
first but later may seem to enjoy the sound level 
(Chodorkoff 1960), Europeans find room tempera- 
tures comfortable that Americans experience as 
chilly and disturbing. 

Effector processes. The effector mechanisms set 
off by homeostatic disturbances vary from simple 


biochemical buffering of the blood stream, through 
hormonal modifications and autonomic nervous 
system reflex changes, to complex and elaborate 
learned responses. As Bernard (1859) noted, 
homeostasis in its simplest form involves protec- 
tion of constancies of body fluids. Life escaped 
from the sea when techniques evolved for carrying 
the sea (as cell environment) within the organism. 
He wrote that internal constancies are basic to 
the free life—free from the limitations of an 
oceanic environment, free to live on land and in 
the air. 

The more complex activities described above 
expand this conception markedly. First, the organ- 
ism may remove itself from disturbing stimuli 
(avoid heat, seek shade). Second, the organism 
may restore the constancy of the external environ- 
ment (air conditioning, stores of food). There 
seems to be general agreement that these two kinds 
of responses, while remote from biochemical se- 
cretion or mechanical reflexes, qualify as homeo- 
static in nature. Third, the organism may seek to 
protect (or restore) symbolic conditions associated 
with homeostatic success—bank accounts, institu- 
tional systems, and the like. There is less agree- 
ment that these phenomena belong under the 
homeostatic rubric, yet it is difficult to find a logical 
rule by which to exclude them. 

Evaluation of homeostatic theory. Critics have 
objected to homeostatic theory as being too con- 
servative, as implying that motivation is conceived 
solely as operating to restore pre-existing condi- 
tions. In a very narrow sense this criticism is true: 
unless the essential steady states are restored to 
their normal range, the organism dies. (It is also 
true that most people are conservative unless de- 
prived.) In a broader sense, homeostatic theory 
says that energy is mobilized to take action that 
will restore and protect these steady states, but 
that the action may be novel and inventive. Fire, 
clothing, and other inventions serve homeostatic 
uses. The individual, frustrated by inadequate 
habits, may acquire new ones which will reduce 
tensions. 

The assertion that all energy mobilizations de- 
rive from disturbances of steady states does en- 
counter problems. The phenomena of curiosity, 
manipulative motivation, and sensory deprivation, 
as well as the sex drive, offer difficulties. Some 
psychologists have preferred to treat homeostasis 
as one segment of the topic of motivation. Those 
who prefer to press for an inclusive view, in which 
all motivation obeys homeostatic principles, resort 
to assumptions such as efficient organismic func- 
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tioning “demanding” a certain level of sensory 
stimulation, just as it “demands” a certain level of 
glucose or vitamin B. “Sensory deprivation” may 
be motivating because the central nervous system 
has a normal activity level that is disrupted by 
either an excess or a deficit of external stimulation 
[see PERCEPTION, article on PERCEPTUAL DEPRIVA- 
TION; STIMULATION DRIVES; see also Hebb 1955; 
Miller 1960). 

Aspiration level studies can also be incorporated 
into this schema. Motivation, in the sense of effort 
directed toward a specific goal, obeys aspiration 
principles. An incentive below present attainment 
levels is ignored; one that is too far above present 
achievement is perceived as impossible. Effective 
incentives, therefore, must fall within the “normal 
range” as a function of adaptation level. Repeated 
successes lead to a shift upward, failures to a shift 
downward, in aspiration level. Prior experience 
determines what will be anticipated [see ACHIEVE- 
MENT MOTIVATION]. 

Homeostatic conception of personality. Person- 
alities can be described by the hierarchy of steady 
states that are defended, by the cues that disturb 
these states, and by the techniques utilized to pro- 
tect and restore equilibriums (Stagner 1961). 

Preferred steady states. Some individuals value 
physical comfort; others, a quiet family life; others, 
a buzz of activity. Variations in adaptation or aspi- 
ration level may be involved. Objects that symbo- 
lize internal equilibrium, such as the mother, a 
house, or a bank account, become values protected 
in themselves. The self-image, the individual's 
percept of himself, becomes an object of value. 
Threats to the self-image (failure, social ridicule, 
loss of status) provoke blood pressure and glandu- 
lar changes as well as vigorous action to restore 
the integrity of the ego. 

Sources of disturbance. Some persons manifest 
phobias and other disturbances that are obviously 
not realistic. Neurotic aggressions and anxieties fit 
into this category. Others are disturbed only by 
realistic threats to physical constancies or to ac- 
quired values. 

Methods of restoring equilibriums. The indi- 
vidual may rely chiefly upon overt muscular action 
to protect and restore valued constancies, or he 
may utilize symbols and fantasy. The whole range 
of coping and defensive reactions indicates indi- 
vidual differences in ways of maintaining steady 
states. 

Personality defense mechanisms. Sigmund Freud 
has often been cited as a forerunner of homeo- 
static theory, although he never discussed negative 
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feedback loops (Fletcher 1942; Walker 1956). His 
conception of repression, projection, dreams, ra- 
tionalization, and other mechanisms as devices for 
protecting the individual against threatened dis- 
turbance, or devices for restoring equilibrium, 
places him in this category [see DEFENSE MECHAN- 
ISMS]. 

When anxiety and tension are aroused, infantile 
ways of restoring equilibrium may be reinstated. 
Eating, drinking, and sexual contact are primitive 
devices for reducing tension. Unfortunately, if the 
steady state that has been disturbed involves a 
need for other substances, these defenses have only 
temporary effect; the tension returns promptly. 
Neurotic behavior, in homeostatic terms, is be- 
havior that diminishes tension without correcting 
the basic imbalance (Menninger et al. 1963). 

One must note that some psychologists take 
issue vigorously with attempts to conceptualize 
personality in homeostatic terms. Allport (1955) 
asserts that man can be understood more by what 
he is striving for in the future than by what he is 
seeking to restore of the past. 

Homeostasis as a social phenomenon. Cannon 
(1932) noted that the concept of homeostasis 
seemed relevant also to social processes. One such 
implication derives from the process of mutual 
assistance (Dempsey 1951). There is some evi- 
dence (Chodorkoff 1960) that the mother—infant 
relationship is a true dyadic system, in which each 
organism can be a cue disturbing the other, and 
in which the behavior of one may be equilibrium 
restoring for the other. Most convincing is the 
fact that social conditions can trigger the same 
reflexive mechanisms involved in physiological 
homeostasis (cf. Denenberg et al. 1964; Hoagland 
1964). Hoagland, for example, reported that rats 
reared in overcrowded cages develop glandular dis- 
orders and die at earlier ages than less crowded 
littermates. These animals show distinct evidences 
of tension and behavioral disturbance. Hoagland 
also commented that equilibrium develops at a given 
population density. Below this point, reproduction 
is usually accelerated; but above the equilibrium 
value, infertility and early death reduce the popu- 
lation. Thus he is disposed to favor the view that 
homeostatic functions are true group processes, 
not merely manifestations of individual disturb- 
ance. Allee et al. (1949) have pointed out that 
ecological balance often shows all the properties 
ascribed to homeostatic equilibrium. 

To apply the concept of homeostasis to even 
more complex social phenomena, such as the mar- 
ket economy, one must identify the steady state, 


the factors disrupting this state, the cues indicat- 
ing disturbance, and the restorative mechanisms, 
Equilibrium theory in economics seems to meet 
these requirements. In a specific economic market, 
for example, clothing, one finds a tendency for the 
price to stabilize. An increase in consumer demand 
will upset the existing steady state; information 
spreads by way of efforts to purchase; producers 
in the clothing industry scent possibilities for profit 
and increase output accordingly. Thus there is a 
feedback loop analogous to that observed in indi- 
viduals. 

Failures of homeostasis. One criticism of ho- 
meostatic theory is that humans often do things 
that disrupt, rather than restore, equilibrium. To 
some extent this is merely a function of errors in 
informational input or in guiding action. A young 
lion who charges his prey from too great a dis- 
tance is not thereby proved to be disobeying ho- 
meostatic principles; he simply has not learned 
effective responses. 

In some cases homeostasis operates inefficiently 
at the individual level because of inadequate sen- 
sory controls. Man is not endowed with receptors 
sensitive to gamma radiation, so, without warning, 
he may die of fall-out. If the species survives, 
mutation may provide such sense organs. 

Group phenomena also show catastrophic effects 
of poor information input. War between nations 
has frequently come about because the leaders of 
one nation did not accurately perceive the situation 
(for example, in 1939 Hitler thought England 
would not fight over Poland), Disputes between 
a union and management can be settled more 
amicably if information flows freely between the 
two (Muench 1960). 

These illustrations do not prove that the homeo- 
Static model explains group action; no such logical 
proof exists at this time. Nevertheless, the homeo- 
static model has been a fruitful one in pointing 
to areas for research and in suggesting genotypic 
unities that cut across phenotypic discontinuities, 
just as the theory of gravitation cut across events 
from atoms to galaxies. Consequently, homeostasis 
has been a healthy influence in psychology and in 
the other social sciences, 

Ross STAGNER 


[Directly related are the entries Drives; MOTIVATION. 
Other relevant material may be found in CYBER- 
NETICS; STRESS; and in the biography of CANNON.] 


BIBLIOGRAPHY 
An extensive and scholarly discussion of homeostatic 
theories may be found in Cofer & Appley 1964. The homeo- 
static conception of personality is supported in Stagner 


1961, and the opposing view is well stated in Allport 1955. 
Applications to social groups are proposed in Cannon 1932 
and 1941 and to industry in Stagner 1956. 


ALLEE, W. C. et al. 1949 Principles of Animal Ecology. 
Philadelphia: Saunders. 

ALLPORT, GORDON W. 1955 Becoming: Basic Considera- 
tions for a Psychology of Personality. New Haven: 
Yale Univ. Press. 

BERNARD, CLAUDE 1859 Leçons sur les propriétés phy- 
siologiques et les altérations pathologiques des liquides 
de l'organisme. 2 vols. Paris: Baillière. 

CANNON, WALTER B. (1932) 1963 The Wisdom of the 
Body. Rev. & enl. ed. New York: Norton. 

CANNON, WALTER B. 1941 The Body Physiologic and 
the Body Politic. Science 93:1-10. 

CHoDoRKOFF, Joan R. 1960 Infant Development as a 
Function of Mother-Child Interaction. Ph.D. disserta- 
tion, Wayne State Univ. 

Corer, C. N.; and Apprey, M. H. 1964 Homeostatic 
Concepts and Motivation. Pages 302-365 in C. N. 
Cofer and M. H. Appley, Motivation: Theory and Re- 
search. New York: Wiley. 

Dempsey, Epwarp W. 1951 Homeostasis. Pages 209- 
235 in S. S. Stevens (editor), Handbook of Experi- 
mental Psychology. New York: Wiley. 

DENENBERG, W. H.; HupceEns, G. A.; and ZARROW, M. X. 
1964 Mice Reared With Rats: Modification of Be- 
havior by Early Experience With Another Species. 
Science 143:380-381. 

FLETCHER, JoHN M. 1942 Homeostasis as an Explana- 
tory Principle in Psychology. Psychological Review 
49: 80-87. 

Harrow, Harry F. 1958 The Nature of Love. American 
Psychologist 13:673-685. 

Hess, Donatp O. 1955 Drives and the C.N.S. (Concep- 
tual Nervous System). Psychological Review 62:243- 
254. 

Hoactanp, Hupson 1964 Cybernetics of Population 
Control. Bulletin of the Atomic Scientists 20, no. 2: 
2-6. 

Mastow, A. H. 1937 The Influence of Familiarization 
on Preference. Journal of Experimental Psychology 
21:162-180. 

MENNINGER, KARL; MAYMAN, MARTIN; and PRUYSER, PAUL 
1963 The Vital Balance: The Life Processes in Men- 
tal Health and Illness. New York: Viking. 

Mitter, James G. 1960 Information Input Overload 
and Psychopathology. American Journal of Psychiatry 
116:695-704. 

Morcan, Cuirrorp T. 1957 Physiological Mechanisms 
of Motivation. Volume 5, pages 1-35 in Nebraska 
Symposium on Motivation. Edited by Marshall R. 
Jones. Lincoln: Univ. of Nebraska Press. 

Muencu, Grorce A. 1960 A Clinical Psychologist’s 
Treatment of Labor-Management Conflicts. Personnel 


Psychology 13: 165-172. 
1964 Peromyscus Leucopus: 


SouTHwick, CHARLES H. i 
An Interesting Subject for Studies of Socially Induced 


Stress Responses. Science 143:55-56. 

Sracner, Ross 1956 Psychology of Industrial Conflict. 
New York: Wiley. > See especially pages 89-116, 
“Motivation: Principles,” and pages 117-154, “Moti- 
vation: Some Applications.” 

Stacner, Ross 1961 The Nature of Personality Struc- 
ture. Pages 69-86 in Ross Stagner, Psychology of Per- 
sonality. 3d ed. New York: McGraw-Hill. 


HONOR 503 


Sracner, Ross; and Karwoskt, T. F. 
New York: McGraw-Hill. 

WaLrxer, Nicer 1956 Freud and Homeostasis. British 
Journal for the Philosophy of Science 7:61-72. 


1952 Psychology. 


HOMICIDE 
See under CRIME. 
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HONESTY 
See MORAL DEVELOPMENT. 


HONOR 


The notion of honor has several facets. It is a 
sentiment, a manifestation of this sentiment in 
conduct, and the evaluation of this conduct by 
others, that is to say, reputation. It is both internal 
to the individual and external to him—a matter 
of his feelings, his behavior, and the treatment 
that he receives. Many authors have stressed one 
of these facets at the expense of the others; how- 
ever, from the point of view of the social sciences 
it is essential to bear in mind that honor is simul- 
taneously all of these, for both its psychological 
and social functions relate to the fact that it stands 
as a mediator between individual aspirations and 
the judgment of society. 

Like the other self-regarding sentiments, honor 
expresses an evaluation of self in the terms which 
are used to evaluate others—or as others might be 
imagined to judge one. It can, therefore, be seen 
to reflect the values of the group with which a 
person identifies himself. But honor as a fact, 
rather than as a sentiment, refers not merely to 
the judgment of others but to their behavior. The 
facets of honor may be viewed as related in the 
following way: honor felt becomes honor claimed, 
and honor claimed becomes honor paid. The pay- 
ment of honor involves the expression of respect 
which is due to a person either by virtue of his 
role on a particular occasion, as when a guest is 
honored in accordance with the laws of hospitality, 
or by Virtue of his status or rank, which entitles 
him to a permanent right to precedence marked by 
honorific insignia, expressed in modes of address 
and titles and demonstrated in deference. Honor is 
also exchanged in mutual recognition: in saluta- 
tions and the return of invitations and favors. 

The same principles that govern the transac- 
tions of honor are present in those of dishonor, 
though in reverse: the withdrawal of respect dis- 
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honors, since it implies a rejection of the claim to 
honor and this inspires the sentiment of shame. 
To be put to shame is to be denied honor, and it 
follows that this can only be done to those who 
have some pretension to it. He who makes no such 
claim has nothing to lose; he cannot be denied 
precedence if he prefers to go last. Those who 
aspire to no honor cannot be humiliated. Honor 
and dishonor, therefore, provide the currency in 
which people compete for reputation and the 
means whereby their appraisal of themselves can 
be validated and integrated into the social system 
—or rejected, thus obliging them to revise it. 
Hence, honor is not only the internalization of the 
values of society in the individual but the external- 
ization of his self-image in the world. 

The sentiment of honor seeks validation, but 
from what quarter? From God or from the con- 
science of the person himself? From friends or 
from kinsmen? From persons in authority or from 
the crowd? Public opinion, allegedly the arbiter of 
reputations, arbitrates with anything but a firm 
hand, since it varies according to the activity and 
the context. Consensus is not easily established 
in a complex society; individual views differ, and 
different groups have different standards. The sig- 
nificance of the acts of public honor and the grant- 
ing of dignities is, therefore, this: they place the 
seal of public recognition on reputations that would 
otherwise stand in doubt and endow them with 
permanence. It is the function of authority to im- 
pose consensus. and it does this with regard to 
the worth of persons: it converts prestige into 
status, 

Theories of honor have varied greatly as to the 
relative importance which they accord to different 
qualities, and this is due to the different social 
contexts and reference groups from which they 
derive. 

Honor as a moral concept. Honor is commonly 
considered by moral philosophers to be a state of 
the individual conscience and, as such, equivalent 
to the absence of self-reproach. It relates to in- 
tentions rather than to the objective consequences 
of action, and a man is therefore said to be the 
only judge of his own honor. If he knows his in- 
tentions to be “above reproach,” then he is in- 
different to the comments of others, who cannot 
evaluate the quality of his motives. He is com- 
mitted by his honor to the fulfillment of duties 
that are recognized as attaching to social roles. 
The casuists recognized honor as a personal re- 
sponsibility and admitted the defense of honor as 
a licit form of self-defense which could excuse 
actions that would otherwise be sinful. Neverthe- 


less, the churches have always considered that, in 
the evaluation of his own motives, a man is bound 
to refer to their authority, which claims for itself 
the right to define honor in terms of religious 
virtue. 

Somewhat different is the view of honor which 
derives it from civic virtue, for here it is ratified 
not by religious doctrine but by the market place. 
Popular acclaim comes in to qualify, if not to 
guide, the judgments of the individual conscience, 
and honor becomes the reputation for virtue. (This 
was Aristotle’s view.) But once the notion of 
reputation is admitted as a constituent of honor, 
its value as a purely moral concept faces ambi- 
Zuities: a potential conflict emerges between the 
dictates of conscience and the facts of recognition. 
This is brought to the forefront by those moralists 
(not by any means confined to churchmen) who 
have upbraided the code of honor of their day from 
the standpoint of morality. Public opinion cannot 
be trusted to confirm even the most modest man’s 
claim to civic virtues or even to accord reputation 
on that basis at all. It is apt to give its applause to 
more spectacular qualities and to pay honor to 
other sorts of excellence. With regard to dishonor, 
it is yet more capricious, for here its great weapon 
is ridicule, which seldom employs a moral criterion 
at all, but destroys reputations on the grounds of 
a man’s pretentiousness, foolishness, or misfor- 
tune, not his wickedness. As Moliére observed, 
one would rather be Tartuffe than Orgon. Public 
opinion, in its sympathy for the successful, betrays 
the notion of honor as a purely moral concept. 

Honor as precedence. In contrast to the moral 
view of honor, which relates it to merit—whether 
religious, civic, or professional—other writers have 
insisted upon its factual aspect. For example, 
Hobbes (Leviathan, chapter 10) saw no more in 
honor than the achievement of precedence and the 
competition for worldly honors. Honor, in his view, 
is not a matter of sentiment and aspirations (since 
all men would like to be honored), but one of in- 
dividual preferment, to the attainment of which 
virtue is quite irrelevant. 

It is, moreover, from this aspect that the word 
honor derives etymologically: it first applied to 
grants of land or the privilege of levying taxes 
ceded by a sovereign to his eminent servants and 
supporters. Royal favor was, therefore, not only the 
“fountain of honor,” as in the view of many later 
political theorists, but literally the origin of the 
word. As might be expected, honor was particularly 
to be earned through military prowess, and orig- 
inally, it should be noted, it conceded not only 
social dignity but economic advantage as well. 


Honor in this sense accorded precedence. This, 
however, is not only conferred by royal statute but 
derives from social interaction at every level in a 
society; the claim to honor depends always, in the 
last resort, upon the ability of the claimant to im- 
pose himself. Might is the basis of right to prece- 
dence, which goes to the man who is bold enough 
to enforce his claim, regardless of what may be 
thought of his merits. 

Worldly honor validates itself by an appeal to 
the facts, submitting always to the reality of power, 
whether military, political, social, or economic and 
whether it rests upon the consensus of a commu- 
nity, the favor of superiors, or the control of sanc- 
tions. For this reason courage is the sine qua non 
of honor, and cowardice is always its converse. This 
fact is inherent in its nature and not merely the 
heritage of a class which once earned its status, or 
so it claimed, through feats of arms. The mottoes 
of the nobility (which may serve as a gloss upon 
their conception of honor) commonly emphasize 
not only the claim to prestige and status but, above 
all, the moral quality necessary to win and to retain 
them; when “all is lost save honor,” this at least 
has been preserved, and while this is so, a return 
of fortune is always possible. Willingness to stand 
up to opposition is essential to the acquisition, as 
to the defense, of honor, regardless of the mode 
of action that is adopted. No amount of moral justi- 
fication validates the honor of a coward, even where 
retaliation to an affront is ruled out on moral 
grounds; to turn the other cheek is not the same 
thing as to hide it. Indeed, in terms of the code of 
honor, turning the other cheek is simply a means 
of demonstrating contempt. Christian forgiveness 
cloaks a claim to superiority that cannot but irri- 
tate the sensitivities of the forgiven, for it not only 
implies disdain, it attempts to alter the rules where- 
by honor is achieved or lost. 

However well courage may promote the claims 
of the courageous, the fact of triumph is ultimately 
what counts. The distribution of the spoils is the 
privilege of the victor; where claims to precedence 
conflict, the decision goes to the big battalions. It 
is the fact of precedence that establishes the right 
to command and the privilege of speaking first or 
last. In this sense, therefore, honor and leadership 
imply one another, for both are subject to the re- 
ality of power. £ 

Honor as a personal attribute. Since honor is 
felt as well as demonstrated, it is allied to the con- 
ception of the self in the most intimate ways. It 
is a state of grace. It is liable to defilement. It is 
linked to the physical person in terms of the sym- 
bolic functions attached to the body: to the blood, 
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the heart, the hand, the head, and the genitalia. 
Honor is inherited through the “blood,” and the 
shedding of blood has a specially honorific value in 
transactions of honor—the stains of honor, it was 
said, could be cleansed only by blood. The heart 
is the symbol of sincerity, since it is thought of—in 
the European tradition at least—as the seat of the 
intentions and therefore the home of the true self, 
which lies behind all worldly disguises. The right 
hand is the purveyor of honor: it touches; it shakes 
or is shaken; it is kissed or waved; it wins honor, 
for it wields the sword and pulls the trigger. The 
head is the representation of the self in social life, 
that by which a man is recognized, that which is 
placed in effigy onycoins, and that which is touched 
in salute, crowned, covered or uncovered, bowed, 
or shorn. The private parts are the seat of shame, 
vulnerable to the public view and represented sym- 
bolically in the gestures and verbal expressions of 
desecration. In their association with the excretory 
function they are the source of pollution, yet as 
the means of procreation they are intimately con- 
nected with honor, for they signify the extension of 
the self in time. Sexual purity is, therefore, often 
regarded as the essence of honor in women, whose 
feminine status precludes their striving for it by 
might. The body as a whole is especially associated 
with honor, since physical contact implies intimacy 
and makes explicit the honorific relationship with 
another, whether to express attachment, obeisance, 
or contempt. 

Because honor centers in the physical being, a 
person’s presence bears a particular significance, 
as Simmel observed when he spoke of a sacred aura 
surrounding each individual. Whatever occurs in 
a person's presence obliges him to react in one way 
or another, positively or negatively, for he cannot 
hide his cognizance of it: he is inescapably a party 
to what he witnesses, and his will is thereby com- 
mitted. This is important because the essence of 
the social person comprises his will and (as the 
moralists understood, though in a different context ) 
his intentions. Hence, apologies for an affront 
normally take the form of denying the intention, 
thereby making the affront in a sense fortuitous 
because not willed. The true affront to honor must 
be intended as such. For the same reason oaths are 
binding only if freely sworn and, like the rites of 
the church, are invalid if devoid of good intention. 
The oath commits the honor of the swearer by 
guaranteeing his intentions, for it is not dishonor- 
ing to deceive another man, only to “break faith” — 
that is to say, to rescind an established commit- 
ment, for this implies cowardice. This is under- 
lined by the mottoes of the aristocracy (so often 
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devoted to the theme of steadfastness), the broken 
weapons which clutter up their arms, and the de- 
fiant invitations to fate to put them to the test. 

This emphasis on intention or will marks an es- 
sential point: the essence of honor is personal 
autonomy. All men are bound by certain irrevocable 
ties, but the man of honor cannot otherwise be 
committed; he can only commit himself. To be 
forced by whatever circumstance to revoke his in- 
tentions once they are committed is to abnegate 
his personal autonomy. Thus, the concept of honor 
is tied to precedence, for to command over others 
enhances it, while obedience restricts it, 

Collective honor. The group possesses collec- 
tive honor, in which the individual members par- 
ticipate. It affects their honor and is affected by 
their behavior. The honor of a collectivity is vested 
in its head and in symbolic representations; flags, 
crests, coats of arms, badges, uniforms, and all the 
insignia whereby members of the group are rec- 
ognized are the objects of a greater or lesser de- 
gree of reverence and are treated as though they 
possess honor in themselves. In the anthropological 
language of an earlier generation, such objects 
have mana, a concept that also expresses the idea 
of honor when it is applied to individuals or to parts 
of their person. 

While some types of groups are joined volun- 
tarily and may be left in the same way (or may 
expel one of their number), membership in other 
groups is ascribed at birth. In the latter type of 
group, honor is bound up integrally, for the group 
defines a person’s essential nature. The family (and 
in some societies the kin group) and the nation are 
the most fundamental of these collectivities, and 
thus traitors to their fathers or their sovereigns are 
the most execrable of all. (Parricide and regicide 
are sacrilegious but homicide is not.) The family 
is the repository of personal honor, for honor is 
hereditary, not merely in its aspect as social status 
but also with regard to the moral qualities which 
attach to it. Therefore, the dishonor cast on one 
member is felt by all. 

Male and female honor are clearly differentiated 
with regard to conduct. A moral division of labor 
Operates within the family, especially in the Latin 
countries: the aspect of honor as precedence be- 
comes, according to this system of values, the pre- 
Togative of the male, while honor as sexual purity 
is restricted to the female. Hence, sexual conquest 
enhances the prestige of men; sexual liberty defiles 
the honor of women. (Congruently, a high value 
is attached to virginity in unmarried girls.) The 
defense of female purity, however, is a male re- 
sponsibility, and men are therefore vulnerable to 


dishonor not through their own sexual misconduct 
but through that of their womenfolk—that is to 
say, members of the same nuclear family, includ- 
ing mother, wife, unmarried sister, and daughter. 
Hence, sexual insults that impugn the honor of 
men refer not to them but to their women. 

The aspect of honor that is associated with so- 
cial status descends preferentially in the male line 
—as hereditary titles, for example—but in its 
moral aspect a man’s honor comes to him primarily 
from his mother, and further, his sister's honor 
reflects upon his through their mother. However, 
if a man inherits his honor in this way, he is, above 
all, actively responsible for the honor of his wife, 
and the cuckold (expressed in the Mediterranean 
by an analogy with the he-goat) is the paragon of 
dishonor, equal only, if in a different way, to the 
traitor. It should be noted that the cuckolder is 
exculpated: though he represents the cuckoo in the 
analogy on which the word is based, the title is 
reserved for the man whose marital right is usurped 
(and the same transposition is performed in the 
symbolism of the Mediterranean by endowing the 
wronged husband with the horns). Although the 
cuckolder may be thought immoral, he bears no 
stigma of dishonor. Once more dishonor goes to 
the defeated, and ridicule is visited on the defiled 
one rather than on the guilty. This code of honor 
is attenuated to the degree that women are regarded 
as independent of male authority. 

Honor and the sacred. The connection between 
honor and the sacred does not derive simply from 
the ambition of the church to stand as the arbiter 
of honor, a role that it has never entirely succeeded 
in achieving in the popular view, but rather from 
the sacred nature of honor itself, which, as the 
essence of the social personality and the personal 
destiny of the individual, stands in a preferential 
relation to the deity; a man’s true self is known only 
to God, from whom nothing can be hidden and in 
whose eyes honor is ultimately vindicated. Honor is 
committed by invoking the sacred, whether in the 
form of a conditional curse witnessed by the spirit- 
ual powers or by implicating any other agency from 
which honor derives. The appeal to the sacred acts 
as a guarantee that the swearer will accept his 
shame under the prescribed circumstances. Hence, 
even in modern law courts oaths are required. The 
commitment of honor to the sovereign derives also 
from the fact that he stands, as divinely authorized 
ruler, in a preferential relationship to the Deity, a 
conception that has survived centuries of rebellion 
and even the passing of the monarchy altogether 
in many instances. Sacredness attaches to the no- 
tion of legitimate rule, regardless of the doctrine 


in which this is expressed. For this reason republics 
have tended to conserve the sanctifying rituals of 
the monarchies they superseded. 


Honor and social status 


The distribution of honor in communities of 
equals, whether local groups or specialized institu- 
tions, tends to relate to eminence, recognized virtue, 
and age. However, in a stratified society it accords 
with social status, and as such, it is more often 
ascribed by birth than achieved. Thus, medieval 
society was ranked in terms of honor, from the 
aristocracy, who had the most—on account of their 
power, their valor, and their proximity to the king 
—to those who had none at all, the heretics and the 
outcasts, those who indulged in infamous occupa- 
tions or had been convicted of infamy. 

Honor is hereditary, as we have seen; the excel- 
lence that once brought honor to the forebears is 
credited to their offspring, even though it may no 
longer be discernible in their conduct. Titles and 
property, the honorific and effective forms of hered- 
itary power, endow the social system with con- 
tinuity in this regard; honor is ascribed. Yet, the 
supposition of excellence derived from birth is con- 
stantly betrayed, and a conflict therefore emerges 
between the criterion of prestige derived from per- 
sonal worth and that which looks to social origins. 
This has provided one of the favorite themes of 
European literature from the twelfth-century Poem 
of the Cid onward, a theme which is matched in 
popularity only by its counterpart: the role of 
money rather than excellence in the acquisition of 
honor. The ideal unity of honor is apt to fragment 
when it strikes the facts, and the different bases 
for according it become opposed to one another. 

Moreover, the criteria by which honor is be- 
stowed have greatly varied in time: occupation and 
religious orthodoxy were once important, although 
they are no longer. With the evolution from a sys- 
tem of legal status to a class system, economic 
privileges ceased to be attached to status, and the 
contrary came to be true. In the modern world the 
criteria appear to be changing again: titles can no 
longer be purchased and wealth gives way to fame 
as the measure of reputation—a fact not uncon- 
nected with the development of the communica- 
tions industry. The theater, which was once a dis- 
honorable occupation, now presents candidates for 
the British sovereign’s Honours List, which also 
finds a place for jockeys and football players. 

The struggle for honor is palliated by a provision 
that saves society from anarchy. Indeed, this strug- 
gle is not only the basis upon which individuals 
compete but also that on which they cooperate. 
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Here the notion of steadfastness is crucial, whether 
it binds together those who recognize their mutual 
equality or those whose relationship is one of patron 
and client. The reciprocal demonstrations of favor, 
which might be called mutual honoring, establish 
relationships of solidarity. The notion of a commu- 
nity of honorable men replaces the competition for 
honor; reputation attaches to honesty, and the 
steadfastness that honor enjoins is seen in financial 
reliability and contractual faith—the “honoring” of 
a check or bond displays this sense. Honor then 
comes to be the guarantor of the credit system. 
This notion of honor was, in particular, the ideal 
of the Puritans, who attempted to suppress all its 
competitive and flamboyant aspects in favor of 
equating it to conscience (only to see them re- 
emerge triumphant with the Restoration, whose lit- 
erature pays more tribute to cuckoldry than that of 
any other period in the English language). The 
relation of the puritan ethic to capitalism, first 
stressed by Max Weber and R. H. Tawney, restricts 
competition to the field of wealth and, while giving 
financial probity a prime place in the notion of 
honor, makes financial success the arbiter of pres- 
tige. Under such conditions social status inevi- 
tably becomes increasingly an economic matter, to 
the exclusion of all other criteria. 

Honor is always bound to wealth and -posses- 
sions, for they provide an idiom in which to express 
relationships of relative inequality. Thus, hospital- 
ity, charity, and generosity are highly honorific; 
provided that they derive from free will, they gain 
honor as expressions of magnanimity. Beneficence 
transforms economic power into honor. However, 
honor is thus gained at the potential expense of 
the recipient. By doing someone a favor you hu- 
miliate him unless he may reciprocate. Hence 
comes the necessity for the return of hospitality 
and for accepting its return. The Northwest Coast 
Indians of America carried this principle to its 
limits in the potlatch, in which the demonstrations 
of largesse and the destruction of property were 
performed with the open intention of humiliating 
a rival. Through the challenge to reciprocate hos- 
pitality, they expressed their hostilities and, as they 
put it, “fought with property.” 

Honor is the backbone of the system of patron- 
age. He who admits his inferiority and accepts pa- 
tronage is not dishonored by attaching himself to 
a superior. On the contrary, his honor is enhanced 
by participating, through this attachment, in his 
patron’s honor. The honor of a patron is equally 
enhanced by the possession of clients; he gains 
prestige in return for the protection that he affords 
to those who recognize his power. Honor accrues 
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through being paid and is lost through being de- 
nied where it is due. By giving it away, you show 
that you have it; by striving for it, you imply that 
you need it. This is the meaning of magnanimity. 

The struggle for honor takes place, therefore, 
only where precedence is both of value and in 
doubt. Competition for it has understandably been 
more acute in the higher reaches of society, where 
family pride and also the practical importance of 
precedence are greatest (this can be clearly seen, 
for example, in the descriptions of the French aris- 
tocracy in the memoirs of the duke of Saint-Simon ). 
It must not, however, be thought that there is no 
competition for honor in plebeian communities. 
The agonistic quality of personal and kin-group 
relations in the villages of the Mediterranean is 
most striking, especially among pastoralists, whose 
unstable fortunes urge them to compete. This com- 
petition occurs, nonetheless, within a framework of 
moral values that public opinion upholds. The point 
of honor, whether among the aristocracies of for- 
mer times or the modern Greek shepherds or Arab 
peasants, imposes a code for the distribution of 
honor that contains conflict within boundaries set 
by the ethical code of the community. 


The point of honor 


In the courts of Renaissance Italy there arose 
a code of behavior which regulated the exchange 
of honor and the competition for it and whose ulti- 
mate sanction was the duel. This code spread, not 
without changes and adaptations, throughout Eu- 
rope and America and persisted in many places 
into the twentieth century. A flow of published 
works four centuries long defined the modes and 
pretexts for taking offense, the formalities of chal- 
lenge and the duel, and the circumstances in which 
honor could be judged to be lost or redeemed. A 
veritable jurisprudence of honor was elaborated, 
which makes explicit certain fundamental char- 
acteristics of the concept. 

The point of honor, as this code of behavior was 
called, was confined to the upper class. The honor 
of a man could not be impugned by someone who 
was not a social equal—that is, someone with 
whom he could not compete without loss of dignity. 
The impudence or the infidelity of an inferior could 
be punished, but honor was not attained by react- 
ing to the action of an inferior. Thus, honor was 
impregnable from below. Willingness to enter a 
duel depended upon the recognition of equality of 
class (but not of rank), 

An affront depends upon being made public, for 
repute is lost only in the eyes of others. Hence, it 
was even sometimes maintained that no affront 


could be given in a purely private conversation. On 
the other hand, against public ridicule there is no 
recourse, since an affront must be performed in- 
dividually to be resented by an individual. The act 
of resentment is always an individual responsibility; 
whether or not a man wishes to accept humiliation 
depends upon his own will. According to the code 
of honor, an insult could only be resented by the 
recipient himself, unless he were impeded from 
doing this because of age, sex, infirmity, or cleri- 
cal status. Even then it could be resented only by 
a close kinsman—that is, one who participated in 
his honor. Briefly, the duel decided quarrels be- 
tween individuals who competed for precedence 
and repute in the eyes of a public composed of their 
social equals. It was even at one time suggested 
that by destroying the honor of another man one 
might add to one’s own, just as knights once added 
to their fame by their victories in single combat. 

The affront placed honor in jeopardy, a state 
of threatened desecration from which it could only 
be saved by the demand for satisfaction. By show- 
ing his readiness to fight, a man restored his honor 
to a state of grace. Honor, however, was indifferent 
to the result of the duel, which demonstrated only 
the prowess of the victor and the choice of fate. 
The form of the act of resentment which consti- 
tuted a challenge was the mentita, “giving the lie.” 
Since deception was not in itself considered dis- 
honorable behavior, it seems anomalous that the 
accusation of lying constituted the paramount in- 
dictment of honor. However, first of all, the accu- 
sation constituted a counterinsult and bound the 
man who delivered the original insult to issue the 
challenge, thereby permitting his antagonist to gain 
the choice of weapons. Yet, other than as a tactical 
device, this form of challenge relates to the obliga- 
tion to tell the truth: one is under no obligation 
to tell it to an inferior; one owes it to a superior. 
(Children are required to tell the truth to adults, 
who feel no reciprocal duty.) To lie to an equal is 
an affront, since it represents an attempt to treat 
him as an inferior. On the other hand, lying could 
be interpreted as the subterfuge of one who lacked 
the courage to tell the truth or to act openly. The 
mentita, therefore, committed the response of the 
accused person, since failure to respond confirmed 
this implication of cowardice, while honorable re- 
sponse invalidated it at the cost of making mani- 
fest the intention to insult. 

Failure to react to a slight was open to two con- 
flicting interpretations. Either it implied cowardice 
and the acceptance of humiliation, which entailed 
the loss of honor, or it implied contempt, the denial 
that the author of the slight possessed sufficient 
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honor to affront, that is, a denial of his equality 
In the latter case the author, not the victim, of the 
slight was dishonored. Disregarding a slight could 
even be treated as magnanimity, on condition that 
the insulted person possessed the right to forgive 
or to overlook—that is to say, that he was indeed 
superior. On the other hand, to pick quarrels over 
nothing, to exploit the humiliation of others beyond 
the point that public opinion recognized as legiti- 
mate in order to establish precedence, was consid- 
ered dishonorable, for it spelled lack of magnanim- 
ity. It was the court of public opinion that judicated 
between rival interpretations of the failure to react 
to a slight; in the last resort, a man was dishonored 
only at the point where he was forced to realize that 
he had been, where his shame was brought home 
to him. Those who possessed impregnable honor 
could afford not to compete for precedence, but to 
show those magnanimous qualities which are as- 
sociated with it in the figurative senses of the words 
that denote high status: nobility, gentlemanliness, 
etc. Words which denote low status imply the con- 
trary of magnanimity: meanness, villainy. 

Although the point of honor was much criticized 
from the moral viewpoint, it provided a means of 
settling disputes, It did not permit the unbridled 
use of violence. Once satisfaction was accorded, 
the quarrel could not rightfully be taken up again. 
In this way it can be contrasted with the moral 
obligation to seek vengeance in order to redeem 
honor and with the state of feud that this com- 
monly leads to. The point of honor was, therefore, 
a pseudolegal institution governing the sphere of 
social etiquette where the law was either not com- 
petent or not welcome. 


Honor and the law 

The code of honor associated with the duel ap- 
peared in history at a time when the state was 
endeavoring to suppress private violence; dueling 
was made illegal, as well as condemned by the 
church, almost from its inception as @ formalized 
mode of settling disputes. Yet, the duel was also, 
in some ways, a continuation of an earlier tradition. 
The right of knights to prove their worth in single 
combat was at one time inherent in the right to 
bear arms, and jousting provided a festive occa- 
sion on which to do so (though it finally developed 
into a sport). During the Middle Ages the private 
encounter was also recognized as a form of legal 
process. The judicial combat allowed for the settle- 
ment of disputes by remitting the decision to divine 
judgment; it was a form of ordeal. 

With the development of the legal system and 
the increased centralization of power, sovereigns 
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aspired to take the settlement of disputes out of 
the unpredictable hands of the Deity and submit it 
to the adjudication of courts. The courts, however, 
are ill-designed to fulfill the requirements of the 
man of honor, in that, first of all, they oblige him 
to place his jeopardized honor in the hands of 
others and thus prevent him from redeeming it for 
himself—the only way in which this can be done. 
The legal process involves delay (prejudicial to its 
state), expense (unwarranted for one who would 
settle accounts at once and for nothing), and pub- 
licity, which aggravates instead of mitigating the 
affront which is the cause of the dispute. Moreover, 
honor is not commutable into payment, so the com- 
pensation that courts impose offers no valid satis- 
faction. Finally, the settlement of a dispute in court 
excludes the possibility of demonstrating personal 
worth through the display of courage. The law has 
never, therefore, appealed to adherents to the code 
of honor, even where it has provided a means of 
redress against the kind of conduct that constitutes 
an affront. This it cannot easily do in any case, 
since it operates according to a different reason- 
ing. Thus, although in all the countries of Europe 
legislation against dueling was passed repeatedly 
from the sixteenth century onward, the custom 
continued, with a large measure of connivance 
from the judicial authorities, until the twentieth 
century. 

Although all the countries of Europe use the con- 
cept of honor in their ceremonial pronouncements, 
it is explicity recognized only in the laws of the 
countries of southern Europe, where honor appears 
not only as a factor in the cultural background of 
court proceedings but also as a legal concept. In 
these countries the legal codes define the jural sig- 
nificance of honor. However, honor relates to con- 
duct as a disculpating factor by making otherwise 
reprehensible behavior justifiable in terms of the 
legitimate motive which derives from the sense of 
honor. The right of a man to defend his honor is 
far more clearly recognized in the judicial proce- 
dures of southern European countries than in 
Anglo-Saxon law, which generally requires the 
demonstration of material damage for an affront 
to be actionable. 

The constancy required by honor prefigures the 
law’s demand for regularity; both establish a com- 
mitment upon the future. Nevertheless they differ 
in the way they commit the future, for honor de- 
mands fidelity to individuals, law to abstract prin- 
ciples. Therefore, where they exist together, they 
are liable to conflict, for one relates to persons and 
is centered in the will; the other aspires to reduce 
persons to legal categories, which involves attack- 
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ing the fundamental principle of personal auton- 
omy. The man of honor is a law, but a law unto 
himself. Wherever the authority of law is ques- 
tioned or ignored, the code of honor re-emerges to 
allocate the right to precedence and dictate the 
principles of conduct: among aristocracies and 
criminal underworlds, schoolboy and street-corner 
societies, open frontiers and those closed commu- 
nities where reigns “the Honorable Society,” as the 
Mafia calls itself. 


Honor in literature 


As we have seen, the different aspects of honor 
are liable to come into conflict with each other and 
to present individuals with an unenviable choice. 
This was the theme of the Spanish theater of the 
“Golden Century” in which concern for honor had 
become exacerbated. The dramatists presented this 
theme in the form of tragedy in an aristocratic set- 
ting while the novelists tended to treat it with comic 
satire in a plebeian context. Hence, the teatro de 
honor concerns such matters as marital honor, 
honor as precedence, honor as social status, civic 
virtue represented by plebeian honor, female purity, 
fidelity to the king, the duty to redeem honor by 
vengeance, the difficulty of hiding dishonor, and 
the power and perfidy of public opinion. The 
Spanish dramatists were, on the whole, critical of 
aristocratic honor and sometimes betrayed a re- 
formatory intention, while the satires of the nov- 
elists frequently attacked the very notion of honor 
itself, contrasting it with the real power of money. 
The greatest and most savage satire on honor is 
assuredly the Lazarillo de Tormes, which antedates 
Falstaff’s famous tirade by nearly half a century. 

Every ruling power claims the right to distribute 
honor; to lay down the principles by which it is to 
be won is the essence of authority and the process 
of legitimation. Authority, like honor, is allied both 
to the sentiments and to the reality of social status 
and looks both to the possession of force and to 
popular consent, for authority is, as it were, the 
judicial aspect of honor, whose transactions and 
rituals set the seal of legitimacy upon the social 
order, making its commands appear necessary and 
right. The king can do no wrong. His word imposes 
consensus. Yet, if what is done in his name is un- 
popular, it does not escape criticism. His legitimacy 
is eventually brought into question. The frailty in 
authority qualifies its sacredness by exposing it to 
the danger that a counterconsensus will emerge to 
brand as infamous the fount of honor. The with- 
drawal of consent reveals the Achilles’ heel of 


authority; the loss of respect foreshadows delegiti- 
mation. 


Therefore, the polemics with regard to the nature 
of honor mirror the social conflicts of their age, for 
they reflect the interests of different groups and 
classes striving to impose their evaluations of be- 
havior. The polemics both reflect and promote the 
struggle between these groups. The facts of power 
decide the moral arguments. Through its social 
function as a mediator, honor dictates the modes 
of allegiance that obtain throughout the social 
structure, and each particular notion of honor 
favors a certain faction. Hence, the aristocracy and 
the church adhered to quite different definitions 
of honor throughout the history of modern Europe. 
The new middle classes had a different notion of 
it, again, as Speier (1935) has pointed out. Differ- 
ent classes differ in their concepts of honor, and 
the concepts of rural communities differ from those 
of the city, though not always in accordance with 
that romanticized image of Arcadia which has 
stirred the literary imagination ever since Horace. 
A particular code of honor must be seen against its 
social background in order to be understood. 

Modern urban society accords precedence largely 
on an economic basis. The independence of women 
has relieved men to some extent of their respon- 
sibility for them and, thus, of their vulnerability 
through them. The power of the law and the range 
of its competence have greatly increased, thus 
eliminating the possibility of winning honor through 
physical courage, save in certain sports and in war. 
The power of personal patronage has declined in 
favor of impersonal and institutional allegiances. 
The vocabulary of honor has acquired archaic over- 
tones in modern English, yet the principles of hon- 
or remain, for they are not, like the particular con- 
ceptions in which they are manifested, the product 
of a given culture at a given time, but universal 
principles of social action that may be found clothed 
in the idiom of head-hunting, social refinement, fi- 
nancial acumen, religious purity, or civic merit. 
Whatever the form the principles of honor may 
take, they serve to relate the ideal values of a 
society to its social structure and to reconcile the 
world as its members would see it with the world 
as it is. 

JULIAN Pirt-RIvERS 
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HOOTON, EARNEST A. 


From 1913 until he died in 1954, Earnest Albert 
Hooton taught anthropology at Harvard. Born in 
Clemansville, Wisconsin, in 1887, he lived in a suc- 
cession of small towns in the state, moving from 
one to another as his father, a Methodist minister, 
received calls from various churches. His mother, 
of Scottish origin, remained in his memory as the 
decisive influence on his childhood. 

An unathletic and bookish boy, young Hooton 
won distinction in school and was class valedic- 
torian at the age of 15, when he graduated from 
the Manitowoc High School. He was already an in- 
structor in anthropology at Harvard when he mar- 
ried Mary Camp in 1915. 

Hooton was trained initially as a classicist at 
Lawrence College (B.A., 1907) and the University 
of Wisconsin (M.A., 1908; PH.D., 1911). While a 
graduate student he was awarded a Rhodes scholar- 
ship and spent three years at New College, Oxford. 
He was strongly influenced by the Oxford anthro- 
pologists, particularly R. R. Marett, whom he 
greatly admired; and it was probably an interlude 
of study with Sir Arthur Keith, the distinguished 
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British anatomist and student of human evolution, 
that bent his general interest in anthropology to a 
special concern with its physical or biological 
aspect. 

Although Hooton never altogether lost his inter- 
est in cultural anthropology and for many years 
taught courses on European prehistory and African 
ethnology, he did become more and more involved 
with the problems presented by physical anthro- 
pology, the field with which his name is now prin- 
cipally associated. 

He was perhaps the first in the United States to 
offer a systematic coverage of physical anthro- 
pology. During his lifetime Hooton’s laboratory be- 
came the principal center in the United States for 
training specialists in physical anthropology, and 
as a result his students were established widely 
throughout the nation and even abroad. His suc- 
cess as a university lecturer, however, extended far 
beyond the corps of graduate students who worked 
closely with him. He possessed a witty, informal 
style that was extremely attractive and that drew 
large numbers of undergraduates and others to his 
courses. 

Although it was Hooton’s influence as a teacher 
that was especially noteworthy, it was his scientific 
and popular writing that made him known to a 
much wider circle than his professional activity 
alone could reach. His pungent and graceful style 
was notably successful in Up From the Ape (1931), 
in which he covered the traditional themes of 
physical anthropology: human evolution and the 
development of racial differentiation. In a similar 
vein he produced Man’s Poor Relations (1942), 
which deals with the primate relatives of man; as 
well as Apes, Men, and Morons (1937), Twilight 
of Man (1939a), and Why Men Behave Like Apes, 
and Vice Versa (1940). In “Young Man, You Are 
Normal” (1945) he concerned himself with the 
dysgenic effects of modern society and the rela- 
tionship between body build and behavior. These 
books were written for the general reader and were 
widely distributed. They did much to introduce the 
subject matter of physical anthropology to a pub- 
lic unfamiliar with it. But because they were solidly 
written, several of them were quickly adopted as 
standard textbooks; Up From the Ape, for example, 
went into a number of editions. 

Hooton’s scientific writing embraced a wide 
variety of topics. He wrote on human evolution, 
developing among other ideas a concept of asym- 
metry in the morphological evolution of man. Origi- 
nally intended to account for the discrepancies in 
that now discredited fossil, the Piltdown skull, the 
principle of asymmetrical evolution has proven 
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valid and useful. Instead of the widely held belief 
of that time that all segments of the body had 
evolved pari passu, Hooton envisioned an evolu- 
tionary process capable of producing apparent dis- 
harmonies by affecting one part more than another. 
This implied that the adaptive process is selective 
rather than necessarily general and total. He was a 
frequent contributor to discussions on the evolution 
of human dentition, a subject on which he also 
lectured at the Harvard Dental School. 

The classification of human populations and 
their description was one of Hooton’s abiding inter- 
ests. Much of his work reveals a persistent con- 
cern with typology, exploring and refining the 
methods of analysis as well as the correlates of 
type. In his first major study, The Ancient Inhabi- 
tants of the Canary Islands, published in 1925, he 
had already come to grips with the problem. Influ- 
enced by Goring’s applications of statistics to the 
analysis of a sampling of English convicts (1913) 
and by the voluminous contributions of Pearson to 
anthropometric data, he enthusiastically adapted 
these new mathematical tools to a relatively large 
series of prehistoric Guanche craniums and skele- 
tons. By these statistical procedures he succeeded 
in identifying a number of coexistent physical 
types that he then attempted to derive from vari- 
ous related population groups in north Africa. On 
the basis of such type analyses and supported by 
archeological evidence he reconstructed a popula- 
tion history of the Canary Islands. 

A similar orientation is clearly discernible in 
The Indians of Pecos Pueblo (1930) where, apply- 
ing similar but more detailed methods of type 
analysis, he recognized a variety of distinct racial 
or subracial groups. Based on this typology, which 
was derived from the Study of an Indian popula- 
tion in the Southwest, he reconstructed a tentative 
aboriginal history of the New World. 

His interest in body build led him into the field 
of applied anthropology, in which he was a pioneer. 
He published studies on body build for use in de- 
signing furniture, and he carried out investigations 
for the government on other parameters of body 
build that were intended for guidance in selecting 
personnel for specialized functions. Later this inter- 
est was to involve him in problems of the associa- 
tion of body build with variations in behavior. 

The attempt to isolate the various physical types 
in a random sample of a population was not origi- 
nal with Hooton. His colleague Dixon had em- 
barked on a similar Survey, and others had tried 
the same thing. But Hooton’s distinction in this 
type of research was his far greater sophistication 
in statistical methods and his use of subjective sort- 


ings that were subsequently validated by mathe- 
matical tests. Type analysis, however, has not 
been very fruitful, since it assumes gene behavior 
that modern genetics does not support. In other 
words, type analysis implies that the various strains 
that compose a population somehow survive intact 
in the mixture of a breeding population; it does not 
take into account that some of the types defined 
may be artifacts of the mixture itself rather than 
survivals of original components. 

For the remainder of his research career Hooton 
was primarily concerned with studies on living 
populations. During the 1930s he was mainly en- 
gaged in a very large-scale survey of the criminal 
population of the United States. In this study he 
sought to apply to a major social problem any in- 
sights to be derived from a racial and constitutional 


` approach. But unlike the earlier researchers, he 


was primarily seeking to determine if relationships 
existed between body type and behavior, and, in 
this case, if criminal behavior or any of its varie- 
ties was linked with racial or physical criteria. His 
conclusion that such correlations did in fact exist 
recalled the work of Cesare Lombroso in the nine- 
teenth century, which had long been repudiated. 
Hooton’s investigations, however, were far more 
sophisticated and were free of the gross errors that 
had marred Lombroso’s work. The reception of 
Hooton’s results, published in The American Crimi- 
nal (1939b) and in Crime and the Man (1939c), 
was negative on the whole. Some critics, in fact, 
were extremely harsh, attacking him primarily 
from the conviction that social behavior is deter- 
mined more significantly by environmental factors 
than by biological ones. Nevertheless, when re- 
examined, his results cannot be summarily dis- 
missed on such theoretical grounds. The questions 
raised by this monumental study have yet to be 
resolved, and Hooton’s service in bringing them 
forward has not been fully appreciated. 

His last researches were a natural outgrowth of 
the study of body build and crime, for he now 
turned to a broader application of these same theo- 
retical principles. Using the principle of somato- 
type he investigated the association of varieties of 
temperament and behavior with body build in a 
carefully selected sample of Harvard students. He 
also followed the same procedures in analyzing 
large bodies of data derived from military sources. 
Much of this work has never been published. 

Hooton was notable for his warmth and his 
ready responsiveness to people, his humor, and his 
acute perception. Among his other gifts he had a 
remarkable flair for light verse. 

HARRY L. SHAPIRO 


[For the historical context of Hooton’s work, see Puyst- 
CAL ANTHROPOLOGY and the biographies of DIXON; 
HRDLIČKA; MARETT.] 
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HORNEY, KAREN 


The work of Karen Horney (1885-1952) influ- 
enced the course of development of psychoanalysis 
decisively. Although her revisions of psychoanalytic 
principles resemble those of Alfred Adler—which 
has led some to label her as no more than an 
updated Adlerian—she went well beyond his inno- 
vations to formulate a theory of psychopathology 
that was at once more comprehensive in its scope 
and more penetrating in its insights. Her ideas, 
grounded in clinical experience, are almost totally 
devoid of the dramatic speculation that frequently 
marked the writings of the man she always ac- 
knowledged to be her indispensable forerunner, 
Sigmund Freud. 

Karl Abraham, Freud’s student and close ad- 
herent, supervised Horney’s psychoanalytic train- 
ing in Berlin. After completing her training in 
1915, she spent five years in clinical and outpatient 
work in Berlin hospitals and 12 years as instructor 
at the Berlin Psychoanalytic Institute, supple- 
menting her work with private practice. In 1932 
Franz Alexander invited her to become associate 
director of the Chicago Institute for Psychoanal- 
ysis; two years later she moved to New York to 
pursue her own practice. Her dissatisfaction with 
orthodox psychoanalytic theory is stated most ex- 
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plicitly in New Ways in Psychoanalysis (1939). 
This was such a thoroughgoing critique of the 
rigidities of Freudian doctrine that it placed her 
in the forefront of that wing of psychoanalysis 
which has since become known as the Neo- 
Freudian movement. 

Critique of orthodox Freudianism. Horney 
took exception to several components of Freud’s 
system. The feature she found most objectionable 
was the libido theory. She could not accept the 
notion that an allegedly somatic, yet impercepti- 
ble, source of erotic instinctual energy called 
libido could account not only for individual person- 
ality traits and individual behavior but also for 
the character development of a lifetime. She 
thought that this theory made most human rela- 
tionships exclusively sexual relationships; that it 
explained adult behavior as a mere repetition of 
the experiences of infancy, to the virtual neglect 
of the influence of later experiences; that it made 
fixed and ineradicable instincts the controlling 
forces of human destiny. Horney deplored the 
psychic determinism Freud's libido theory inflicted 
on psychology, as well as the excessive orientation 
of that psychology toward biology and anatomy. 
Thus, Freudian, or orthodox, psychoanalysis gave 
the Oedipus complex a strictly sexual interpreta- 
tion—that is, the child’s relationships with his 
parents, later transferred to others, are to be under- 
stood as symbolic of sexual desires and frustrations 
—while Horney’s clinical experience led her to 
claim that this did not correspond to the facts of 
psychopathology. Again, Horney disagreed with the 
orthodox view that deprived the ego of its directive 
and discretionary role, reducing it to a sentinel 
eternally on guard against instinctual excesses and 
dependent on the id (representing instincts) or 
the superego (representing conscience) or objec- 
tive reality for its activation and purpose. Finally, 
orthodox psychoanalysis considered man to be 
doomed by the “death instinct” to destructive im- 
pulses which must be turned against others if he 
is to avoid self-destruction; Horney countered that 
the prevalence of destructiveness or hostility did 
not prove its instinctual nature and that hostility 
might be an appropriate reaction to a provoking 
social environment. These and other of Horney’s 
objections amounted to a revision of accepted 
psychoanalytic principles and led, in 1941, to her 
disqualification as training analyst by the New 
York Psychoanalytic Institute and an abrupt sever- 
ance from the New York Psychoanalytic Society. 
Independence, however, brought the opportunity 
to found the Association for the Advancement of 
Psychoanalysis, whose very name indicates her 
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insistence on pushing forward the frontiers of the 
still young science. 

Anxiety. Crucial to Horney’s thought is the 
concept of anxiety. Conditions of life in infancy 
may generate a basic anxiety in which the child 
feels himself helpless and isolated in the face of 
a world conceived of as hostile and as jeopardizing 
his safety and security. The sources of all anxiety 
are to be found in interpersonal relations, chiefly 
those encountered in early family life, but at all 
events those resulting from the culture in which 
one lives. The unconsciously developed neurotic 
trends in personality—trends which enable the 
individual to cope with a demoralizing array of 
conflicting values presented to him by the patterns 
of his culture—assume a compulsory character, 
deflecting him from bringing his real self to matu- 
rity and substituting the vain pursuit of fantastic 
and unrealizable images of himself. 

Contemporary Western society's contradictory 
demands of success (obtaining power) versus love 
(obtaining affection), of the arousal of desires 
versus their frustration, and of individual freedom 
versus the limitations imposed on freedom by real- 
ity, produce, as Horney described vividly in all her 
writings, an almost incredible variety of neurotic 
behavior patterns. The value given to individual- 
istic achievement makes competition pervasive and 
leaves a heritage of isolation, fear, insecurity, 
hostility, and anxiety, which continually under- 
mines self-esteem. The experience of the inade- 
quacy and falsity of love is especially likely to turn 
healthy affection into neurotic defensive patterns. 
Clearly, the etiology of neuroses must be referred 
not to instincts and their vicissitudes, but to inter- 
personal relations and cultural exigencies. Psycho- 
analysis must become more a social than a medical 
science, Horney implied, and so make itself avail- 
able as a basis for social and economic reform. 
The writings of her closest colleagues, Harry Stack 
Sullivan and Erich Fromm, point in the same direc- 
tion; the three comprised the nucleus of the Neo- 
Freudian school of psychoanalysis, with Sullivan 
developing the concept of anxiety and its roots: in 
interpersonal relations and Fromm exploring the 
deficiencies and incongruities in modern capitalist 
Society that make it what he considers an insane 
society. 

Alienation. The concept of anxiety enabled 
Horney to connect Psychopathology with culture, 
thereby giving her work the dimension of a social 
philosophy. The concept of alienation, prominent 
in her later writings, coupled with that of conflict, 
Strengthened the connection of psychopathology 


with the inner dimension, the psyche, which some 
critics hold to be the true, if not the exclusive, 
concern of psychoanalytic psychology. Alienation 
refers to a condition in which the individual is 
divorced from his real self, a condition in which 
not only his standards and values but also his 
capacities for judgment, initiative, and self-direc- 
tion, his very feelings and thoughts, are imputed 
to a false image of himself which he manufactures 
unconsciously to allay his basic anxiety. His psy- 
chic energies are diverted to the realization of this 
phantom ideal of himself, which is happily free 
of conflicting neurotic trends, while his real self 
comes to be regarded as an indwelling stranger 
who is to be hated and repressed. An internal 
numbness to genuine feelings, compounded into 
a loss of identity, supplants the vital dynamism of 
psychic development. Concealed from awareness 
are conflicts among neurotic trends, such as com- 
pliance, aggressiveness, and detachment, or the 
central inner conflict between the idealized image 
and the real self, whose healthy trend is to strive 
for its unfettered growth. As Horney’s conception 
of neurosis matured, she saw neurosis not simply 
as a conflict among neurotic trends but as a proc- 
ess, culminating in alienation, in which these 
neurotic trends are pitted against healthy ones. 

With her unusual insight into psychic malfunc- 
tioning, Horney saw that a neurosis, far from being 
a temporary maladjustment, has a dynamism of 
its own: it dominates the whole personality and 
becomes a way of life. The conditions of life in 
contemporary Western society, as often as not, 
compel the individual to relinquish his true identity 
if he is to survive and have any identity at all, to 
maintain an inferior type of psychic activity rather 
than undergo a total breakdown. Neurosis, in other 
words, has become virtually equivalent to “nor- 
mality,” by virtue of social conditions largely 
beyond individual control. But if mental illness has 
become a way of life, mental health may be at- 
tained not so much by sponsoring a crusade for 
social reform as by a reintegration of personality 
that enables each person to rely on his own inner 
capacities for psychic development. For therapeutic 
purposes, having satisfactory interpersonal rela- 
tions in general takes second place to resolving 
intrapsychic conflicts by intensive self-analysis. 
Ultimately, psychoanalysis had for Horney more 
of a clinical than a political application, so that, 
surprisingly enough, this most outspoken of rebels 
against the conservatism of orthodox Freudian 
psychoanalysis in the end accepted its more con- 
ventional uses in psychotherapy. 


Psychic growth. Implicit in the foregoing is 
the idea of psychic growth, the postulate that under 
favorable circumstances—notably interpersonal re- 
lations featuring warmth and cordiality—the real 
self emerges, stabilizes, and develops; that is, the 
latent potentialities for constructive change in the 
personality are able to bring about a condition of 
inner freedom, unity, happiness, and self-direction 
that for want of a better term may be called nor- 
mality. In a world where normality is an ambigu- 
ous standard, Horney believed that this tendency 
toward psychic growth provides the most nearly 
absolute criterion for human values. It is a 
morality of evolution, as she called it, an evolution 
of human personality whose terminus ad quem, 
however, can neither be prescribed uniformly 
for all people nor specified for any single individ- 
ual. If, as critics charged, psychic growth provides 
a nebulous standard for ethics and if, as was also 
charged, it gives psychoanalysis an unwarranted— 
meaning unwonted—optimistic bias, Horney could 
retort to the first charge that a preconceived notion 
of what is good or bad presumes a universality of 
cultural norms that is contrary to fact, and to the 
second that clinical experience confirmed repeat- 
edly the existence of a drive toward mental health 
making possible the liberation of developmental 
forces in the personality. Without it, psychoana- 
lytic therapy itself could not proceed. 

Psychotherapy. Horney was led by both theory 
and practice to revise the principles of psycho- 
analytic therapy. A return to mental health in- 
volves more than bringing unconscious conflicts to 
the level of consciousness and more than the 
removal of overt symptoms, as Freud had thought. 
Since every neurosis is a disorder of personality, 
she held, therapy requires the reorganization of 
personality and the analysis of the entire structure 
of the neurosis and the current conditions necessi- 
tating it. While this idea also stems from the earlier 
work of Franz Alexander, Horney made it the 
linchpin of her innovations in therapy. Psycho- 
therapy that proceeded by seeking a direct causal 
connection between infantile experiences and the 
present neurosis, on the assumption that the neu- 
rosis is an elaborate repetition of early instinctual 
life, tended to neglect the function performed by 
anxiety and neurotic trends in the current psychic 
situation, as well as the ramifications of the neuro- 
sis in all facets of the patient's daily life. Horney 
sought to return the individual to his own sponta- 
neous self-direction and cognizance, free of the 
grip of neurotic trends and idealized illusions, by 
developing his emotional self-awareness as well as 
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by rational or intellectual introspection. The inter- 
action between patient and analyst, known tech- 
nically as transference, also received a significant 
modification in Horney’s hands. Since it is but 
one type of interpersonal relation, and since all 
interpersonal relations involve feelings, the inevi- 
table emotional confrontation of analyst and pa- 
tient should not be avoided by constructing a 
façade of professional impersonality, as recom- 
mended by Freud, but should be used to obtain a 
more intimate understanding of the pervasive 
character of the neurosis. Love, hostility, defiance, 
pride, self-accusation, and so forth, permeate the 
analytic session; under the guidance of an alert 
and sensitive analyst they can be studied inten- 
sively and fruitfully. Horney accepted, without 
appreciable modification, free association and the 
interpretation of dreams and parapraxes, the other 
conventional tools of psychoanalysis. 

Feminine psychology. Not the least of Horney’s 
contributions to a revised psychoanalysis was her 
reappraisal of feminine psychology. Unwilling to 
accept Freud’s opinion of it as an offshoot of 
masculine psychology, an accidental, inferior, and 
necessarily warped by-product of women’s genital 
differences from men, Horney argued that Freud 
had ignored the impact of cultural conditions on 
the psychic orientation of women. In a predomi- 
nantly masculine culture women are deprived of 
considerable conjugal and maternal fulfillment by 
the undervaluing of love, and consequently they 
have neurotic problems peculiar to their sex. More 
than anything else, this protest against Freud’s 
negative attitude toward women and their psy- 
chology seems to have been responsible for driving 
Horney from the Freudian fold and for giving her 
thought its characteristic turn. If women could and 
should have lives of their own, once the distorting 
pressures of culture were relaxed, by the same 
token other culture-produced neuroses might be 
eliminated. by reducing cultural pressures. 


The goal of mental health and growth during a 
lifetime of psychic integrity and independence per- 
sisted in Horney’s thought and received ever more 
resonant expression in her writings. To the blasé 
and world-weary generation of the 1920s her affir- 
mation of positive and humanistic values was 
refreshing. The years of economic hardship and 
international tension that followed created an audi- 
ence for her writings that responded to her sympa- 
thetic understanding of its plight. She gave a re- 
newed impulse to the psychoanalytic movement 
and by her rebellion reaffirmed the principle of 
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scientific inquiry free of the confines of dogma. 
The value of her work will be estimated in terms 
of the reduction of human suffering and the ex- 
pansion of human living that it made possible. 


MARTIN BIRNBACH 


[For the historical context of Horney’s work, see Psy- 
CHOANALYSIS and the biographies of ABRAHAM; 
ADLER; ALEXANDER; FREUD. Directly related are ANx- 
1ETY; Neurosis; and the biography of SULLIVAN.] 
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I 
SOCIAL ASPECTS 


Although it varies according to climatic differ- 
ences, housing constitutes one of the most universal 
forms of material culture, being found in all except 


nomadic societies. Housing also represents an im- 
portant element in all capital formation and the 
largest single component in the total building effort 
of any nation. From a sociological point of view, 
housing has a major part to play in ensuring con- 
tinuity of community life. 

There is also a close interrelationship between 
housing and family organization. In all cultures 
and at all times the type of housing has corre- 
sponded in some way to the organization of the 
family and has in turn sustained and reinforced 
existing forms of family organization.. In many 
parts of the world, the extended family system is 
reflected in various clusters of rooms within which 
there are sections reserved for the basic units. The 
long house, for example, is found in a variety of 
cultures, from the Iroquois of North America to 
the peoples of southeast Asia; while the polyga- 
mous family structure still found in many parts of 
Africa requires a complex of huts, one for the hus- 
band and one for each wife and her children, ar- 
ranged in order of rank. 

The home is a ménage for production as well as 
for consumption and may therefore also reflect fea- 
tures of economic organization; for example, in the 
eighteenth-century wool centers in England many 
cottages included an upstairs room in which family 
members worked, and rural cottages everywhere 
tend to confuse the housing of humans and of live- 
stock. The greater systematization of modern West- 
tern life has eliminated most of this form of pro- 
vision, at least in urban communities; but new 
dwellings may still incorporate doctors’ offices, art- 
ists’ studios, or scholars’ libraries, and new shops 
still often include living accommodations for owner 
or manager. 

Control of house building. Until the nineteenth 
century there were few specialist builders, and in 
most cases members of the family unit built their 
own houses. This is still the case in many parts of 
the world, with the result that there is a close cor- 
respondence between the dwellings built and the 
way of life of those who build them. However, in 
more industrialized societies this identity of aim is 
easily lost. Indeed there are typically three parties 
to the house-building operation: the builder and his 
workmen, the owner or entrepreneur who com- 
missions the dwelling, and the family that will in 
due course occupy it. The situation is further com- 
plicated by the fact that in many cases the new 
development will be associated with the migration 
of families into towns, a move which in any case 
requires them to urbanize their way of life and 
may involve renouncing the extended family or 


tribal system to which they had previously been 
accustomed, 

Where an entrepreneur or landlord places him- 
self between the occupier and the designer and 
builder, it is inevitable that the housing type will 
to some extent be imposed on the occupier. The 
motives of a landlord can vary considerably; but 
whatever they may be, his control of the means and 
nature of house provision is very powerful. 

Although public authority housing takes an im- 
portant and increasing share in the total building 
effort of most industrialized countries, the share of 
the private landlord is declining even faster and 
is being largely replaced by a growth in owner occu- 
pancy. Even in the socialist countries of eastern 
Europe, an important proportion of new house 
building is being undertaken privately. But the 
number of custom-built houses in such cases is 
relatively small, and most owner-occupiers move 
into houses that were, for all practical purposes, de- 
signed without consulting them. If there is a greater 
correspondence in the private sector between de- 
sign and family needs, it is because the unsuitable 
houses prove more difficult to sell and their design 
is therefore not repeated; such houses, however, 
will not be destroyed but will merely be passed 
down to a householder who is less able to pay for 
what he wants. 

Trends in housing standards. In every indus- 
trialized and industrializing country in the world 
there is an acute shortage of urban dwellings, and 
a substantial proportion of the world’s population 
is living in severely substandard housing: slums 
that have survived the uncontrolled building of the 
nineteenth century or shacks that reflect the un- 
controlled urbanization of the twentieth century. 
But in spite of overcrowding and the physical de- 
cay of buildings, it is often difficult to demonstrate 
that bad housing is directly responsible for bad 
physical health; although bad health is often as- 
sociated with bad housing, this is because both of 
these are generally secondary effects of poverty 
(Schorr 1964, pp. 6-7, 143-145). 

Space standards. There is clearer evidence that 
inadequate space standards can disorganize family 
life. The current expert consensus in European 
countries is that the lower limit for mental health 
is 170 square feet of floor area per person (Chom- 
bart de Lauwe 1955; Musil 1962), and the desira- 
ble standards established by the American Public 
Health Association (1950) are twice the above 
figures. But these represent standards for new con- 
struction; and many European and North Ameri- 
can families probably have a current space stand- 
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ard of less than 80 square feet per person, while 
it is certain that families are to be found in Latin 
America, Africa, and Asia with less than 20 square 
feet of floor area per person. 

There is also evidence that an equally important 
criterion is the number of rooms in the dwelling. 
A careful survey by Loring (1956), using paired 
groups of “well adapted” and “disorganized” fami- 
lies, showed that significant differences were as- 
sociated with number of rooms available, floor 
area, and general surroundings, but not with other 
supposedly significant factors such as possession of 
a bath or the physical condition of the dwelling. 

In spite of the gigantic backlog of substandard 
housing, it is expected that new construction will 
continue to improve its minimum standards of 
equipment and space. In Britain during the past 
sixty years, the recommended minimum space 
standard has increased by about 40 per cent (or 
50 square feet per person), and it is believed that 
the increase may have been greater in most other 
industrialized societies, which started from a lower 
base. On the basis of a sustained 3.5 per cent 
annual growth in productivity, it seems very pos- 
sible that the minimum standard of new housing 
will double in the next forty years; even if space 
standards increase at only half this rate, a standard 
for new construction of 250 square feet of floor 
area per person would then be commonplace by the 
early twenty-first century. 


The family and the home 


In affluent countries, the rise of general space 
standards to the level previously reserved for 
middle-class homes has coincided with, and may 
have helped to stimulate, the growth of individua- 
tion in the home. Whereas in the traditional work- 
ing-class home family life was lived collectively in 
the “living-kitchen,” today there is more stress on 
individual privacy and a greater tendency for mem- 
bers of the family to follow their own pursuits. 
There is a consequent trend toward the use of 
study-bedrooms and a greater willingness to recog- 
nize the distinctive needs of children in the home. 

At the same time, new dwellings continue to play 
a substantial part in bringing about changes in 
family organization. This is illustrated particularly 
vividly in the context of urbanization. In eastern 
Europe, for example, there is a traditional type of 
peasant cottage consisting essentially of two rooms: 
the white room, which contains all the most valued 
family possessions and also acts as a store for grain 
and other nonperishable food; and the black room, 
which is used for all normal household purposes 
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and includes niches for sleeping. Comparable ar- 
rangements are found in most other rural econ- 
omies in Europe. Obviously such a subdivision is 
not appropriate or even possible in a multistory 
apartment building; the application of urban be- 
havior standards requires more individual privacy, 
for example, for sleeping, but also requires less 
space for food storage and for other functions 
around which the traditional rural home rotates. 

The urban wife is increasingly employed outside 
the home, and the development of substitutes 
for household tasks—launderettes, partly prepared 
foods, ironing and mending services—is proceed- 
ing rapidly alongside industrialization. Meal-taking 
is unlikely to disappear from the home, but the 
habit of taking some meals in company cafeterias 
and in restaurants is likely to grow. The care of 
children will probably be increasingly shared be- 
tween the family and the social and educational 
agencies, and there may even be an increase in the 
use of boarding schools (cf. Musil 1962, p. 555). 
This should not, however, be interpreted as indi- 
cating a reduction in the space requirements in the 
home. It is regarded as important for family cohe- 
sion that the child should feel a member of the 
household even when absent; few authorities antic- 
ipate a widespread adoption of the deliberate segre- 
gation of parents and children practiced in many 
Israeli kibbutzim. 

It is anticipated that a decreasing proportion of 
households will contain three generations; if pres- 
ent trends continue, more separate accommoda- 
tions will be provided for elderly couples and single 
or widowed old people. There will probably be a 
concurrent increase in the number of young people 
living alone or with their peers. The net result of 
these and other changes will probably be that the 
proportion of households consisting solely of par- 
ents and dependent children will further increase. 


The ecology of housing 


In preindustrial countries, the majority still live 
in rural settlements, which have in most cases de- 
veloped a close internal communication network 
commensurate with their internal social cohesion, 
With industrialization and the accompanying ur- 
banization, the traditional close-knit integrated 
community is increasingly replaced by a pluralistic 
type of community comprising a number of sub- 
cultures, at once interlocking and independent of 
each other. To a considerable extent the ecology of 
the community will reflect these divisions among 
its population. 

Invasion and succession. The ecological the- 
ories and studies of the early Chicago school were 


to a large extent concerned with what classes of 
people occupied different zones of the city (for ex- 
ample, Zorbaugh 1929). Similarly the ecological 
principles of invasion and succession were applied 
by these authors to the successive waves of occu- 
pants that passed through old existing stocks of 
houses. That succession took place at all is a reflec- 
tion of the durability of houses, which so regularly 
outlive their original purpose, so that mansions 
originally built for wealthy families and their serv- 
ants survive in multiple family occupation or as 
small hotels and rooming houses. 

As has been stated, the main tendency in indus- 
trialized countries in the past fifty years has been 
the convergence in space and other standards of 
middle- and lower-class houses. Particularly in town 
centers, the relatively small row houses built for 
lower-class families three or more generations ago 
are coming within the size range of the urban 
middle-class family; and if they survive the hazard 
of urban renewal they are liable to experience 
“succession in reverse,” reappearing with brightly 
painted doors as bijou residences for those who can 
afford to buy out their less affluent lower-class occu- 
pants. Even in the United States, with its tendency 


to tear down and replace, succession is now some- ° 


times a two-way process. 

Slums and ghettos. One of the forms of suc- 
cession that has been most documented and stud- 
ied, particularly in the United States, has been the 
process by which immigrants or visible races such 
as Negroes have been forcibly segregated into 
slum-ghettos. This has come about because earlier 
inhabitants have typically first resisted invasion 
and then fled, leaving the newcomers in possession. 
Housing for nonwhites and for underprivileged 
immigrants is generally substandard in space, 
amenities, and repair; out-of-bounds for mortgage 
investment or other private financing; overpriced 
for what it is; and underprovided with public and 
social services (Abrams 1955, pp. 74-75). 

Areas with the above characteristics are not in- 
evitably turned into ghettos. Slums and substand- 
ard areas occur in almost all cities of any size, 
particularly if the population is growing. If obsoles- 
cent housing ripe for slum formation does not 
already exist, shacks are often built on the perim- 
eter of the town to act as a substitute. This type of 
development is resisted by civic authorities with 
varying vigor and success. 

Urban renewal and relocation. Unlike perim- 
eter shack development, substandard areas are 
often well located near the city center, and some 
offer tempting possibilities for urban renewal. The 
extent to which this situation has been exploited for 


private gain has varied greatly, but there have been 
notorious examples in American cities and else- 
where in which whole neighborhoods of underpriv- 
ileged families have been displaced in favor of 
luxury apartment buildings, creating even greater 
pressures on surviving slums. Until comparatively 
recently the net effect of urban renewal has prob- 
ably been to increase the amount of segregation; 
and while, by the mid-1960s (except in countries 
such as the Republic of South Africa, which re- 
tained a philosophy of segregation) there appeared 
to be a tide toward integration and interracial 
housing, it seems probable that the general trend 
toward social stratification and class division is 
being accentuated. 

Whether urban renewal reduces or increases the 
problem of the slums, it is commonly believed that 
its net effect is to harm the community spirit of the 
long-established neighborhoods. Young and Will- 
mott (1957) and Gans (1962) have argued that 
relocation programs almost inevitably break up the 
cohesion provided by the kinship network, while 
Jacobs (1961) has stressed the inhumanity and 
monotony of many renewal projects. In discussing 
older neighborhoods, it is necessary to distinguish 


. clearly between those sections containing a fluid, 


disorganized, and often no-family population, and 
those sections containing close-knit but poor fam- 
ilies, often recent immigrants or otherwise under- 
privileged. 

The success of relocation largely depends on how 
it is administered. The “melting-pot” theory, based 
on the idea that slum dwellers can be “reformed” 
by placing them in a project where they will be 
influenced by respectable neighbors, can produce a 
calamitous and quite unnecessary destruction of 
social cohesion and lead to loneliness, anxiety, and 
temporary neurosis. For all families that move 
there is a painful period of adjustment which is 
only bearable if the move was wanted and if estab- 
lished neighbors are welcoming. It has been shown 
(for example, by Festinger et al. 1950) that friend- 
ships can depend on physical layout and that satis- 
faction and morale in the neighborhood are maxi- 
mized if the layout as well as the social climate 
encourage interaction. European research workers 
(for example, Kuper 1953) support this thesis but 
tend to set more store on the need for the individual 
to be able to select and regulate the extent of his 
interactions. 

Suburban development. Whatever the density 
of redevelopment, a normal result of renewal is an 
increment of pressure on suburban housing. Even 
without these pressures, improvements in trans- 
port and communications are rapidly increasing the 
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range and popularity of the commuter areas around 
cities [see TRANSPORTATION, article on COMMUTA- 
TION]. It is in the most urbanized and industrialized 
countries that the suburban belt is most clearly in 
evidence. The original “new towns,” although 
planned as communities and occupationally more 
or less self-sufficient, have so far reflected in a 
more organized way the same demographic tend- 
encies as have elsewhere led to peripheral suburban 
development. Their populations are generally to a 
large extent self-selected and consist mainly of 
upwardly mobile middle-class families. It has been 
claimed (for example, by Whyte 1956) that such 
families have developed a new interactive way of 
life that leaves little room for permanence or pri- 
vacy, but more recent studies (Berger 1960; Dob- 
riner 1963) have shown that new communities set 
up under different circumstances can develop along 
more traditional working-class lines. 

Beyond the city suburbs and the “green belts” 
there is a still newer wave of community develop- 
ment. This may have started with the quest for 
exclusiveness of upper-class “exurbanites,” but 
emancipation by automobile has brought whole 
counties into range, and the sprawl of an even 
looser suburbanism lacking either physical or social 
focus is extending every year. This type of develop- 
ment is occurring throughout the Western world 
in all regions of relatively high density, and it has 
so far overwhelmed the resistances of even the 
most assiduous planners. In the more industrial- 
ized counties it is becoming difficult to find a gen- 
uine rural village. The old types of social activity 
that generations of country dwellers have gradually 
built up are largely inappropriate, and the older 
rural institutions are unable, if not unwilling, to 
adapt to the new pattern of social demand. 


Status attributes of housing 


In spite of the convergence of social class char- 
acteristics, there are distinctive class differences in 
ideology and behavior that transcend differences 
in wealth and can be readily detected by objective 
comparisons. The two main differences between 
middle-class and working-class ideologies that are 
reflected in their homes are the former's greater 
stress on individualism, which requires a home 
that gives some scope for privacy and self-expression 
to every member of the household, and the latter's 
greater stress on saving and repairing as opposed 
to spending and discarding, which requires greater 
provision for storage of possessions. The middle- 
class home is also used for more or less discreet 
displays of “conspicuous consumption.” These early 
insights, including those of Veblen (1899) and the 
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Lynds (1929), were followed by the systematic 
attempts of Warner and Lunt (1941), Chapin 
(1947), Guttman (1942), and others to construct 
a scale of social status. In spite of various anom- 
alies, it is clear that the importance of space and 
material possessions to the individual family, and 
the state of repair in which the home is kept, can 
quite properly be used as one component in the 
perennial search for an objective index of social 
class, 

There is naturally a general correlation between 
the type of house and the neighborhood, both be- 
cause of the tendency toward uniformity of stand- 
ards in any particular section and also because of 
the physical fact that adjacent dwellings will often 
have about the same date of construction. As a 
general rule, the location of a home is likely to be 
even more revealing of the social status of a family 
than the type of house and its contents. In the 
United States a large number of ecological reports 
have been prepared since the early 1930s, with the 
help of the census tract data provided by the 
Bureau of the Census, showing social class and 
other related characteristics of cities and city neigh- 
borhoods all over the country. Similar information 
is becoming available in Britain and elsewhere and 
is being used for sophisticated forms of ecological 
analysis. 


Social policy 

On a world view, very great variations are to be 
found in the extent to which different governments 
take responsibility for housing their populations. 
Some intervention is found necessary wherever 
urbanization is occurring, if only in order to pre- 
vent the outbreak of epidemics and other illnesses 
caused by unsanitary and overcrowded conditions. 
Furthermore, if the government is promoting in- 
dustrial development, it will be under pressure to 
provide, or stimulate the building of, workers’ hous- 
ing within reach of the new plant; the incentive 
of good housing is an important lever in attracting 
Suitable families into the towns from rural areas. 
In addition to these motives of enlightened self- 
interest, governments may also be impelled on 
grounds of social welfare to make provision for 
their less privileged citizens, for example, by pro- 
ducing houses for special groups such as old peo- 
ple, by creating new interracial neighborhoods, or 
by subsidizing the provision of new houses in order 
to raise housing standards, 

Historically, the provision of housing for special 
groups in industrialized countries has been one of 
the responsibilities assumed by private charities. 
As still occasionally happens, the close-knit indus- 


trial and professional groups looked after the hous- 
ing needs of the less privileged of their members’ 
or ex-members’ families. Other more general pri- 
vate charities in a great variety of countries have 
built almshouses and continue to build and main- 
tain housing projects for low-cost rentals. In the 
nineteenth century money for these purposes was 
raised in Great Britain from so-called “five per cent 
philanthropists,” who were willing to loan money 
for a return at less than the current rate of interest 
so that these projects could be financed. 

Subsidized housing. It is only in the last fifty 
years that the burden of subsidizing the housing of 
the poor has been partly assumed by the state and 
the city authorities. Even today there are some in- 
dustrialized countries in which the public provision 
of housing is negligible. The outstanding example 
is the United States, where new public housing 
represents only 3 or 4 per cent of new permanent 
Starts (U.S. Bureau of the Census, as quoted in 
Van den Broek 1964), so that the total share of 
public housing is probably now only 1 or 2 per cent. 
Some states have even passed legislation contain- 
ing clauses that effectively prevent any additional 
provision of public housing (in 1966 the constitu- 
tionality of such clauses was being tested in the 
courts). 

In the United States and in other countries with 
similar policies, very substantial public funds are 
invested in the housing program; but they are 
used not as a form of social service but to help 
citizens to buy their own houses. This policy is 
intended to benefit those most able to help them- 
selves; the assumption is that the houses thus 
vacated will become available for poorer families 
and that a general reshuffling will take place so 
that virtually all sections of the community may 
benefit. 

In spite of the great divergence in philosophy 
between this kind of assisted self-help and the 
welfare-state policies now being implemented over 
much of the globe, the difference in practice has 
been far less striking. Governments have generally 
shied away from making direct provision by build- 
ing new houses for the poorest families. This fol- 
lows the experience of the private philanthropists 
that it is unattractively expensive to subsidize new 
housing to the point at which really poor families 
can afford the rents. 

Even in countries with a relatively low economic 
growth rate, much of the current demand for hous- 
ing at any date is from newly married couples and 
from upwardly (and, in the case of the urban 
population, outwardly) mobile families, and the 
provision of new housing is predominantly attuned 


to their needs. It is only in cities undertaking a 
vigorous urban renewal and slum-clearance pro- 
gram that a substantial number of really poor 
families are displaced and forced to seek new 
homes. Even with generous subsidies many of these 
underprivileged families find the new rents too high 
and tend to gravitate back to the remaining slums, 
thereby further overcrowding these areas. 

There is by now a fair understanding in some 
countries of human needs in housing and also of 
the means by which these needs can be satisfied. 
This will have to be extended to other areas and 
also kept up to date as new housing forms are 
developed. What is now needed in addition is a 
fuller knowledge of what conditions are most favor- 
able to satisfactory social organization and, in par- 
ticular, what types of social planning should ac- 
company the massive relocation and urban renewal 
programs that we shall have to undertake in the 
coming years. In other words, the balance of effort 
should swing from static to dynamic studies, from 
social investigation toward participation and social 
action. 

JOHN MADGE 


[See also PLANNING, SOCIAL, articles on REGIONAL AND 
URBAN PLANNING and WELFARE PLANNING; SEGRE- 
GATION.] 


BIBLIOGRAPHY 

ABRAMS, CHARLES 1955 Forbidden Neighbors: A Study 
of Prejudice in Housing. New York: Harper. 

AMERICAN PuBLIC HEALTH ASSOCIATION, COMMITTEE ON 
THE HycreNE oF Hovusinc 1950 Planning the 
Home for Occupancy. Chicago: Public Administration 
Service. 

BERGER, BENNETT M. 1960 Working-class Suburb: A 
Study of Auto Workers in Suburbia. Berkeley: Univ. 
of California Press. 

CHAPIN, FRANCIS S. (1947) 1955 Experimental Designs 
in Sociological Research. Rev. ed. New York: Harper. 

CHOMBART DE LAUWE, P. H. 1955 Le logement, le 
ménage et l'espace familial. Informations sociales 
9:956-980. 

DosrINER, WILLIAM M. 1963 Class in Suburbia. Engle- 
wood Cliffs, N.J.: Prentice-Hall. 

FESTINGER, LEON; SCHACHTER, STANLEY; and Back, KURT 
(1950) 1963 Social Pressures in Informal Groups: A 
Study of Human Factors in Housing. Stanford Univ. 
Press. 

Gans, HERBERT J. 1962 The Urban Villagers: Group 
and Class in the Life of Italian-Americans. New York: 
Free Press. 

Gutrman, Louis 1942 A Revision of Chapin’s Social 
Status Scale. American Sociological Review 7:362- 
369. 

Jacons, JANE 1961 The Death and Life of Great Ameri- 
can Cities. New York: Random House. > A paperback 
edition was published in 1962 by Random House and 
by Cape. 

Kuper, Leo (editor) 1953 Living in Towns. London: 
Cresset. 


HOUSING: Economic Aspects 521] 


Lorinc, Wiliam C. Jr. 1956 Housing Characteristics 
and Social Disorganization, Social Problems 3:160- 
168. 

Lynp, Rosert S.; and Lynp, Heren M. (1929) 1930 
Middletown. New York: Harcourt. + A paperback edi- 
tion was published in 1959. 

Musil, Jar 1962 The Sociological Approach in Plan- 
ning Workers’ Housing: The Experience of Czecho- 
slovakia. International Labour Review 86:545-566. 

Scnorr, ALviIn L. 1964 Slums and Social Insecurity. 
London: Nelson. 

VAN DEN Broek, J. H. (editor) 1964 Habitation. 3 vols. 
Amsterdam: Elsevier. 

VEBLEN, THORSTEIN (1899) 1953 The Theory of the 
Leisure Class: An Economic Study of Institutions. 
Rev. ed. New York: New American Library. > A 
paperback edition was published in 1959 

Warner, W. Lioyp; and Lunt, PauL S. 1941 The So- 
cial Life of a Modern Community. New Haven: Yale 
Univ. Press. 

Wuyte, WiLLIaM H. Jr. 1956 The Organization Man. 
New York: Simon & Schuster. + A paperback edition 
was published in 1957 by Doubleday. 

Younc, MICHAEL; and WILLMOTT, PETER 1957 Family 
and Kinship in East London, Institute of Community 
Studies, Report No. 1. London: Routledge; Glencoe, 
Ill.: Free Press. > A paperback edition was published 
in 1963 by Penguin. 

ZorBAUGH, Harvey W. 1929 The Gold Coast and the 
Slum. Univ. of Chicago Press. 


tt 
ECONOMIC ASPECTS 


Economic problems, analyses, and policies con- 
cerning housing differ considerably between the 
various types of national economies. Dissimilar 
needs and suggested solutions appear among de- 
veloped as compared to emerging economies. The 
market structure, whether planned, quasi-planned, 
or more laissez-faire, also strongly influences the 
manner in which problems arise. 

Economists have approached the housing ques- 
tion from three rather disparate points of view: 
that of real investment, consumption, and technol- 
ogy. Most analysis has been concerned with con- 
struction and housing as investment problems. 
Major interest has focused on the fact that con- 
struction is a large, significant industry with one 
of the greatest year-to-year variances. This litera- 
ture has dealt with fluctuations and possible cycles. 
However, other investment questions such as ex- 
ploring optimum rates of construction in relation 
to a country’s existing and planned expansion of 
capital and present and expected levels of consump- 
tion have received more attention in recent years. 

For over one hundred years, housing has also 
played a prominent role in the theory of consump- 
tion and social reform. Housing takes a large share 
of consumption budgets. Individual standards of 
living are heavily influenced by the level of housing 
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and the amount paid for it. Criticisms of the hous- 
ing level achieved are common, and economists 
have sought the reasons why better standards of 
housing do not prevail. 

Other economic problems arise with respect to 
the methods of production. Housing techniques 
have developed over thousands of years, but many 
feel that their rate of improvement has lagged be- 
hind other fields. Studies of production have at- 
tempted to find the economic rationale for existing 
techniques while searching for methods to improve 
the production process. 

New investment in housing. Table 1 shows the 
relative importance of construction and housing in 
different economies. Total construction expendi- 
tures range from 3.5 to about 15 per cent of the 
gross national product (GNP) and from 40 to 70 
per cent of gross fixed investment. Depending on 
the economy, housing production varies from 2 to 
5 per cent of the GNP. In each case, the magnitude 
of housing production is sufficiently large for its 
movements to cause related repercussions through- 
out the economy. The impact is heightened because 
construction in general, and particularly housing 
construction, is among the most volatile of all 
major industries. Housing fluctuations have had 
large amplitudes. In the United States, for example, 
the value of housing production in 1933 was less 
than 13 per cent of its value in 1929. In the post- 
World War 1 period, while economies experienced 
relative general stability, housing remained ex- 
tremely unstable. 

Economists have debated whether the fluctuation 
pattern of building differs basically from that of 
the rest of the economy. The evidence is not clear- 
cut, but most observers note that long swings or 
cycles of 18 to 20 years appear to mark the indexes 
of building construction in both the United States 


and the United Kingdom. Some of the literature 
on the subject attaches the name of “the building 
or construction cycle” to these observed movements. 
Abramovitz (1962) holds that since the Civil War 
there has been a succession of long swings in ag- 
gregate construction activity, consisting of upsurges 
followed by either protracted declines or pro- 
nounced retardations, in which all the major sec- 
tors of the industry have usually if not invariably 
participated. 

Causes of fluctuations. Many have attempted 
to describe the special factors which could make 
housing fluctuations differ from the rest of the 
economy. As in other areas of cycle theory, some 
have tried to derive simple, constant models for all 
past movements; the majority of theories, however, 
are eclectic. They hold that while housing fluctua- 
tions do take a unique form because of the nature 
of the industry, the proximate cause or causes of 
past movements differ. 

The theories ascribe unique demand, supply, and 
market interactions to the housing market. Unique- 
ness arises from special demand factors, the size 
and durability of houses, the few times the average 
family enters the market and the family’s relative 
ignorance, the importance of credit in facilitating 
construction and purchases, the entrepreneurs’ 
small size and lack of capital, and the difficulties of 
market adjustment. 

A high percentage of construction is that re- 
quired to house or service net additions to house- 
holds. An expanding population, either through 
natural growth as births exceed deaths or through 
migration, requires more dwellings. Another im- 
portant demographic force is the splitting of ex- 
tended families into separate households. Young 
couples may seek their own homes earlier in life 
rather than share units with their parents, The 


Table 1 — Patterns of investment, 1960 


GROSS FIXED 
GNP INVESTMENT 
(millions of dollars) (As per cent of GNP) 

Argentina 9,444.1 20.9 
Canada 37,046.6 22.7 
Chile 4,892.2 8.7 
Germany (F.R.G.) 67,708.2 24.0 
Japan 38,845.0 31.4 
Kenya 630.0 18.4 
South Africa 7,433.5 20.7 
Sweden 12,287.0 21.9 
United Kingdom 71,068.8 16.1 
United States 504,404.0 16.4 


* Not available, 


TOTAL CONSTRUCTION 
AS PER CENT OF; 


DWELLING CONSTRUCTION 
AS PER CENT OF: 


Gross fixed Gross fixed 
GNP investment GNP investment 
7.8 37.6 Fy * 
14.9 65.6 4.0 17.7 
3.6 41.2 * * 
12.0 50,2 5.4 22.4 
* * 2.3 7.4 
10.3 55.8 3.0 16.4 
11.8 57.2 * * 
14.1 64.5 5.0 22.8 
79 48.7 3.0 18.6 
10.9 66.6 45 27.4 


Source: Year Book of National Accounts Statisties: 1962. Copyright, 
United Nations (1963). Reproduced by permission. 


number of single individuals, particularly widows, 
maintaining their own homes may rise. 

In the past, waves of household additions appear 
to have resulted from variations in migration and 
from an uneven age distribution in the population. 
Even within given population structures the rate 
of family formation has varied significantly as a 
result of both social and economic forces. 

The need to replace existing units is another 
demand factor. Net replacements have ranged from 
zero to a third or more of total demand. Several 
theories posit a replacement cycle. Required re- 
placements could fluctuate as an echo of bunched 
dwelling construction in the past. However, re- 
placement cycles are not evident in the data. Since 
the housing stock has been expanding rapidly, the 
percéntage of the stock old enough to require re- 
placement is minor. The useful life of dwellings is 
extremely long. Few buildings require replacement 
on merely physical grounds; most replacement 
takes place because the location is needed for other 
uses. Replacements appear more closely related to 
the size of the stock and to economic conditions 
than to the age and quality of existing units. 

Over periods such as a decade, most of the de- 
mand for new construction is directly related to the 
number of net additions to the households in a 
locality, plus the net number of units demolished. 
replaced, or removed. Movements in unoccupied 
units (vacancies) are significant in short periods, 
but their importance diminishes the longer the time 
span examined. 

The short-run movements in demand have 
causes similar to those of fluctuations in other 
spheres. Movements in employment, in assets (par- 
ticularly equities of existing dwellings), in prices, 
costs, and in credit all change the demand for new 
units. Insofar as these variables move with business 
conditions, the resulting housing fluctuations re- 
semble those of other spheres. 

There are also logical reasons to expect that 
construction will at times fluctuate in a pattern 
differing from the remainder of the economy. 
Many authors place particular emphasis on credit. 
Because of size and cost of buildings, construction 
is highly dependent on the existence and terms 
of debt financing. Movements in credit availability, 
the required down payment, the period of amor- 
tization, and the interest rate can force individual 
owners and builders in and out of the market. 
Credit terms may move separately from other eco- 
nomic variables. The length of lag between the 
movements of credit terms and their impact in the 
construction market differs from that of other in- 
dustries. Many writers believe that the post-World 
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War n experience in both the United States and 
western Europe gave evidence that credit did cause 
special fluctuations. 

Housing demand has two dimensions: one 
consists of the number of separate dwellings re- 
quired; the second measures the amount spent 
per dwelling. This latter amount varies with the 
size, quality, and features included in a house. 
Economic variables appear to have a much larger 
impact on the amount spent per dwelling than 
they have on the number of dwellings demanded. 

When the demand for the services of dwellings 
shifts, still larger fluctuations occur in required 
new investment. This creates greater amplitude 
in the movements of building. The acceleration 
principle is at work: a 2 per cent increase in the 
desire for housing may cause new construction 
demand to double, while in response to a similar 
decrease new construction might fall almost to 
zero. [See INVESTMENT, article on THE AGGREGATE 
INVESTMENT FUNCTION. ] 

Other reaction differences arise because of pe- 
culiar factors in the market and the real estate 
industry. Supply adjusts slowly. Because of lack 
of information much building, particularly of 
apartment houses, may occur even after an excess 
of supply exists. The “gestation period” for large 
structures is long. Apparent increases in demand 
can easily be multiplied through high expectations, 
easy credit, and the large number of small firms. 
Furthermore, builders or promoters who sell to 
others can often make a profit simply through the 
building process, even if the completed unit is 
unneeded. 

When the new excess supply reaches the market, 
vacancies result. As these build up new construc- 
tion is slowed down. If a demand drop occurs with 
vacancies already high, as happened after 1929, 
the disruption of the market is heightened. Since 
the number of units required has a low price 
elasticity, no fall in prices is sufficient to fill the 
vacant space, 

Stabilization policies. Because the industry's 
structure makes large-scale fluctuations probable, 
interest has attached to governmental policies 
which might dampen some cycles. Suggested poli- 
cies have included better market information, 
credit controls, varying subsidies, and direct gov- 
ernmental action either to remove old units from 
the stock or to purchase new ones. 

Stabilization programs may attempt to increase 
the long-run basic demand or to concentrate more 
demand in particular periods. Attempts to offset 
short-run movements are difficult. Forecasting 
problems, the complicated lag situation, and a 


524 HOUSING: Economic Aspects 


slow reaction time all make it extremely difficult 
to inject the stabilizing demand into the market 
at the necessary times. 

Other investment problems. Developed coun- 
tries have studied housing because of its fluctua- 
tions. In developing economies, housing investment 
has been looked at primarily as a major drain on 
limited funds. Industrialization and urbanization 
go hand in hand; individuals and families leave 
the farm village to work in the city, where they 
must be housed. Each industrial job requires addi- 
tional investment in the urban infrastructure. 
Analysts cannot consider merely the direct invest- 
ment requirements and the payoff for a transfer 
of employment from farm to factory. They must 
take into account many additional costs, of which 
housing is a major item. Investments in hous- 
ing tend to be expensive and slow. In developing 
economies, slums and urban growth are virtually 
synonymous. Decisions as to how to meet these 
additional needs are difficult and complex. Sig- 
nificant social costs result when investment is 
channeled to housing rather than to seemingly 
more productive spheres. On the other hand, slums 
and housing shortages create social costs which 
may in turn reduce the productive effort, 

Within developed economies, major debates have 
occurred as to the proper amount of investment to 
be allocated to housing when the economy is at 
full employment. Some feel that housing needs 
were given too low a priority as western Europe 
attempted to fight inflation in the post-World 
War 1 period. New housing space was rationed 
during a period when other parts of the economy 
were booming. 

Consumption of housing. Housing standards 
vary greatly from country to country, and even 
wider ranges of consumption standards exist 
within countries. Almost everywhere some people 
live in shacks while others occupy mansions. Large 
numbers of families would occupy both more 
and higher-quality space if either their incomes 
were increased or the costs of housing lowered. 
The problem of why adequate housing is not made 
available to all income groups has received consid- 
erable analysis, but no generally accepted answers 
are available (Grigsby 1963). 

Table 2 shows the percentage of expenditures 
going to rent. It shows large variations; the average 
Spent on rent varies from 1 to nearly 15 per cent 
among the countries listed. Within individual coun- 
tries the highest percentage spent is commonly 
double or triple the lowest. 

Considerable effort has been expended on the 
analysis of these variations. The reasons for some 


Table 2 — Percentage of household consumption ex- 
penditures spent on rent 


AVERAGE 
(MEAN) RANGE OF 
PER CENT PER CENT 
SPENT ON SPENT ON 
RENT RENT 
British Guiana 1955-1956 2.9 O.6to 5.9 
France 1956-1957 5.2 43to 6.4 
India 1950-1951 1.1 0.5to 1.7 
Nigeria 1953-1955 10.0 9.5 to 10.6 
Philippines 1956-1957 91 6.8 to 16.5 
South Africa 1955 14.6 11.2 to 16.5 
Sweden 1958 11.6 9.6 to 13.5 
United Arab Republic 1958 12.4 10.5 to 17.0 
United Kingdom 1959 10.3 5.9 to 15.6 
United States 1950 11.9 9.5 to 17.9 


Source: Compendium of Social Statistics: 1963. 


differences are evident. One area’s rents may 
appear to exceed others purely because of defini- 
tions—what is included in rent and the services 
furnished vary. The relative level of housing costs, 
frequently based on differences in climate and 
Standards, will be important. Tastes in housing 
are also extremely broad, so that what is demanded 
and paid for will differ. Within a given country, 
factors influencing family expenditures are known 
to be numerous. Maisel and Winnick (1960) re- 
port results similar to those of many other studies. 
Tenure (rental or ownership) and the type of 
credit available for ownership cause significant 
expenditure variations. Among a given tenure 
group, income is the most important variable. The 
cross-section income elasticity of housing appears 
to fall around .5 and .6. 

Family demographic features are also impor- 
tant. The type of head, the age of head, and race 
are all related to differences in tenure and in 
expenditures within tenure groups. Other factors 
which influence housing demanded and received 
include education, occupation, and the location of 
the family by size and type of community. 

Use of subsidies. Many value judgments have 
been made that people underspend for housing. 
The level of housing purchased in a free market 
by many groups of the population is held to be 
below a desirable standard based on concepts of 
national health and welfare. It is argued that since 
housing of good quality is an expensive item, low- 
income families can purchase or rent sound dwell- 
ings only at the expense of other budget items. 
Since their income is inadequate for all wants, 
they tend to sacrifice the quality of housing. Com- 
pared to sacrifices of food, clothing, or other items, 
poorer housing can more easily be accepted. 

In almost all countries the problem of inade- 


quate housing has been met through subsidies. 
These tend to take the form of direct housing 
subsidies rather than general income subsidies 
which could be spent as the low-income family 
desired. Among existing types of subsidies are 
public ownership of dwellings with rents charged 
at less than costs, rent payments, government 
payment of part of the costs of construction, and 
government subsidies in the form of easier or 
cheaper credit. 

The use of housing subsidies rather than income 
subsidies would appear to be based on the assump- 
tion that the housing market is special. It is feared 
that general income subsidies would be dissipated 
through higher rents with only minor improve- 
ments in the quality or quantity of housing. It is 
argued that unless the government uses its ex- 
penditures to assure an increased supply of ade- 
quate housing, the money will be wasted. Various 
reasons for such possible poor results are cited. 
Improper transmission of information in the hous- 
ing market is one. The market structure may be 
such that unless the quality is controlled through 
the form of the subsidy most of the increased 
supply will be poor. Furthermore, if subsidies are 
general the recipients might choose to continue 
to live in housing deemed unsatisfactory by those 
favoring the subsidies. 

Low-income housing programs have frequently 
been teamed with those for slum clearance. It 
is argued that the normal market cannot clear 
slums unless governmental powers are utilized. 
The external economies and diseconomies are such 
that a few property owners could frustrate the 
will of the majority by threatening or attempt- 
ing to gather most of the joint values for them- 
selves. Tying a housing subsidy program to slum 
clearance is not a necessary condition, but it 
simplifies the process. Joint programs make re- 
placement units available for some of the displaced 
households. 

Effects of rent controls. In many periods of 
housing shortages, rent controls have been adopted. 
Arguments for controls depend primarily on con- 
cepts of a fair distribution of income. Additional 
housing can be brought into the market only 
slowly, or, in periods of war or disaster, not at 
all. Since the price elasticity of household forma- 
tion appears to be low, even rapidly rising rents 
would not have much impact on the number of 
dwellings demanded. Higher rents would have to 
be met from already inadequate budgets. These 
higher rents would increase the income of land- 
lords, who are assumed to be among the wealthier 
classes. Rent controls are also used because they 
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are comparatively simple to administer. Potential 
capital gains on dwellings can be limited. Whether 
or not it is economically sound, the ability to 
contain one area of prices has obvious practical 
advantages. 

Most arguments against rent controls consider 
difficulties that arise if controls are extended be- 
yond immediate emergencies. It is claimed that 
controls halt market adjustments. They artifically 
raise demand because of lower prices, while they 
hold down supply since higher returns cannot be 
earned. Also over longer periods, the redistribu- 
tion effects tend to alter. Instead of tenants being 
favored over landlords, old tenants are favored over 
new and some industries over others. The costs of 
operating in an artificial market rise with time, 
while the economic advantages fall. 

Production of housing. The types of houses 
produced vary widely. The United States and Eng- 
land mix single-family units and larger apartments, 
with most emphasis given to single-family units. 
In Europe a higher percentage of apartments is 
produced. In many other areas small, simply con- 
structed individual units predominate. 

In most countries, however, the housing industry 
has roughly similar characteristics. Entry is simple, 
and the number of firms is large. The amount of 
overhead and of mechanical equipment required is 
low. The rate of technological innovation is slow, 
and labor tends to be skilled in specific crafts. Dif- 
ficulties arise in training and in planning the pro- 
gression of jobs. People compare housing to other 
industries and usually conclude that it is unpro- 
gressive or backward. 

In many countries popular comment seems to 
attribute the difficulties of the industry to a “devil 
theory.” Progress is thought to be blocked by some 
entrenched group, be it builders, labor, financial 
institutions, bureaucratic government, or others. 
The few economic studies of the housing industry 
(Maisel 1953; Kelly 1959) place much greater 
emphasis on its structure as a normal response to 
the unique problems of building dwellings. 

Houses are primarily assembled and attached 
to individual plots of land. Land costs are signifi- 
cant, and each lot is unique. The joining of the 
land and building is complex and expensive. The 
buildings are large, heavy, bulky, and durable. A 
great deal of the expense in housing construction 
results simply from moving materials from point 
to point rather than from fabrication. In the as- 
sembly process, many different skills must be used 
because the variety of materials and finishes is 


great. 3 l 
The industry produces a wide variety of custom- 
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made products. Its demand is fickle and fluctuates 
widely; builders must be prepared to expand and 
contract production rapidly. Flexibility is possible 
through drawing on the labor and materials avail- 
able in the much broader area of total construction; 
however, such flexibility tends to be at the expense 
of specialization and large-scale automated pro- 
duction. 

Probably the rate of innovation has been some- 
what greater than is commonly believed. On the 
other hand, it has certainly been slower than in 
most highly industrialized parts of the economy. It 
is virtually impossible to separate the structure of 
the industry and the uniqueness of the product and 
market from the rate of change. Many different 
types of control, of management, and of labor exist 
in various countries; nevertheless, no country or 
form of organization has developed a vastly differ- 
ent product or obviously more efficient techniques. 

There is some indication that because of the in- 
dustry’s fragmented structure, research expendi- 
tures and results are less than elsewhere. Firms 
work on their own product but pay little attention 
to the process as a whole. No firm controls enough 
of the final product to make an over-all point of 
view worthwhile. While there appear to be potential 
external economies, governments have done small 
amounts of research; and their efforts and results 
thus far are minor. 

Many observers conclude that any major changes 
in structure and technique will depend upon the 
discovery of new materials and vastly different 
methods of production. As long as houses continue 
to be built approximately as they were in the past, 
changes can only be gradual. No single part of the 
structure, or individual material, or task is impor- 
tant enough for a saving in its cost to be truly 
significant. More rapid progress appears to require 
the elimination of whole groups of materials and 
functions. 


SHERMAN J. MAISEL 
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Carl I. Hovland (1912-1961 ), American pioneer 
in communications research, began his career as 
an experimental psychologist working on classical 
problems of conditioning and human learning. By 
the age of 30, when he turned to the newly devel- 
oping field of research on attitude change, he had 
already become one of the most eminent psycholo- 
gists of his generation. 

The most important of Hovland’s early research 
studies were focused on the generalization of con- 
ditioned responses. During the 1930s, he also made 
significant discoveries concerning factors that in- 
fluence reminiscence effects in human memory 
functioning, the efficiency of alternative methods 
of rote-learning, and the modes of resolution of 


motor conflicts. From 1942 until his untimely death 
from cancer in 1961, Hovland devoted the major 
part of his time to careful investigations of the 
effects of social communication, using research 
designs and analytic methods derived from the 
more highly developed fields of experimental psy- 
chology. It is primarily for his contributions in this 
area that he is regarded as one of the foremost 
social scientists of the twentieth century. Wilbur 
Schramm (1963, p. 5), in reviewing communica- 
tions research in the United States, refers to the 
work that came out of Hovland’s research program 
at Yale University between 1950 and 1961 as “the 
largest single contribution, . . . [to this field] any 
man has made.” The Distinguished Scientific Con- 
tribution Award was presented to Hovland by the 
American Psychological Association, in 1957, “for 
his original and provocative contributions to the 
scientific study of persuasive communications and 
the modification of beliefs and attitudes.” The 
citation states further: 


. . . Combining a sensitive use of controlled experi- 
mentation with penetrating logical analysis, he has 
done much to isolate the major factors at work when 
an individual is confronted with the complex informa- 
tional input of a persuasive argument. By judicious use 
of psychological theory, he has been able to relate this 
area of social psychology to basic investigations of the 
higher mental processes. His work has been of central 
importance in advancing attitude research from the 
early stage of merely demonstrating that changes can 
be produced to the point of making predictions about 
when and where they will occur. His work has pro- 
vided a convincing demonstration of the values of a 
sustained and integrated program of research. (Amer- 
ican Psychological Association 1958, p. 158) 


Formative years. A native of Chicago, Hovland 
attended nearby Northwestern University, where 
he devoted himself to acquiring as thorough a 
background as possible in mathematics, physics, 
and biology, as well as in experimental psychology. 
After obtaining his M.A, degree in 1933, he com- 
pleted his graduate studies in psychology at 
Yale University. He remained affiliated with Yale 
throughout his entire academic career, starting as 
an instructor in 1936 (immediately after receiving 
his PH.D, degree), attaining the rank of professor 
of psychology in 1945 and the chair of Sterling 
professor two years later. 

As a graduate student and junior faculty mem- 
ber during the prewar years at Yale, Hovland par- 
ticipated in the stimulating intellectual environ- 
ment of the Yale Institute of Human Relations, 
which helped to shape his interests and approach 
to the study of human behavior. Of particular im- 
portance in Hovland’s training was the influence of 
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the great American psychologist Clark L. Hull. 
Using a rigorous empirical approach in conjunc- 
tion with analytic theory construction, Hull was 
highly successful during the late 1930s in organ- 
izing and stimulating talented young psychologists 
at Yale to carry out research on significant prob- 
lems of motivation and learning. After serving as 
Hull’s research assistant for several years, Hovland 
became a coinvestigator in the series of studies on 
human learning, which led to his being a coauthor 
of the well-known book by Hull and his collabo- 
rators, Mathematico—Deductive Theory of Rote 
Learning (1940). Although not sharing Hull's 
predilection for far-reaching theoretical formula- 
tions, Hovland acquired an extraordinary degree of 
methodological sophistication, both from Hull him- 
self and from other specialists whom Hull had re- 
cruited to participate in his research program. Of 
equal importance was the optimistic vision he ac- 
quired that led him to extend the analytic approach 
of experimental psychology to other research areas 
in the human sciences—particularly those suffer- 
ing from a dearth of dependable generalizations in 
the midst of an abundance of vague theoretical 
speculations. 

After he had received his PH.D. in 1936, Hov- 
land’s outlook and approach to social science re- 
search continued to be fostered by collaboration 
with the staff of Yale’s Institute of Human Rela- 
tions, which was at the height of its influence dur- 
ing the late 1930s and early 1940s. Outstanding 
social scientists from all over the world were 
brought together and given ample time and re- 
sources to pursue the inquiries of their choice. 
Hopes ran high that this would make for rapid 
cross-fertilization among traditionally isolated fields 
and lead to vital new breakthroughs, comparable 
to those emerging from interdisciplinary develop- 
ments in the physical and biological sciences. 
Among the outstanding personalities with whom 
Hovland came in contact were Dusser de Barenne, 
Mark May, Walter Miles, Edward Sapir, and 
Robert Yerkes. 

Although the senior members of the institute 
rarely achieved their high aspirations for interdisci- 
plinary advances. the intellectual ferment created 
among the research assistants and junior staff 
members of Hovland’s generation did produce un- 
expected gains. Coming from different social sci- 
ence disciplines, these well-trained young men 
began to influence each other as they examined the 
implications of generalizations that purported to 
account for complex aspects of human behavior. 
Among Hovland’s contemporaries at the institute 
were John Dollard, Leonard Doob, Clellan S. Ford, 
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Neal Miller, O. Hobart Mowrer, George P. Murdock, 
Robert R. Sears, and John W. M. Whiting. With 
these men he formed bonds of personal friendship 
and often participated with them in lively semi- 
nars. One well-known product was the collabora- 
tive Yale volume Frustration and Aggression (Dol- 
lard et al. 1939). Several members of this group, 
together with Donald Marquis, Ernest R. Hilgard, 
and Kenneth W. Spence, who were also at Yale 
during the early 1940s, played an important role 
in the development of learning theory. 

Following the lead of Hull, the Yale group at- 
tempted to formulate unambiguous behavioral laws 
concerning the conditions under which habits are 
strengthened and weakened. These laws were then 
used as a basis for explaining complex social phe- 
nomena, such as the displacement of hostility from 
the family to outsiders, observed by specialists in 
such diverse fields as anthropology, psychoanalysis, 
and social psychology. Hovland contributed to the 
work of this group not so much by suggesting com- 
prehensive theoretical insights as by focusing on 
rigorous analysis of empirical evidence. His origi- 
nality took the form of discovering new functional 
relationships by working closely with the available 
findings, noting inconsistencies and reversals that 
others might be inclined to overlook, and then pro- 
ceeding to unravel the puzzles by ingeniously test- 
ing a series of alternative explanations with a new 
set of data. These qualities also characterized his 
later work on communication effects. 

Research on mass communication. In 1942 Hov- 
land took a leave of absence from Yale in order to 
serve as a research expert on morale problems for 
the United States government. He became chief 
psychologist and director of experimental studies in 
the research branch of the information and edu- 
cation division of the War Department. In this 
role, he worked closely with two eminent sociolo- 
gists, Samuel Stouffer, who was then the research 
director of the research branch, and Leonard C. 
Cottrell, senior social analyst in the same organiza- 
tion. For four years Hovland participated in the 
planning of a series of large-scale investigations on 
social psychological factors in military morale; the 
empirical findings from these studies were subse- 
quently incorporated into the American Soldier 
volumes by Stouffer and his collaborators. 

Hovland’s main role in the military research or- 
ganization, however, was to conduct psychological 
experiments on the effectiveness of training and 
information programs, including the series of “Why 
We Fight” films that were intended to influence the 
motivation of men in the American armed forces. 
In his own experimental section of the research 


branch, Hovland assembled a group of six psy- 
chology graduate students, who worked with him 
on these studies for several years: John Finan, 
Irving L. Janis, Arthur A. Lumsdaine, Nathan 
Maccoby, Fred D. Sheffield, and M. Brewster Smith. 
Although partly oriented toward meeting the prac- 
tical needs of the military services, the studies con- 
ducted by Hovland and his group embodied a re- 
search approach that led to major advances on 
many basic problems in social psychology. 

Following the pattern of his earlier work at Yale, 
Hovland set up investigations designed to test 
hypotheses concerning the conditions under which 
mass communications are effective and to explore 
fully the implications of all the relevant data. But 
instead of confining the research to restricted 
laboratory settings, the mainstay of experimental 
social psychology up to that time, Hovland took 
advantage of the unique opportunities afforded by 
his military research mission. He and his group 
investigated the effects of different types of com- 
munication on “live” issues by conducting experi- 
mental studies with equated groups of soldiers at 
U.S. Army training centers. One of the most widely 
cited of these pioneering communications experi- 
ments involved testing the effects of a one-sided. 
versus a two-sided presentation of a controversial 
issue. The results contradicted some of the well- 
publicized contentions of Nazi propaganda strate- 
gists who claimed that to be successful a com- 
munication should never mention the opposing side 
of an argument. Among men initially hostile to 
the point of view fostered by a communication 
(and particularly those familiar with cogent op- 
posing arguments), it was found to be more effec- 
tive to include mention of the opposing arguments 
than to give a strictly one-sided presentation. 

Many of the investigations by Hovland and his 
group provide systematic data bearing on the 
sources of audience resistance to persuasive efforts 
and call attention to factors that help to overcome 
such resistance, These wartime studies formed the 
basis for a book entitled Experiments on Mass 
Communication by Hovland, Lumsdaine, and Shef- 
field (1949), which was published as part of the 
same series as the American Soldier volumes, 
jointly sponsored by the U.S. War Department 
and the Social Science Research Council. 

The Yale communication studies. After the war 
Hovland returned to Yale University as chairman 
of the department of psychology and was awarded 
a Sterling professorship. Having recruited for his 
department several members of his wartime re- 
search team, Hovland continued to devote his ener- 
gies to systematic research on communication 


effects. With the support of the Rockefeller Foun- 
dation, he organized and directed the Yale studies 
in attitude and communication, which enabled a 
large number of junior faculty members and gradu- 
ate students to participate in collaborative research 
on a variety of communication problems of their 
own choice. 

The main purpose of the research project was 
to explore systematically the factors that influence 
the effectiveness of social communications. Hov- 
land himself continued to take a leading role as an 
active research worker, and his own experiments 
set a high standard as models of analytic precision. 
Among his best-known studies are those elucidating 
the influence of the communicator’s prestige and 
the ways that prestige effects disappear with the 
passage of time. Following a lead obtained from 
the wartime research reported in Experiments on 
Mass Communication, Hovland and his collabo- 
rators showed that when a persuasive message is 
presented by an untrustworthy source it tends to be 
discounted by the audience, so that immediately 
after exposure there is little or no attitude change; 
but then, after several weeks, the source is no 
longer associated with the issue in the minds of 
the audience and positive attitude changes appear 
(Hovland & Weiss 1951). This delayed or “sleeper” 
effect was shown to vanish, as predicted, when 
after several weeks the unacceptable communicator 
was “reinstated” by reminding the audience about 
who had presented the earlier persuasive material 
(Kelman & Hovland 1953). 

For more than fifteen years Hovland system- 
atically investigated factors that determine the 
effectiveness of persuasive communications, in- 
cluding studies of different sequential arrange- 
ments of arguments, the retention of arguments 
and conclusions, and judgmental processes that 
enter into attitude change. While pursuing his own 
research, Hovland continually encouraged his as- 
sociates on the Yale project to select other variables 
in line with their own research interests, such as 
the influence of group affiliation, role playing, emo- 
tional appeals, and personality predispositions. The 
major research findings of the first five years of 
the communications research project, together with 
theoretical analyses of the problems under investi- 
gation, were summarized in a volume entitled 
Communication and Persuasion by Hovland, Janis, 
and Kelley (1953). This volume was followed dur- 
ing the next eight years by a series of four multi- 
authored monographs on more specific topics: 
The Order of Presentation in Persuasion (Hovland 
et al. 1957); Personality and Persuasibility (Janis, 
Hovland et al. 1959); Attitude Organization and 
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Change (1960b); and Social Judgment (Sherif & 
Hovland 1961). The series of works by Hovland 
and his co-workers, according to Nathan Maccoby 
(1963), furnishes the empirical core of “the new 
scientific rhetoric,” the body of psychological knowl- 
edge accumulated from objective description and 
analysis of the processes of persuasion. 

Research on thought processes. In the last dec- 
ade of his life Hovland’s research on verbal con- 
cepts and judgment led him into an intensive 
analysis of symbolic processes. Once again he 
played a pioneering role in developing a new 
field of research—computer simulation of human 
thought processes. His first major contribution in 
this field was a “communication analysis” of con- 
cept learning (1952) which showed how a newly 
developed mathematical theory could be applied to 
computer simulation of the ways in which people 
form new concepts. His general method of analyz- 
ing concept learning and his notational system 
were soon adopted by many other research workers 
who were conducting experiments on human learn- 
ing and cognitive processes. 

Several years after Hovland’s pioneering paper, 
there were some breakthroughs in the program- 
ming of digital computers, which Hovland immedi- 
ately applied in constructing a computer simulation 
model of the steps a person goes through as he 
thinks out the solution of problems requiring the 
attainment of a new concept. Supported by gener- 
ous research grants from the Ford Foundation and 
the Bell Telephone Laboratories, Hovland and his 
collaborators began devising a series of experi- 
ments in order to obtain some of the missing in- 
formation needed for an adequate theory to account 
for human acquisition of complex concepts through 
experience. One of the main findings, reported in a 
paper by Hunt and Hovland (1960), was that most 
human learners readily make use of information 
about conjunctive concepts (for example, all mem- 
bers of the given class share two characteristics, A 
and B) but tend to ignore information pointing to 
disjunctive concepts (for example, all members of 
the given class possess either characteristic A or 
B). Accordingly, Hovland developed a computer 
model of concept formation in which a hierarchy 
of responses was programmed in such a way tha 
the conjunctive concepts would be the first ty 
tried out and disjunctive concepts would be scann 
only after other approaches consistently failed. I 
a highly influential paper entitled “Computer Simu: 
lation of Thinking” (1960a), Hovland pointed out 
the potential advantages of making use of new de- 
velopments in mathematics and computer tech- 
nology for advancing the human sciences. Many 
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research workers are now implementing the mixed 
research strategy he recommended, combining ex- 
perimental studies of human thinking with the 
development of computer programs that simulate 
human psychological processes. 

Other contributions. Hovland’s influence on 
the methodology of social science research was con- 
sistently directed toward integrating seemingly di- 
vergent lines of research. One of his best-known 
papers deals with the problems of reconciling con- 
flicting results derived from experimental and sur- 
vey studies of attitude change (1959). He pointed 
out that one gets the impression from survey re- 
search that very few people are affected by mass 
communications, whereas experiments on opinion 
change show that from one-third to one-half of the 
audience is influenced by a single exposure to a 
persuasive message. This apparent divergence can 
be accounted for by a number of well-known fac- 
tors that are often overlooked, such as the use of 
captive audiences and remote or unfamiliar issues 
in experimental studies, in contrast to the audi- 
ence’s self-selective exposure and high ego-involve- 
ment in the issues typically studied by survey 
research. Hovland’s recommendation was that the 
two research approaches should be used conjointly, 
“combining their virtues so that we may develop a 
social psychology of communication with the con- 
ceptual breadth provided by correlational study of 
process and with the rigorous but more delimited 
methodology of the experiment” (1959, p. 17). 

At a memorial session of the New England Psy- 
chological Association, held a year after Hovland’s 
premature death, his former students and associates 
recalled his “uncanny ability to integrate and 
focus knowledge” and “to discern the central 
aspects of a problem” while at the same time ful- 
filling his leadership role in a “gentle and sup- 
portive” way. As Herbert Kelman put it, he was 
“the world’s most non-authoritarian leader.” Indeed, 
Hovland welcomed diverse theoretical viewpoints 
and encouraged those working with him to try out 
new research strategies. His incisive comments 
stimulated his co-workers and students to make 
their studies as rigorous as possible and to pursue 
fully the substantive inferences that could be 
drawn from the data. This type of direction, com- 
bined with the atmosphere of freedom of inquiry 
which he consistently fostered, nurtured the talents 
of the many younger psychologists whose names 
appear as coauthors of his books and articles, most 
of whom have subsequently become leading figures 
in American social psychology. 

Not the least of Hovland’s contributions was the 
public service he rendered in the role of a “states- 
man of the social sciences.” As one of the few psy- 


chologists of his generation elected to the National 
Academy of Sciences, Hovland was invited to be a 
committee member or consultant to the Rockefeller 
Foundation, the Ford Foundation, the Russell Sage 
Foundation, the Bell Telephone Laboratories, the 
Social Science Research Council, the National Re- 
search Council, the Research Development Board, 
the Fund for Adult Education, and a number of 
other private national research organizations, as 
well as to some of the social research agencies of 
the U.S. government. Hovland fulfilled his con- 
sultant role by consistently working to improve the 
standards and quality of research in psychology 
and related fields. 


Perhaps the most comprehensive statement of 
the scope of Hovland's substantive contributions to 
social science research is contained in the citation 
of the Warren medal, awarded by the Society of 
Experimental Psychologists in the last year of his 
life: “For his systematic analyses . . . [in] four areas 
of research—verbal learning, conditioning, concept 
formation and attitude change.” 

IRVING L. JANIS 


[Directly related are the entries ATTITUDES, article on 
ATTITUDE CHANGE; PERSUASION; SOCIAL PSYCHOL- 
OGY; SUGGESTION. Other relevant material may be 
found in COMMUNICATION, Mass; and in the biog- 
raphies of HULL; SAPIR; STOUFFER; YERKES.| 
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HRDLIČKA, ALES 


Aleš Hrdlička (1869-1943), a leading American 
physical anthropologist, was born in Humpolec, 
Bohemia (now Czechoslovakia), the oldest of sev- 
en children of Maximilian and Caroline Hrdlička. 
In 1882 he immigrated with his father to New York 
City where he first worked in a cigar factory while 
learning English in night schools. Later he attend- 
ed the Eclectic Medical College in New York City, 
from which he graduated in 1892 at the head of 
his class. To improve his medical standing he then 
attended the New York Homeopathic Medical Col- 
lege and Hospital, also in New York City, gradu- 
ating in 1894, and again topping his class. He 
immediately began interning in the State Homeo- 
pathic Hospital for the Insane at Middletown, New 
York. In 1896, after accepting a research position 
in the Pathological Institute of the New York State 
Hospitals, he went to Paris where, among other 
things, he studied anthropology under Léonce 


Manouvrier. 
Hrdlitka’s program at the institute brought him 
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into contact with F. W. Putnam of the department 
of anthropology of the American Museum of 
Natural History. This connection led to an invita- 
tion to accompany the explorer and author Carl 
Lumholtz on a trip to Mexico in 1898. The result- 
ing experience with American Indians was a turn- 
ing point in Hrdli¢ka’s career; when the Patho- 
logical Institute closed in 1899, he turned his 
attention entirely to anthropology. From that time 
until the end of 1902, he participated in the Hyde 
expeditions of the American Museum to the South- 
west and to Mexico. 

In 1903 Hrdlička accepted an invitation from 
William H. Holmes to become assistant curator in 
charge of a new division of physical anthropology 
of the United States National Museum in Wash- 
ington. In 1910 he was made curator, which posi- 
tion he held until his retirement in 1942. 

One of Hrdliéka’s first acts as assistant curator 
was to arrange for the transfer to the National 
Museum of human skeletal material found at 
archeological sites which the Smithsonian Institu- 
tion had deposited at the Army Medical Museum. 
He then devoted his energies to extending the geo- 
graphical, chronological, and racial coverage of the 
collection. At the same time he continued his 
studies of living peoples. The extent of these activi- 
ties can best be seen in the record of his travels: 
1905—southern Arizona and New Mexico (Apache 
and Pima); 1906—Florida; 1909—Egypt and Eu- 
rope; 1910—South America and Mexico; 1912— 
Europe, Siberia, and Mongolia; 1913—Peru; 1915 
—Minnesota and Missouri ( Chippewa and Sioux); 
1916—Florida; 1917—New England, Virginia, and 
Tennessee (Old Americans); 1918—Florida; 1920 
—Japan, Korea, China, and Hawaii; 1922—Brazil 
and Europe; 1923—Europe; 1925—Europe, India, 
southeast Asia, Australia, and South Africa; 1926- 
1938—Alaska and Commander Islands (Eskimo 
and Aleut); 1939—Europe and Siberia. 

Ultimately Hrdlička accumulated for the mu- 
seum one of the world’s largest skeletal collections, 
By studying this material he was able to publish a 
large number of scientific reports, including cata- 
logues of crania and monographs on the antiquity 
of man in America and on the most ancient skeletal 
remains from the Old World. He also published 
Several reports on living American Indians, two 
books on the American white population (1925; 
1931), and a book on anthropometry. 

Hrdlička influenced anthropology through the 
American Journal of Physical Anthropology, which 
he founded in 1918 and edited until 1942, and 
through the American Association of Physical 
Anthropologists, which he founded in 1929, serv- 


ing as president in 1930-1931. In these roles, un- 
fortunately, he tried too much to control and domi- 
nate the thinking in the field. 

Hrdli¢ka also gave much of his time to general 
scientific organizations and international con- 
gresses. He was largely responsible for the organ- 
ization of the 19th International Congress of 
Americanists held in Washington in 1919, He was 
elected a member of the American Philosophical 
Society in 1918 and of the National Academy of 
Sciences in 1921. In 1922 he received an honorary 
sc.D. from the University of Prague, in 1926 the 
same degree from the University of Brno, and in 
1927 the Huxley Memorial Medal from the Royal 
Anthropological Institute of Great Britain and 
Ireland. 

On his way to Europe in 1939 Hrdlička suffered 
a heart attack but, after a month in a hospital in 
England, continued on to Siberia. He died of a 
subsequent heart attack in Washington. 


T. D. STEWART 


[For the historical context of Hrdlitka’s work, see 
PHYSICAL ANTHROPOLOGY.] 
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HUIZINGA, JOHAN 


Johan Huizinga (1872-1945), the Dutch cul- 
tural historian, was born in Groningen and died 
at De Steeg, a village outside Arnhem. His family 
combined the deep piety of a long line of Baptist 
ministers with a scientific devotion to factual truth, 
exemplified by Huizinga’s father, a doctor and pro- 


fessor of physiology at Groningen. Although in his 
mature years Huizinga subscribed to no religious 
confession, he was a lifelong believer; his noncon- 
formist background was partly responsible for his 
toleration, his respect for the privacy of others, 
his strong ethical stoicism, and his moderation in 
political (even in historical) issues. His respect 
for factual integrity is everywhere apparent in his 
work and became the basis for his attack upon 
totalitarianism. 

After studying Dutch language and literature at 
Groningen and taking his degree in 1896 in com- 
parative philology (on a Sanskrit subject), Huizinga 
began teaching history to high school students in 
Haarlem. Here he did the research for his first— 
and only archival—historical work, a study of the 
rise of Haarlem (1905-1906). Through his men- 
tor, the historian Petrus Johannes Blok, Huizinga 
was appointed professor of history at Groningen 
in 1905: his inaugural lecture, “Het aesthetische 
bestanddeel van geschiedkundige voorstellingen” 
(1905; “The Aesthetic Element in Historical Pres- 
entation”), gave promise of his use of literature 
and the visual arts to interpret history. Until his 
appointment as professor of history at Leiden, how- 
ever, the harvest of this approach remained only 
a promise. 

It was in his Leiden years, from 1915 until his 
death, that Huizinga’s special talents were re- 
vealed. His youthful interest in fin-de-siécle lit- 
erature (French, English, and Dutch) and his 
knowledge of Flemish “primitive” and modern 
painting provided him with a fresh view of the 
social significance of aesthetic forms. The Waning 
of the Middle Ages (1919), his masterpiece, is a 
study of the forms of life of fifteenth-century 
France and Burgundy, based upon his critical read- 
ing of aesthetic and philological sources normally 
overlooked by the working historian. Huizinga at- 
tempted to do for the late medieval North what 
his acknowledged master, Jacob Burckhardt, had 
done for the Italian Renaissance. 

In 1918 Huizinga published “Mensch en menigte 
in Amerika” (“Man and Mass in America”), an 
extrapolation from American materials worked up 
to introduce his Leiden students to American his- 
tory—itself an event of considerable cultural im- 
portance. After several studies of Renaissance his- 
tory (1920; 1924), he published “De taak der 
cultuurgeschiedenis” (“The Task of Cultural His- 
tory”) in 1929. It was a description and prescrip- 
tion for his students and other historians, in which 
he plainly advocated subsuming to history other 
“philological” disciplines—the law, philosophy, his- 
tories of music, literature, and the fine arts—as 
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well as pillaging from the social sciences cocep- 
tualizations to facilitate both the setting of hiitori- 
cal questions and the organization of historical 
answers. 

Except for his study of Haarlem, Huizinga made 
no great contributions to political or econonic 
history; unlike Burckhardt, he did not seem) to 
regard even the state as a work of art. Huizirga 
was primarily interested in “intellectual” histoty, 
the history of the literate classes; but he was fur- 
ther interested in every aspect of “social” histor}, 
the forms through which men in the past con 
ducted their lives. From texts normally regarded 
as nonhistorical, Huizinga produced evidence for 
forms of collective and private life of past periods 
and places. Like an archeologist, he excavated life 
“as it really was.” His methodological writings from 
this period emphasize the usefulness to historians 
of the insights and methods of sociology and an- 
thropology. 

Although Huizinga never collected quantitative 
data for his own research, he nonetheless respected 
conclusions about social and cultural habits based 
on quantitative analysis. His own preference for 
the work of Max Weber and Ernst Troeltsch, among 
sociologists, and for Marcel Mauss, among anthro- 
pologists, was the result of his admiration of the 
hypotheses they framed and their use of hypoth- 
eses in the analysis of various data. His own com- 
parable work is his study of seventeenth-century 
Dutch culture (1932), the best epitome of that 
complex period ever written; there he advanced 
the thesis that Dutch Calvinism was not of pri- 
mary importance in the formation of Golden Age 
culture. The Waning of the Middle Ages and Homo 
ludens (1938) extrapolate cultural manifestations 
from surrounding historical data in order to draw 
up, more schematically than is usual in historical 
writing, the structure of, in the first instance, 
northern European society in the fifteenth century 
and, in the second, the functions and variations 
of play, playing, and games in human culture as 
a whole, Huizinga was much interested in the 
historical work of the group headed by Lucien 
Febvre and Marc Bloch, to be found in the periodi- 
cal Annales histoire économique et sociale, with 
its self-conscious use of social materials and socio- 
logical method. However, despite this sociological 
emphasis, Huizinga reacted unfavorably to the 
work of Freud and seems to have made almost no 
use of Marxian ideas throughout his work. 

From 1935 on, Huizinga’s principal work was 
not historical, although it was certainly cultural, 
or moral-sociological. In that year, only two years 
after Hitler's coup, he published In the Shadow of 
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Tomerrow: A Diagnosis of the Spiritual Distemper 
of Our Time, a highly polemical study of contem- 
porary mass culture. Although his critics attacked 
his disgust with the cheapness of the radio and of 
yellow journalism as reactionary, Huizinga re- 
garded these as signs of a general loss of value far 
moe seriously manifest in the political, social, and 
maal behavior of contemporary totalitarianism. 
Ten years later, his posthumous Geschonden 
wereld: Een beschouwing over de kansen op herstel 
van onze beschaving (1945; “World in Ruins: A 
Consideration of the Chances of Rebuilding Our 
Civilization”) was published, both a sequel and a 
corrective to In the Shadow of Tomorrow. Like de 
Tocqueville (really, even more than Burckhardt, 
Huizinga’s model) in Democracy in America, 
Huizinga was conservative in his insistence upon 
conscientious individual responsibility and opti- 
mistic in his conviction that by individual recom- 
mitment a collective responsible world could be 
reconstructed, 

Homo ludens, Huizinga’s most extraordinary and 
original book, is a study of the “play-element” in 
culture—occidental, oriental, primitive, ancient, 
and modern. He considered all sorts of playing, 
from children’s games to the dialectic of philos- 
ophy and of the law courts. By no means a history 
book—its lesson is psychosociological or even meta- 
social rather than historical—Homo ludens is 
nonetheless the book of a historian who relates 
each specific form of such an activity as play to the 
temporal and local culture of which it is a part. 

A generation earlier, Huizinga might have be- 
come a literary and artistic critic; a generation 
later, he might have become a sociologist. As it 
was, he practiced cultural history as if “high” cul- 
ture and “sociological” or “anthropological” culture 
were mutually dependent. Although he preferred 
the man to the mass, the individual to the crowd, 
he saw the task of the cultural historian to be the 
interpretation of societies, or groups made up of 
individuals. Huizinga’s work has made it difficult 
for historians to ignore the signs of nonpolitical 
social forms within which human beings have al- 
ways lived their lives. 


Rosai L., COLIE 


[For the historical context of Huizinga’s work, see 
History, articles on INTELLECTUAL HISTORY and 
SOCIAL HISTORY; and the biographies of BURCKHARDT; 
Mauss; TOCQUEVILLE; TROELTSCH; WEBER, MAx.] 
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HULL, CLARK L. 


Clark L. Hull (1884-1952) was the most influ- 
ential figure in the experimental psychology of 
learning during the decades immediately preceding 
and following World War 11 [see LEARNING; LEARN- 
ING THEORY]. This was not because his researches 
were unusually definitive or his theory universally 
accepted. If there was a single cause, it was that 
Hull presented his theoretical ideas with a degree 
of rigor and analytical detail then unfamiliar in 
psychology. A substantial proportion of the re- 
search in learning in this era was formulated in 
terms of Hull’s theory, whether designed to sus- 
tain or to disprove it, and virtually all such research 
at least acknowledged a relationship to Hullian 
conceptions. Although Hull’s theory contained some 
ambiguities and inconsistencies, it set a challeng- 
ing level of explicitness that contributed in an im- 
portant way to the developing image of psychology 
as a science. 

Hull’s ideas drew on many sources. Darwin had 
proposed the principle of natural selection that 
formed a rationale for much of Hull’s_ theory. 
Many of the phenomena of conditioning that Hull 
described and some of the theoretical constructs 
that he espoused had been discussed by Pavlov. 
Freud had asserted the central role of motivation 
in behavior, a position Hull increasingly adopted. 
Watson had established the behavioristic departure 
from earlier introspectionism; Hull followed in the 
less radical, neobehavioristic tradition. Thorndike’s 
law of effect, asserting the important role of rein- 
forcement in learning, was central to Hull's major 
theorizing. Tolman had described the place of the 
intervening variable in behavior theory and had 
drawn the critical distinction between learning and 
performance, which Hull later formalized. Not only 
was Hull indebted to these predecessors, but he was 
unusually careful also to acknowledge the impor- 
tant contributions of his colleagues, notably Neal 
E. Miller and Kenneth W. Spence, in developing 
his approach. 

To this potpourri of influences, Hull added inte- 
gration and purpose. He made more explicit the 
ideas of these and other men and in so doing added 
a wealth of his own conceptual innovations. He 
believed that controlled studies of simple learning 
situations would reveal a core set of principles of 
behavior and that these could then be applied to 
the more complex conditions of individual and 
group behavior. If successful, this program, he felt, 
would have inestimable value for understanding, 
predicting, and, where appropriate, controlling 
behavior. 
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Hull was born in a log cabin on a farm in New 
York State in 1884. His education, which began in 
a rural school, was interrupted by a job as a mining 
engineer and an attack of poliomyelitis that made 
the use of a cane necessary thereafter. He was 34 
when he received a PH.D. from the University of 
Wisconsin. Early in life, Hull began a series of 
personal notebooks in which he recorded a variety 
of facts, ideas, and reflections, hoping to preserve 
the thoughts of his youth for later, less creative 
years. These “idea books” show an early sense of 
destiny and a developing obsession with urgency. 
They also contain many detailed analyses of his 
theory-in-the-making which illustrate his conviction 
that the explicit, formal statement of an idea is 
most likely to reveal its weaknesses. As he said in 
Principles of Behavior, “It is believed that a clear 
formulation, even if later found incorrect, will 
ultimately lead more quickly and easily to a correct 
formulation than will a pussyfooting statement 
which might be more difficult to convict of falsity” 
(1943, p. 398). Hull was uncompromising in his 
expectation of speed, effort, and punctuality from 
his associates and intolerant of slipshod thinking 
by his students. He was generous both with per- 
sonal attention and professional credit to all who 
worked with him. He consciously avoided vindic- 
tive arguments, to which some of the reviews of 
his work could easily have led, but accepted, indeed 
welcomed, considered criticism. 

Hull's early scientific activities are noteworthy 
in themselves. Continuing at the University of 
Wisconsin after obtaining his degree, Hull worked 
in the area of the measurement and prediction of 
achievement. His first major work, Aptitude Test- 
ing (1928), reveals his principal concern—the 
problems of validation of tests. To reduce compu- 
tational time and errors, Hull designed an auto- 
matic correlation machine, a beautifully complex 
unit that, as he said in his autobiography, “really 
worked,” and that illustrates his unusual talent for 
gadgetry. He then became interested in hypnosis 
and conducted a series of experimental studies, 
published in his second major work, Hypnosis and 
Suggestibility (1933), that brought scientific rigor 
to this often quasi-scientific area. Hull moved to 
the Institute of Human Relations at Yale University 
in 1929 and there, at the age of 45, started to 
formalize a mechanistic theory of behavior. He was 
to make his major contribution to psychology in 
this area. [See ACHIEVEMENT TESTING; APTITUDE 
TESTING; HYPNOSIS; SUGGESTION. ] 

The institute housed a vigorous group of workers 
of varying ages, fields, and persuasions, all dedi- 
cated to the concept of a science of behavior. 
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Nevertheless, it was 11 years before Hull and sev- 
eral colleagues formalized a set of learning prin- 
ciples in Mathematico—Deductive Theory of Rote 
Learning (1940), a work less well known than it 
deserves to be, probably because of its use of the 
language of formal logic and mathematics. Three 
years later a more general and readable statement 
of the fundamental principles appeared in Hull’s 
third and most influential major work, Principles 
of Behavior (1943). Another nine years passed be- 
fore Hull's A Behavior System (1952a) described 
selected examples of more complex individual be- 
havior in terms of his theoretical principles. This 
last work was published posthumously; the intend- 
ed application to social and cultural problems re- 
mained to be written at the time of Hull’s death 
in 1952. 

Reinforcement theory of learning. Hull’s theo- 
ry can be characterized as a mechanistic stimulus— 
response conception of behavior. Learning, Hull 
believed, consists in the bonding together of some 
stimulus event (a light, a tone, a maze) with some 
response of the organism (salivating, running, 
turning). Behavior in general is describable in 
terms of hierarchies and constellations of such con- 
nections. Hull did not, however, deny the reality of 
ideas, knowledge, and purpose; indeed, he accepted 
these as obvious descriptive properties of molar be- 
havior. He argued, rather, that these are not the 
basic units of behavior and that they are under- 
standable in terms of more fundamental stimulus— 
response events. He introduced, therefore, the no- 
tion of the fractional, anticipatory goal response, 
which assumes that a portion of the reinforcing 
event at the end of a behavior chain can have an 
effect at the beginning of the chain. This response, 
or more properly, its response-produced stimulus, 
serves to guide overt behavior, as if purposefully, 
toward the goal. Whatever the complexity of be- 
havior apparent to direct inspection, it could be 
reduced to simpler principles by the further dis- 
covery of response mechanisms, often hypothetical 
internal responses with no other function than to 
provide the organism with guiding stimuli. 

Principles of Behavior provides the epitome of 
a reinforcement theory of learning: the formation 
of stimulus-response connections is assumed to 
depend on a subsequent reinforcing state of affairs, 
and the degree to which the connection is strength- 
ened depends upon the quantitative properties of 
the reinforcement. That is to say, learning occurs 
only if the response is rewarded, and the larger and 
more immediate the reward, the better the learning. 
Recognizing the possible circularity of this theory 
in the absence of an independent definition of re- 


inforcement, Hull proposed the drive reduction 
hypothesis [see Drives]. He assumed that a hungry 
rat learns to run through a maze because the turn- 
ing responses at the choice-points are reinforced 
by the eventual reduction in hunger drive oc- 
casioned by eating in the goal box. Furthermore, 
since choice-points near the goal are followed most 
quickly by this reinforcement, Hull correctly in- 
ferred that errors would be eliminated in backward 
fashion, the appropriate responses at the beginning 
of the maze being learned last. These two explicit 
postulates—that learning occurs only if responses 
are reinforced and that reinforcement occurs only 
if a drive is reduced—generated a substantial pro- 
portion of the research of Hull and his colleagues. 

Habit and drive. The stimulus—response con- 
nection, which Hull called habit strength and sym- 
bolized sH,., is itself not the sole determinant of be- 
havior. In order to appear in overt performance, 
habits must be energized by drive. This latter con- 
struct, symbolized D, is related to presumed bio- 
logical and survival needs of the organism and 
species (hunger, thirst, pain, sex) and is assumed 
to combine multiplicatively with habit to determine 
excitatory potential (,E,), which in turn deter- 
mines performance. This equation—sE, = sH, x D 
—formalizes the distinction between learning and 
performance and inextricably entangles learning 
with motivation. When elaborated and expand- 
ed, the formulation became a powerful source of 
hypotheses, 

The proper use of this equation requires a so- 
phisticated understanding of both habit and drive. 
Consider first ,Hz. The actual stimulus to which a 
response is attached is not the distal event as it 
occurs in the environment or even the proximal 
event as it occurs on the receptor. Rather, it is a 
trace of this latter event, which, according to Hull, 
decays over time. Moreover, because stimuli are 
never exactly identical, Hull further hypothesized 
that habit learned to one stimulus generalizes some 
habit strength to other similar stimuli. The postu- 
lates of the stimulus trace and habit generaliza- 
tion illustrate the way Hull derived behavioral 
phenomena. 

Pavlov had shown that if the sound of a bell 
regularly occurs, say, ten seconds before food is 
delivered to a hungry dog, the salivation that had 
occurred only after the food was in the dog’s mouth 
becomes anticipatory, i.e., begins to occur when 
the bell sounds and before the food arrives. Hull's 
theoretical interpretation is that the bell initiates 
a stimulus trace in the nervous system of the dog 
which immediately begins to decay, but which is 
present in some unique intensity ten seconds later 


when the food is delivered. Since the food elicits 
salivation as an unlearned reflex, and since it also 
provides reinforcement by reducing hunger, the 
appropriate conditions are present for the forma- 
tion of a habit connection between the ten-second 
trace of the bell and salivation. Now, because earlier 
traces of a bell are similar to the ten-second trace 
of that same bell, the habit being formed to the 
latter will generalize to the former. Hence, the dog 
salivates anticipatorily as a result of generalization. 
By making explicit quantitative assumptions about 
the shape of the stimulus trace function and the 
degree of generalization between different traces, 
one can make detailed derivations about the ex- 
pected latency of salivation at different stages of 
training and the importance of the time interval 
separating the ringing of the bell and the presenta- 
tion of the food. 

Hull also expanded the construct of drive. Per- 
haps his most important elaboration is the concept 
of the association of any particular intensity of any 
particular drive (e.g., a rat deprived of food for 
24 hours) and a characteristic drive stimulus which 
differs to some extent from that produced by other 
drives (e.g., deprivation of water) or other inten- 
sities of the same drive (e.g., 12 hours of food 
deprivation ). Hence, although Hull’s conception of 
drive is that of a general energizer—hunger can as 
well motivate a response learned under thirst— 
drives do have a directing function, because each 
drive stimulus will tend to become selectively at- 
tached to those responses most likely to reduce it. 
Thus, a rat in a maze can learn to turn right when 
hungry and left when thirsty even though, as mo- 
tivators, these drives act indiscriminately on all 
habits. 

Given the fact that there is experimental extinc- 
tion—the weakening and disappearance of re- 
sponses when reinforcement is terminated—Hull 
was faced with two theoretical alternatives. One 
was the assumption that sHe itself is weakened by 
nonreinforcement. However, there were a variety 
of reasons for believing that learning is a relatively 
permanent process, not the least of which is the 
dramatic evidence of memory Hull had obtained 
using hypnosis. Hence, Hull followed Pavlov in as- 
suming that experimental extinction does not rep- 
resent the loss of habit, but rather the accumula- 
tion of inhibition. Excitatory potential is reduced 
by inhibition resulting from nonreinforcement; the 
dog does not forget how to salivate to the bell; 
instead, learning is inhibited if food no longer fol- 
lows the bell. 

The system was completed by relating excitatory 
potential to performance. One common response 
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measure is the probability that a response will occur 
on any presentation of a stimulus. To adapt this 
measure to his theory, Hull borrowed the notion 
of a threshold from physiology and assumed that 
excitatory potential is subject to oscillation from 
moment to moment according to an essentially ran- 
dom distribution. A response will occur only if sEx 
is above threshold at the moment and, if this is so, 
the speed and amplitude of response will depend 
directly upon the momentary value of excitatory 
potential. 

One of the persistent problems that Hull faced 
in developing his theory was that the intervening 
variables such as habit and drive are inherently 
unobservable by direct inspection and hence cannot 
be measured. Since the theory is cast in quanti- 
tative terms, Hull’s derivations typically consist of 
illustrative examples, with arbitrary values for the 
important variables. He foresaw the importance of 
quantifying these variables in units that might be 
applied to the wide range of situations to which the 
theory was intended to apply. Toward this end, Hull 
undertook a meticulous series of studies employing 
the Thurstone scaling technique to quantify habit 
and drive. Hull valued this work greatly, in keeping 
with his belief that mathematics is a fundamental 
tool of behavioral as well as physical science. 

Stimulus dynamism and incentive motivation. 
Almost as soon as Principles of Behavior was pub- 
lished, major modifications of its theory began. One 
unsolved problem was the role of stimulus inten- 
sity; Hull became convinced that not only do drive 
stimuli have energizing and directing functions, 
but that all stimuli have dynamogenic as well as 
cue properties. Furthermore, the rapid changes in 
an animal's response after a change in the amount 
of reward suggested that reinforcement is a mo- 
tivational variable as well as a learning variable. 
A new equation for excitatory potential therefore 
appeared. It included the concepts of stimulus 
dynamism based on the intensity of the eliciting 
stimulus, and of incentive motivation based on the 
amount and time of delay of reward as additional 
motivational factors multiplying habit. According 
to this new conception, a dog salivates more toa 
strong than a weak bell not because he has learned 
to salivate better, but because the stronger stimulus 
energizes a more vigorous response. A rat runs 4 
maze faster for a larger reward than for a smaller 
not because he has learned the maze better, but 
because the larger reward provides more incentive 
to run. Motivational factors thus began to dominate 
Hull's learning theory, although he clung to the 
assumption that some reinforcement is required 
for the accretion of habit strength. 
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Discrimination learning. All these hypotheses 
were generated on the basis of data obtained in 
relatively simple learning situations. The logic of 
Hull's program, it will be recalled, is that these 
principles will be applicable, when properly inter- 
preted and possibly expanded, to more complex 
situations as well. 

In A Behavior System, Hull described the fol- 
lowing case, among others: If an organism is con- 
fronted with two stimuli differing, let us say, in 
brightness, and is consistently rewarded if he 
chooses one of these, he will learn to behave adap- 
tively; he will learn to discriminate between the 
stimuli and consistently to choose the rewarded 
alternative. In Hullian language, choice between 
responses is somehow based on excitatory potential. 
The simplest hypothesis is that if several responses 
are in competition the one that has the largest 
excitatory potential will occur. With the choice 
measure specified in this way, discrimination learn- 
ing follows readily from the theory. 

It is first noted that the excitatory potential of 
the positive stimulus will be increased by the re- 
inforcement following responses to it. Although 
generalization will produce some excitatory poten- 
tial for the nonreinforced stimulus, this will be 
reduced by the inhibition developed whenever re- 
sponse is made to it. Thus, continued training will 
lead to a gradual separation of the excitatory poten- 
tials of the two stimuli and hence a developing 
preference for the reinforced one. A number of 
additional and more detailed implications also fol- 
low. For example, the discrimination should be 
easier the more different the two stimuli are, since 
there will be less generalization between them and 
more rapid separation of their excitatory potentials. 
As is true of most useful theories, this analysis has 
more implications than appear at first glance, 
many of which have been derived by Spence 
(1960) from his related theory and tested with 
tolerably good results, 

Assessment of Hull's theory. Hull correctly fore- 
saw that the details of his theory were tentative 
and likely to require revision as relevant data were 
collected. His principal lasting contribution lies 

„ More in his extensive illustration of an approach to 
behavior theory than in the specific theory itself. 
This approach leans heavily upon the availability 
of empirical data, since it involves the postulation 
of intervening variables or empirical constructs 
which are intended to summarize the existing data 
and to generalize upon them for predicting new 
data. For example, it is known that rats run a maze 
to water faster the longer they have been previ- 
ously deprived of water. The intervening variable 


approach assumes that deprivation of water affects 
the organism by producing thirst that potentiates 
the response. It is important to note the two essen- 
tial criteria of an adequately formulated interven- 
ing variable: it must be anchored to some inde- 
pendent variable such as deprivation, and it must 
also be anchored to some dependent variable such 
as response speed. 

Hull believed that intervening variables actually 
exist in the neurophysiological structure of the 
organism, and many of his postulates are couched 
in physiological-sounding terms. Some critics may 
therefore incorrectly assume that the usefulness 
of the theory is dependent upon the adequacy of 
such physiological speculations. But Hull intended 
only to stress the ultimate reducibility of molar 
behavioral events to more predictable molecular 
physiological events and to suggest possible rather 
than specific avenues toward such reduction. For 
example, he felt that his concept of drive might be 
related to endocrine and chemical mechanisms, and 
if this were true, an important link would be estab- 
lished between psychology and physiology. But such 
a result would be a bonus; the value of the ap- 
proach is assessed by its ability to predict behav- 
ioral facts and not by its physiological validity. 

The construct “thirst” integrates a wider set of 
findings than the single one that longer water 
deprivation leads to faster running. The same in- 
dependent variable ( deprivation of water) also 
affects behavior in a number of other ways—the 
amount the rat will drink, the vigor with which he 
will operate a manipulandum, the intensity of elec- 
tric shock he will tolerate in order to obtain water, 
etc. Furthermore, a number of operations other 
than the withholding of water can be found to pro- 
duce the same pattern of behavioral consequences 
(responses )—feeding dry or salty food, injecting 
drugs into the rat’s brain, etc. All of these em- 
Pirical relationships are tied together by the single 
notion of thirst. A quite similar constellation of 
findings appears with deprivation of food, and the 
construct “drive” integrates at a more abstract level 
the energizing and directing functions common to 
hunger, thirst, and other needs. 

The postulation of the combination rule that 
drive multiplies habit permits novel predictions. 
For example, since the choice between two alter- 
natives depends upon the difference between their 
excitatory potentials, rats will learn a maze faster 
under high than under low drive. This is because 
the developing differential in habit, resulting from 
the reinforcement of only one alternative, will be 
amplified in proportion to the prevailing level of 
drive. When such implications are tested and the 


results fail to conform to prediction, the theory 
necessarily is modified. 

The extension of the simple principles to more 
complex situations follows a similar progression. 
For example, when it is observed that a previously 
neutral stimulus paired temporally with a noxious 
stimulus takes on some of the properties ascribed 
to drive, the richness of the theory can be drawn 
upon to develop the concept of learned drive. The 
principles of learning should describe the way 
drives are acquired, and the principles of perform- 
ance should describe the way such acquired drives 
affect behavior. Dollard and Miller (1950) have 
carefully extended the application of the basic prin- 
ciples along these lines, and they have applied the 
resulting theory to psychotherapy. 

Hull assumed further that these same principles 
could be applied to still more molar social and 
economic situations, This might require some addi- 
tional principles not apparent in the simple situ- 
ations from which the theory sprang. A boy and a 
girl on a date are reacting in accordance with their 
drives and habits just as much as a rat in a maze; 
the drives include a variety of social pressures 
toward conformity, approval, and morality, and the 
habits involve fine discriminations of the cues pro- 
vided by the other person. Hull believed that the 
reduction of such situations to his theoretical con- 
ception was in principle possible, and certainly 
could not be proved impossible a priori. 

Hull’s program, even as far as he carried it, was 
certainly not flawless, and it led psychologists to 
overestimate the imminent attainment of a general 
behavior theory. Hull tended to oversell the pro- 
gram, to offer more than could be delivered, to be 
willing to guess at systematic behavioral facts in 
order to construct the framework of a theory. Partly 
as a reaction to this, subsequent workers have 
tended to have more modest goals, to build mini- 
ature models, or to delve in detail into circum- 
scribed aspects of the larger problem. When and 
where behavior theory on the grand scale will again 
be promoted cannot be predicted. Hull, for one, was 
confident that it could be achieved, that it would be 
achieved, and that his work would provide a useful 
stepping stone to the goal. His last book concludes 
with this hope: 


And finally, the crowning achievement of all will be 
the creation of a really quantitative system of social 
behavior. . . . It seems incredible that nature would 
create one set of primary sensory-motor laws for the 
mediation of individual behavior and another set for 
the mediation of group behavior. Presumably, then, the 
laws which are derived for social behavior will be 
based for the most part on the same postulates as those 
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which form the basis of individual behavior. If this 
turns out to be true, we are even now an appreciable 
distance on our way toward the ultimate goal of inte- 
grating the individual-social sciences with the group- 
social sciences. (1952a, pp. 355-356) 


Frank A. LOGAN 


[See also the biographies of THORNDIKE and TOLMAN.) 
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HUMAN RIGHTS 


The subject of this article is the international 
concern with human rights. The article cannot, 
however, concentrate exclusively on the activi- 
ties of international organizations and on con- 
ferences which purport to promote respect for 
human rights. The instruments, institutions, and 
operations of the international organizations of our 
day have roots in the philosophical, constitutional, 
and legal developments of many nations spread 
over many centuries. The work on the international 
plane has in its turn had considerable influence on 
constitutional, legal, and political developments 
within nation-states, not least on the written law 
of many of the states of Asia, Africa, and the Carib- 
bean which have recently acceded to independence. 


Origins and basic concepts 


The expression “human rights,” as a term of art, 
is of recent origin. Even in its French-inspired form 
“rights of man” (droits de l'homme), it goes back 
only to the last decades of the eighteenth century. 
The idea, however, of the law, or the lawgiver, 
defining and protecting the legal rights of men— 
mainly the mutual rights of the members of the 
community—is very old indeed. It would, perhaps, 
be somewhat farfetched to look for elements of the 
protection of human rights in the Code of the 
Babylonian king Hammurabi (about 2130 to 2088 
B.C.), the most ancient code of law at present 
known. The sanctions which it provides in trying 
to protect worthy human-rights objectives (such as 
the administration of justice, marriage, and the 
family) are so disproportionately cruel that it is 
preferable to disregard this legislator in our con- 
text. However, as Rudolph von Jhering pointed out 
a century ago, the law of ancient republican Rome 
guaranteed to the Roman citizen (not to the for- 
eigner or to the slave) the right to take part in 
the government of his country by participating in 
the exercise of the power of legislation, in the ad- 
ministration of criminal justice, in electing public 
officials, and even in having a share in the police 
power (Jhering 1852-1878), 

An eminent American scholar has said (Yntema 
1958) that the concepts of the Roman civil law, 


which were formulated by the Roman juridical 
genius in order to render justice in the mutual 
relations of individual men, are in essence a prac- 
tical definition of the rights of man and a reason- 
able and authoritative criterion to which those who 
seek justice and protection of the inherent dignity 
of the human person can appeal. This holds true, 
in Yntema’s view, of the Roman law as it was 
applied on the continent of Europe and of the 
common law of the Anglo-Saxon countries. The 
common law and the civil law, so different in their 
institutions and techniques and, at the same time, 
so similar in their criteria of what is fair, offer 
an objective yardstick for judging conduct in terms 
of individual rights and freedoms. These systems 
have, of course, also tolerated institutions and prac- 
tices which are inconsistent with the modern con- 
ception of a public order protecting human dignity. 
Nevertheless, through many centuries communities 
have existed where at least part of what are now 
considered to be fundamental human rights were 
well protected by elaborate and refined bodies of 
law. 

In England of the seventeenth century battles 
were fought against the nonobservance of the an- 
cient rights of Englishmen. Out of these struggles 
there came two great documents: the Petition of 
Right of 1628 and the Bill of Rights of 1689. These 
did not purport to define the basic human rights of 
all mankind. They were intended to give relief for 
specific grievances by limiting the power of the 
king and by strengthening the power of Parliament 
and of the courts. Their ideas and even their texts 
are, however, reflected in the work of the American 
and French revolutionaries of the eighteenth cen- 
tury: in the immortal passages of the American 
Declaration of Independence, in the Virginia bill 
of rights of 1776, in the French Declaration of the 
Rights of Man and of the Citizen, and in the 
American bill of rights. 

In the course of the nineteenth and twentieth 
centuries, the example set by the United States and 
France of adopting bills of rights or otherwise em- 
bodying such rights in their constitutions, was fol- 
lowed on the entire continent of Europe, and the 
movement spread to the Americas, Asia, and Africa, 
but until very recently Britain and the British 
dominions and possessions remained aloof from 
this movement. 

The Russian Revolution of 1917, while following 
the American—French precedent in the form of its 
pronouncements, gave it a fundamentally different 
substance. The difference lies not in the emphasis 
on economic and social rights in addition to the 
traditional political and civil rights. Such provisions 


are also found in other constitutions, for example, 
in the Mexican constitution of 1917, in the Wei- 
mar constitution of Germany of 1919, and in the 
constitution of the Republic of Spain of 1931. The 
difference between the Soviet pronouncements and 
their Western predecessors lies in the complete 
transformation of the meaning of political and civil 
rights. 

This basic difference in concepts becomes clear 
when one compares the provisions of the American 
bill of rights with the corresponding provisions of 
the first Soviet constitution. The first amendment 
to the American constitution provides, among other 
things, that Congress shall make no law abridging 
the freedom of speech or of the press; the Russian 
constitution of 1918 “for the purpose of securing 
freedom of expression to the toiling masses,” abol- 
ishes all dependence of the press upon capital, and 
turns over to the working people and the poorest 
peasantry “all technical and material means for 
the publication of newspapers, pamphlets, books,” 
etc. The American constitution prohibits abridg- 
ment of the right of the people peaceably to assem- 
ble, while the Lenin constitution, in order to ensure 
complete freedom of assembly to the working class 
and to the poorest peasantry, offers “all premises 
convenient for public gatherings together with 
lighting, heating, and furniture.” 

The Soviet constitution of 1936, while changing 
the wording, has, by and large, maintained this 
general approach. The Western constitutional ideas 
were designed to prevent interference with funda- 
mental rights mainly, though not exclusively, by 
the public authorities. The Soviet concept does not 
treat of this aspect at all. It promises to make avail- 
able technical facilities; it does not promise freedom 
in the choice of the purposes for which they will 
be used. In trying to draw conclusions from the 
existence in so many legal systems of catalogues of 
rights, one must keep in mind these differences in 
the basic concepts. 


Human rights and international law 


Attempts to seek a foundation for the rights of 
the individual in the law of nations go back to an 
early stage in the history of international law. The 
work of the Spanish theologian—lawyer Francisco 
de Vitoria—born 1480, died 1546—is perhaps the 
first attempt to use legal reasoning, moral principle, 
and political courage in support of a cause which 
we might consider as involving human rights as 
well as international law. He lectured about “the 
Indians recently discovered” and pleaded for their 
rights vis-à-vis the Spanish conquerors. The great- 
est figures in the literature of international law in 
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the seventeenth and eighteenth centuries exercised 
a powerful influence on the growth of the concept 
of the inalienable rights of man. In 1950 Sir Hersch 
Lauterpacht characterized the close connection be- 
tween human rights and international law by say- 
ing that “the law of nations, in itself conceivable 
only as being above the legal order of sovereign 
States, is not only a law governing their mutual 
relations but is also, upon final analysis, the uni- 
versal law of humanity in which the individual 
human being as the ultimate unit of all law rises 
sovereign over the limited province of the State” 
(p. 120). 

In the field of action, as distinct from scholar- 
ship, the international concern with human rights 
has manifested itself in two different ways: by so- 
called “humanitarian intervention” and by the 
adoption of international treaties. 

In traditional international law it was assumed 
that a state had the authority to treat its own 
nationals as it saw fit. When, however, the ill- 
treatment by a state of its own population was of 
such an intensity that it shocked the conscience 
of mankind, other states, usually the great powers 
of the period, took it upon themselves to threaten, or 
even to use, force in order to come to the rescue of 
the oppressed population. While resort to this type 
of action was not infrequent, the doctrine under- 
lying it has never become a generally recognized 
part of international law because of the abuse in- 
herent in the concept of “humanitarian interven- 
tion.” 

The protection of oppressed or endangered groups 
by international treaty started in the seventeenth 
and eighteenth centuries in matters of religious 
liberty. In the course of the nineteenth century the 
international treaty was used also to protect ethnic 
and racial groups and to combat the slave trade and 
slavery; in the twentieth century it has been used 
to improve labor conditions, to arrange for the 
supervision of the administration of mandated ter- 
ritories, and to provide under the supervision of the 
League of Nations for certain rights of racial, re- 
ligious, or linguistic minorities in a number of 
states, mainly in central and eastern Europe. 


The United Nations and human rights 


Not until the world had passed through the 
tragic events of World War 11 and had witnessed 
the barbarous acts committed by the totalitarian 
regimes of that period was the universal organiza- 
tion of the international community, the United 
Nations, charged with some responsibilities in the 
matter of human rights, and its members pledged 
themselves to take action for the achievement of 
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universal respect for and observance of human 
rights and fundamental freedoms for all. 

The San Francisco Charter of 1945 through 
which the peoples of the United Nations reaffirmed 
their faith in “fundamental human rights” did not 
define these rights. The charter has made it abun- 
dantly clear, however, that one particular activity 
at least is repugnant to it: discrimination on the 
grounds of race, sex, language. or religion. 

Nor has the charter established specific inter- 
national machinery for the enforcement of its 
human-rights provisions, except for arrangements 
under the trusteeship system. But it has created 
organs of general and wide competence—the Gen- 
eral Assembly, the Security Council, and the Eco- 
nomic and Social Council—and it has laid the 
foundations for the establishment of an unlimited 
number of subsidiary bodies, including commis- 
sions in the field of human rights. 

The General Assembly, and to some extent also 
the Economic and Social Council, have not hesi- 
tated to use their general powers of investigation 
and recommendation to take action of varying 
character and intensity in such situations as viola- 
tions of religious or political freedom in eastern 
Europe, race conflict in South Africa, forced labor 
in various parts of the world, infringement of trade 
union freedom, practices violating the human dig- 
nity of women in less developed communities, the 
status of Buddhists in South Vietnam, and many 
others. The fight against colonialism has been one 
of the characteristics of the international scene in 
the post-World War u world. Rightly or wrongly, 
the majority of governments have, to a large ex- 
tent, identified the struggle against colonial domi- 
nation with the struggle for human rights. 

International bill of rights. Since the United 
Nations Charter had not defined human rights and 
had not created special international institutions 
for their enforcement, it was widely assumed in 
1945 that this would soon be done in an “Interna- 
tional bill of rights.” In 1947-1948 it was decided 
that this “bill” would consist of two or more docu- 
ments: a declaration, a covenant, and “measures 
of implementation.” In 1948 the General Assembly 
proclaimed in a resolution the first part of this bill 
of rights as the Universal Declaration of Human 
Rights. The drafting of the other parts of the bill 
has not yet been completed; according to later de- 
cisions, they are to consist of two covenants, one 
on civil and political rights, the other on economic, 
social, and cultural rights, with provision for inter- 
national supervision of their implementation. (By 
the end of 1963 all the general and substantive 
provisions of the two covenants had been approved 


on the General Assembly level, while—as of 1965 
—the international procedural arrangements have 
still to be agreed upon. ) 

The declaration of 1948 is the only world-wide 
official document where the human rights of which 
the charter speaks are set forth. Its range is very 
wide. It proclaims not only the traditional political 
and civil rights and freedoms of its national prede- 
cessors but also “economic, social and cultural 
rights.” The declaration has, to some extent, filled 
the gap created by the delay in completing the 
covenants and acquired a status different from and 
more important than the one which was originally 
intended for it. It has been used by the United 
Nations, by other international organizations and 
conferences, and by governments as a yardstick to 
measure the compliance by governments with the 
obligations deriving from the charter in matters of 
human rights. It has penetrated into international 
conventions, national constitutions, and legislation, 
and even, in isolated cases, into court proceedings. 

The technique of developing international stand- 
ards by proclaiming instruments of the declaration 
type has been frequently used, the most potent 
post-1948 example being the Declaration on the 
Granting of Independence to Colonial Countries 
and Peoples, of 1960. Additional human-rights dec- 
larations were also adopted: the Declaration of 
the Rights of the Child, in 1959, and the United 
Nations Declaration on the Elimination of All 
Forms of Racial Discrimination, in 1963. In other 
fields examples are the declarations on Permanent 
Sovereignty Over Natural Resources, 1962, and on 
Legal Principles Governing the Activities of States 
in the Exploration and Use of Outer Space, 1963. 
Many more are in various stages of drafting and 
preparation. 

Conventions on specific human-rights subjects. 
While the work of drafting the comprehensive cov- 
enants on human rights has been going on, the 
United Nations and its specialized agencies have 
produced a considerable body of treaty law on the 
subject of human rights by adopting and putting 
into force conventions on more limited subjects. 
Important examples are the Freedom of Association 
and Protection of the Right to Organize Conven- 
tion, 1948; the Genocide Convention, 1948; the 
Convention on the Political Rights of Women, 
1952; the Conventions on the Status of Refugees, 
1951, and of Stateless Persons, 1954; the Conven- 
tions on the Reduction of Statelessness, 1961, and 
on the Nationality of Married Women, 1957; the 
Supplementary Convention on the Abolition of 
Slavery, 1956, and the Convention on the Abolition 
of Forced Labor, 1957; the Discrimination (Em- 


ployment and Occupation) Convention, 1958, and 
the Convention against Discrimination in Educa- 
tion, 1960; the Convention on Consent to Marriage, 
Minimum Age for Marriage and Registration of 
Marriages, 1962; and the International Convention 
on the Elimination of All Forms of Racial Discrimi- 
nation, 1965. 

Regional arrangements. In the years during 
which the United Nations has been slowly working 
its way toward the completion of the world-wide 
covenants, much more rapid progress was achieved 
in western Europe. Under the auspices of the Coun- 
cil of Europe, the European Convention of Human 
Rights of 1950 came into being, supplemented by 
protocols of 1952 and 1963 and by the European 
Social Charter of 1961. By the convention, the 15 
European states which are parties to it have under- 
taken to secure to everyone within their jurisdiction 
the rights which it defines—most of the traditional 
political and civil rights. The convention estab- 
lished two new organs to ensure the observance of 
the engagements undertaken by the parties: the 
European Commission on Human Rights and the 
European Court of Human Rights. 

The existence of the court is significant pri- 
marily as a symbol. Its jurisdiction is very nar- 
rowly circumscribed. It was established in 1959, 
and by the beginning of 1965 only two cases had 
gone before it. The commission, however, has 
played an important and effective part. Ten of the 
European states (not including the United King- 
dom and France, the latter not being a party to the 
convention at all) have accepted the right of indi- 
viduals to petition the commission and to claim to 
be the victims of a violation of the convention by 
one of the parties. It has been the commission’s 
examination of petitions, and especially the exam- 
ination of their admissibility, which offered the 
commission the opportunity to build up an impres- 
sive body of case law. And although in the over- 
whelming majority of cases the commission has 
had to reject the petition, its work has transformed 
the abstract idea of the international protection of 
human rights into a concrete, tangible, day-to-day 
task. This operation has laid bare the formidable 
problems and pitfalls inherent in so novel an 
experiment. 

For many years, the Organization of American 
States has tried to establish inter-American institu- 
tions for the promotion of respect for human rights. 
In 1948 the “American Declaration of the Rights 
and Duties of Man” was proclaimed at Bogota; con- 
ventions on political and civil rights of women and 
on territorial and diplomatic asylum were signed 
in 1948 and 1954 respectively. An Inter-American 
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Convention on Human Rights providing for the 
establishment of a commission and a court of hu- 
man rights, on the European model, was drafted 
by the Inter-American Council of Jurists in 1959. 
At the beginning of 1965 final action on the draft 
had not yet been taken by the political organs. 

Program of practical action. The effectiveness 
and desirability of the international treaty as an 
instrument for the promotion of human rights was 
challenged early in the history of the United Na- 
tions. The most weighty challenge came in 1953 
when U.S. Secretary of State John Foster Dulles 
stated that the new Eisenhower administration did 
not believe in “treaty coercion” and did not favor 
“formal undertakings” as the proper and most effec- 
tive way to achieve throughout the world the goals 
of human liberty. As a consequence the United 
States declared its intention not to become a party 
to any covenant on human rights or to sign human- 
rights conventions of a more limited scope. As we 
have seen, the majority of governments has con- 
tinued to hold that multilateral conventions have 
their place among the endeavors to give effect to 
the human-rights provisions of the charter. Those 
governments consented, however, to a new program 
proposed by the United States, provided it was 
adopted in addition to treaties and not instead of 
them. 

Thus, in 1955-1956 the United Nations started 
a series of new activities: (1) periodic (triennial) 
reporting by states on developments; (2) studies of 
specific rights or groups of rights; and (3) advisory 
services in the field of human rights. A great num- 
ber of governments have participated in the report- 
ing procedure in the first three triennial cycles; the 
Commission on Human Rights has, however, not 
yet arrived at a final policy about the use to be 
made of the reports. The first of the studies was a 
world-wide inquiry and investigation into the status 
of the right to be free from arbitrary arrest, deten- 
tion, and exile. The most fruitful branch of the 
program of advisory services has been the conven- 
ing of mostly regional conferences (called sem- 
inars) to exchange views and experiences on 
important and topical problems, such as the pro- 
tection of human rights in criminal law and pro- 
cedure, the role of the police in the protection of 
human rights, or the status of women in family 
law. It was a series of seminars on judicial and 
other remedies against the illegal exercise or abuse 
of administrative authority which made the institu- 
tion of the “ombudsman” known outside the Scandi- 
navian countries and led to its acceptance in at 
least two other countries (England and New Zea- 
land) and to its consideration elsewhere (Canada 
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and some United States jurisdictions). One such 
regional conference, held in Kabul, Afghanistan, in 
1964, was devoted to “human rights in developing 
countries,” a basic problem which deserves more 
detailed discussion (Seminar . . . 1964). 


Human rights in developing countries 


Observance of human rights poses problems 
everywhere, and no country’s record, be it highly 
industrialized or economically underdeveloped, is 
faultless. In most of the countries which have ac- 
ceded to independence only recently the problems 
are aggravated. Many of these countries have im- 
placable enemies: poverty, ignorance, disease, and 
inertia. Western commentators have tended to 
underestimate the importance of illiteracy and ig- 
norance, particularly in unsophisticated rural soci- 
eties (De Smith 1964, p. 237). At the Kabul 
conference it was agreed that there could be no 
meaningful exercise of many human rights in a 
country where economic resources were scarce and 
the bulk of the population lived on the margin of 
subsistence. It was argued that in such matters as 
the development of the whole economy and the 
improvement of health, education, and housing, 
the state needed the necessary powers to implement 
its plans even though, meanwhile, the rights of in- 
dividuals or groups might be temporarily curtailed. 
Some speakers wondered, however, whether the 
governments of some developing countries did not 
show an excessive concern for the internal and 
external security of the state and apply unnecessary 
restrictions on human rights, 

Developing countries experience difficulties not 
only in regard to economic and social rights, the 
enjoyment of which clearly presupposes a certain 
economic standard, but also in regard to the “classi- 
cal” civil and political rights. A basic prerequisite 
of the right to a fair trial is the existence of an 
adequate number of well-trained lawyers, and 
“crash programs” for their training were recom- 
mended at the Kabul conference. The scarcity of 
competent and efficient public officials is also an 
obstacle to progress in the human-rights field. An- 
other specific problem of the underdeveloped world 
arises in the very sensitive area of “labor mobili- 
zation.” In some countries, it was alleged at the 
conference, it is not yet possible to rely exclusively 
upon systems of voluntary labor to satisfy the basic 
needs of the nation. Recourse has sometimes to be 
had to certain forms of labor mobilization involving 
the compulsory removal of people to areas other 
than those of their residences. It was claimed, how- 
ever, that such national labor services should be 


distinguished from the practices of forced labor 
which were forbidden by international conventions 
(see also Cowen 1964). 


Neither the human-rights provisions of the United 
Nations Charter and the Universal Declaration of 
Human Rights nor the other treaties and declara- 
tions mentioned above have brought about the mil- 
lennium. It is therefore not the purpose of this 
article to convey the idea that the rights set forth 
in these instruments are, in fact, respected and 
observed throughout the world. In 1952, in 1957, 
and, again, in 1962 the General Assembly of the 
United Nations reiterated that notwithstanding the 
obligations arising from the charter and notwith- 
standing the Universal Declaration of Human 
Rights, violations of human rights continue to occur 
in various parts of the world, and that notwith- 
standing some progress the situation in many parts 
of the world was unsatisfactory. 

The task of making the protection of human 
rights general, permanent, and effective still lies 
ahead. This should not lead us, however, to belittle 
the progress that has been achieved on the inter- 
national and national levels since World War 1. 
Whatever the technical position might be in par- 
ticular instances, and however difficult it might 
often be to achieve redress through international 
or national remedies, the traditional concept, that 
the way a state treats those subject to its power and 
Jurisdiction is within its unfettered discretion, is a 
thing of the past. A vast but asymmetrical structure 
of international obligations has been built, some 
vague, some precise; and injunctions and exhorta- 
tions have been issued, some authoritative, others 
less so. The fate of the individual may still in fact 
be at the mercy of his State, but it is also a matter 
of continuing and increasing international concern. 


EGON SCHWELB 


[See also INTERNATIONAL CRIMES. Other relevant ma- 
terial may be found in CONSTITUTIONAL LAW, articles 
on CIVIL LIBERTIES and CIVIL RIGHTS; INTERNA- 
TIONAL LAW; INTERNATIONAL ORGANIZATION.] 
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HUMBOLDT, ALEXANDER VON 


Alexander von Humboldt (1769-1859), the most 
famous German of his time, was celebrated as a 
geographer, explorer, and naturalist; he was less 
well known for his valuable contributions to the 
development of the social sciences. He came from a 
Pomeranian family that had been lately ennobled. 
After the early death of his father, a major in the 
Prussian Army, his bourgeois mother ceased to 
keep up the family’s connections with the court; 
instead, she employed excellent private tutors to 
set her two sons on the road traveled by the bour- 
geois elite—the sciences. Humboldt studied at 
Frankfurt on the Oder, Göttingen, Hamburg, and 
Freiberg (in Saxony). C. W. Dohm, Karl Ludwig 
Willdenow, and Georg Forster, his principal teach- 
ers, awakened his interest in political and botanical 
geography and in exploration. In his youth he also 
associated with Goethe, Schiller, and many other 
writers. 

With this background he was exceedingly well 
prepared for his first voyage of exploration, which 
took place between 1799 and 1804. He was accom- 
panied on this voyage by Aimé Bonpland, the 
French physician and botanist. Humboldt explored 
the territories of what are now Venezuela, Cuba, 
Colombia, Ecuador, Peru, and Mexico, sailed up 
the Casiquiare and determined its longitude and 
latitude, climbed Mount Chimborazo to the height 
of 17,900 feet, and suggested various improve- 
ments in mining technology and other aspects of 
the economy to the Spanish colonial authorities. 

After a visit to the United States, where he met 
not only various scientists in Philadelphia but also 
President Thomas Jefferson, he returned to Europe. 
He then settled in Paris, where he proceeded to 
evaluate the results of his expedition and to prepare 
for a new project of exploration in Asia. This latter 
expedition did not take place until 1829, after he 
had spent two years in Berlin. 

The preparation, execution, and evaluation of 
Humboldt’s expeditions were exemplary and won 
him world fame. The 23 volumes of his travel de- 
scriptions (1805-1834) are the most comprehen- 
sive ever published by a private individual. He col- 


546 HUME, DAVID 


laborated with German, French, and British scholars 
to produce the last major achievement of the repub- 
lic of letters of the eighteenth century and so com- 
bined in characteristic fashion the idée encyclopé- 
dique with his belief in the division of labor among 
specialists. 

Humboldt early rejected the purely utilitarian 
point of view in science, stressing instead the value 
of research for the sake of knowledge. This per- 
mitted him to give unbiased consideration to social 
scientific problems. In the field of anthropology, for 
example, he was able to appreciate without preju- 
dice the culture of the Indian tribes of South Amer- 
ica, since he did not believe in higher or lower races 
but rather in the unity of mankind and its destiny 
of liberty. In his celebrated Political Essay on the 
Kingdom of New Spain (1811-1812), he laid the 
foundations of modern regional geography on the 
basis of physical geography, using Mexico as his 
example. His portrait of colonial rule is based on 
statistical evidence, for Humboldt managed to gain 
access to the jealously guarded Spanish colonial 
archives. Both for the social sciences and for geog- 
raphy this early description of Mexico is still a use- 
ful source book. 

Humboldt convincingly refuted Friedrich List’s 
theory of economic stages. Whereas List conceived 
of the transition from hunters and gatherers to 
shepherds and peasants in terms of the develop- 
ment of material capital under Old World condi- 
tions, Humboldt demonstrated that in America 
agriculture must have evolved from gathering, with- 
out the intermediate stage of pastoral life, since no 
large domesticable animals had existed in the New 
World. 

Political scientists should appreciate that Hum- 
boldt was one of the first Europeans to predict 
a great future for the United States. Publication of 
his travels also gave intellectual aid and moral sup- 
port to the South American wars of independence. 

Humboldt’s efforts in the field of demography 
are of enduring value. He introduced the exam- 
ination of the quotient of extremes into population 
Statistics, thus making it possible to supplement 
the abstract figure of population density by citing 
the low quotient in countries uniformly densely 
(or sparsely) settled and the high quotient in coun- 
tries unevenly settled. He also made allowance for 
population dynamics, furnishing birth and mor- 
tality rates for Mexico, These figures were long un- 
available for such regions. 

Humboldt’s old age was devoted to completion 
of his Cosmos: A Sketch of a Physical Description 
of the Universe (1845-1858), in which he ex- 
panded his geographical thinking into a more gen- 


eral “description of the world in physical terms.” 
His great prestige in Germany did much to shift 
attention from predominantly philological studies 
to the natural sciences. 


HANNO BECK 


[See also GEOGRAPHY and POPULATION, article on THE 
FIELD OF DEMOGRAPHY.] 
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HUME, DAVID 


David Hume was one of the most distinguished 
writers of the eighteenth century. Although he was 
only partly appreciated in his own period, with the 
passage of time his stature has grown, and his 
writings have attracted increasing attention from 
students in many disciplines. 

Born in Scotland in 1711, Hume died in 1776, 
the year of the publication of Adam Smith’s Wealth 
of Nations. His major work, A Treatise of Human 
Nature (1739-1740), lies in the field of philos- 
ophy. But the range of Hume’s writings is vast. In 
a large number of essays (which in 1758 were in- 
corporated into a volume entitled Essays Moral, 
Political and Literary) Hume dealt with the fields 
of economics, politics, and aesthetics, as well as 
with much that would now be regarded as socio- 
logical material. Hume was also a historian. His 
interest in history is reflected in many of his writ- 


ings, and in the later years of his life he published 
(1754-1762), in serial volumes, a History of Eng- 
land that was a pioneering work in the area. 

Owing to the subsequent compartmentalization 
of the fields that Hume explored, students of his 
thought have tended to treat his work in a com- 
partmentalized fashion. This is difficult to avoid, 
since Hume dealt with a large variety of problems, 
and their analysis often requires specialized train- 
ing. An unfortunate consequence of this, however, 
has been the tendency to overlook the important 
unifying elements in Hume's thought. Philosophers 
have argued that in turning to the essays, Hume 
was abandoning philosophy—allegedly because of 
the poor reception accorded his Treatise, and be- 
cause of his passion for literary fame. Economists, 
on the other hand, have generally treated Hume's 
economic essays (which first appeared in 1752 
under the title Political Discourses) as a substan- 
tially self-contained segment of his thought, bear- 
ing little or no relation to his philosophy. 

Hume, however, made it clear that he intended 
his philosophy to serve as the “capital or center” of 
all the “moral” (i.e., psychological and social) sci- 
ences, or as the center of a general science of 
human experience ({1739-1740] 1958, p. xix). And 
as is evident from the prefatory “Advertisement” 
to the Treatise, he hoped, if the Treatise met with 
success, to incorporate a study of the various “moral 
sciences” into a subsequent edition of this work. 
Many of Hume's essays thus display relations to 
the substance of the Treatise; and this is clearly 
the case in his treatment of economics. Hume was 
pre-eminently, and notably more so than his close 
friend Adam Smith, the philosopher-economist of 
the eighteenth century. 

A brief sketch of Hume's general system of 
thought will help clarify the relation between his 
philosophy and his economic thought. The “capital 
or center” of Hume's system consists of the “prin- 
ciples of human nature,” those qualities and rela- 
tions pertaining to the human understanding and 
human passions that Hume believed to be irreduci- 
ble and common to all mankind. Discussed in Books 
1 and 1 of the Treatise, these principles constitute 
the analytical phase of Hume's thought. The sec- 
ond and synthetic phase of Hume’s thought consists 
of laws of human behavior in which Hume sought 
to show that various aspects of man’s behavior are 
products of environmental forces operating on 
“human nature.” The framing of these general uni- 
formities is Hume’s concern in his study of the 
various “moral sciences.” An emphasis on psychol- 
ogy is thus a major distinctive characteristic of 
Hume’s treatment of these sciences. 
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Hume's interest in history is likewise of major 
importance in his thought. This interest emerged 
early in Hume's life and was integral to his ap- 
proach to experience, As a philosophical empiricist, 
Hume repeatedly stressed the importance of the 
study of history for achieving understanding of 
human experience, pointing out that history con- 
stitutes the exclusive source of our “experiments” 
concerning human nature and human behavior 
(ibid., p. xxiii). Moreover, as both “moral scientist” 
and historian, Hume, by employing his principles of 
human nature, sought to frame historical laws of 
behavior, or laws that may explain basic trans- 
formations in human behavior. This approach he 
termed “natural history” (the word “natural” here 
denoting the “usual” or “probable”). Representing 
an attempt at “scientific history,” this differs from 
conventional historiography, with its stress on 
unique particulars, which predominates in Hume’s 
own History of England. Historical generalizations, 
reflecting the “natural history” approach, interlace 
much of Hume's work. And within the context of 
his economic writings the method of “natural his- 
tory” is of central importance. Its role becomes fully 
apparent when Hume’s economic thought is con- 
sidered at three different levels of analysis: his 
economic psychology, political economy, and eco- 
nomic philosophy. 

Before turning to this, however, it is well to 
consider one other aspect of Hume's thought: his 
treatment of the methodology of science, which, as 
one of the most important of Hume's contributions, 
is likewise relevant to his economic thought. Hume 
considered this question in Book 1 of the Treatise, 
where he was concerned with the basis of our 
“beliefs” concerning empirical events. His position 
represents an attack on the rationalist acceptance 
of a “necessary connection” between such events. 
As Hume argued, all beliefs concerning matters of 
fact are reducible to associations of ideas that are 
separable from each other. All causal relations take 
this form. We thus can “conceive any object to be 
non-existent this moment, and existent the next, 
without conjoining to it the distinct idea of a cause 
or productive principle.” What is conceivable, more- 
over, is possible. And the demonstrability of a “nec- 
essary connection” requires that the opposite be 
shown to be impossible. Hence, “necessary connec- 
tion” between matters of fact (unlike the logical 
relations between propositions, where validity de- 
pends wholly on the principle of noncontradiction ) 
cannot be demonstrated (ibid., pp. 79-80). 

On what, then, are our beliefs in matters of fact 
based? All that Hume could find here was a psy- 
chological process. More specifically, we come to 
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“believe,” say, that a causal relation exists between 
two events because, as a result of our repeated 
observation of their contiguity and reoccurrence 
(their “constant conjunction”), the mind—simply 
through a nonrational associative mechanism—so 
firmly links the two that the occurrence of one 
leads us to expect its usual attendant (ibid., 
p. 96). For Hume, “belief” is, in a word, a “habit.” 
The implication of this for “science” is clear: since 
we have no a priori justification for beliefs in em- 
pirical events, the only way we can enlarge our 
understanding of experience is through further 
investigation with a view to establishing additional 
“constant conjunctions.” It must be recognized, 
however, that the notion of the uniformity of nature 
itself, or the view that the future will resemble the 
past, is based on faith. 

Hume’s observations on the prospects for dis- 
covering uniformities in the “moral sciences” are 
also noteworthy. Although he believed that history 
had already provided an adequate basis for framing 
“principles of human nature” ([1739-1779] vol. 4, 
p. 69 in 1898 edition), he was acutely aware of 
the wide historical variability of human behavior 
and the difficulty, especially in view of the range 
and complexity of human passions, of specifying 
how man would respond to changing circumstance 
(ibid., vol. 3, p. 163; [1739-1740] 1958, p. 131). 
With considerable caution, he sought to discrimi- 
nate between different areas of human behavior in 
terms of their tractability to scientific treatment. 
It is very difficult, he pointed out, to discover uni- 
formities in aspects of behavior which are peculiar 
to small numbers of individuals, since in such cases 
the passions involved are delicate and subtle and 
are often affected by imperceptible influences 
({1739-1779] vol. 3, p. 175 in 1898 edition). It is 
difficult, also, to establish reliable generalizations 
in the field of “politics” (ibid., vol. 3, pp. 156-157). 
Since mass behavior, on the other hand, is fre- 
quently governed by passions that are “gross” and 
“stubborn,” reliable uniformities may be found 
here. Its dependence upon such “gross” passions 
as avarice, for example, would make it easier to 
account for the “rise and progress of commerce” 
than for the growth of “learning” (ibid., vol. 3, p. 
176). Significantly, it is with a justification of the 
use of generalization in economics (which he con- 
trasted with the chance-ridden field of foreign 
diplomacy) that Hume introduced his economic 
essays ([1752-1758] 1955, pp. 3-4). 

With a view to the central role of psychology and 
history, let us turn now to the substance of Hume’s 
economic thought and consider first his economic 
psychology, which links his economic thought to 
the “capital or center” of all the “moral sciences,” 


as developed in the Treatise. Here the analysis 
takes the form of a natural history of the “rise and 
progress of commerce” in which Hume sought to 
explain the economic growth of his own general 
period in terms of the impact of changing historical 
circumstance on human passions. In the economic 
essays these passions are regarded as “causes of 
labour.” Because the analysis stems directly from 
Hume’s general treatment of human nature, it is 
notably multidimensional and is considerably more 
complex than the economic psychology typical of 
Hume’s own period. Beyond considering the impor- 
tance of the desire for consumption pleasures as an 
incentive to economic activity, Hume called atten- 
tion to the role played by the desire for “action” as 
well as a desire for “liveliness” (a state of lively 
passion that is common to the experience both of 
consumption pleasures and interesting activity). 
In turn, the desire for “gain,” joined to the desire 
for action, is treated in large part as a desire for 
the trophies of success in the “economic game.” 
All these motives are integral to the “rise and 
progress of commerce,” the initial stimulus to which 
is found in the opening of foreign trade and whose 
self-perpetuating character is traced to the growth 
of new habits of industry (ibid., chapter 2; Rotwein 
1955). 

Political economy, the second level of Hume’s 
economic thought, comprises the main portion of 
his economic essays. Here, where Hume was con- 
cerned with specific aspects of market relations, he 
dealt with several of the principal controversial 
questions of his period: monetary theory, interest 
theory, the issue of free trade, the shifting and 
incidence of taxes, and fiscal policy. At this level 
the role of the natural history of the “rise and 
progress of commerce” is seen in Hume’s critical 
evaluation of the doctrines of his period; for his 
analysis is predominantly concerned with one ques- 
tion: Are prevailing beliefs regarding market rela- 
tions acceptable when considered in the light of 
economic growth and the psychological and other 
factors involved in the growth process? 

A brief summary of several of the major aspects 
of Hume's political economy will serve to make 
clear the general importance of this perspective. 
One of the most frequently emphasized of Hume’s 
contributions to later classical doctrine is his analy- 
sis of the quantity theory—specie flow mechanism. 
Here Hume sought to show that there is no ground 
for the mercantilist fear that without trade restric- 
tions a nation will lose its money supply. The 
quantity theory-specie flow doctrine is thus intro- 
duced to show that it is ultimately a nation’s level 
of economic development that determines the quan- 
tity of money it can attract and retain. “I should 
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as soon dread, that all our springs and rivers 
should be exhausted, as that money should aban- 
don a kingdom where there are people and in- 
dustry” ([1752b] 1955, p. 61). Hume's interest 
theory—in its emphasis on the importance of real 
capital (as against the mercantilist stress on the 
quantity of money )—also anticipated the classical 
position. But, more fundamentally, Hume was con- 
cerned to show that the availability of real capital 
is itself determined by the impact of the growth of 
industry on economic motivation. And here he 
made use of all the “causes of labour” introduced 
in his economic psychology. In an industrially ad- 
vancing economy, he argued, there is increasing 
opportunity for “action” in the form of “lucrative 
employment.” This induces frugality both by pro- 
viding an alternative to pleasure seeking (the pre- 
occupation of the idle landowner in an agrarian 
economy) as a mode of gratifying the desire for 
liveliness, and by intensifying the desire for gain, 
or the desire to accumulate the tokens of economic 
success. As the income of the industrious class 
grows, along with its contribution to output, there 
is thus a substantial increase in the supply of real 
savings, and the interest rate inevitably falls 
({1752-1758] 1955, pp. 53-54). 

In his treatment of the issue of trade restric- 
tions, Hume recognized that free trade increases 
real income (at any given level of economic devel- 
opment) by bringing about a more efficient alloca- 
tion of resources (ibid., pp. 66, 75, 79)—the 
argument Adam Smith was later to elaborate more 
fully and systematically. Growth considerations, 
however, occupy the center of Hume's analysis: 
meeting the mercantilist argument for trade restric- 
tions on its own ground, he sought to show that, 
far from inhibiting the economic growth of other 
countries, one nation’s economic growth commonly 
contributes to the development of its neighbors. 
An expansion of industry abroad, he emphasized, 
yields new technology that the home country can 
emulate, and since such an expansion increases 
foreign income, it also increases the foreign de- 
mand for home exports. In his detailed treatment 
of the growth of foreign industry that competes 
directly with domestic output, Hume stressed the 
stimulating effects of foreign competition on the 
spirit of industry at home: he argued, for example, 
that such competitive pressure encourages a nation 
to divert unemployed resources to new uses and, 
further, that the resulting diversification will mini- 
mize the impact on employment of any subsequent 
fluctuations of demand in particular markets (ibid., 
pp. 78-81). 

In some respects Hume’s doctrine is a com 
of classical elements and elements exhibiting an 


plex 
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affinity to the mercantilist position (with inconsist- 
encies between the two remaining unresolved). 
But this complex can itself be traced mainly to 
Hume's interest in stressing the importance of de- 
velopmental considerations. For example, Hume 
generally employed the quantity theory of money 
to show that it is economic growth rather than the 
quantity of money that is of basic significance to a 
nation. But with a view to growth considerations 
he was also led to argue that gradual changes in 
the money supply (as against large absolute quan- 
tities of money as such) are desirable. By repeatedly 
stimulating employment in the short run, such 
changes, he contended, will in the long run en- 
hance the spirit of industry itself (ibid., pp. 37-40). 
Similarly, although on quantity theory grounds 
Hume argued that money has no effect on interest 
rates, he also acknowledged that if it affects eco- 
nomic growth, a long-run increase in the money 
supply will lower interest rates (ibid., p. 59). 

Hume's economic philosophy—the third level of 
his economic thought—is contained in his essay 
“Of Refinement in the Arts.” Here, Hume presented 
a moral justification for a commercial and indus- 
trial society and, as a basis for judgment, invoked 
the utilitarian ethic, which he had earlier elab- 
orated and extensively defended in Book m1 of the 
Treatise and in his Inquiry Concerning the Prin- 
ciples of Morals (1751). On this level of Hume's 
economic thought the perspective of “natural his- 
tory” played an especially important role. In this 
context, where Hume considered the relation be- 
tween economic growth and the happiness of the 
individual, the “causes of labour” (the desires for 
“pleasure,” “action,” and “liveliness” ) are treated as 
“ends” whose attainment is furthered by the devel- 
opment of industry ({1752-1758] 1955, pp. 21-22). 
His analysis here, both in its detail and in its adop- 
tion of a pluralistic position on human happiness, 
bears a direct relation’ to his earlier series of essays 
on the “good life” entitled “The Epicurean,” “The 
Stoic,” “The Platonist,” and “The Sceptic.” Among 
all ends, Hume placed the greatest emphasis on 
“action” (the response to challenge ); and a reading 
of “The Stoic” makes clear that, when treating this 
end most broadly, Hume had in mind the value 
that is now generally stressed in justifying a free 
society—the striving for self-fulfillment (Rotwein 
1955, pp. xcv—xcix). 

Hume also considered the influence of economic 
development on the intellectual, cultural, moral, 
and political life of society; with regard to politics, 
he argued (in a passage Adam Smith was later to 
cite for its central importance and originality) that 
the growth of political liberty itself is traceable to 
the economic decentralization and individualism 
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brought about by the development of commerce 
([1752—1758] 1955, pp. 28-29). Making all allow- 
ances for gaps in Hume’s knowledge of medieval 
society (with which he compared an industrial 
society) and for other shortcomings in his argu- 
ment, his analysis—in its statement of the case for 
a liberal social order—deserves recognition as an 
early classic. 

Compared to the Wealth of Nations, Hume’s 
treatment of economics is brief. Hume, moreover, 
gave relatively little attention to the questions of 
value and distribution which absorbed much of 
Smith’s analysis, and which were to become the 
dominant interest of the classical and neoclassical 
economists. In its theoretical treatment of psycho- 
logical and historical influences, Hume’s analysis, 
however, is both more comprehensive and more 
penetrating than Smith’s, although Smith was more 
concerned with such influences than the generality 
of the later classical economists. 

Since Hume’s time, however, others have given 
these influences a prominent place in their work. 
The “historical” and “institutional” economists are 
cases in point. Other important themes that occur 
in Hume's work are to be found in contemporary 
economic literature. Most conspicuous is the re- 
vived interest in economic growth and in the cul- 
tural forces associated with this process, a product 
of the concern with underdeveloped economies. 
Also, interest in the question of full employment 
has led to a consideration of the psychology under- 
lying aggregative behavior, while the shift in em- 
phasis from impersonal markets to the role of 
individual units or collective entities (including 
business organizations and trade unions) and to 
the area of imperfect competition generally has 
likewise directed attention to a variety of psycho- 
logical considerations not ordinarily stressed in 
“orthodox” economic analysis. In view of the rapid- 
ity of institutional change in recent decades, it is 
not surprising that the very question of the desir- 
ability of alternative institutional “systems” and the 
consequent normative and historical analysis of 
institutions should now be receiving increasing 
attention, 

In various ways, these streams of thought differ 
from Hume's approach. Nonetheless, owing to 
Hume's remarkable capacity to treat important 
issues with a view to relevant psychological and 
historical factors, much of his economic thought 
still retains its freshness. In the standard literature 
it has long been the practice to give Hume relatively 


little attention (as one of a group of “predecessors” 
of Smith), although over the years several major 
technical aspects of his work have received general 
recognition. Hume’s contribution can be fully ap- 
preciated, however, only when the body of his 
economic analysis as a whole is related to the con- 
text of broad and systematic social inquiry in which 
it was developed. 

EUGENE ROTWEIN 


[Other relevant material may be found in Economic 
GROWTH; History, article on THE PHILOSOPHY OF 
HISTORY; MONEY; UTILITARIANISM; and in the biog- 
raphies of MANDEVILLE; RICARDO; SMITH, ADAM; 
Turcot.] 
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